BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 016713
         (384 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
 gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana]
 gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana]
 gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
          Length = 472

 Score =  566 bits (1459), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 273/378 (72%), Positives = 321/378 (84%), Gaps = 2/378 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQR--LASLDIFRGLAVALMILVDHAGGD 58
           M+EIK E +H   L+  + D S    +  L   R  LASLDIFRGL VALMILVD AGGD
Sbjct: 1   MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
           WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct: 61  WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct: 121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
           + Y WHW++AA VLV+YLA LYGTYVPDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct: 181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG+LSS+S+
Sbjct: 241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ILSTIIGVHFGH+I+H KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct: 301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query: 359 VTSGAAALVFSAIYALVD 376
           VTSGAAALVFS++Y+LVD
Sbjct: 361 VTSGAAALVFSSLYSLVD 378


>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa]
 gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  563 bits (1450), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 265/377 (70%), Positives = 318/377 (84%), Gaps = 4/377 (1%)

Query: 1   MSEIKAETTHHHPLIISE-PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
           M+EIKA+    H L I+E  D+S Q+    +   R+ASLDI+RGL VALMILVD AGG+W
Sbjct: 1   MAEIKADIALDHRLTIAEVTDISAQKPDPKI---RVASLDIYRGLTVALMILVDDAGGEW 57

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
           P+I HAPWNGCNLADFVMPFFLFIVG+AI LA KRI  R  AV++VI RTLKLLFWGI+L
Sbjct: 58  PKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFKRITSRHHAVRRVIVRTLKLLFWGIML 117

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
           QGGFSHAPD+LTYGVD++ IR CG+LQRIA +YL+V+L+EIFTK  Q ++   G  SI++
Sbjct: 118 QGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYLVVALMEIFTKKKQTRELPPGWLSIYK 177

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
           LY   WLM AC+LV+YLA++YGTYVP WQFT+ ++DSADYGKVF V C VR KL+PPCNA
Sbjct: 178 LYSSQWLMGACILVIYLAVIYGTYVPHWQFTVNDRDSADYGKVFTVECAVRGKLDPPCNA 237

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG+IDR++LGINHMY HPAW+RS+ACT++SP+EGP R  APSWC APFEPEG+LSS+S++
Sbjct: 238 VGFIDREILGINHMYQHPAWKRSEACTENSPYEGPFRTSAPSWCKAPFEPEGILSSISAV 297

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LSTIIGVHFGHV+++ +GH ARLK W+ MGFALLI GL LHFT+AIPLNKQLYT SYVCV
Sbjct: 298 LSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALLILGLVLHFTHAIPLNKQLYTFSYVCV 357

Query: 360 TSGAAALVFSAIYALVD 376
           TSGAAALVFS+IYALVD
Sbjct: 358 TSGAAALVFSSIYALVD 374


>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  556 bits (1433), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 258/357 (72%), Positives = 308/357 (86%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           ++  ++  +    QRLASLDIFRGL VALMILVD AGGDWP I+HAPWNGCNLADFVMPF
Sbjct: 3   EIKVERSLAGNNRQRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPF 62

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+LLQGGFSHAPDEL+YGVDV M+
Sbjct: 63  FLFIVGVSIALSLKRISNKFEACKKVCFRTCKLLFWGLLLQGGFSHAPDELSYGVDVTMM 122

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R CG+LQRIALSYL+V+L+EIFTKD+ +++ S GR SIF+ Y  HW++   VLV+YLA L
Sbjct: 123 RFCGILQRIALSYLVVALIEIFTKDLHEENLSTGRLSIFKSYYCHWIVGVSVLVIYLATL 182

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           YGTYVPDW+F + +KDS  YGK+ +V+CGVR KLNPPCNAVGY+DR+VL INHMYHHPAW
Sbjct: 183 YGTYVPDWEFVVNDKDSILYGKIQSVSCGVRGKLNPPCNAVGYVDRQVLVINHMYHHPAW 242

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           RRSKA T DSP+EG LR+DAPSWCHAPFEPEG+LSS+S+ILSTIIGVHFGH+IIH +GHL
Sbjct: 243 RRSKAFTDDSPYEGALRQDAPSWCHAPFEPEGILSSISAILSTIIGVHFGHIIIHLQGHL 302

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ARLK W++ G   L  GLTLHFT+ +PLNKQLY+ SY+CVTSGAAALVFS++Y+LVD
Sbjct: 303 ARLKHWISTGLVFLTLGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVD 359


>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Vitis vinifera]
          Length = 499

 Score =  543 bits (1400), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 269/367 (73%), Positives = 308/367 (83%), Gaps = 2/367 (0%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
           + H LIIS+     ++     KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41  NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           CNLADFVMPFFLFIVGVAIALALKRIPDR  A+KKV  RTLKLLFWG+LLQG F+  PD+
Sbjct: 99  CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD++ IR CG+LQ IAL+YL+V+L+EI TK  Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQXIALAYLVVALLEITTKKAQAKDLSPGQFSIFKLYCWHWLMGA 218

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV  V CG R KL+PPCN VGYIDR++LG
Sbjct: 219 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 278

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           +NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 279 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 338

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV++H KGH  RLK WV MGFALL+ G+TLHFT AIPLNKQLYT SYVCVTSGAAALVFS
Sbjct: 339 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAAALVFS 398

Query: 370 AIYALVD 376
             Y LVD
Sbjct: 399 FFYILVD 405


>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
 gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
          Length = 481

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 246/357 (68%), Positives = 299/357 (83%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+  +++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 31  DEADDNEKAPRRSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 90

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVG+AI L+LKRIPDR  AV++V+ RTLKLLFWGILLQG +SHAPDELTYGVD++ +
Sbjct: 91  FLFIVGMAIPLSLKRIPDRGRAVRRVVIRTLKLLFWGILLQGRYSHAPDELTYGVDMKHV 150

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R  G+LQRIAL+YL+V+++EI TKD + +DQS   FSIFR+Y   W++A C+LV+YLAL+
Sbjct: 151 RWGGILQRIALAYLVVAVLEIVTKDAKIQDQSSSGFSIFRMYLSQWIVACCILVIYLALV 210

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           YG YVPDW+F + N DS +YGKV  VTCG R  L+PPCNAVGYIDRKVLGINHMY  PAW
Sbjct: 211 YGIYVPDWEFRVRNVDSPNYGKVLTVTCGTRGILDPPCNAVGYIDRKVLGINHMYQKPAW 270

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
           RR +ACT DSP EG  R DAP+WC APFEPEG+LSS+S++LSTIIGVH+GHV++H K H 
Sbjct: 271 RRHRACTDDSPHEGHFRNDAPAWCVAPFEPEGILSSLSAVLSTIIGVHYGHVLVHMKSHT 330

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            RL+QWVTMG  LL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFS +Y LVD
Sbjct: 331 DRLRQWVTMGICLLVLGIILHFSHAIPLNKQLYTFSYICVTAGAAGVVFSVLYFLVD 387


>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 490

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 266/376 (70%), Positives = 309/376 (82%), Gaps = 1/376 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M EIK ++T HHP  +   D SD      +K++RLASLDIFRGL VALMILVD AGG+WP
Sbjct: 22  MEEIKPDSTSHHPHRLISVD-SDALLPKPVKSKRLASLDIFRGLTVALMILVDDAGGEWP 80

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
            I HAPW GCNLADFVMPFFLFIVG+AIALALKRIP++  A++KV  RTLKLLFWG+LLQ
Sbjct: 81  MIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQ 140

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
           GG+SHAPD+LTYGVDVR IRL G+LQRIAL+YL+V+ VE+ ++  Q   Q    FSIF+ 
Sbjct: 141 GGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRKTQSNVQPFNHFSIFKS 200

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y W+WL+ AC+LVVY ALLYG YVPDWQFT+ + +S  YG+ F V CGVR  L+PPCNAV
Sbjct: 201 YFWNWLVGACILVVYFALLYGIYVPDWQFTVTDSESVYYGRNFTVACGVRGNLDPPCNAV 260

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GYIDRKVLGINH+Y HPAWRRS+ACT++SP+ G  R +APSWC APFEPEG+LSS+S+IL
Sbjct: 261 GYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAIL 320

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STIIGVHFGHV+IH + H ARLKQWVTMGF LLI GL LHFT+AIPLNKQLYT SYVCVT
Sbjct: 321 STIIGVHFGHVLIHFQDHSARLKQWVTMGFTLLILGLVLHFTHAIPLNKQLYTFSYVCVT 380

Query: 361 SGAAALVFSAIYALVD 376
           SGAAALVFS  Y LVD
Sbjct: 381 SGAAALVFSVFYTLVD 396


>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 486

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 250/376 (66%), Positives = 307/376 (81%), Gaps = 6/376 (1%)

Query: 7   ETTHHHPLIISEPDVSDQQEKS-----HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           E    H    +  D+ D  EK         ++R+ASLD+FRGL VALMILVD AGG+WP 
Sbjct: 17  EDPDRHRTHEAADDLDDDGEKKASRPSSSSSRRVASLDVFRGLTVALMILVDGAGGEWPV 76

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
           I HAPW+GCNLADFVMPFFLFIVG+AI L+LKRIPDR  AV++V+ RTLKLLFWGILLQG
Sbjct: 77  IGHAPWHGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGWAVRRVVIRTLKLLFWGILLQG 136

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV-GRFSIFRL 180
           G+SHAPDELTYGVD++ IR CG+LQRIAL+YL+V+++EI TKD + +DQS  G FS+FRL
Sbjct: 137 GYSHAPDELTYGVDMKHIRWCGILQRIALAYLVVAVIEIATKDARVQDQSSSGFFSVFRL 196

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y   W++A C+L++YL+L+YG YVPDW+FT+ N DS +YGKV  VTCG R  L+PPCNAV
Sbjct: 197 YLSQWIVACCILLIYLSLVYGVYVPDWEFTVRNVDSPNYGKVLTVTCGTRGNLSPPCNAV 256

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GYIDRKVLGINH+Y  PAWRR + CT DSP EGP ++DAP+WC +PFEPEGLLSS S++L
Sbjct: 257 GYIDRKVLGINHLYQKPAWRRHRDCTDDSPHEGPFKRDAPAWCASPFEPEGLLSSFSAVL 316

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STIIGVH+GHV++H K H+ RLKQWVTMG ALL+ G+ LHF++AIPLNKQLYTLSY+CVT
Sbjct: 317 STIIGVHYGHVLVHMKSHMDRLKQWVTMGVALLLLGIILHFSHAIPLNKQLYTLSYICVT 376

Query: 361 SGAAALVFSAIYALVD 376
           +GAA ++FS +Y LVD
Sbjct: 377 AGAAGIIFSMLYFLVD 392


>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 485

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 243/351 (69%), Positives = 296/351 (84%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +++R+ASLD+FRGL VALMILVD AGG+WP I HAPW+GCNLADFVMPFFLFIVG+AI L
Sbjct: 46  RSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWDGCNLADFVMPFFLFIVGMAIPL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           +LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD++ IR CG+LQRIA 
Sbjct: 106 SLKRIPDRGRAVRRVVIRTLKLLFWGILLQGGYSHAPDELAYGVDMKHIRWCGILQRIAF 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YL+V+++EI TKD   +DQS   FSIFR+Y   W++A C+L++YL+L+YG YVPDW+F 
Sbjct: 166 AYLVVAVIEIATKDANIQDQSSSGFSIFRMYFSQWIVACCILLIYLSLVYGIYVPDWEFR 225

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           + N DS +YGKV  VTCG R KL+PPCNAVGYIDRKVLGINH+Y  PAWRR +ACT DSP
Sbjct: 226 VRNVDSPNYGKVLTVTCGTRGKLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRACTDDSP 285

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
            EGP + DAP+WC +PFEPEGLLSS S++LSTIIGVH+GHV++H K H+ RLKQWVTMG 
Sbjct: 286 HEGPFKSDAPAWCASPFEPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGI 345

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           ALL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFS +Y LVD    G
Sbjct: 346 ALLLLGIILHFSHAIPLNKQLYTFSYICVTAGAAGIVFSMLYFLVDMVSLG 396


>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays]
 gi|194693076|gb|ACF80622.1| unknown [Zea mays]
 gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 484

 Score =  514 bits (1323), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 247/363 (68%), Positives = 301/363 (82%), Gaps = 6/363 (1%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+   ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28  DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           FLFIVG+A+ LALKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD+R +
Sbjct: 88  FLFIVGMAVPLALKRIPDRGRAVRRVVVRTLKLLFWGILLQGGYSHAPDELAYGVDMRHV 147

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRFS-IFRLYCWHWLMAACVLV 193
           R  G+LQRIAL+YL+V+++E+ TKD     +DQ   S GRFS +FR+Y   W++A C+LV
Sbjct: 148 RWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRFSRVFRMYLSQWIVACCILV 207

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           VYL+L YG YVPDW+F + N DS DYGKV  V CG R  L+PPCNAVGYIDR+VLGINHM
Sbjct: 208 VYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALDPPCNAVGYIDRRVLGINHM 267

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LSS+S++LST++GVH+GHV++
Sbjct: 268 YQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILSSLSAVLSTVVGVHYGHVLV 327

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H K H  RL+QWVTMG ALL+ G+ LHF++AIPLNKQLYT SY+CVT+GAA +VFSA+Y 
Sbjct: 328 HMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTFSYICVTAGAAGVVFSALYF 387

Query: 374 LVD 376
           LVD
Sbjct: 388 LVD 390


>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 258/367 (70%), Positives = 293/367 (79%), Gaps = 20/367 (5%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
           + H LIIS+     ++     KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41  NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           CNLADFVMPFFLFIVGVAIALALKRIPDR  A+KKV  RTLKLLFWG+LLQG F+  PD+
Sbjct: 99  CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD++ IR CG+LQ                   Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQ------------------AQAKDLSPGQFSIFKLYCWHWLMGA 200

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV  V CG R KL+PPCN VGYIDR++LG
Sbjct: 201 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 260

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           +NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 261 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 320

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV++H KGH  RLK WV MGFALL+ G+TLHFT AIPLNKQLYT SYVCVTSGAAALVFS
Sbjct: 321 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAAALVFS 380

Query: 370 AIYALVD 376
             Y LVD
Sbjct: 381 FFYILVD 387


>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 464

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 268/377 (71%), Positives = 313/377 (83%), Gaps = 8/377 (2%)

Query: 1   MSEIKAETTHHHPLIISEP-DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
           M+EIK E    H L +SE   +SD   K+  KT+R+ASLDIFRGL VALMILVD AGG W
Sbjct: 1   MAEIKGE----HSLNVSEELPLSD---KNLPKTKRVASLDIFRGLTVALMILVDDAGGQW 53

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
           P I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R  AVKKVI RTLKLLFWG+LL
Sbjct: 54  PMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLL 113

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
           QGGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++  Q +D      SIF+
Sbjct: 114 QGGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFK 173

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
           LY WHWL+ AC+L VYLALLYG +VPDWQFT+ N DS   G    VTCGVR KL+PPCNA
Sbjct: 174 LYYWHWLVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNA 233

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VGYIDR+V+GINHMY  PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+I
Sbjct: 234 VGYIDREVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAI 293

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LSTIIG+HFGHV+IH + H +RLK W+ +G ALL  GL LHFT+AIPLNKQLYTLSYVCV
Sbjct: 294 LSTIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCV 353

Query: 360 TSGAAALVFSAIYALVD 376
           TSGAAAL+FSA Y +VD
Sbjct: 354 TSGAAALLFSAFYIMVD 370


>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 465

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 269/378 (71%), Positives = 312/378 (82%), Gaps = 9/378 (2%)

Query: 1   MSEIKAETTHHHPLIISE--PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD 58
           M+EIK E    H L +S+  P+VSD   K+  KT+R+ASLDIFRGL VALMILVD AG  
Sbjct: 1   MAEIKGE----HSLNVSQELPEVSD---KNLPKTKRVASLDIFRGLTVALMILVDDAGEQ 53

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
           WP I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R  AVKKVI RTLKLLFWG+L
Sbjct: 54  WPMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLL 113

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           LQGGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++  Q +D      SIF
Sbjct: 114 LQGGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSTQARDPEPTHLSIF 173

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
            LY WHWL+ AC+LVVYLALLYG +VPDW FT+ N DS   G    VTCGVR KL+PPCN
Sbjct: 174 NLYYWHWLVGACILVVYLALLYGIHVPDWGFTVHNPDSIYNGTTLTVTCGVRGKLDPPCN 233

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           AVGYIDR+VLGINHMY  PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+
Sbjct: 234 AVGYIDREVLGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISA 293

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ILSTIIG+HFGHV+IH + H +RLK W+ +G ALL  GL LHFT+AIPLNKQLYTLSYVC
Sbjct: 294 ILSTIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVC 353

Query: 359 VTSGAAALVFSAIYALVD 376
           VTSGAAAL+FSA Y  VD
Sbjct: 354 VTSGAAALLFSAFYITVD 371


>gi|413951398|gb|AFW84047.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 503

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 247/382 (64%), Positives = 301/382 (78%), Gaps = 25/382 (6%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D +D  EK+   ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28  DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87

Query: 80  FLFIVGVAIALALK-------------------RIPDRADAVKKVIFRTLKLLFWGILLQ 120
           FLFIVG+A+ LALK                   RIPDR  AV++V+ RTLKLLFWGILLQ
Sbjct: 88  FLFIVGMAVPLALKVRRRRRSSRPSVVHAMHAHRIPDRGRAVRRVVVRTLKLLFWGILLQ 147

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRF 175
           GG+SHAPDEL YGVD+R +R  G+LQRIAL+YL+V+++E+ TKD     +DQ   S GRF
Sbjct: 148 GGYSHAPDELAYGVDMRHVRWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRF 207

Query: 176 S-IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
           S +FR+Y   W++A C+LVVYL+L YG YVPDW+F + N DS DYGKV  V CG R  L+
Sbjct: 208 SRVFRMYLSQWIVACCILVVYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALD 267

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           PPCNAVGYIDR+VLGINHMY  PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LS
Sbjct: 268 PPCNAVGYIDRRVLGINHMYQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILS 327

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           S+S++LST++GVH+GHV++H K H  RL+QWVTMG ALL+ G+ LHF++AIPLNKQLYT 
Sbjct: 328 SLSAVLSTVVGVHYGHVLVHMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTF 387

Query: 355 SYVCVTSGAAALVFSAIYALVD 376
           SY+CVT+GAA +VFSA+Y LVD
Sbjct: 388 SYICVTAGAAGVVFSALYFLVD 409


>gi|222619812|gb|EEE55944.1| hypothetical protein OsJ_04649 [Oryza sativa Japonica Group]
          Length = 846

 Score =  496 bits (1277), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/345 (68%), Positives = 292/345 (84%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 408 SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 467

Query: 92  LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
           LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 468 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 527

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           YL+V+++EI TK+ + +DQS   FSIFR+Y   W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 528 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 587

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            +  + ++GK+  VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP 
Sbjct: 588 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 647

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
           EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H  RLKQW  MG  
Sbjct: 648 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 707

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LLI GLTLHF++AIPLNKQLYT SY+CVT+GAA +VF   Y LVD
Sbjct: 708 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVD 752


>gi|115442029|ref|NP_001045294.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|57899654|dbj|BAD87323.1| unknown protein [Oryza sativa Japonica Group]
 gi|57900117|dbj|BAD88179.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534825|dbj|BAF07208.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|215697092|dbj|BAG91086.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 488

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 238/345 (68%), Positives = 292/345 (84%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 50  SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 109

Query: 92  LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
           LKRIPDR  AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 110 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 169

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           YL+V+++EI TK+ + +DQS   FSIFR+Y   W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 170 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 229

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            +  + ++GK+  VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP 
Sbjct: 230 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 289

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
           EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H  RLKQW  MG  
Sbjct: 290 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 349

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LLI GLTLHF++AIPLNKQLYT SY+CVT+GAA +VF   Y LVD
Sbjct: 350 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVD 394


>gi|224123608|ref|XP_002330163.1| predicted protein [Populus trichocarpa]
 gi|222871619|gb|EEF08750.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  486 bits (1251), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 248/376 (65%), Positives = 300/376 (79%), Gaps = 2/376 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M++IKA  ++   L+I+  D +          +R+ASLDIFRGL VALMILVD AGG+WP
Sbjct: 1   MADIKAYISYAKRLLIA--DGTHFSAPKPDPERRVASLDIFRGLTVALMILVDDAGGEWP 58

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
           ++ HAPW+G NLADFVMPFFLFIVG+AI L  K I  R  AVKK+I RTLKLLFWGI+LQ
Sbjct: 59  KMGHAPWHGSNLADFVMPFFLFIVGMAIPLTFKGITSRDHAVKKMIVRTLKLLFWGIMLQ 118

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
           GGFSHAPD+L+YGVD++ IR CG+LQRIA +YL+++L+EIFTK  Q KD   GR SIFRL
Sbjct: 119 GGFSHAPDKLSYGVDMKKIRWCGILQRIAFAYLVMALMEIFTKKDQTKDLPPGRLSIFRL 178

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y   WL+ AC+LVVYLA++YG YVP WQFT+ +++S+DYGKVF V C VR KL+P CNA+
Sbjct: 179 YGSQWLVGACILVVYLAVIYGMYVPHWQFTVNDEESSDYGKVFTVECAVRGKLDPACNAI 238

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
            YIDRK+LGINH+Y HPAW+RS+ACT+ S +E P +  AP+WC APFEP+G+LSS+SS+L
Sbjct: 239 AYIDRKILGINHLYQHPAWKRSEACTEASLYEAPFQTSAPTWCKAPFEPDGILSSISSVL 298

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           STI G HFGHV +H KG  ARLK W  MG ALLI GL LHFT+A+PLNKQLYT SYVCVT
Sbjct: 299 STITGAHFGHVHVHLKGDTARLKHWTVMGLALLILGLVLHFTHAMPLNKQLYTFSYVCVT 358

Query: 361 SGAAALVFSAIYALVD 376
           SGAAALVFSAIY LVD
Sbjct: 359 SGAAALVFSAIYILVD 374


>gi|357511851|ref|XP_003626214.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355501229|gb|AES82432.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 483

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 255/393 (64%), Positives = 306/393 (77%), Gaps = 21/393 (5%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVA------------- 47
           M EI  E + H   ++SE +    +E    K +R+ASLDIFRGL VA             
Sbjct: 1   MEEIIGEHSVH---VVSEVEPVSAKELPK-KVKRVASLDIFRGLTVADGDLTVFVAVKYR 56

Query: 48  ---LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
              LMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+LK+IP++  AVKK
Sbjct: 57  AKQLMILVDDAGGEWPAIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKKIPNKLLAVKK 116

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           VI RTLKLLFWG+LLQGG+SHAPD L+YGVD++ IR CG+LQRIAL+YL+V+LVEI ++ 
Sbjct: 117 VIVRTLKLLFWGLLLQGGYSHAPDHLSYGVDMKHIRWCGILQRIALAYLVVALVEIISRS 176

Query: 165 VQDKDQ-SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
            QD+D       SIF LY WHWL+AAC+LVVY+ LLYG +VPDWQFT+ N DS   G  F
Sbjct: 177 RQDRDDPEPTNLSIFTLYYWHWLVAACILVVYMPLLYGIHVPDWQFTVHNPDSIYNGTTF 236

Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
            VTCGVR KL+PPCNAVGYIDR+VLGINH+Y  PA RRS+ACT   P+EGP +K AP+WC
Sbjct: 237 TVTCGVRGKLDPPCNAVGYIDREVLGINHVYKKPASRRSEACTVKPPYEGPFKKTAPAWC 296

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
           +APFEPEG+LSS+S+ILSTIIG+H+GHV+IH + HL+RLKQW+ +G ALL  G  LHF++
Sbjct: 297 YAPFEPEGILSSISAILSTIIGLHYGHVLIHLQDHLSRLKQWILLGLALLTLGFILHFSH 356

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            IPLNKQLYTLSYVCVTSGAAALVFSA Y +VD
Sbjct: 357 VIPLNKQLYTLSYVCVTSGAAALVFSAFYVMVD 389


>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis]
 gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis]
          Length = 519

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 218/387 (56%), Positives = 270/387 (69%), Gaps = 18/387 (4%)

Query: 5   KAETTHHHPLIISEPDVSD-----QQEKSHL----------KTQRLASLDIFRGLAVALM 49
           K + TH    +I E +++      +QE   L          KT+R+A+LD FRGL V LM
Sbjct: 42  KLDKTHDGGGVIPEKELTSSTVLVEQEGEQLQQPEQLPVKQKTKRVATLDAFRGLTVVLM 101

Query: 50  ILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
           ILVD+AG  +  I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP + DAVKK+  RT
Sbjct: 102 ILVDNAGESYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPRKRDAVKKISLRT 161

Query: 110 LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD 169
           LKLLFWGILLQGG+SHAP +L+YGVD+++IR CG+LQRIAL Y+ V+L+E  T   +   
Sbjct: 162 LKLLFWGILLQGGYSHAPVDLSYGVDMKLIRWCGILQRIALVYMFVALIETLTIKERQTV 221

Query: 170 QSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV 229
                FSIF  Y W W+      ++Y+   Y  YVPDW FT  + +       + V CG+
Sbjct: 222 LQPNHFSIFTAYRWQWIGGFIAFLIYMITTYALYVPDWSFTAYDDNRPTR---YTVKCGM 278

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
           R  L P CNAVGY+DR+V GINH+Y +P W R KACT  SP  GPLR DAPSWC APFEP
Sbjct: 279 RGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTFSSPATGPLRADAPSWCLAPFEP 338

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           EGLLS++S+ILS  IG+H+GHV+IH KGH  RLKQWV+MG  L +  + LHFT+AIP+NK
Sbjct: 339 EGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGLGLFLIAIILHFTDAIPINK 398

Query: 350 QLYTLSYVCVTSGAAALVFSAIYALVD 376
           QLY+ SYVC T+GAA +VFS  Y L+D
Sbjct: 399 QLYSFSYVCFTAGAAGIVFSGFYILID 425


>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 508

 Score =  449 bits (1156), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 210/355 (59%), Positives = 260/355 (73%), Gaps = 2/355 (0%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
             +Q     KT+R+A+LD FRGL + LMILVD AG  +P I H+PWNGC LADFVMPFFL
Sbjct: 62  EQEQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 121

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVGVAIALALKRI     +VKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR 
Sbjct: 122 FIVGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 181

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           CG+LQRIAL Y +V+L+E FT  ++    + G  SIF  Y W W       ++Y+   + 
Sbjct: 182 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKWQWFGGFVAFLIYMITTFS 241

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVPDW F  ++  + D  K + V CG+R  L P CNAVG++DR+V G+NH+Y  P WRR
Sbjct: 242 LYVPDWSF--VDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 299

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
            KACT  SP  GP R DAPSWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH KGH  R
Sbjct: 300 LKACTFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSER 359

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LKQWV+MGF LLI  + LHFT+A+P+NKQLY+ SYVC T+GAA +VFS  Y L+D
Sbjct: 360 LKQWVSMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGFYILID 414


>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa]
 gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 214/358 (59%), Positives = 264/358 (73%), Gaps = 7/358 (1%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            D+Q     K++R+A+LD FRGL + LMILVD AGG +P I H+PWNGC LADFVMPFFL
Sbjct: 51  GDRQPVVKQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFL 110

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVGVAIALA KRIP R DAVKK+I RTLKLLFWG+LLQGG+SHAP +L YGVD+++IR 
Sbjct: 111 FIVGVAIALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRW 170

Query: 142 CGVLQ-RIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
            G+LQ RIAL Y++V+L+E +  K+ Q  +     F+IF  Y W W+      V+Y+   
Sbjct: 171 FGILQQRIALVYMVVALIEALIPKNRQTIEPD--HFTIFTAYRWQWIAGFISFVIYMVTT 228

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
           +  YVPDW FT+   D     + + V CG+R  L P CNAVGY+DR+V GINH+Y +P W
Sbjct: 229 FALYVPDWSFTV---DEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVW 285

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
            R KACT  SP  GP RKDAPSWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH KGH 
Sbjct: 286 SRLKACTLSSPGSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHA 345

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            RL+QWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC T+GAA +VFS  Y L+D 
Sbjct: 346 ERLRQWVSMGVILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFSGFYVLIDV 403


>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera]
          Length = 489

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 208/355 (58%), Positives = 258/355 (72%), Gaps = 3/355 (0%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           ++Q     K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADFVMPFFLF
Sbjct: 45  EEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLF 104

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           IVGVA+ALALK+IP  + AVKK+  RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR  
Sbjct: 105 IVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWF 164

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           G+LQRIA+ Y +V+L+E  T   +      G FSI   Y W W+      ++Y+   Y  
Sbjct: 165 GILQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYAL 224

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           YVPDW F I   D     K + V CG+R  L P CNAVGY+DR+V GINH+Y  P W R 
Sbjct: 225 YVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRL 281

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           KACT  SP  GP R+DAPSWC+APFEPEGLLS++S+ILS  IG+H+GHV+IH KGH  RL
Sbjct: 282 KACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERL 341

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           KQWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC T+GAA +V SA Y ++D 
Sbjct: 342 KQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVLSAFYLVIDV 396


>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Glycine max]
          Length = 509

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 205/355 (57%), Positives = 256/355 (72%), Gaps = 2/355 (0%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
             +Q     KT+R+A+LD FRGL + LMILVD AG  +P I H+PWNGC LADFVMPFFL
Sbjct: 63  EQEQPVVKQKTKRIATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 122

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           FIVG+AIALALKRI     AVKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR 
Sbjct: 123 FIVGIAIALALKRIAKIKHAVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 182

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           CG+LQRIAL Y +V+L+E FT  ++    + G  SIF  Y W W       ++Y+   + 
Sbjct: 183 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFTAYKWQWFGGFVAFIIYMITTFT 242

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            YVP W F  ++  + D  K + V CG+R  L P CNAVG++DR+V G+NH+Y  P WRR
Sbjct: 243 LYVPHWSF--LDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 300

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
            K     SP  GP R DAPSWC +PFEPEGLLSS+S+ILS  IG+H+GH++IH KGH  R
Sbjct: 301 LKMTIDYSPASGPFRDDAPSWCRSPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHSER 360

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LKQWV MGF LLI  + LHFT+A+P+NKQLY+ SYVC T+GAA +VFS +Y LVD
Sbjct: 361 LKQWVLMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGLYILVD 415


>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 419

 Score =  440 bits (1131), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 206/328 (62%), Positives = 245/328 (74%), Gaps = 3/328 (0%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           M+LVD AGG +P I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP    AVKK+I R
Sbjct: 1   MVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIILR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
           TLKLLFWGILLQGG+SHAPD+L+YGVD+R IR CG+LQRIAL Y +V+L+E +T  ++  
Sbjct: 61  TLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRPS 120

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
               G  SIF  Y W WL      V+Y+  ++  YVPDW F   N D     K + V CG
Sbjct: 121 TLKPGHLSIFTAYRWQWLGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKP---KRYTVECG 177

Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
           +R  L P CNAVGY+DR+V G+NH+Y  P W R KACT  SP EGPLRK+AP+WC APFE
Sbjct: 178 MRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFE 237

Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLN 348
           PEG LSSV +ILS  IG+H+GHV+IH KGH  RLKQW++MGF LL  GL LHFT+AIP+N
Sbjct: 238 PEGFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPIN 297

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVD 376
           KQLY+ SYVC T+GAA +VFS  Y L+D
Sbjct: 298 KQLYSFSYVCFTAGAAGIVFSVFYLLID 325


>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera]
          Length = 511

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 209/377 (55%), Positives = 258/377 (68%), Gaps = 25/377 (6%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           ++Q     K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADFVMPFFLF
Sbjct: 45  EEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLF 104

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR-- 140
           IVGVA+ALALK+IP  + AVKK+  RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR  
Sbjct: 105 IVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWF 164

Query: 141 --------------------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
                               L G LQRIA+ Y +V+L+E  T   +      G FSI   
Sbjct: 165 GILQVFPLPLFTGKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSA 224

Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
           Y W W+      ++Y+   Y  YVPDW F I   D     K + V CG+R  L P CNAV
Sbjct: 225 YKWQWIGGFVAFLIYMITTYALYVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNAV 281

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GY+DR+V GINH+Y  P W R KACT  SP  GP R+DAPSWC+APFEPEGLLS++S+IL
Sbjct: 282 GYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAIL 341

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           S  IG+H+GHV+IH KGH  RLKQWV+MG  LLI  + LHFT+AIP+NKQLY+ SYVC T
Sbjct: 342 SGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFT 401

Query: 361 SGAAALVFSAIYALVDC 377
           +GAA +V SA Y ++D 
Sbjct: 402 AGAAGIVXSAFYLVIDV 418


>gi|302754694|ref|XP_002960771.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
 gi|300171710|gb|EFJ38310.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
          Length = 493

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 217/396 (54%), Positives = 280/396 (70%), Gaps = 18/396 (4%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQ----------RLASLDIFRGLAVALMIL 51
            +++   T  H +   E D   ++E    + Q          R+A+LD+FRGL VALM+L
Sbjct: 9   EKLEGHDTKEHKISFHEEDRHVKKESLLNEDQAVAVPLKKPVRIATLDVFRGLTVALMVL 68

Query: 52  VDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           VD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIALALKRIPD+  A +KV+ RTLK
Sbjct: 69  VDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRTLK 128

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           LLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA  YL+V+LVEI T   +  +  
Sbjct: 129 LLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLELP 188

Query: 172 VGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT------IINKDSADYGKVFNV 225
            G+F IF+LY WHW  A  V+++Y ++ YG YVPDW F       +++     +    NV
Sbjct: 189 KGQFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFIDSGHRFVVSLAKFVFSSQINV 248

Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
            CGVR  + P CNAVG+IDR +LGINH+Y  P W R+++C  DSP EG    +AP+WC A
Sbjct: 249 QCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAWCKA 308

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+  W     ALL+    LHFT+AI
Sbjct: 309 PFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFTHAI 368

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           PLNKQLY+ SYVC T+GAA ++FS +Y ++D  IFG
Sbjct: 369 PLNKQLYSFSYVCFTAGAAGMIFSLLYVVID--IFG 402


>gi|302804288|ref|XP_002983896.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
 gi|300148248|gb|EFJ14908.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
          Length = 493

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 212/357 (59%), Positives = 267/357 (74%), Gaps = 8/357 (2%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           K  R+A+LD+FRGL VALM+LVD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIAL
Sbjct: 48  KPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIAL 107

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           ALKRIPD+  A +KV+ RTLKLLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA 
Sbjct: 108 ALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAF 167

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
            YL+V+LVEI T   +  +   G F IF+LY WHW  A  V+++Y ++ YG YVPDW F 
Sbjct: 168 GYLIVALVEIATTKSRSLELPKGHFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFI 227

Query: 211 ------IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +++     +    NV CGVR  + P CNAVG+IDR +LGINH+Y  P W R+++
Sbjct: 228 DSGHRFVVSLAKFVFSSQINVQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQS 287

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
           C  DSP EG    +AP+WC APFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+  
Sbjct: 288 CDLDSPAEGDPPANAPAWCKAPFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLH 347

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           W     ALL+    LHFT+AIPLNKQLY+ SYVC T+GAA ++FS +Y ++D  IFG
Sbjct: 348 WTIPAAALLVLATILHFTHAIPLNKQLYSFSYVCFTAGAAGMIFSLLYVVID--IFG 402


>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 498

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 210/352 (59%), Positives = 260/352 (73%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E    K+ R+A+LD FRGL + +MILVD AG  +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 53  EEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFFLFIV 112

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA A+KR+P+   AVKKV  RTLK++FWG+LLQGG+SHAPD+L YGVD++MIR CG+
Sbjct: 113 GVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIRWCGI 172

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y  V+L+E+FT  V+      G ++IF  Y W WL A  VLV+Y+   +  YV
Sbjct: 173 LQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTFSLYV 232

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PDW F   N    + GK F V CGVR  L+P CNAVG+IDR+V GINH+Y  P W R+K 
Sbjct: 233 PDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWIRTKD 292

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
           CT  SP  G LR DAP+WC  PFEPEGLLSS+SSI+S  IG+H+GHV+IH K H  RL  
Sbjct: 293 CTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKERLTH 352

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           W++MGFALL+ G+ LHFTNAIP+NKQLY+ SY+C T GAA +V SA YAL+D
Sbjct: 353 WLSMGFALLLLGILLHFTNAIPINKQLYSFSYICFTGGAAGIVLSAFYALID 404


>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
 gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
          Length = 512

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 208/381 (54%), Positives = 260/381 (68%), Gaps = 20/381 (5%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + +  V+  +E    K++R+A+LD FRGL + LMILVD AGG +  I H+PWNGC LADF
Sbjct: 38  VEKERVAVAEEVPKKKSRRVAALDAFRGLTIVLMILVDDAGGAYERIDHSPWNGCTLADF 97

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VMPFFLFIVGVAIA ALKR+P+  +AVK++  RTLK+LFWG+LLQGG+SHAPD+L+YGVD
Sbjct: 98  VMPFFLFIVGVAIAFALKRVPNMGNAVKRITIRTLKMLFWGVLLQGGYSHAPDDLSYGVD 157

Query: 136 VRMIRLCGVLQ--------------------RIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           ++ IR  G+LQ                    RIAL Y +V+L+E FT  V+      G +
Sbjct: 158 MKKIRWMGILQLYIYHGNNLDSFLFFTLGHQRIALVYFIVALIEAFTVKVRPTTVRSGPY 217

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
           +IF  + W WL      V+Y+   +  YVPDW +   N    + GK F V CGVRA L  
Sbjct: 218 AIFNAHRWQWLGGFIAFVIYMVTTFSLYVPDWSYVYHNDGDVNDGKQFTVKCGVRASLEQ 277

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
            CNAVGY+DR+V GINH+Y  P W RSK CT  SP  GPLR DAP WC APFEPEGLLSS
Sbjct: 278 ACNAVGYVDRQVWGINHLYTQPVWIRSKDCTSSSPNMGPLRADAPEWCLAPFEPEGLLSS 337

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
           +SS+LS  IG+H+GHV+IH K H  RLK W+  GF+LL+  + LHFTNAIP+NKQLY+ S
Sbjct: 338 ISSVLSGTIGIHYGHVLIHFKTHKERLKHWLVTGFSLLVLAIILHFTNAIPINKQLYSFS 397

Query: 356 YVCVTSGAAALVFSAIYALVD 376
           YVC T GAA +V SA Y L+D
Sbjct: 398 YVCFTGGAAGIVLSAFYILID 418


>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 488

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 210/375 (56%), Positives = 261/375 (69%), Gaps = 16/375 (4%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           K E     P I+ E  +         KT+R+A+LD FRGL + LMILVD AGG +  I H
Sbjct: 33  KEEEKEVAPTIVEEAQLRQ-------KTKRVATLDAFRGLTIVLMILVDDAGGAYSRIDH 85

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
           +PWNGC LADFVMPFFLFIVGVAIALA KRI      V K+  RT+KL+FWG++LQGG+S
Sbjct: 86  SPWNGCTLADFVMPFFLFIVGVAIALAFKRIGSIKQGVMKISLRTIKLVFWGLILQGGYS 145

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV---GRFSIFRLY 181
           HAPD+L YGVD++ IR CG+LQRIAL Y +V+++E FT     K + V   G FSIF  Y
Sbjct: 146 HAPDDLEYGVDMKHIRWCGILQRIALVYFVVAMIEAFT--TIGKPRVVLDHGHFSIFTAY 203

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              W+      ++Y+   Y  YVP+W F+++  D   +   + V CGVR  L P CNAVG
Sbjct: 204 --RWIGGFAAFIIYIITTYALYVPNWSFSVLEDDQLLHH--YTVVCGVRGHLGPACNAVG 259

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           ++DR+V GINH+Y +P W R K CT  +P EGPLR DA SWC APFEPEGLLSSVS+ILS
Sbjct: 260 HVDRQVWGINHLYSYPVWIRHKDCTFSAPDEGPLRDDAASWCLAPFEPEGLLSSVSAILS 319

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
             IG+H+GHV++H K H  RLKQWV+MGF   I G+ LHFTNAIP+NKQLY+LSYVC T+
Sbjct: 320 GTIGIHYGHVLLHFKTHSQRLKQWVSMGFGFFIIGIILHFTNAIPINKQLYSLSYVCFTA 379

Query: 362 GAAALVFSAIYALVD 376
           GAA +VFS  Y L+D
Sbjct: 380 GAAGIVFSGFYILID 394


>gi|242075654|ref|XP_002447763.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
 gi|241938946|gb|EES12091.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
          Length = 446

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 180/346 (52%), Positives = 240/346 (69%), Gaps = 6/346 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+AL
Sbjct: 53  RPQRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALAL 112

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PD+ DA KK + R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA+
Sbjct: 113 AYKRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAI 172

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL +L EI+ K  +D D     + + + Y +  L+ A V + Y+ LLYGTYVPDW++ 
Sbjct: 173 AYLLTALCEIWLKGDEDVDYG---YDLLKRYRYQLLVGAVVAITYMCLLYGTYVPDWEYQ 229

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
                S +  K F V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP
Sbjct: 230 TSGPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSP 287

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F
Sbjct: 288 QNGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSF 347

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++L+    + F   + +NK LY+LSY   T+GAA L+FS IY LVD
Sbjct: 348 SMLVLAFAMDFFG-LHMNKPLYSLSYTLATAGAAGLLFSGIYTLVD 392


>gi|226509496|ref|NP_001144452.1| uncharacterized protein LOC100277415 [Zea mays]
 gi|195642330|gb|ACG40633.1| hypothetical protein [Zea mays]
          Length = 441

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 176/346 (50%), Positives = 235/346 (67%), Gaps = 4/346 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 46  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRITSWLVPSF 342

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 343 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVD 387


>gi|219885579|gb|ACL53164.1| unknown [Zea mays]
 gi|413937084|gb|AFW71635.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 482

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 176/346 (50%), Positives = 235/346 (67%), Gaps = 4/346 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 87  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 323

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 324 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 383

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 384 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVD 428


>gi|413937082|gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 441

 Score =  369 bits (948), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 176/346 (50%), Positives = 235/346 (67%), Gaps = 4/346 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 46  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H  R+  W+   F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 342

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++L     + F   + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 343 SMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAAGLLFAGIYALVD 387


>gi|224033113|gb|ACN35632.1| unknown [Zea mays]
 gi|413918233|gb|AFW58165.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 444

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 238/344 (69%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++   
Sbjct: 173 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 229

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  
Sbjct: 230 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 287

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 288 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 347

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L+    + F   + +NK LYTLSY   T+GAA L+FS IY LVD
Sbjct: 348 LVLAFAMDFFG-LHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVD 390


>gi|226494648|ref|NP_001146383.1| uncharacterized protein LOC100279961 [Zea mays]
 gi|219886923|gb|ACL53836.1| unknown [Zea mays]
 gi|413918231|gb|AFW58163.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 469

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 238/344 (69%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 78  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 137

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 138 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 197

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++   
Sbjct: 198 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 254

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  
Sbjct: 255 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 312

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 313 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 372

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L+    + F   + +NK LYTLSY   T+GAA L+FS IY LVD
Sbjct: 373 LVLAFAMDFFG-LHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVD 415


>gi|219886509|gb|ACL53629.1| unknown [Zea mays]
 gi|414587417|tpg|DAA37988.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 438

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 237/344 (68%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 47  QRLVSLDVFRGITVLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 106

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA KK + R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++Y
Sbjct: 107 KRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAY 166

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           LL +L EI+ K  +D D     + + + Y +   + A V + Y++LLYGTYV DW++   
Sbjct: 167 LLTALCEIWLKGDEDVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTS 223

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V CGVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  
Sbjct: 224 GPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQN 281

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++
Sbjct: 282 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSM 341

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L+    + F   + +NK LYTLSY   T+GAA L+F  IY LVD
Sbjct: 342 LVLAFAMDFL-GLRMNKPLYTLSYTLATAGAAGLLFCGIYTLVD 384


>gi|359473865|ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera]
          Length = 444

 Score =  363 bits (932), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 177/344 (51%), Positives = 239/344 (69%), Gaps = 4/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA+MILVD AGG  P I+H+PWNG  LADFVMPFFLFIVGV++ALA 
Sbjct: 51  RRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAY 110

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +     A K  + R LKLL +G+ LQGG+ H  + LTYGVD+  IRL G+LQRIA++Y
Sbjct: 111 KNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAY 170

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
            L ++ EI+ K   D +   G  S+ + Y + W +   + V Y +LLYG YVPDW+++I 
Sbjct: 171 FLAAVCEIWLKG--DSNVKSGS-SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIP 227

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
           ++ S+   K+F V CGVR+   P CNAVG IDR VLGI H+Y  P + R K C+ +SP  
Sbjct: 228 SETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDY 287

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+     L
Sbjct: 288 GPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCL 347

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L+ G  L F   + +NK LYTLSY+CVT+GAA ++F+ IY +VD
Sbjct: 348 LVLGFALDFF-GMHVNKALYTLSYMCVTAGAAGILFAGIYLMVD 390


>gi|115485801|ref|NP_001068044.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|77551354|gb|ABA94151.1| expressed protein [Oryza sativa Japonica Group]
 gi|113645266|dbj|BAF28407.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|125577433|gb|EAZ18655.1| hypothetical protein OsJ_34172 [Oryza sativa Japonica Group]
 gi|215701389|dbj|BAG92813.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 448

 Score =  363 bits (931), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 179/354 (50%), Positives = 235/354 (66%), Gaps = 3/354 (0%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D +  +    QRL SLD+FRG+ VALMILVD  GG  P ISH+PW+G  LADFV PFFLF
Sbjct: 44  DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 103

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           IVGV++A A K++PD+  A KK + R +KL   G++LQGGF H   ELTYGVD+R IRL 
Sbjct: 104 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 163

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++YL+V+L EI+ + V           +   Y     +   ++V YL +LYG 
Sbjct: 164 GVLQRIAIAYLVVALCEIWLRRVSSGGNIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 223

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           +VPDW++ + + DS    K F V CGV+    P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 224 HVPDWEYEVTSPDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 281

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           + C+  SP  GPL  +APSWC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+
Sbjct: 282 EQCSMASPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 341

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           K+W  +   LL  G +LH    + +NK LY+LSY CVT+G A L F AIY LVD
Sbjct: 342 KRWSILSLCLLTLGFSLHLF-GLHMNKSLYSLSYTCVTTGTAGLFFVAIYLLVD 394


>gi|168035930|ref|XP_001770461.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678169|gb|EDQ64630.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 487

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 186/356 (52%), Positives = 248/356 (69%), Gaps = 12/356 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           K+ RLASLD+FRGL++A+MILVD+AGG WP I+H+PW G  LADFVMPFFLFIVGVA+AL
Sbjct: 42  KSPRLASLDVFRGLSIAVMILVDNAGGVWPSINHSPWTGITLADFVMPFFLFIVGVALAL 101

Query: 91  ALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
             KRI  D+  A +K + RT KLL  G+++QGG+ H   + +YGVD+  IR CGVLQRIA
Sbjct: 102 TYKRITRDKKVASQKALGRTAKLLIVGLVIQGGYFHGLHDTSYGVDLERIRWCGVLQRIA 161

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+Y++V+L EI+    + +D S   F+IF+ Y +HW +AA ++  YLALLYG YVPDW F
Sbjct: 162 LAYMVVALCEIWAPR-RRQDVSNDNFAIFKTYHFHWAVAAAIVATYLALLYGVYVPDWDF 220

Query: 210 ---TIINKDSADY------GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
              T++N  +         G +  V CGVR  + P CNAVGY+DR +LG++H+Y  P +R
Sbjct: 221 IPPTVLNSTALHVSVVRVNGSMSEVHCGVRGNIGPACNAVGYLDRTILGVSHLYQRPVFR 280

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           R+ AC+ +SP  GPL   AP WC APF+PEGLLSS+S++ S  +G+HFGHV++H K H+A
Sbjct: 281 RTPACSVNSPDYGPLPSGAPDWCKAPFDPEGLLSSLSAVGSCFLGLHFGHVLVHRKEHIA 340

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL  W+ M   LLI    L     +P NK LY++SY+  T GAA +VF+  Y LVD
Sbjct: 341 RLWDWMIMSLVLLIV-GLLLHLLGVPFNKPLYSVSYMLFTGGAAGVVFAGFYLLVD 395


>gi|326505544|dbj|BAJ95443.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 429

 Score =  361 bits (927), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 175/344 (50%), Positives = 232/344 (67%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 38  QRLVSLDVFRGITVLLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAY 97

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K   R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++Y
Sbjct: 98  KRVPDKLDATRKATLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAY 157

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+ +L +I+ K   D D  +    + + Y +  L    + + Y+ALLYGTYVPDW++ I 
Sbjct: 158 LVTALCQIWLKGDDDVDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRIS 214

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
                +  K F V CGVR    P CNAVG IDRK+LGI H+Y  P + RS+ C+ DSP  
Sbjct: 215 GPGFTE--KTFTVRCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQN 272

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F +
Sbjct: 273 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGM 332

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L+    + F   + +NK LYTLSY   T+GAA L+F+ IY LVD
Sbjct: 333 LVLAFAMDFFG-MHMNKPLYTLSYTLCTAGAAGLLFAGIYTLVD 375


>gi|242065256|ref|XP_002453917.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
 gi|241933748|gb|EES06893.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
          Length = 439

 Score =  360 bits (924), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 172/346 (49%), Positives = 234/346 (67%), Gaps = 6/346 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +  RL SLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADFVMPFFLFIVGV++ L
Sbjct: 46  RQPRLVSLDVFRGITVLLMIIVDDAGGFLPSLNHSPWDGVTIADFVMPFFLFIVGVSLTL 105

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PD+ +A KK + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 106 AYKRVPDKLEATKKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 166 AYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLYGIYVPDWEYK 222

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           I    S +  K F+V CGVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P
Sbjct: 223 ISGPGSTE--KSFSVKCGVRGDTGPACNAVGMVDRTILGIDHLYRRPVYARTKECSINYP 280

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAPSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F
Sbjct: 281 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLIPSF 340

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++L     + F+  + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 341 SMLALAFLMDFSG-MRMNKPLYTISYTLATAGAAGLLFAGIYALVD 385


>gi|218185886|gb|EEC68313.1| hypothetical protein OsI_36402 [Oryza sativa Indica Group]
          Length = 450

 Score =  358 bits (920), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 181/356 (50%), Positives = 237/356 (66%), Gaps = 3/356 (0%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D +  +    QRL SLD+FRG+ VALMILVD  GG  P ISH+PW+G  LADFV PFFLF
Sbjct: 46  DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 105

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           IVGV++A A K++PD+  A KK + R +KL   G++LQGGF H   ELTYGVD+R IRL 
Sbjct: 106 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 165

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++YL+V+L EI+ + V           +   Y     +   ++V YL +LYG 
Sbjct: 166 GVLQRIAIAYLVVALCEIWLRRVSSGGDIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 225

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           +VPDW++ + + DS    K F V CGV+    P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 226 HVPDWEYEVTSLDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 283

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           + C+ DSP  GPL  +APSWC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+
Sbjct: 284 EQCSMDSPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 343

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           K+W T+   LL  G +LH    + +NK LY+LSY CVT+G A L F AIY LVD  
Sbjct: 344 KRWSTLSLCLLTLGFSLHLF-GLHMNKSLYSLSYTCVTAGTAGLFFVAIYLLVDVK 398


>gi|255543288|ref|XP_002512707.1| conserved hypothetical protein [Ricinus communis]
 gi|223548668|gb|EEF50159.1| conserved hypothetical protein [Ricinus communis]
          Length = 426

 Score =  358 bits (919), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 4/342 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL SLD+FRGL VALMILVD+AGG  P I+H+PWNG  LAD VMPFFLFIVGV++ L  
Sbjct: 50  HRLLSLDVFRGLTVALMILVDYAGGILPAINHSPWNGLTLADLVMPFFLFIVGVSLGLTY 109

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K++P +A A +K I RTLKLL  G  LQGG+ H  ++LTYGV+V  +RL G+LQRIA++Y
Sbjct: 110 KKLPCKAVATRKAILRTLKLLTLGFFLQGGYLHGLNDLTYGVNVEKLRLMGILQRIAIAY 169

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+ +L EI+ K     D      S+ R Y + W MA  ++  YL+L+YG YVPDW++ I 
Sbjct: 170 LVGALCEIWLKGDDHVDSCS---SLLRKYRFQWAMALVLISTYLSLIYGLYVPDWEYQIP 226

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
            + S+   K+F V CGVR    P CNAVG IDR  LGI H+Y  P + R+K C+ +SP  
Sbjct: 227 AEASSSPAKIFLVKCGVRGNTGPACNAVGLIDRTTLGIQHLYGKPVYARTKLCSINSPDY 286

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEG+LSSV ++++ +IG+H+GH+I+H K H  R+  W+     L
Sbjct: 287 GPLPADAPSWCQAPFDPEGILSSVMAVVTCLIGLHYGHIIVHFKDHRNRMLHWMIPSICL 346

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +  GL L F   + +NK LY+ SY+ VT+GAA ++F+ IY L
Sbjct: 347 IGLGLALDFL-GMHVNKALYSFSYMSVTAGAAGILFTGIYKL 387


>gi|302796996|ref|XP_002980259.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
 gi|300151875|gb|EFJ18519.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
          Length = 401

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 169/329 (51%), Positives = 227/329 (68%), Gaps = 10/329 (3%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           +MILVD+AGG+WP I+H+PWNG  LAD VMPFFLFIVGVA+AL  K+IP + D+ +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+LKL F G+ LQGG+ H  ++L+YGVD+ +IR CG+LQRIA  YL+V+L E++   VQ 
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYLVVALCEVWLPRVQG 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                  F   + Y +HW+     L VYL+LLYG  VPDWQF + N  +        VTC
Sbjct: 121 S-----YFGFMQNYLFHWIFVVVTLTVYLSLLYGLKVPDWQFELPNNRNI----TMTVTC 171

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G R+ L+PPCNAVGY+DR++LG+NH+   P + R+++C+ +SP  GPL  DAP WCHAPF
Sbjct: 172 GTRSNLDPPCNAVGYVDRQILGVNHLDQRPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+LSSVS+I++  IG+H+GH I+  K H  R+  ++   F LL  G  LH    I +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAFILLALGYVLHLL-GIKM 290

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LY+ SY+C T+GAA  VF  +Y LVD
Sbjct: 291 NKPLYSFSYMCFTAGAAGAVFCLLYILVD 319


>gi|302759310|ref|XP_002963078.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
 gi|300169939|gb|EFJ36541.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
          Length = 401

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 166/329 (50%), Positives = 225/329 (68%), Gaps = 10/329 (3%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           +MILVD+AGG+WP I+H+PWNG  LAD VMPFFLFIVGVA+AL  K+IP + D+ +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+LKL F G+ LQGG+ H  ++L+YGVD+ +IR CG+LQRIA  Y++V+L E++   VQ 
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYVIVALCEVWLPRVQG 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                  F I + Y +HW+     L VYL+LLYG  VP WQF + N  +        VTC
Sbjct: 121 S-----YFGIMQNYLFHWIFVVVTLTVYLSLLYGLKVPHWQFELPNNRNI----TMTVTC 171

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           G R+ L+P CNAVGY+DR++LG+NH+   P + R+++C+ +SP  GPL  DAP WCHAPF
Sbjct: 172 GTRSNLDPACNAVGYVDRQILGVNHLDQQPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+LSSVS+I++  IG+H+GH I+  K H  R+  ++     LL  G  LH    I +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAVILLALGYVLHLL-GIKM 290

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LY+ SY+C T+GAA  VF  +Y LVD
Sbjct: 291 NKPLYSFSYMCFTAGAAGAVFCLLYILVD 319


>gi|116309454|emb|CAH66526.1| H0502B11.6 [Oryza sativa Indica Group]
 gi|218194797|gb|EEC77224.1| hypothetical protein OsI_15768 [Oryza sativa Indica Group]
          Length = 448

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 237/344 (68%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMILVD AG   P I+H+PW+G  LADFVMPFFLFIVGVA+ALA 
Sbjct: 57  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 116

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+P++ +A +K I R LKL   G++LQGGF H    LT+G+D+  IRL G+LQRIA++Y
Sbjct: 117 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 176

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+ K   D D     F + +   +   +   V++ Y+  LYGTYVPDW++ I 
Sbjct: 177 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 233

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V C VR    P CNAVG IDRK+LGI H+Y  P + RSK C+ +SP  
Sbjct: 234 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 291

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++
Sbjct: 292 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 351

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LI   +L F   + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 352 LILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLFAGIYALVD 394


>gi|115458212|ref|NP_001052706.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|113564277|dbj|BAF14620.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|222628804|gb|EEE60936.1| hypothetical protein OsJ_14685 [Oryza sativa Japonica Group]
          Length = 447

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 237/344 (68%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMILVD AG   P I+H+PW+G  LADFVMPFFLFIVGVA+ALA 
Sbjct: 56  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 115

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+P++ +A +K I R LKL   G++LQGGF H    LT+G+D+  IRL G+LQRIA++Y
Sbjct: 116 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 175

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+ K   D D     F + +   +   +   V++ Y+  LYGTYVPDW++ I 
Sbjct: 176 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 232

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K F V C VR    P CNAVG IDRK+LGI H+Y  P + RSK C+ +SP  
Sbjct: 233 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 290

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++
Sbjct: 291 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 350

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LI   +L F   + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 351 LILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLFAGIYALVD 393


>gi|212723180|ref|NP_001132467.1| uncharacterized protein LOC100193923 [Zea mays]
 gi|194694464|gb|ACF81316.1| unknown [Zea mays]
 gi|414587418|tpg|DAA37989.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 391

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 169/330 (51%), Positives = 226/330 (68%), Gaps = 6/330 (1%)

Query: 47  ALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVI 106
           ALMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK +
Sbjct: 14  ALMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAV 73

Query: 107 FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
            R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++YLL +L EI+ K  +
Sbjct: 74  LRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDE 133

Query: 167 DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVT 226
           D D     + + + Y +   + A V + Y++LLYGTYV DW++      S +  K F V 
Sbjct: 134 DVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTSGPGSIE--KSFFVK 188

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CGVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  GPL  DAPSWC AP
Sbjct: 189 CGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAP 248

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++L+    + F   + 
Sbjct: 249 FDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLR 307

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +NK LYTLSY   T+GAA L+F  IY LVD
Sbjct: 308 MNKPLYTLSYTLATAGAAGLLFCGIYTLVD 337


>gi|195642128|gb|ACG40532.1| hypothetical protein [Zea mays]
          Length = 379

 Score =  344 bits (883), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 168/329 (51%), Positives = 226/329 (68%), Gaps = 6/329 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK + 
Sbjct: 3   LMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAVL 62

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD++ IRL G+LQRIA++YLL +L EI+ K  +D
Sbjct: 63  RALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDED 122

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D     + + + Y +  L+ A V + Y++LLYGTYVPD ++      S +  K F V C
Sbjct: 123 VDYG---YDLLKRYRYQLLVGAVVAITYMSLLYGTYVPDCEYQTSGPGSIE--KSFFVKC 177

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR   +P CNAVG IDR++LGI H+Y  P + RSK C+ DSP  GPL  DAPSWC APF
Sbjct: 178 GVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPF 237

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F++L+    + F   + +
Sbjct: 238 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLRM 296

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LYTLSY   T+GAA L+F  IY LVD
Sbjct: 297 NKPLYTLSYTLATAGAAGLLFCGIYTLVD 325


>gi|224057870|ref|XP_002299365.1| predicted protein [Populus trichocarpa]
 gi|222846623|gb|EEE84170.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 173/343 (50%), Positives = 227/343 (66%), Gaps = 4/343 (1%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL SLD+FRGL VALMILVD AGG  P I+H+PWNG  LAD VMPFFLFIVGV++ L  K
Sbjct: 1   RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFIVGVSLGLTYK 60

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
           ++  +A A +K I RTLKLL  G+ LQGGF H  ++LTYGVD+  IR  G+LQRIA+ YL
Sbjct: 61  KLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTYGVDMTQIRWMGILQRIAIGYL 120

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
           + ++ EI+ K     +      S+ R Y + W      + +YL+LLYG +VPDW++ I  
Sbjct: 121 VGAMCEIWLKG---GNHVTSGLSMLRKYQFQWAAVLMFVTIYLSLLYGLHVPDWEYQIPV 177

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
             SA   K+F V CGVR    P CNA G IDR +LGI H+Y  P + R+K C+ +SP  G
Sbjct: 178 AASASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYRKPIYARTKPCSINSPGYG 237

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           PL  DAPSWC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R   W+       
Sbjct: 238 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRTLHWMVPS-TCF 296

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +    +     + +NK LYT SY+CVT+GAA +VF+ IY LVD
Sbjct: 297 LVLGLVLDLLGMHVNKALYTFSYMCVTAGAAGIVFTGIYLLVD 339


>gi|357510831|ref|XP_003625704.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355500719|gb|AES81922.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 444

 Score =  340 bits (873), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 158/276 (57%), Positives = 200/276 (72%), Gaps = 4/276 (1%)

Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
            VKK+I RTLKLLFWGILLQGG+SHAPDEL YGV+++ IR CG+LQRIAL Y +V+L+E 
Sbjct: 79  TVKKIILRTLKLLFWGILLQGGYSHAPDELVYGVNMKFIRWCGILQRIALVYCIVALIET 138

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
           FT  ++    S GR +IF  Y   W       ++Y+   +  YVP+W F  ++  + D  
Sbjct: 139 FTTKLRPTTLSPGRIAIFTAY--KWFGGFMAFLIYMITTFALYVPNWSF--VDHVNNDEP 194

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
           K + V CG+R  L P CNAVGY+DR+  G+NH+Y  P WRR KACT  SP EGP R DAP
Sbjct: 195 KRYTVICGMRGHLGPACNAVGYVDRQTWGVNHLYSQPVWRRLKACTFSSPSEGPFRDDAP 254

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
           SWC APFEPEGLLSS+S+ILS  IG+H+GHV+IH K H  RLKQW +MGF LL+  + LH
Sbjct: 255 SWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKSHSERLKQWFSMGFVLLVVAIILH 314

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           FT+AIP+NKQLY+ SYVC T+GAA ++FS +Y L+D
Sbjct: 315 FTDAIPINKQLYSFSYVCFTAGAAGIIFSILYILID 350


>gi|255548527|ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
 gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis]
          Length = 460

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 170/367 (46%), Positives = 237/367 (64%), Gaps = 9/367 (2%)

Query: 19  PDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           P  S   E+  L      QRL SLD+FRGL +ALMILVD AGG +P I+H+PW G  LAD
Sbjct: 32  PSSSSSDEREALPPPTPNQRLMSLDVFRGLTIALMILVDDAGGAFPSINHSPWFGVTLAD 91

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           FVMPFFLF VGV+I+L  K+I  ++ A KKV+ RT+KL   G+LLQGG+ H  + LTYG+
Sbjct: 92  FVMPFFLFGVGVSISLVFKKISSKSVATKKVMLRTIKLFLLGVLLQGGYFHGRNHLTYGI 151

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           DV  IR  GVLQRI++ YL  S+ EI+  +    D  +   +  + Y   W+++  +  +
Sbjct: 152 DVLKIRWLGVLQRISIGYLFASISEIWLVNHCIVDSPL---AFMKKYYAQWMVSLILCSL 208

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGK-VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           Y  LLY  +VP+W+F   + +   YG     V CGVR  L PPCNAVG IDR +LG +H+
Sbjct: 209 YTCLLYFLFVPNWEFEASSINLFGYGSGTQTVICGVRGSLEPPCNAVGLIDRFLLGEHHL 268

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P +RR+K C+ +SP  GPL  ++P WC APF+PEG+LSS+ + ++ ++G+ FGHV++
Sbjct: 269 YQRPVYRRTKQCSVNSPDYGPLPPNSPPWCLAPFDPEGILSSLMAAVTCLLGLQFGHVLV 328

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H K H+ R+  W+   F+LL+ G  L     IP +K LYTLSY C+T+GA+ L+ + I+ 
Sbjct: 329 HLKDHMQRILVWLISSFSLLVTGFVLKLI-GIPFSKPLYTLSYTCITTGASGLLLTIIFY 387

Query: 374 LVDCNIF 380
            VD   F
Sbjct: 388 AVDVKHF 394


>gi|147844298|emb|CAN82113.1| hypothetical protein VITISV_031338 [Vitis vinifera]
          Length = 401

 Score =  337 bits (863), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 172/353 (48%), Positives = 231/353 (65%), Gaps = 25/353 (7%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA+MILVD AGG  P I+H+PWNG  LADFVMPFFLFIVGV++ALA 
Sbjct: 51  RRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAY 110

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +     A K              +  GG+ H  + LTYGVD+  IRL G+LQRIA++Y
Sbjct: 111 KNLSSGYLATK--------------MASGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAY 156

Query: 153 LLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
            L ++ EI+ K D   K  S    S+ + Y + W +   + V Y +LLYG YVPDW+++I
Sbjct: 157 FLAAVCEIWLKGDXNVKSGS----SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSI 212

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
            ++ S+   K+F V CGVR+   P CNAVG IDR VLGI H+Y  P + R K C+ +SP 
Sbjct: 213 PSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPD 272

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
            GPL  +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+     
Sbjct: 273 YGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSC 332

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           LL+ G  L F   + +NK LYTLSY+CVT+GAA ++F+ IY +V     GP T
Sbjct: 333 LLVLGFALDFF-GMHVNKALYTLSYMCVTAGAAGILFAGIYLMV-----GPLT 379


>gi|186530230|ref|NP_199601.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332008203|gb|AED95586.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 177/347 (51%), Positives = 234/347 (67%), Gaps = 7/347 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H  RL QW+   
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 340

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           F LL+          + LNK LYTLSY+CVTSGA+  + SAIY +VD
Sbjct: 341 FCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVD 386


>gi|449446789|ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 494

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 246/366 (67%), Gaps = 9/366 (2%)

Query: 16  ISEPDVSDQQE---KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
           I EP  S       +S  +  RL SLD+FRG+ VALMI+VD+AGG  P I+H+PW+G  L
Sbjct: 79  IDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWDGLTL 138

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           AD VMPFFLFIVGV++ALA K+IP R  A +K + RTLKLLF G+ LQGGF H  + LTY
Sbjct: 139 ADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTY 198

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           GVD++ IR  G+LQRIA++Y L +L EI+ K     D      ++ R Y    + A  + 
Sbjct: 199 GVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GSDYVNSETALRRKYQLQLVAAVVLT 255

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           ++YLAL YG YVPDW++ + +  ++D    K+F+V CG R    P CNAVG IDRK+ GI
Sbjct: 256 MLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGPACNAVGMIDRKIFGI 315

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y  P + R++ C+ ++P  GPL  DAPSWC APF+PEGLLS+V ++++ ++G+H+GH
Sbjct: 316 QHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGH 375

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +I+H K H  R+  W+     L++  + L F   + +NK LYT+SY+ VT+GAA L+F+ 
Sbjct: 376 IIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GMHINKVLYTVSYMSVTAGAAGLLFTG 434

Query: 371 IYALVD 376
           IY +VD
Sbjct: 435 IYLMVD 440


>gi|242071239|ref|XP_002450896.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
 gi|241936739|gb|EES09884.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
          Length = 455

 Score =  332 bits (850), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 174/350 (49%), Positives = 231/350 (66%), Gaps = 4/350 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMILVD  GG  P ISH+PW+G  LADFV PFFLFIVGV++A 
Sbjct: 60  RGQRLASLDVFRGITVVLMILVDDVGGLVPAISHSPWDGVTLADFVFPFFLFIVGVSLAF 119

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+P++  A KK + R  KL   G+LLQGG+ H   +L+YGVD+  IRL G+LQRIA+
Sbjct: 120 AYKRVPNKTLATKKALIRASKLFLLGLLLQGGYFHTIHDLSYGVDLHKIRLMGILQRIAI 179

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +Y  V+L EI+ +     D   G + + R Y     +   + V Y  LLYG YVPDW++ 
Sbjct: 180 AYFAVALCEIWLRG-GASDNGAGGYVLIRRYRHQLFVGLVLTVTYTVLLYGMYVPDWEYV 238

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           + + D+    K F V CGVR    P CNAVG IDR VLGI H+Y HP + ++  C+ +SP
Sbjct: 239 VTSPDTTL--KNFMVKCGVRGDTGPGCNAVGMIDRCVLGIQHLYAHPVYLKTAQCSINSP 296

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
             GPL  DAP+WC APF+PEGLLSS+ +I++ +IG+  GHVI+H K H  R+ +W     
Sbjct: 297 RNGPLPSDAPTWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKQHSKRIVRWSIPSL 356

Query: 331 ALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
            LLI G++L     + +NK LY+LSY CVT+G+A L F+ IY LVD   +
Sbjct: 357 ILLILGVSLDLF-GMHMNKSLYSLSYTCVTTGSAGLFFAGIYLLVDVYFY 405


>gi|218190872|gb|EEC73299.1| hypothetical protein OsI_07466 [Oryza sativa Indica Group]
          Length = 454

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 159/329 (48%), Positives = 221/329 (67%), Gaps = 6/329 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + 
Sbjct: 78  LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 137

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D
Sbjct: 138 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 197

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V C
Sbjct: 198 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 252

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF
Sbjct: 253 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 312

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +
Sbjct: 313 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRM 371

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LYT+SY   TSGAA L+F+ IY LVD
Sbjct: 372 NKPLYTISYALATSGAAGLLFAGIYTLVD 400


>gi|125582342|gb|EAZ23273.1| hypothetical protein OsJ_06967 [Oryza sativa Japonica Group]
          Length = 423

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 159/329 (48%), Positives = 221/329 (67%), Gaps = 6/329 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + 
Sbjct: 47  LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 106

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D
Sbjct: 107 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 166

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
            D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V C
Sbjct: 167 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 221

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF
Sbjct: 222 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 281

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +
Sbjct: 282 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRM 340

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LYT+SY   TSGAA L+F+ IY LVD
Sbjct: 341 NKPLYTISYALATSGAAGLLFAGIYTLVD 369


>gi|38346153|emb|CAE02025.2| OSJNBb0118P14.13 [Oryza sativa Japonica Group]
          Length = 415

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 174/368 (47%), Positives = 242/368 (65%), Gaps = 13/368 (3%)

Query: 15  IISEPDVSDQQEKSHL------KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           +++ PD  +Q++   +      + Q L  + +F    + LMILVD AG   P I+H+PW+
Sbjct: 1   MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           G  LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL   G++LQGGF H   
Sbjct: 60  GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            LT+G+D+  IRL G+LQRIA++Y++ +L EI+ K   D D     F + +   +   + 
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             V++ Y+  LYGTYVPDW++ I    S +  K F V C VR    P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           GI H+Y  P + RSK C+ +SP  GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQY 294

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           GHVI+H + H  R+ +W+   F++LI   +L F   + +NK LYT+SY   T+GAA L+F
Sbjct: 295 GHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFF-GMHMNKPLYTVSYALATAGAAGLLF 353

Query: 369 SAIYALVD 376
           + IYALVD
Sbjct: 354 AGIYALVD 361


>gi|388508176|gb|AFK42154.1| unknown [Lotus japonicus]
          Length = 467

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 177/353 (50%), Positives = 238/353 (67%), Gaps = 7/353 (1%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
             K   ++QRL S+D+FRGL VALMILVD AGG  P ++H+PW+G  +ADFVMP FLFIV
Sbjct: 67  NHKPQSQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIV 126

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           G+++AL  K++     A +K I R LKLL  G+ LQGG+ H  ++LT+GVD++ IRL G+
Sbjct: 127 GLSLALTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGI 186

Query: 145 LQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           LQRIA++YLL +L EI+ K D   K  S    S+ R Y + W +A  +   YL LLYG Y
Sbjct: 187 LQRIAIAYLLTALCEIWLKCDDIVKSGS----SLLRKYRYQWAVAFVLSGFYLCLLYGLY 242

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VPDW++  I  DS+   K F+V CGV A   P CN VG IDRK+LGI H+Y  P + R  
Sbjct: 243 VPDWEYQ-IPTDSSSVPKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMP 301

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
            C+ +SP  GPL  DAP+WC APF+PEGLLSSV +I++ +IG+H+GH+I+H K H  R+ 
Sbjct: 302 ECSINSPDYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRII 361

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            W+     L++FG  LH    + +NK LY+ SY CVT+GAA ++  AIY +VD
Sbjct: 362 HWMIPTSCLIVFGFALHLF-GMHVNKVLYSFSYTCVTAGAAGILLVAIYLMVD 413


>gi|115446433|ref|NP_001046996.1| Os02g0526000 [Oryza sativa Japonica Group]
 gi|49388281|dbj|BAD25399.1| unknown protein [Oryza sativa Japonica Group]
 gi|49388287|dbj|BAD25402.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536527|dbj|BAF08910.1| Os02g0526000 [Oryza sativa Japonica Group]
          Length = 376

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 158/328 (48%), Positives = 220/328 (67%), Gaps = 6/328 (1%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI+VD AG   P ++H+PW+G  +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + R
Sbjct: 1   MIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
            LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YLL ++ EI+ K   D 
Sbjct: 61  ALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDDV 120

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
           D  +    + R Y +  ++A  +  +Y  +L G YVPDW++ I    S +  K F+V CG
Sbjct: 121 DCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRCG 175

Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
           VR    P CNAVG +DR +LGI+H+Y  P + R+K C+ + P  GPL  DAPSWC APF+
Sbjct: 176 VRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFD 235

Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLN 348
           PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+   F++L    ++ F   I +N
Sbjct: 236 PEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRMN 294

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVD 376
           K LYT+SY   TSGAA L+F+ IY LVD
Sbjct: 295 KPLYTISYALATSGAAGLLFAGIYTLVD 322


>gi|297791891|ref|XP_002863830.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309665|gb|EFH40089.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 432

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 181/366 (49%), Positives = 237/366 (64%), Gaps = 14/366 (3%)

Query: 14  LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           L IS P +   +E       RL SLD+FRGL VALMILVD  G   P I+H+PW+G  LA
Sbjct: 24  LQISRPSLPPDKE-------RLVSLDVFRGLTVALMILVDDVGEILPSINHSPWDGVTLA 76

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           DFVMPFFLFIVGV++A A K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG
Sbjct: 77  DFVMPFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYG 136

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +DV  IR  G+LQRIA++YL+ +L EI+ K   +    +   S+ + Y +HW++A  +  
Sbjct: 137 IDVEKIRFMGILQRIAIAYLVAALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITT 193

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +YL+LLYG YV DW++ I  +D       F    V CGVR    P CNAVG +DR  LGI
Sbjct: 194 IYLSLLYGLYVSDWEYQISTEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGI 253

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y  P + R+K C+  SP  GPL  DAPSWC APF+PEGLLSS+ +I++ ++G+H+GH
Sbjct: 254 QHLYRKPVYARTKQCSISSPNNGPLPPDAPSWCQAPFDPEGLLSSLMAIVTCLVGLHYGH 313

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +IIH K H  RL QW+   F LL+          + LNK LYTLSY+CVTSGA+  + SA
Sbjct: 314 IIIHFKDHKKRLNQWILRSFCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSA 372

Query: 371 IYALVD 376
           IY +VD
Sbjct: 373 IYLMVD 378


>gi|224131042|ref|XP_002320987.1| predicted protein [Populus trichocarpa]
 gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 180/390 (46%), Positives = 242/390 (62%), Gaps = 15/390 (3%)

Query: 1   MSEIKAETTHHHP--------LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILV 52
           + E + E   H+P          I+    +     S   TQRL SLD+FRGL VALMILV
Sbjct: 9   LDERQREPLLHNPRSLSNEEEEEITNTPSTSSSNASPPPTQRLLSLDVFRGLTVALMILV 68

Query: 53  DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKL 112
           D AGG +P I+H+PW G  LADFVMPFFLF+VGV+I+L  K++  +  A KKVI RT+KL
Sbjct: 69  DDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSKPMATKKVIQRTIKL 128

Query: 113 LFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV 172
              G+LLQGG+ H    LTYGVDV  IR  GVLQRI++ YL  ++ EI+  D    D  +
Sbjct: 129 FLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMSEIWLVDSITVDSPM 188

Query: 173 GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI--INKDSADYGKVFNVTCGVR 230
              +  + Y   W++A      Y+ LLYG YVPDW+F +   N    ++G    V CGVR
Sbjct: 189 ---AFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHEFGTKI-VNCGVR 244

Query: 231 AKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPE 290
             L PPCNAVG IDR  LG +H+Y HP +RR+K C+ +SP  GPL  ++P WC APF+PE
Sbjct: 245 GSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPNSPGWCLAPFDPE 304

Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
           G+LSS+ + ++  +G+ FGH+++H KGH+ RL  W    F +LI G        +PL K 
Sbjct: 305 GILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITGYVFELL-GVPLCKP 363

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
           LYTLSY+C+T+GA+ L  + I+ +VD   F
Sbjct: 364 LYTLSYMCITAGASGLALTIIFYIVDVKHF 393


>gi|326512130|dbj|BAJ96046.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 162/367 (44%), Positives = 231/367 (62%), Gaps = 12/367 (3%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + +P  +D +     K QR+ASLD+FRGL VA+MILVD AGG WP I+HAPW G  +ADF
Sbjct: 36  LPQPPGADAKPGQQ-KPQRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADF 94

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VMP FLFI+GV+ AL  K+ P++    KK   R +KL   G++LQGG+ H   +LTYGVD
Sbjct: 95  VMPAFLFIIGVSAALVFKKTPNKIATSKKAACRAIKLFILGVILQGGYIHGRHKLTYGVD 154

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
           +  IR  GVLQRIA+ Y L ++ EI+  +    D  V   S  + Y   W+MA  +  +Y
Sbjct: 155 LDQIRWLGVLQRIAIGYFLAAISEIWLVNNTSVDSPV---SFVKKYFMEWIMAIIISALY 211

Query: 196 LALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           + L++G YVP+W+F +       +  S D G    + CG+   L PPCNAVG++DR +LG
Sbjct: 212 IGLVFGLYVPNWEFKVQTSSSTFSNPSNDVG-FKTIQCGLTGSLGPPCNAVGFVDRVLLG 270

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            +H+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ + +S  +G+HFG
Sbjct: 271 ESHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFG 330

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           HV+IH K H  R+  W+     L + G  L     +P +K LYT+SY+ +T G +  V  
Sbjct: 331 HVLIHCKTHSQRMMSWLLASTVLTVSGFLLQLL-GMPFSKPLYTVSYMLLTGGVSGFVLL 389

Query: 370 AIYALVD 376
            +Y +VD
Sbjct: 390 LLYCIVD 396


>gi|356527477|ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 463

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 184/380 (48%), Positives = 253/380 (66%), Gaps = 9/380 (2%)

Query: 1   MSEIKAETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG 56
           +S   A+TT    H + +I  +  ++  Q +   K+ RL SLD+FRGL VALMILVD AG
Sbjct: 35  VSPTIAQTTPLHLHINNIIEEQHIIARHQPQPQPKSPRLVSLDVFRGLTVALMILVDDAG 94

Query: 57  GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWG 116
           G  P ++H+PWNG  LAD+VMPFFLFIVGV++AL  K++    DA +K   R LKLL  G
Sbjct: 95  GLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYKKLSCGVDASRKASLRALKLLVLG 154

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           + LQGG+ H  ++LTYGVD++ IR  G+LQRI ++YL+ +L EI+ K   D   + G  S
Sbjct: 155 LFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIGVAYLVAALCEIWLKS--DDTVNSGP-S 211

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
           + R Y + W +A  +  +YL LLYG YVPDW + I  + S++  K F+V CGVR    P 
Sbjct: 212 LLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPSSE-PKTFSVKCGVRGNTGPA 270

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           CNAVG IDR +LGI+H+Y  P + R   C+ +SP  GPL  DAP+WC APF+PEGLLSSV
Sbjct: 271 CNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSV 330

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            +I++ +IG+H+GH+I+H K H  R+  W+     L++FGL L     + +NK LY+LSY
Sbjct: 331 MAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFGLALDLF-GMHINKVLYSLSY 389

Query: 357 VCVTSGAAALVFSAIYALVD 376
            CVT+GAA ++F  IY +VD
Sbjct: 390 TCVTAGAAGILFVGIYLMVD 409


>gi|224080634|ref|XP_002306188.1| predicted protein [Populus trichocarpa]
 gi|222849152|gb|EEE86699.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  324 bits (831), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 169/373 (45%), Positives = 238/373 (63%), Gaps = 20/373 (5%)

Query: 11  HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
           H PL+    D+ +Q   S  KT R+ASLD+FRGL V LM+LVD+ G   P I+H+PWNG 
Sbjct: 6   HKPLL----DIEEQPRTSK-KTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGL 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           +LADFVMPFFLFI GV++AL  KR+P+R +A +K + R ++L   G++LQGG+ H  + L
Sbjct: 61  HLADFVMPFFLFIAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFL 120

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           TYGVD++ IR  G+LQRI++ Y+  +L EI+      +D      S  + Y WHW  A  
Sbjct: 121 TYGVDMKRIRWLGILQRISIGYIFAALCEIWLSCRSRRD-----VSFLKSYYWHWGAAFS 175

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVF---NVTCGVRAKLNPPCNAVGYI 243
           +  +YL LLYG YVPDWQF + N  S+    ++  V+    V C VR  L P CN+ G I
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMI 235

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR VLGI+H+Y  P +R  K C   +   G + + APSWCHAPF+PEG+LSS+++ ++ I
Sbjct: 236 DRYVLGIDHLYKKPVYRNLKECNMST--NGQVPESAPSWCHAPFDPEGVLSSITAAVACI 293

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG+ +GH + H + H  R++ W+    +LL+ GL L      P+NK LYT  Y+ +T  +
Sbjct: 294 IGLQYGHSLAHLQDHKQRMQNWILFSLSLLLVGLLLAVVGD-PVNKSLYTFGYMLITCAS 352

Query: 364 AALVFSAIYALVD 376
           A + +SAIY LVD
Sbjct: 353 AGITYSAIYLLVD 365


>gi|357134575|ref|XP_003568892.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 495

 Score =  323 bits (829), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 163/377 (43%), Positives = 229/377 (60%), Gaps = 11/377 (2%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHA 65
           A   H +P   S         K   K  R+ASLD+FRGL VA+MILVD AGG WP I+HA
Sbjct: 22  ASEIHPYPESPSPRQPPGTDAKPERKPHRVASLDVFRGLTVAMMILVDDAGGAWPGINHA 81

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           PW G  +ADFVMP FLFI+GV+ AL  KR  ++    KK  +R  KL   G++LQGG+ H
Sbjct: 82  PWLGVTVADFVMPAFLFIIGVSAALVFKRTQNKIATSKKAAYRAFKLFILGVILQGGYIH 141

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
               LTYGVD+  IR  GVLQRIA+ Y L ++ EI+  +    D  V   S  + Y   W
Sbjct: 142 GRHNLTYGVDLDHIRWLGVLQRIAIGYFLAAMSEIWLVNNISVDSPV---SFVKKYFMEW 198

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNA 239
           +MA  +  +Y++L++G YVP+W+F +       +  S + G    V CG+R  L PPCNA
Sbjct: 199 VMAIMISALYISLIFGLYVPNWEFKVQTSNLTFSNGSNEIG-FKTVQCGLRGSLGPPCNA 257

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG++DR +LG NH+Y +P ++R+K C+ +SP  G L  +AP WC APF+PEGLLS++ + 
Sbjct: 258 VGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGALPPNAPDWCLAPFDPEGLLSTLMAA 317

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           +S  +G+HFGHV+IH + H  R+  W+     L   G  L  +  +P +K LYT+SY+ +
Sbjct: 318 VSCFVGLHFGHVLIHCQNHSQRMLSWLLASTVLTASGFLLQLS-GMPFSKPLYTVSYMLL 376

Query: 360 TSGAAALVFSAIYALVD 376
           T G +  +   +Y +VD
Sbjct: 377 TGGVSGFLLLLLYYIVD 393


>gi|356516509|ref|XP_003526936.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 416

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 165/365 (45%), Positives = 229/365 (62%), Gaps = 12/365 (3%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +P + +  E +  +  R+ASLD+FRGL+V LMI VD+A   +P I+HAPWNG +LADFVM
Sbjct: 5   QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLADFVM 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFI G+++AL  KR P R  A  K   R L L   GILLQGG+ H    LT+GVD++
Sbjct: 65  PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            IR  G+LQRI++ Y++ +L EI+    + K+         + Y W W +A  +L +Y  
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179

Query: 198 LLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           LLYG YVPDWQF +    S+      G ++ V C VR  L P CN+ G IDR +LG++H+
Sbjct: 180 LLYGLYVPDWQFDVSASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHL 239

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P +R  K C   +  +G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+ 
Sbjct: 240 YRKPVYRNLKGCNMSA--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLA 297

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H + H  RL  W+    + L  GL L     IPLNK LYT+SY+ +TS A+ L F A+Y 
Sbjct: 298 HLQDHKGRLYNWMCFSLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAASGLTFIALYF 356

Query: 374 LVDCN 378
           LVD +
Sbjct: 357 LVDVH 361


>gi|356569086|ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 461

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 184/367 (50%), Positives = 248/367 (67%), Gaps = 7/367 (1%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
           H H +I  +  +S  Q +   K+ RL SLD+FRGL VALMILVD AGG  P ++H+PWNG
Sbjct: 48  HIHNIIEEQRIISRHQPQP--KSPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNG 105

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
             LAD+VMPFFLFIVGV++AL+ K++    DA +K   R LKLL  G+ LQGG+ H  ++
Sbjct: 106 LTLADYVMPFFLFIVGVSLALSYKKLSCGVDASRKASLRALKLLALGLFLQGGYFHRVND 165

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LT+GVD++ IR  G+LQRIA++YL+V+L EI+ K   D   + G  S+ R Y + W +A 
Sbjct: 166 LTFGVDIKQIRWMGILQRIAVAYLVVALCEIWLKS--DDTVNSGP-SLLRKYRYQWAVAL 222

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +  +YL LLYG YVPDW + I  + SA+  K F+V CGVR    P CN VG IDR +LG
Sbjct: 223 ILSFLYLCLLYGLYVPDWVYQIQTEPSAE-PKTFSVKCGVRGNTGPACNVVGMIDRMILG 281

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
           I H+Y  P + R   C+ +SP  GPL  DAP+WC APF+PEGLLSSV +I++ +IG+H+G
Sbjct: 282 IQHLYKRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYG 341

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           H+I+H K H  R+  W+     LL+FGL L     + +NK LY+LSY CVT+GAA ++F 
Sbjct: 342 HIIVHFKDHRVRIIYWMIPTSCLLVFGLALDLF-GMHINKVLYSLSYTCVTAGAAGVLFV 400

Query: 370 AIYALVD 376
            IY +VD
Sbjct: 401 GIYLMVD 407


>gi|168007055|ref|XP_001756224.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692734|gb|EDQ79090.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 411

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 171/336 (50%), Positives = 223/336 (66%), Gaps = 18/336 (5%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD-AVKKVIF 107
           MILVD+AGG WP I+H+PW+G  LADFV+PFFLFIVGVA+AL  K+I +    A +K I 
Sbjct: 1   MILVDYAGGIWPAINHSPWDGVTLADFVLPFFLFIVGVALALTYKKIINEKQLASQKAIG 60

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT----- 162
           R+LKL+  G+ +QGG+ H     +YGVD+  IR CGVLQRIAL+Y++V+L EI+      
Sbjct: 61  RSLKLVIVGLFIQGGYFHGVHNTSYGVDLESIRWCGVLQRIALAYMVVALCEIWAPRGHY 120

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
             +    +S  RF  FR       +AA ++ +YL LLYG YVPDW+F      SA    V
Sbjct: 121 DSMNVYIKSTRRFGTFRA------VAAAIVAIYLVLLYGVYVPDWEFV-----SAADSTV 169

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
           F V CGVR  + P CN VGY+DR +LG++H+Y    +RR+ AC+  SP  GPL   AP W
Sbjct: 170 FQVKCGVRGDVGPSCNVVGYLDRTLLGLSHLYQKAVYRRAPACSVLSPDYGPLPAGAPVW 229

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
           C APF+PEGLLSS+S+I+S  +G+HFGHV++H K H ARLK WV M   LL+ G  LH  
Sbjct: 230 CKAPFDPEGLLSSMSAIVSCFLGLHFGHVLVHHKEHNARLKDWVLMSLTLLVTGALLHVL 289

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
             +P NK LY++SY+  T GAA LVF+  Y LVD +
Sbjct: 290 -GMPWNKPLYSVSYMLFTGGAAGLVFAGYYFLVDVH 324


>gi|357149263|ref|XP_003575052.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 432

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 168/344 (48%), Positives = 229/344 (66%), Gaps = 6/344 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AGG  P ++H+PW+G  + DFVMPFFLFIVGV++ LA 
Sbjct: 41  QRLVSLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTIGDFVMPFFLFIVGVSLTLAY 100

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+P+R +A KK + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++Y
Sbjct: 101 KRVPERLEATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDITEIRLMGILQRIAIAY 160

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+ ++ EI+ K   + D+ +    + R Y +   +   + V+Y  LLYG YVPDW++ I 
Sbjct: 161 LIAAICEIWLKGNDEVDRGL---DLLRRYRYQLFVGLLLSVMYTVLLYGIYVPDWEYQIT 217

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
              S +  K   V CGVR    P CNAVG +DR +LGI+H+Y  P + R+K C+ D P  
Sbjct: 218 GPGSTE--KSLLVKCGVRGDTGPGCNAVGMVDRTMLGIDHLYRRPVYARTKECSIDYPEN 275

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           GPL  DAPSWC APF+PEGLLSSV +I++ ++G+ FGHVIIH + H  R+  W+   F++
Sbjct: 276 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLMGLQFGHVIIHFEKHKERIINWLIPSFSM 335

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L     + F   + +NK LYT+SY   T+GAA   F+ IY LVD
Sbjct: 336 LALAFLMDFI-GMRMNKPLYTISYTFATAGAAGFFFAGIYTLVD 378


>gi|356534906|ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 489

 Score =  317 bits (813), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 178/368 (48%), Positives = 240/368 (65%), Gaps = 10/368 (2%)

Query: 13  PLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
           PL  S P  +D    S L  QRL+SLD+FRGL VALMILVD+ G  +P ++H+PW G  L
Sbjct: 36  PLPQSNP--TDTSSLS-LPNQRLSSLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTL 92

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           ADFVMPFFLF+VGV+I L  K++  + +A KKVI RTLKL   G+LLQGG+ H   +LTY
Sbjct: 93  ADFVMPFFLFVVGVSIGLVFKKVSSKPNATKKVISRTLKLFLLGLLLQGGYFHGHGKLTY 152

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           GVD+  IR  GVLQRI++ Y   S+ EI+  +      S   F   R Y   W+ +  + 
Sbjct: 153 GVDLSKIRWLGVLQRISIGYFFASISEIWLVNHNILVDSPAGF--VRKYSIQWMFSILLC 210

Query: 193 VVYLALLYGTYVPDWQFT----IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            VYL LLYG YVP+W+F     + + DS+    + NV C VR  L PPCN VG+IDR +L
Sbjct: 211 SVYLCLLYGLYVPNWKFKHSNLLSSSDSSHLSIIQNVHCEVRGSLEPPCNVVGFIDRLIL 270

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +HMY  P + R+K C+ +SP  GPL  D+P WC APF+PEG+LSS+ + ++  +G+ +
Sbjct: 271 GEDHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQY 330

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           GH+I+H +GH  R+  W    F+LL+ G  L     +PL+K LYTLSY C+T+GA+ LV 
Sbjct: 331 GHIIVHLQGHKQRVLLWSVFSFSLLLIGYILEIL-GMPLSKALYTLSYTCITAGASGLVL 389

Query: 369 SAIYALVD 376
           +AIY +VD
Sbjct: 390 TAIYYIVD 397


>gi|224103167|ref|XP_002312951.1| predicted protein [Populus trichocarpa]
 gi|222849359|gb|EEE86906.1| predicted protein [Populus trichocarpa]
          Length = 419

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 161/370 (43%), Positives = 238/370 (64%), Gaps = 17/370 (4%)

Query: 11  HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
           H PL+    D+ +Q   S  K  R ASLD+FRGL V LM+LVD+ G   P I+H+PWNG 
Sbjct: 6   HKPLL----DIEEQLHTSK-KPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGL 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           +LAD VMPFFLFI GV++AL  K++P+R +A  K + + +KL   G+++QGG+ H  + L
Sbjct: 61  HLADSVMPFFLFIAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSL 120

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           TYGVD++ IR  G+LQ+I++ Y++ +L EI+      +       S  + Y WHW +A  
Sbjct: 121 TYGVDMKRIRWLGILQKISVGYIVAALCEIWLSCRTRRG-----VSFLKSYYWHWCVAFS 175

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +  +YL LLYG YVPDWQF + N  S+    ++  V+ V C +R  L P CN+ G IDR 
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRY 235

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +LGI+H+Y  P +R  K C   +  +G +  ++ SWCHAPF+PEG+LSS+++ ++ IIG+
Sbjct: 236 ILGIDHLYKKPVYRNLKECNMST--DGQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGL 293

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            +GH++ H + H  R++ W    F+LL+ GL L      P+NK LYT SY+ +TS +A +
Sbjct: 294 QYGHLLAHLQDHKGRMENWTLFSFSLLVVGLLLVVIGD-PVNKSLYTFSYMLITSASAGI 352

Query: 367 VFSAIYALVD 376
            +SA+Y LVD
Sbjct: 353 TYSALYLLVD 362


>gi|212724122|ref|NP_001131867.1| uncharacterized protein LOC100193245 [Zea mays]
 gi|194692766|gb|ACF80467.1| unknown [Zea mays]
 gi|413948803|gb|AFW81452.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
 gi|413948804|gb|AFW81453.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
          Length = 492

 Score =  317 bits (812), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 160/372 (43%), Positives = 230/372 (61%), Gaps = 14/372 (3%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
            H PL     D +  Q +   K +R+ASLD+FRG  VA+MILVD AGG WP I+HAPW G
Sbjct: 38  QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMMILVDDAGGAWPGINHAPWFG 92

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
             +ADFVMP FLFI+GV+ AL  K++ ++  A KK   R  KL   G++LQGG+ H   +
Sbjct: 93  VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD+  IR  GVLQRIA+ Y + ++ EI+  +    D  V      + Y   W MA 
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
            + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ ++++  +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+ FGHV+IH K H  R+  W+     L I    L     +P +K LYT++Y+ +T G +
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWLLASVVLTI-SAYLVLLLGMPFSKPLYTVNYMLLTGGVS 388

Query: 365 ALVFSAIYALVD 376
             +   +Y +VD
Sbjct: 389 GFLLLLLYYIVD 400


>gi|224072443|ref|XP_002303734.1| predicted protein [Populus trichocarpa]
 gi|222841166|gb|EEE78713.1| predicted protein [Populus trichocarpa]
          Length = 381

 Score =  317 bits (812), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 167/329 (50%), Positives = 223/329 (67%), Gaps = 4/329 (1%)

Query: 48  LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
           LMILVD AGG  P I+H+PWNG  LAD VMPFFLF+VGV++ L  K++P +A A +K I 
Sbjct: 3   LMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYKKLPSKAVATRKAIL 62

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R LKLL  G+ LQGGF H  ++LT+GVD+  IR  G+LQRIA+ YL+ ++ EI+ K   D
Sbjct: 63  RALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYLIGAMCEIWLKG--D 120

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
              + G  S+ R Y   W     ++ +YL+LLYG YVPDW++ I    S+   K+F V C
Sbjct: 121 NHVASG-LSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAASSSSPKIFRVKC 179

Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
           GVR      CNAVG IDR VLGI H+Y  P + R+KAC+ +SP  GPL  DAPSWC APF
Sbjct: 180 GVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLPPDAPSWCQAPF 239

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLSSV +I++ ++G+H+GH+I+H K H  R+  W+      ++ GL L  +  + +
Sbjct: 240 DPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLGLVLDLS-GMHV 298

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK LYT SY+CVT+GAA +VF+ IY LVD
Sbjct: 299 NKALYTFSYMCVTAGAAGIVFTGIYMLVD 327


>gi|32487909|emb|CAE05368.1| OJ000315_02.13 [Oryza sativa Japonica Group]
          Length = 452

 Score =  317 bits (811), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 174/392 (44%), Positives = 242/392 (61%), Gaps = 37/392 (9%)

Query: 15  IISEPDVSDQQEKSHL------KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           +++ PD  +Q++   +      + Q L  + +F    + LMILVD AG   P I+H+PW+
Sbjct: 1   MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           G  LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL   G++LQGGF H   
Sbjct: 60  GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            LT+G+D+  IRL G+LQRIA++Y++ +L EI+ K   D D     F + +   +   + 
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             V++ Y+  LYGTYVPDW++ I    S +  K F V C VR    P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL--------------- 293
           GI H+Y  P + RSK C+ +SP  GPLR DAPSWC APF+PEGLL               
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLRLQQYNISFANFAKF 294

Query: 294 ---------SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
                    SSV +I++ +IG+ +GHVI+H + H  R+ +W+   F++LI   +L F   
Sbjct: 295 SLFFLDSRISSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFF-G 353

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           + +NK LYT+SY   T+GAA L+F+ IYALVD
Sbjct: 354 MHMNKPLYTVSYALATAGAAGLLFAGIYALVD 385


>gi|357467537|ref|XP_003604053.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355493101|gb|AES74304.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 421

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 161/356 (45%), Positives = 222/356 (62%), Gaps = 14/356 (3%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
            S  +  R+AS+D+FRGL+V LMI VD+ G  +P ISHAPWNG +LADFVMPFFLF+VG+
Sbjct: 12  NSETQFPRVASVDVFRGLSVFLMIFVDYGGSIFPIISHAPWNGLHLADFVMPFFLFLVGI 71

Query: 87  AIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ++AL  K    R    +  K + R+ +L   GILLQGG+ H     TYGVDV+ IR  GV
Sbjct: 72  SLALVYKNKRSRPTQSSTWKPLLRSFQLFILGILLQGGYFHGIHSFTYGVDVQTIRFFGV 131

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI++ Y++ +L +I    +  K       S F+ Y  HW +AA +L ++  LLYG +V
Sbjct: 132 LQRISIGYIVAALCQICLPTLPSKHT-----SFFKTYYSHWFVAAILLAIHSGLLYGLHV 186

Query: 205 PDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           PDWQF      S+      G V+ V C VR  L P CN+ G IDR +LG++H+Y  P +R
Sbjct: 187 PDWQFDASLSTSSLPPIQAGNVYTVNCSVRGDLGPACNSAGMIDRYILGLDHLYKKPVFR 246

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
             K C   S   G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GH++ + + H  
Sbjct: 247 NLKECNMSS--TGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHILANLEDHKG 304

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL QW+    + L  G  L     IPLNK LYT+SY+ ++S A+ L F A+Y LVD
Sbjct: 305 RLNQWLGFSVSFLALGWFLALI-GIPLNKSLYTVSYMLLSSAASGLTFMALYILVD 359


>gi|195652797|gb|ACG45866.1| hypothetical protein [Zea mays]
          Length = 492

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 159/372 (42%), Positives = 229/372 (61%), Gaps = 14/372 (3%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
            H PL     D +  Q +   K +R+ASLD+FRG  VA+ ILVD AGG WP I+HAPW G
Sbjct: 38  QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMXILVDDAGGAWPGINHAPWFG 92

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
             +ADFVMP FLFI+GV+ AL  K++ ++  A KK   R  KL   G++LQGG+ H   +
Sbjct: 93  VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           LTYGVD+  IR  GVLQRIA+ Y + ++ EI+  +    D  V      + Y   W MA 
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
            + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS++ ++++  +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+ FGHV+IH K H  R+  W+     L I    L     +P +K LYT++Y+ +T G +
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWLLASVVLTI-SAYLVLLLGMPFSKPLYTVNYMLLTGGVS 388

Query: 365 ALVFSAIYALVD 376
             +   +Y +VD
Sbjct: 389 GFLLLLLYYIVD 400


>gi|334188248|ref|NP_001190487.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008209|gb|AED95592.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  314 bits (804), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 179/383 (46%), Positives = 237/383 (61%), Gaps = 44/383 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK------------- 316
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K             
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVYNEPS 340

Query: 317 ------------------------GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
                                    H  RL QW+   F LL+ GL L+    + LNK LY
Sbjct: 341 ISIRPFFFILSETYLLLYVINFLQDHKKRLNQWILRSFCLLMLGLALNLF-GMHLNKPLY 399

Query: 353 TLSYVCVTSGAAALVFSAIYALV 375
           TLSY+CVTSGA+  + SAIY +V
Sbjct: 400 TLSYMCVTSGASGFLLSAIYLMV 422


>gi|357442361|ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 476

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 174/376 (46%), Positives = 237/376 (63%), Gaps = 15/376 (3%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           S I   T H + L    P VS       +  QRL SLD+FRGL VALMILVD  G  +P 
Sbjct: 23  SSILTLTVHENEL----PPVS-------VPNQRLVSLDVFRGLTVALMILVDDVGRAFPS 71

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
           ++H+PW G  LADFVMPFFLF VGV+IAL  K++  + +A KK+I RT+KL   G+LLQG
Sbjct: 72  LNHSPWFGVTLADFVMPFFLFGVGVSIALVFKKVSSKQNATKKIISRTIKLFLLGLLLQG 131

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           G+ H    LTYG+D+  +R  GVLQRI++ Y L S+ EI+  +      S   F   R Y
Sbjct: 132 GYFHGRGNLTYGLDLTKLRWFGVLQRISIGYFLASMSEIWLVNGNILVDSPAAF--VRKY 189

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPPCNAV 240
              W+ +  +  VYL LLYG YVP+W+F   N         + NV C +R  L+PPCNAV
Sbjct: 190 SIQWIFSILLCSVYLCLLYGLYVPNWEFEHSNLLWPGRVSTIQNVHCDMRGSLDPPCNAV 249

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           G+IDR +LG +HMY  P +RR+K C+ +SP  GPL  D+P WC APF+PEG+LSS+ + +
Sbjct: 250 GFIDRLILGEDHMYQRPVYRRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAI 309

Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  +G+ FGH+++  + H  R+  W    F+LL+ G  L     IPL+K LYTLS++ +T
Sbjct: 310 TCFVGLQFGHILVIFQAHKQRVLLWSVFSFSLLVVGYVLEIL-GIPLSKALYTLSFMFIT 368

Query: 361 SGAAALVFSAIYALVD 376
           +GA+ LV +AIY +VD
Sbjct: 369 AGASGLVLTAIYYIVD 384


>gi|449458622|ref|XP_004147046.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449489633|ref|XP_004158370.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 418

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 163/366 (44%), Positives = 226/366 (61%), Gaps = 15/366 (4%)

Query: 17  SEPDVSDQQE--KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           S P + +QQE   S  K  R+ SLD+FRGL+V +M+LVD+ G   P ISH+PW G +LAD
Sbjct: 4   SRPLLKNQQELPASSGKAPRVVSLDVFRGLSVFMMMLVDYGGSFLPIISHSPWIGLHLAD 63

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           FVMP+FLFI GV++AL  K +  +  A +    R L L   G+ LQGG+ H    LTYGV
Sbjct: 64  FVMPWFLFIAGVSVALVYKEVESKVAAARNAACRGLYLFLLGVFLQGGYFHGITSLTYGV 123

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIF-TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           D+  IR  G+LQRI++ YL+ +L EI+ T+  +++ Q    FS      WHW +   +L 
Sbjct: 124 DLESIRWLGILQRISIGYLIAALCEIWLTRCTREEAQHTKSFS------WHWCIIFFLLS 177

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGK---VFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +Y+ L YG YVPDW F I    S+       V+ V C +R  L P CN+ G IDR VLGI
Sbjct: 178 LYMGLSYGLYVPDWDFKISAPSSSLPLSGSYVYKVNCSLRGDLGPACNSAGMIDRYVLGI 237

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y  P +R  K C   S   G   + +PSWC APFEPEGLLSS+++ ++ IIG+ +GH
Sbjct: 238 HHLYTKPVYRNLKECNISS--SGQFPETSPSWCRAPFEPEGLLSSLTATVACIIGLQYGH 295

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           ++   + H  R   W  + F +L FG+ L F   IP+NK LYT+SY+ +TS +A ++F A
Sbjct: 296 ILARAQDHKTRTNGWFLLSFKILAFGIFLVFI-GIPVNKSLYTVSYMLITSASAGIIFCA 354

Query: 371 IYALVD 376
           +Y LVD
Sbjct: 355 LYILVD 360


>gi|326510109|dbj|BAJ87271.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 304

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 146/296 (49%), Positives = 197/296 (66%), Gaps = 5/296 (1%)

Query: 46  VALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV 105
           + LMI+VD AG   P ++H+PW G  +ADFVMPFFLFIVGVA+ALA KR+PD+ DA +K 
Sbjct: 6   LQLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAYKRVPDKLDATRKA 65

Query: 106 IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
             R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA++YL+ +L +I+ K  
Sbjct: 66  TLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAYLVTALCQIWLKGD 125

Query: 166 QDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNV 225
            D D  +    + + Y +  L    + + Y+ALLYGTYVPDW++ I      +  K F V
Sbjct: 126 DDVDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRISGPGFTE--KTFTV 180

Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
            CGVR    P CNAVG IDRK+LGI H+Y  P + RS+ C+ DSP  GPL  DAPSWC A
Sbjct: 181 RCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPLPPDAPSWCQA 240

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           PF+PEGLLSSV +I++ +IG+ +GH+I+H + H  R+  W+   F +L+    + F
Sbjct: 241 PFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVLAFAMDF 296


>gi|449528551|ref|XP_004171267.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 380

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 160/330 (48%), Positives = 226/330 (68%), Gaps = 6/330 (1%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI+VD+AGG  P I+H+PW+G  LAD VMPFFLFIVGV++ALA K+IP R  A +K + R
Sbjct: 1   MIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
           TLKLLF G+ LQGGF H  + LTYGVD++ IR  G+LQRIA++Y L +L EI+ K     
Sbjct: 61  TLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GS 117

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVT 226
           D      ++ R Y    + A  + ++YLAL YG YVPDW++ + +  ++D    K+F+V 
Sbjct: 118 DYVNSETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVK 177

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CG R    P CNAVG IDRK+ GI H+Y  P + R++ C+ ++P  GPL  DAPSWC AP
Sbjct: 178 CGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAP 237

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS+V ++++ ++G+H+GH+I+H K H  R+  W+     L++  + L F   + 
Sbjct: 238 FDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GMH 296

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +NK LYT+SY+ VT+GAA L+F+ IY +VD
Sbjct: 297 INKVLYTVSYMSVTAGAAGLLFTGIYLMVD 326


>gi|115462187|ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group]
 gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group]
          Length = 491

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 162/398 (40%), Positives = 239/398 (60%), Gaps = 24/398 (6%)

Query: 6   AETTHHHPLIISEPDVSD------------QQEKSHLKTQRLASLDIFRGLAVALMILVD 53
           A+  H  PL+ S  D  +                +  K +R+ASLD+FRGL VA+MILVD
Sbjct: 14  ADAGHRRPLLASADDDDEIRPYPASSPSPQHPAGAERKPRRVASLDVFRGLTVAMMILVD 73

Query: 54  HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLL 113
            AGG WP ++H+PW G  +ADFVMP FLFI+GV+ AL  K+ P++  A KK   R +KL 
Sbjct: 74  DAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNKTVATKKAAIRAIKLF 133

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G++LQGG+ H    LTYG+D+  IR  GVLQRIA+ Y L ++ EI+  +    D ++ 
Sbjct: 134 ILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAISEIWLVNNISVDSAI- 192

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-------ADYGKVFNVT 226
             S  + Y   W++A  +  +Y+ LL G YV +W+F +   +S        +   +  + 
Sbjct: 193 --SFVKKYFMEWIVAVMISALYVGLLLGLYVSNWEFKVQTSNSILTIPTPGNEIGMKMIQ 250

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           CGVR  L PPCNAVG++DR +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC AP
Sbjct: 251 CGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGPLPPNAPDWCLAP 310

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS++ + ++  +G+HFGHV++H K H  R+  W+     L + G  L     +P
Sbjct: 311 FDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVLTVSGFLLQLL-GMP 369

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDC-NIFGPW 383
            +K LYT+SY+ +T G +  +   +Y +VD  NI  P+
Sbjct: 370 FSKPLYTVSYMLLTGGVSGFLLLLLYYIVDVINIKKPF 407


>gi|255581844|ref|XP_002531722.1| conserved hypothetical protein [Ricinus communis]
 gi|223528625|gb|EEF30642.1| conserved hypothetical protein [Ricinus communis]
          Length = 397

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 214/332 (64%), Gaps = 11/332 (3%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           M+LVD+ G  +P I+H+PWNG +LADFVMPFFLFI GV++AL  K++  R DA  K + R
Sbjct: 1   MMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKKVTKRIDATWKAMLR 60

Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
            +KL F G+ LQGG+ H  + LTYGVD+  IR  G+LQRI++ Y++ +L EI+   +  +
Sbjct: 61  AVKLFFLGVFLQGGYFHGINSLTYGVDIERIRWFGILQRISIGYIVAALCEIW---LSRR 117

Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFN 224
            QS      F+ Y WHW++A  +  VYL LLYG YVPDWQF + N  S+    +   V+ 
Sbjct: 118 TQSQREIGFFKNYYWHWVVAFSLSAVYLGLLYGLYVPDWQFEMSNAASSALPINGSNVYM 177

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
           V C VR  L P CN+ G IDR VLG +H+Y  P  R  K C   +   G + + +PSWCH
Sbjct: 178 VKCSVRGDLGPACNSAGMIDRYVLGFDHLYTKPVHRNLKECNMTN---GQVSESSPSWCH 234

Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
           APF+PEGLLSS+++ ++ IIG+  GHV+ H + H  R++ W     +LL+ G  L F   
Sbjct: 235 APFDPEGLLSSLTAAITCIIGLQCGHVLAHIQEHKGRIESWSLFSASLLLLGSVLAFI-G 293

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           IP+NK LYT+SY+ +TS  + + F A+Y LVD
Sbjct: 294 IPVNKSLYTISYMLITSALSGITFCALYLLVD 325


>gi|395146531|gb|AFN53685.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 694

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 157/379 (41%), Positives = 224/379 (59%), Gaps = 43/379 (11%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHL-----KTQRLASLDIFRGLAVALMILVDHAGG 57
           E + + +   PL    PD S +++   L     K +RLASLD FRGL + LM+LVD+ G 
Sbjct: 13  EDEMQKSFRPPL--PPPDFSGREDGQLLMLYRKKNKRLASLDAFRGLCIFLMMLVDYGGH 70

Query: 58  DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
            +P I+H+ WNG +LADFVMPFFLFIVGV+IAL  K+ P+R +A +K + +++KL   GI
Sbjct: 71  VFPTIAHSAWNGIHLADFVMPFFLFIVGVSIALVYKKAPNRVEATRKALLKSVKLFLVGI 130

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
           LLQ                         QRI++ Y++ ++ EI+    +  D       I
Sbjct: 131 LLQE------------------------QRISIGYIVGAICEIWLSIRRKGD-----VGI 161

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
            + Y WHW+ A  ++ VY  L YG YVPDWQF++      D   VF V C V+  + P C
Sbjct: 162 IKSYYWHWIAALAIVAVYARLSYGLYVPDWQFSL----PGDQHHVFTVKCSVKGDVGPAC 217

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N+ G IDR VLG++H+Y  P ++  K C   S  + P  +DAPSWCHAPF+PEGLLSS++
Sbjct: 218 NSAGMIDRYVLGLSHLYAKPVYKNLKVCNMSSNKQVP--EDAPSWCHAPFDPEGLLSSLT 275

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
           + ++ IIG+ FGHV+ H + H  RL+ W       L+ GL L      P+NK LY++SY+
Sbjct: 276 AAVTCIIGLQFGHVLAHIQDHKGRLENWSGFSVFFLVLGLFL-VRLGFPINKPLYSISYM 334

Query: 358 CVTSGAAALVFSAIYALVD 376
            +TS +A + F+A+Y LVD
Sbjct: 335 LITSASAGITFAALYLLVD 353


>gi|186530235|ref|NP_001119392.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008204|gb|AED95587.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 359

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 148/291 (50%), Positives = 201/291 (69%), Gaps = 6/291 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 331


>gi|62701854|gb|AAX92927.1| hypothetical protein LOC_Os11g14080 [Oryza sativa Japonica Group]
 gi|77549602|gb|ABA92399.1| D8Ertd354e protein, putative [Oryza sativa Japonica Group]
 gi|125576749|gb|EAZ17971.1| hypothetical protein OsJ_33516 [Oryza sativa Japonica Group]
          Length = 447

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 138/254 (54%), Positives = 173/254 (68%), Gaps = 9/254 (3%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E+   K++R+A+LD FRGL + LMILVD AGG +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 51  EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA ALKR+P    AVKK+  RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y +V+L+E FT  V+      G ++IF  Y W WL     L +Y+   +  YV
Sbjct: 171 LQRIALVYFVVALIEAFTTKVRPTTVRSGPYAIFHAYRWQWLGGFVALFIYMVTTFSLYV 230

Query: 205 PDWQFTIINKDSADYGKVF---------NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
           PDW +   N    + GK F         +V CGVR  L+P CNAVGY+DR V GINH+Y 
Sbjct: 231 PDWSYVYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYT 290

Query: 256 HPAWRRSKACTQDS 269
            P W RSK    DS
Sbjct: 291 QPVWIRSKFNIIDS 304


>gi|238481501|ref|NP_001154765.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008206|gb|AED95589.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 148/287 (51%), Positives = 198/287 (68%), Gaps = 6/287 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 38  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 97

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 98  KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 157

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 158 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 214

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct: 215 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 274

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
           P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K
Sbjct: 275 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 321


>gi|413918232|gb|AFW58164.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 423

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 137/283 (48%), Positives = 188/283 (66%), Gaps = 6/283 (2%)

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
           R+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++YL
Sbjct: 93  RVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYL 152

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
           L +L EI+ +  +D D     + + + Y +   + A V + Y++LLYGTYVPDW++    
Sbjct: 153 LTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSA 209

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
             S +  K   V CGVR   +P CNAVG IDRK+LGI H+Y  P + RSK C+ DSP  G
Sbjct: 210 PGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNG 267

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           PL  DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H  R+  W+   F++L
Sbjct: 268 PLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSML 327

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +    + F   + +NK LYTLSY   T+GAA L+FS IY LVD
Sbjct: 328 VLAFAMDFF-GLHMNKPLYTLSYTLGTAGAAGLLFSGIYTLVD 369


>gi|326432441|gb|EGD78011.1| hypothetical protein PTSG_09649 [Salpingoeca sp. ATCC 50818]
          Length = 1087

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 150/389 (38%), Positives = 211/389 (54%), Gaps = 51/389 (13%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
           Q  +  L  +RL+SLD+FRG  VALM+ VD  G  +P I H+PWNG  LADFVMPFF FI
Sbjct: 627 QPAQRSLPKERLSSLDVFRGFTVALMVFVDETGAAFPPIDHSPWNGVRLADFVMPFFDFI 686

Query: 84  VGVAIALALKRI--------PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VGV++AL+ K+         P    A++K   R LKL   G+L QGG     D + Y  D
Sbjct: 687 VGVSLALSFKKFDLEDATTTPRVWPALRKATIRFLKLFILGMLTQGGI----DIMNY--D 740

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT------KDVQDKDQSVG----RFSIFRLYCWHW 185
           +  IR+ G+LQR+A+ Y  V+L+EIF       ++  + D   G       +   Y WHW
Sbjct: 741 LAHIRIMGILQRVAVCYYAVALMEIFLPRNKKYRNYNETDTVTGWAVDVLHMLWRYKWHW 800

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
             AAC+   +  ++YG  VPD                F   CG R  L P CNA  YIDR
Sbjct: 801 FTAACLFATHTGIMYGVNVPD---------------AFGEECG-RGVLTPACNAATYIDR 844

Query: 246 KVLGINHMY----------HHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLS 294
            VL + HMY          +   ++R   C+  SP +    +DAP+WC H PF+PEGL+S
Sbjct: 845 NVLTVEHMYFPANGGDKSGNDVTFQRLPECSTCSPGKCVPPEDAPAWCLHGPFDPEGLVS 904

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           S+++I++T+IG+H+GHV+   +   AR+  W   G   L+ G  LHF+ A  +N  LY++
Sbjct: 905 SLNAIIATVIGIHYGHVLRRVQSPKARIVHWTAFGVVQLVIGFALHFSGAFVMNTDLYSI 964

Query: 355 SYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           SY  VT+G   ++ +  Y +VD    G W
Sbjct: 965 SYTLVTAGTGGVLLALFYVIVDRLHVGEW 993


>gi|413920627|gb|AFW60559.1| hypothetical protein ZEAMMB73_831897 [Zea mays]
          Length = 343

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 131/231 (56%), Positives = 157/231 (67%)

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y  V+L+E  T  V+      G ++IF  Y W WL      VVY+   +  YVP
Sbjct: 19  QRIALVYFFVALIEALTVKVRPTTVRSGPYAIFDAYRWQWLGGLVAFVVYMVTTFSLYVP 78

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           DW F   N+   + GK F V CGVRA L   CNAVGY+DR+V GINH+Y  P W RSK C
Sbjct: 79  DWSFVYHNEGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDC 138

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
           T  SP  GPLR DAP+WC APFEPEGLLSS+SS+LS  IG+H+GHV+IH K H  RLK W
Sbjct: 139 TSSSPNMGPLRSDAPAWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHW 198

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +  GF+LL+ G+ LHFTNAIP+NKQLY+ SYVC T GAA +V SA Y L+D
Sbjct: 199 LLTGFSLLVLGIILHFTNAIPINKQLYSFSYVCFTGGAAGIVLSAFYILID 249


>gi|413937083|gb|AFW71634.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 317

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 118/233 (50%), Positives = 158/233 (67%), Gaps = 3/233 (1%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + QRLASLD+FRG+ V LMI+VD AGG  P ++H+PW+G  +ADF+MPFFLFIVGV++ L
Sbjct: 87  RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           A KR+PDR +A +K + R LKL   G++LQGGF H    LT+GVD+  IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
           +YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPDW++ 
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           I    S+   K F+V CGVR    P CNAVG +DR VLGI+H+Y  P + R+K
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTK 316


>gi|238481505|ref|NP_001154767.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008208|gb|AED95591.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 340

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 135/290 (46%), Positives = 188/290 (64%), Gaps = 7/290 (2%)

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           ++   +P +  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA
Sbjct: 1   MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           ++YL+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++
Sbjct: 61  IAYLVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEY 117

Query: 210 TIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
            I+ +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+
Sbjct: 118 QILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCS 177

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
            + P  GPL  DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H  RL QW+
Sbjct: 178 INYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWI 237

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              F LL+          + LNK LYTLSY+CVTSGA+  + SAIY +VD
Sbjct: 238 LRSFCLLML-GLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVD 286


>gi|395146473|gb|AFN53630.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 692

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 122/285 (42%), Positives = 178/285 (62%), Gaps = 14/285 (4%)

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K++P+R +A +K   +++KL   GILLQGGF H    LTYGVD+  IRL G+LQRI++ Y
Sbjct: 96  KKVPNRVEATRKAFLKSVKLFLVGILLQGGFFHGLHSLTYGVDIERIRLLGILQRISIGY 155

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ ++ EI+   V+ K    G   I + Y  HW+ A  ++VVY  L YG YVPDWQF + 
Sbjct: 156 IVGAICEIWL-SVRRK----GDVGIIKSYYSHWVAALAIVVVYARLSYGLYVPDWQFAL- 209

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
                D   V+ V C V+  + P CN+ G +DR VLG++H+Y  P ++  K C   S  +
Sbjct: 210 ---PQDQHHVYTVKCSVKGDVGPACNSAGMMDRYVLGLSHLYAKPVYKNLKICNMSSNKQ 266

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA- 331
            P  +DAPSWCHAPF+PEGLLSS+++ ++ IIG+ FGHV+ H + H  RL+ W   GF+ 
Sbjct: 267 VP--EDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHVQDHKGRLENW--SGFSV 322

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             +           P+NK LY++SY+ +TS +A + F+A+Y LVD
Sbjct: 323 FFLLLGLFLVLLGFPINKPLYSISYMLITSASAGITFAALYLLVD 367


>gi|125533951|gb|EAY80499.1| hypothetical protein OsI_35679 [Oryza sativa Indica Group]
          Length = 444

 Score =  241 bits (615), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 125/257 (48%), Positives = 160/257 (62%), Gaps = 18/257 (7%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           +E+   K++R+A+LD FRGL + LMILVD AGG +  + H+PWNGC LADFVMPFFLFIV
Sbjct: 51  EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           GVAIA ALKR+P    AVKK+  RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKD---QSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           LQ + + +          +   D+    +S    S  R      L     L +Y+   + 
Sbjct: 171 LQNLLVLFDNAEDSFGVLRGCSDRGIHHKSSAYDSAVR------LGGFVALFIYMVTTFS 224

Query: 202 TYVPDWQFTIINKDSADYGKVF---------NVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
            YVPDW +   N    + GK F         +V CGVR  L+P CNAVGY+DR V GINH
Sbjct: 225 LYVPDWSYIYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINH 284

Query: 253 MYHHPAWRRSKACTQDS 269
           +Y  P W RSK    DS
Sbjct: 285 LYTQPVWIRSKFNIVDS 301


>gi|255642425|gb|ACU21476.1| unknown [Glycine max]
          Length = 326

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 124/293 (42%), Positives = 175/293 (59%), Gaps = 12/293 (4%)

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           +R P R  A  K   R L L   GILLQGG+ H    LT+GVD++ IR  G+LQRI++ Y
Sbjct: 41  QRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRISIGY 100

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           ++ +L EI+    + K+         + Y W W +A  +L +Y  LLYG YVPDWQF + 
Sbjct: 101 IVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDVS 155

Query: 213 NKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              S+      G ++ V C VR  L P CN+ G IDR +LG++H+Y  P +R  K C   
Sbjct: 156 ASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNMS 215

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
           +  +G +   +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+ H + H  RL  W+  
Sbjct: 216 A--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMCF 273

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
             + L  GL L     IPLNK LYT+SY+ +TS A+ L F A+Y LVD ++ G
Sbjct: 274 SLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAASGLTFIALYFLVDVHVTG 325


>gi|167522597|ref|XP_001745636.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775985|gb|EDQ89607.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1047

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 160/418 (38%), Positives = 219/418 (52%), Gaps = 69/418 (16%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLK-----------TQRLASLDIFRGLAVALMILV 52
           ++   ++  PL+ +  D S+ Q KS++             +RL++LD++RGL +A+MILV
Sbjct: 566 VRPRDSNRTPLLPASTD-SNIQSKSNIDLATDPVAPKPPRERLSALDVYRGLTIAVMILV 624

Query: 53  DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKV 105
           D  G  +P I HAPWNG +LAD V+P F FIVGV+IALA KR          R  A KK 
Sbjct: 625 DETGAAFPPIDHAPWNGLHLADTVVPSFDFIVGVSIALAFKRFDLEAGAQGQRWTAFKKA 684

Query: 106 IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
             R LK LF GI                 D+  IR+ G+LQR+A+ Y  V+L+EIF   +
Sbjct: 685 TDRFLK-LFGGITFM------------NYDLTNIRIFGILQRVAVCYFAVALMEIFLPRL 731

Query: 166 QD---------KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
                       D       +F  Y WHW  AA +L V+ ++LYG  VPD          
Sbjct: 732 TGALPADNGTWADWMRRTQHLFWRYRWHWFSAALLLAVHTSILYGVDVPD---------- 781

Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH-----HPA-----WRRSKACT 266
                 F   CG R +L P CNA  YIDR +L + HMY       PA     ++R   C+
Sbjct: 782 -----AFGERCG-RGQLTPACNAATYIDRLILTVPHMYFPENGGDPAHADVTFKRLPECS 835

Query: 267 QDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
             SP       DAP+WC H PF+PEGL+SS+++I++TIIGVH+GHV+   K  + R+ QW
Sbjct: 836 SCSPGLCVAPADAPAWCLHGPFDPEGLVSSLTAIVTTIIGVHYGHVLRQIKSPMERIFQW 895

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            +     L+ GL LHF+  IPLN  LY++S+V VT G   L+    Y +VD      W
Sbjct: 896 SSFALLQLLLGLILHFS-GIPLNINLYSVSFVLVTGGMTGLLLVLCYLIVDYRPTARW 952


>gi|186530239|ref|NP_001119393.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008205|gb|AED95588.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 292

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 119/231 (51%), Positives = 157/231 (67%), Gaps = 3/231 (1%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
            +D       F V CGVR    P CNAVG +DR  LGI H+Y  P + R+K
Sbjct: 221 KEDQGSTLTTFLVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 271


>gi|238481503|ref|NP_001154766.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008207|gb|AED95590.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 295

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 119/234 (50%), Positives = 157/234 (67%), Gaps = 6/234 (2%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 44  ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
           L+V+L EI+ K   +    +   S+ + Y +HW++A  +  +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
            +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274


>gi|413947252|gb|AFW79901.1| hypothetical protein ZEAMMB73_198786 [Zea mays]
          Length = 505

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 101/229 (44%), Positives = 139/229 (60%), Gaps = 5/229 (2%)

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIA++YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLYG YVPD
Sbjct: 279 RIAIAYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGIYVPD 335

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
           W++ I    S+   K F V CGVR    P CNAVG +DR +LGI+H+Y  P + R+K C+
Sbjct: 336 WEYQIAGPGSSSTKKSFFVKCGVRGDTRPACNAVGMVDRTILGIDHLYRRPVYVRTKECS 395

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
            D    GPL  DAPSWC APF+PEGLLS V +I++ +IG+ F HVIIH + H  R+  W+
Sbjct: 396 IDYLENGPLPPDAPSWCQAPFDPEGLLSFVMAIVTCLIGLQFRHVIIHFEKHRGRIASWL 455

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
              F++L     + F   + +NK LYT+SY  + +GAA L+F  IY LV
Sbjct: 456 VPSFSMLALAFVMDFV-GMRMNKPLYTMSYT-LAAGAAGLLFPGIYVLV 502


>gi|255635187|gb|ACU17949.1| unknown [Glycine max]
          Length = 217

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 93/200 (46%), Positives = 127/200 (63%), Gaps = 5/200 (2%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +P + +  E +  +  R+ASLD+FRGL+V LMI VD+A   +P I+HAPWNG +LADFVM
Sbjct: 5   QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGTHLADFVM 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFI G+++AL  KR P R  A  K   R L L   GILLQGG+ H    LT+GVD++
Sbjct: 65  PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            IR  G+LQRI++ Y++ +L EI+    + K+         + Y W W +A  +L +Y  
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179

Query: 198 LLYGTYVPDWQFTIINKDSA 217
           LLYG YVPDWQF +    S+
Sbjct: 180 LLYGLYVPDWQFDVSASTSS 199


>gi|242007028|ref|XP_002424344.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507744|gb|EEB11606.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 497

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 185/367 (50%), Gaps = 54/367 (14%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
               S +K  R+ SLD FRGLA+ +MI V++ GGD+    H+PWNG  +ADFV P+F++I
Sbjct: 99  NDRYSRIKNSRIKSLDAFRGLAILIMIFVNYNGGDYSVFKHSPWNGITIADFVFPWFIWI 158

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFW----GILLQGGFSHAPDELTYGVDVRMI 139
           +G +  L++     RA + K++ FR LK  F+    GI+L  G   +            +
Sbjct: 159 MGASTVLSIDNNFRRAQSKKEIFFRILKRSFYLIALGIVLNSGHRDSKG---------FL 209

Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCW-HWLMAACVLVVYLA 197
           R+CGVLQRI L+Y +++ +EIF  K + ++      FS   +  W  WL+   ++ +++ 
Sbjct: 210 RVCGVLQRIGLTYFIIASLEIFALKSLLNEHFGPWNFSRNIIKIWIQWLVPILLVAIHVI 269

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           + +  +VP                 F N T G          A GYIDR ++  NHMYH 
Sbjct: 270 ITFTLHVPGCPLGYTGPGGLSNHSAFRNCTGG----------AAGYIDRLIITDNHMYHR 319

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            ++               L+   PS    PF+PEGLL +++S+    +GV    ++I+ +
Sbjct: 320 GSF---------------LKIFKPS---VPFDPEGLLGTLTSVFCAFLGVQSARILINHE 361

Query: 317 GHLARLKQWVTMGFALLIFGLTLHF-------TNAIPLNKQLYTLSYVCVTSGAAALVFS 369
              +++K W+   F  ++ GL   F       +  IP+NK L++LSYV  TS  A L+ +
Sbjct: 362 NSFSKIKSWI---FWAIVMGLISGFLCNWSQNSGIIPINKNLWSLSYVLATSSIAFLILT 418

Query: 370 AIYALVD 376
             Y L+D
Sbjct: 419 TFYTLID 425


>gi|356536971|ref|XP_003537005.1| PREDICTED: uncharacterized protein LOC100781855 [Glycine max]
          Length = 357

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 89/162 (54%), Positives = 109/162 (67%), Gaps = 2/162 (1%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHA--GGDWPEI 62
           K E  H         D  +  + + +  +  +   I       LM+L D A  GG +P I
Sbjct: 20  KGELKHEIERTNGNGDSIEHDKDARITQEGESVQQIVEQEQPLLMVLEDDADAGGAYPRI 79

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
            H+PWNGC LADFVMPFFLF+VGVAIALALKRIP    AVK +I RTLKLLFWGILLQGG
Sbjct: 80  DHSPWNGCTLADFVMPFFLFVVGVAIALALKRIPKVKYAVKNIILRTLKLLFWGILLQGG 139

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           +SHAPD+L+YGVD+R IR CG+LQRIAL Y  V+L+E +T +
Sbjct: 140 YSHAPDDLSYGVDMRFIRWCGILQRIALVYCAVALIETYTTN 181



 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 71/104 (68%)

Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
           G L++ A  +C          + +S+ LS  IG+H+GHV+IH KGH  RLKQW+ MGF L
Sbjct: 160 GILQRIALVYCAVALIETYTTNCISASLSGTIGIHYGHVLIHFKGHSERLKQWLLMGFLL 219

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L  GL LHFT AIP+NKQLY+ SYVC T+GAA +VFS  Y L++
Sbjct: 220 LTLGLMLHFTEAIPINKQLYSFSYVCFTAGAAGIVFSVFYLLLN 263


>gi|384249073|gb|EIE22555.1| hypothetical protein COCSUDRAFT_42235 [Coccomyxa subellipsoidea
           C-169]
          Length = 395

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 174/343 (50%), Gaps = 53/343 (15%)

Query: 46  VALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPDRADAVK 103
           +ALM+ V+HAG + P ++HA W+G +LAD VMP FL +VGV++AL+L  +    R   ++
Sbjct: 1   MALMLFVNHAGHEVPWVAHAAWDGVHLADLVMPCFLLLVGVSVALSLGPRASGPRRPLLR 60

Query: 104 KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT- 162
           KV+ RT KL   G+L+QGG            D+  +R CGVLQRIAL + LVSLV ++  
Sbjct: 61  KVLARTGKLAGLGLLIQGGVGAGAFP---AWDLSRLRYCGVLQRIALCFALVSLVVLYLP 117

Query: 163 -------KDVQDK-DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
                  + + D+ D+S    + FR Y   W++   + V +          +W    +  
Sbjct: 118 QTPSPRLQSLLDRGDESASLMAPFRFYALWWILGTALFVAF----------NWMALFLRP 167

Query: 215 DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGP 274
                       C  R  L   CN   Y+D ++LG +H+Y  P+ RR+     + P E  
Sbjct: 168 PG----------CLARPALTADCNVAAYVDARLLGRSHLYPWPSCRRA-----NPPCE-- 210

Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSIL-STIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
                        +PEGL +++S  L ST +G+ FG V++  +GH ARL+ W      L 
Sbjct: 211 -----------YLDPEGLFATLSGALASTFLGLWFGAVLLTLRGHRARLRSWAYASVLLT 259

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             GL LH T A+P NK LY+ S V +T+G++      +Y   +
Sbjct: 260 ELGLALHVTGAVPFNKNLYSASSVLLTAGSSGAFLGLVYLFTE 302


>gi|391346547|ref|XP_003747534.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Metaseiulus occidentalis]
          Length = 564

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 188/385 (48%), Gaps = 50/385 (12%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEI 62
           AET+   PL  S      +  +  +      R+ SLD FRG  + LMI V++ GG     
Sbjct: 145 AETSSLGPLEASSSTAGSRPPEDGIGKAGKPRIKSLDAFRGFCLFLMIFVNYGGGGLWLF 204

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL 118
            H PW+G   AD + P+F++I+GV++A++L+ +  +   + ++ F    R++KL   G++
Sbjct: 205 EHIPWDGLTFADLLFPWFVWIMGVSMAISLRSMRRKCVPLSEIFFKILSRSVKLFLLGLI 264

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFS 176
           L        + +    D+  +R+ GVLQR A+SY +V+ + +F     D  ++    +  
Sbjct: 265 L--------NSMGKNNDISKLRIPGVLQRFAVSYFVVASMHMFFSRATDAAETAKWAKIR 316

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNP 235
              LY   W+M   ++ +++ L +   VPD     +      + G  FN T G       
Sbjct: 317 DVALYWQEWVMMISLVAIHVLLTFLLDVPDCPKGYLGPGGLHENGTHFNCTGG------- 369

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
              A GYIDR VLG NHMY HP   +    +Q                  PF+PEG+L  
Sbjct: 370 ---AAGYIDRVVLGPNHMYGHPTTEKIYETSQ------------------PFDPEGVLGC 408

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQL 351
           ++SI  T +G+  G +++       RL +W+  G  L L+ G+   F+     IP+NK L
Sbjct: 409 LTSIFLTFLGLQAGKILLTFNNPGRRLSRWICWGVLLGLLAGILCGFSKEDGWIPINKNL 468

Query: 352 YTLSYVCVTSGAAALVFSAIYALVD 376
           ++LSYV  T+G A L+ S  Y ++D
Sbjct: 469 WSLSYVLCTAGLAFLLLSVFYLIID 493


>gi|413953638|gb|AFW86287.1| hypothetical protein ZEAMMB73_717084 [Zea mays]
          Length = 357

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 103/290 (35%), Positives = 154/290 (53%), Gaps = 50/290 (17%)

Query: 97  DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
           ++  A KK   R  KL   G++LQGG+ H   +LTYGVD+  IR  GVLQRIA+ Y + +
Sbjct: 3   NKTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAA 62

Query: 157 LVEI-------------FTKDVQDKDQSVGRF-------------------SIFRLYCWH 184
           + EI             F K    +  ++G F                      + Y   
Sbjct: 63  MSEIWLVNNNLVDSPVPFVKKYFIEWIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIE 122

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNA 239
           W MA  + V+Y+AL++G YV +W+F I   +S     ++  +   + CGVR  L PPCNA
Sbjct: 123 WFMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNA 182

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS----S 295
           VG +DR +LG NH+Y +P ++R+K C+ +SP  GPL  +AP WC APF+PEGLLS    +
Sbjct: 183 VGLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSKPLYT 242

Query: 296 VSSILST-------IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
           V+ +L T       ++ +++   +IH K     L QW+ M  AL+++ L 
Sbjct: 243 VNYMLLTGGVSGFLLLLLYYIVDVIHIKKPFV-LFQWMGMN-ALIVYVLA 290


>gi|413918234|gb|AFW58166.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 202

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 80/137 (58%), Positives = 105/137 (76%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD+FRG+ V LMI+VD AG   P ++H+PW+G  +ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           KR+PD+ DA +K + R LKL   G++LQGGF H    L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 153 LLVSLVEIFTKDVQDKD 169
           LL +L EI+ +  +D D
Sbjct: 173 LLTALCEIWIRGDEDVD 189


>gi|307178470|gb|EFN67159.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 512

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 175/368 (47%), Gaps = 41/368 (11%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           +E + +    +   K +R+ ++D FRG+    MI V+   G +  + HA WNG  L D V
Sbjct: 105 TEEERTPNNNEKATKHRRVKAIDTFRGVCTLFMIFVNDGSGSYTTLGHATWNGMLLGDLV 164

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            P F++I+GV + +AL     R  +  ++ F   K  F   L+      A + L     +
Sbjct: 165 FPCFMWIMGVCVPIALSAQLKRGLSKLEISFSIFKRSFLLFLI----GIALNTLGTNAQL 220

Query: 137 RMIRLCGVLQRIALSYLLVSLVEI-FT----KDVQDKDQSVGRFSIFRLYCW--HWLMAA 189
             IR+ GVLQR  ++YL+VSL+ + FT    K  Q+  Q+     +  +     HW +  
Sbjct: 221 ENIRIFGVLQRFGITYLIVSLLYLCFTPQQPKVAQNLSQTWMTHKMQDILSLLPHWCIML 280

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            +++V+ A+ +   +P      +      + GK FN T G          A GYIDR +L
Sbjct: 281 TLVMVHCAVTFCLPIPGCPTGYLGPGGRHEDGKYFNCTGG----------ATGYIDRILL 330

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
            ++H+Y  P        T DS +              PF+PEG+L  ++SI    +GVH 
Sbjct: 331 TLSHIYQWP--------TIDSIYGS-----------GPFDPEGILGCLTSIFQVFLGVHT 371

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G +++  KG   R+ +W+         G   HFTN IP+NK L++LS+V V++  A    
Sbjct: 372 GVILMMYKGWKERIIRWLVWAVFYGCLGCIFHFTNIIPINKNLWSLSFVLVSTCFALAFL 431

Query: 369 SAIYALVD 376
           S  Y L+D
Sbjct: 432 SGCYLLID 439


>gi|312381520|gb|EFR27253.1| hypothetical protein AND_06166 [Anopheles darlingi]
          Length = 782

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/363 (31%), Positives = 176/363 (48%), Gaps = 55/363 (15%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +++  +RL SLD  RG+A+ LMI V+  GG +  I HA WNG ++AD V P+FLFI+GV 
Sbjct: 388 ANIARKRLQSLDTLRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVC 447

Query: 88  IALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I ++L+    R   +   V  +  R++ L   G+ L             G ++  +R+ G
Sbjct: 448 IPISLRGQLARNVSKRQIVSSITTRSISLFLIGLCLNS---------MNGPNMANLRIFG 498

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLAL 198
           VLQR  ++Y +VSLV +F    Q   Q   RF      I RL    W++   ++V+YLA+
Sbjct: 499 VLQRFGVAYFVVSLVHLFCHREQIASQH--RFVRANVDIIRL-VRQWIIVGLLVVIYLAV 555

Query: 199 LYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +     P                V+ N T G+           GYIDR +LG++H+Y HP
Sbjct: 556 ILLIPAPGCPRGYFGPGGKHLFNVYPNCTGGI----------TGYIDRVLLGMSHLYQHP 605

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
             R          ++G            PF+PEG  + + +IL   +G+  G  I+   G
Sbjct: 606 TARYV--------YDG-----------QPFDPEGPFACLPTILQVFLGLQCGSTILSFTG 646

Query: 318 HLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H  RL+++     AL L+ G+   F+     +P+NK L++LSYV  T+  A L+    Y 
Sbjct: 647 HRQRLQRFAVWSVALGLVAGVLCGFSKNDGWLPVNKNLWSLSYVLATASLAYLLLLICYY 706

Query: 374 LVD 376
            +D
Sbjct: 707 TID 709


>gi|242022263|ref|XP_002431560.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516863|gb|EEB18822.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 607

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 172/368 (46%), Gaps = 47/368 (12%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D    Q  S     R+ S+D FRGLAV LMI V+   G +  + HA WNG  +ADFV P+
Sbjct: 185 DDRTTQASSKPARHRIKSIDTFRGLAVVLMIFVNDGAGHYWFLEHATWNGILVADFVFPW 244

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+++G+ I ++    LKR   R   +  VI R + L   G+LL        + +  G D
Sbjct: 245 FLWVMGLCIPISIRTQLKRNVSRWKILGHVIKRGILLFGLGVLL--------NTVGIGSD 296

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI-FT-KDVQDKDQSVGR-----FSIFRLYCWHWLMA 188
           +  IR+ GVLQR ++ YL+++++ + FT + + ++++  G      F    +    W++ 
Sbjct: 297 LETIRIPGVLQRFSIVYLIIAILGVCFTPRSISNENRFPGSSFRETFQDIIIIFPQWIVI 356

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             ++  Y   ++ + VP      +       G  FN   G            GY+D+ +L
Sbjct: 357 LSIVAAYCYFVFFSPVPGCPSGYLGPGGIQDGGRFNECTG---------GMTGYVDKVLL 407

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G+ H+Y +P                     +  +   PF+PEGLL  + SI     GV  
Sbjct: 408 GVEHIYKNPT-------------------SSKVYKSGPFDPEGLLGVMPSIFQAFFGVQA 448

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  +++     A+L +W T G    I  L L     +P+NK L++LSYV  T+ +A L+ 
Sbjct: 449 GATLLYHPEWKAKLIRWFTWGILNGILALLLSLPGIVPINKNLWSLSYVFTTTSSAFLIL 508

Query: 369 SAIYALVD 376
             IY   D
Sbjct: 509 CVIYFFQD 516


>gi|301608954|ref|XP_002934053.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Xenopus (Silurana) tropicalis]
          Length = 633

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 183/378 (48%), Gaps = 52/378 (13%)

Query: 17  SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           +  D+S Q+  S       QRL SLD FRGLA+ +M+ V++ GG +    H  WNG  +A
Sbjct: 217 NRADISSQETYSRAWNPSVQRLRSLDTFRGLALTIMVFVNYGGGGYWFFKHQSWNGLTVA 276

Query: 74  DFVMPFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPD 128
           D V P+F+FI+G +I L+L  +      R + + KV++R+++L   G+ ++   +   P 
Sbjct: 277 DLVFPWFVFIMGTSIYLSLNSMLSKGSSRWNLLGKVLWRSVQLFLIGLFVINVNYCRGP- 335

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK-DQSVGRFSIFRLYC-W-H 184
                +    IR+ GVLQR++L+YL VS +E IF+K   D   QS   F +  +   W  
Sbjct: 336 -----LSFSEIRIMGVLQRLSLTYLAVSALELIFSKPTPDALTQSRTCFLLQDVLSHWPK 390

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           W++   +  V+L L     VPD     +      D+GK  N T G          A GYI
Sbjct: 391 WIVILALEAVWLCLTLLLQVPDCPLGYLGPGGIGDFGKFPNCTGG----------AAGYI 440

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LG  H+Y HP        T +  ++  +          P++PEGLL +++ ++   
Sbjct: 441 DRMILGQGHIYQHP--------TSNVIYKSTM----------PYDPEGLLGTINCVVMAF 482

Query: 304 IGVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            G+  G +++  K      L R   W + MG    +          IP+NK L+++SY+ 
Sbjct: 483 FGLQAGIILVLYKNQHKYVLVRFFSWAIIMGVLSAVLTKCSTNEGIIPVNKNLWSISYIT 542

Query: 359 VTSGAAALVFSAIYALVD 376
             S  A  +   IY LVD
Sbjct: 543 TLSCFAYFLLMLIYFLVD 560


>gi|332028000|gb|EGI68051.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 557

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 172/383 (44%), Gaps = 45/383 (11%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           I A  +  H +      V D+     L  +R+ ++D FRG +   MI V+   G +  + 
Sbjct: 137 ISAGRSLWHMITKCVTGVKDKSNNKKLAKRRVKAIDTFRGASTLFMIFVNDGSGSYTVLE 196

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           H  W+G  L D V P F++I+GV I +AL     R  +  ++ +  LK  F   L+    
Sbjct: 197 HTIWDGMLLGDIVFPCFMWIMGVCIPIALSAQLKRGVSKLQISYSILKRSFLLFLIGVSL 256

Query: 124 SHAPDELTYGVD--VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QSVGRFSIFRL 180
           +      T G D  V  IR+ GVLQR  ++YL+VSLV +     Q K  ++     I R 
Sbjct: 257 N------TLGTDSQVENIRIFGVLQRFGVTYLVVSLVYLCFPSQQSKILRNTSPTWIMRK 310

Query: 181 Y------CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKL 233
                    HW +    ++V+ AL +   VP      +      + GK FN T G     
Sbjct: 311 MQDILSLLPHWFVMLIFVIVHCALTFCLPVPGCPTGYLGPGGMHEDGKYFNCTGG----- 365

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
                A GYID+ VL +NH+Y +P  +                         PF+PEG+L
Sbjct: 366 -----ATGYIDKTVLTLNHIYQYPTIKSVYG-------------------SGPFDPEGIL 401

Query: 294 SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
             +++I    +GVH G +++  K    R+ +W+         G   HFTN IP+NK L++
Sbjct: 402 GCLTAIFQVFLGVHAGTILMLYKDWKDRVMRWLLWAVFYACLGCAFHFTNTIPVNKNLWS 461

Query: 354 LSYVCVTSGAAALVFSAIYALVD 376
           LS+V VT+  +    S  Y L+D
Sbjct: 462 LSFVFVTTSFSLAFLSGCYLLID 484


>gi|91079154|ref|XP_966977.1| PREDICTED: similar to CG6903 CG6903-PA [Tribolium castaneum]
          Length = 533

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 180/369 (48%), Gaps = 56/369 (15%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S+ +EK     +RL SLD FRG+++ +MI V++  G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+G  + ++L    K+     D    V+ R++KL   G+ L             G  + 
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275

Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
            +R+ GVLQR  + YL+V+ + +F   ++  +    +G+F    L  W  W++   +  V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335

Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              +L LL     P  +  +      + GK FN T G          A GYID  +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y  P  +     TQ                   F+PEG+L  ++SI+   IGV  G  
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424

Query: 312 IIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALV 367
           ++  K H ARL +W++    A ++ G    F+     IP+NK L+++S+V VTS  A L+
Sbjct: 425 LLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFAFLL 484

Query: 368 FSAIYALVD 376
            S  Y L+D
Sbjct: 485 LSICYVLID 493


>gi|270004236|gb|EFA00684.1| hypothetical protein TcasGA2_TC003561 [Tribolium castaneum]
          Length = 569

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 180/369 (48%), Gaps = 56/369 (15%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S+ +EK     +RL SLD FRG+++ +MI V++  G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+G  + ++L    K+     D    V+ R++KL   G+ L             G  + 
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275

Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
            +R+ GVLQR  + YL+V+ + +F   ++  +    +G+F    L  W  W++   +  V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335

Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              +L LL     P  +  +      + GK FN T G          A GYID  +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y  P  +     TQ                   F+PEG+L  ++SI+   IGV  G  
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424

Query: 312 IIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALV 367
           ++  K H ARL +W++    A ++ G    F+     IP+NK L+++S+V VTS  A L+
Sbjct: 425 LLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFAFLL 484

Query: 368 FSAIYALVD 376
            S  Y L+D
Sbjct: 485 LSICYVLID 493


>gi|348529394|ref|XP_003452198.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 600

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/381 (30%), Positives = 183/381 (48%), Gaps = 60/381 (15%)

Query: 19  PDVSDQQEKSHL-KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           P V+D      L  ++RL SLD FRG+++ +M+ V++ GG +    H  WNG  +AD V 
Sbjct: 188 PPVTDNILPPPLTSSKRLRSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVF 247

Query: 78  PFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P+F+FI+G +IAL++  +      R   ++K ++R+L+L   G+L+       P+     
Sbjct: 248 PWFVFIMGTSIALSINALLRAGATRCSLLRKAVWRSLQLFIIGVLVIN-----PNYCQGA 302

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAA 189
           +    +R+ GVLQR+A SYL+V+ +++       DV   D        F LY   WL   
Sbjct: 303 LAWENLRIPGVLQRLAWSYLVVACLDLLVARGQLDVITVDAWWSPAIDFLLYWPAWLCVI 362

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            + V++L L +   VPD     +      D G   N T G          A G+IDR +L
Sbjct: 363 LLEVLWLFLTFLLPVPDCPTGYLGPGGIGDMGLYVNCTGG----------AAGFIDRLLL 412

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  HMY +P+ R   A                     P++PEG+L S++SIL   +G+  
Sbjct: 413 GEKHMYQNPSSRVIYA------------------TRIPYDPEGVLGSINSILMAFLGLQA 454

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-----------IPLNKQLYTLSYV 357
           G +I+H +     +          LI+GL+L   +A           IP+NK L++LSYV
Sbjct: 455 GKIILHYRERPKSIMS------RFLIWGLSLGVISAFLTKCSTDRGFIPVNKNLWSLSYV 508

Query: 358 CVTSGAAALVFSAIYALVDCN 378
              +  A ++   +Y +VD N
Sbjct: 509 TTLACFAFVLLMLVYYIVDVN 529


>gi|443685781|gb|ELT89271.1| hypothetical protein CAPTEDRAFT_227545 [Capitella teleta]
          Length = 605

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/391 (30%), Positives = 186/391 (47%), Gaps = 57/391 (14%)

Query: 13  PLIISEPDVSDQQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
           P + +E    D +  +  KT  +RL SLD FRG+++ +MI V++ GG +    H+ WNG 
Sbjct: 188 PEMQTESATDDAETTAVNKTHKERLRSLDAFRGMSLTIMIFVNYGGGGYWFFDHSYWNGL 247

Query: 71  NLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHA 126
            LAD V P+F +I+G A+AL+    ++R   +     K+I RT  L   GI+L  G    
Sbjct: 248 TLADLVFPWFTWIIGTALALSIQGQMRRGKTKFSIAAKIIRRTCVLFALGIVLGSGGGSE 307

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHW 185
           P      VDV+ +R+ GVLQR+A+SYL+V+L+  IF K   +KD    R  + R    HW
Sbjct: 308 P------VDVQTLRIPGVLQRLAISYLVVALLHLIFAK--ANKDHQPSRLDMVRDITDHW 359

Query: 186 LMAACVLVV---YLALLYGTYVPDWQFTIINKDSA-----DYGKVFNVTCGVRAKLNPPC 237
                VLV+   +L L +   + D + T            + GK  N T G         
Sbjct: 360 PQWGIVLVMVACHLGLTFLLPISDVEGTCPTGYLGPGGLHEGGKYENCTGG--------- 410

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
            A   IDR      H+Y  P  +      +                  P +PEG+L +++
Sbjct: 411 -AAAVIDRWFFSRQHVYQTPTCKEVYKTVE------------------PHDPEGILGTLT 451

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYT 353
           SI    +G+  G ++   K    R+++W+  G  L LI GL   F      IP+NK L++
Sbjct: 452 SIFLCFLGLQAGVILTTFKQKSPRMRRWIVWGIILGLIAGLLCGFKQDGGWIPVNKNLWS 511

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           LS+V   +  A ++ +  Y L+D  + G W+
Sbjct: 512 LSFVLGLASMAFVLLAVFYLLID--VHGLWS 540


>gi|417411833|gb|JAA52338.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 180/366 (49%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +ADFV P+F+FI+G +I L++ 
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGQYWYFKHASWNGLTVADFVFPWFVFIMGSSIFLSMS 258

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + KV +R+  L+  G+++       P+     +    +RL GVLQR+ 
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313

Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E+ F K V ++    GR S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W      L+  GLT    N   IP+NK L+++SYV   S  A  +   +Y +VD  
Sbjct: 466 RFTAWCCF-LGLISVGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPIVD-- 522

Query: 379 IFGPWT 384
           + G WT
Sbjct: 523 VKGVWT 528


>gi|449500329|ref|XP_004174928.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Taeniopygia guttata]
          Length = 789

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 165/340 (48%), Gaps = 48/340 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +IALAL
Sbjct: 399 QRLRSLDTFRGLSLVIMVFVNYGGGKYWFFKHVSWNGLTVADLVFPWFVFIMGTSIALAL 458

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      +   ++K+I+R+  L+  GI++       P+     +    +R+ GVLQR+
Sbjct: 459 GSMLRWGSSKWKVLRKIIWRSFVLILLGIIVVN-----PNYCLGPLSWDNLRIPGVLQRL 513

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
             +YL+V+ +E +FT+     D+          Y   W+    +  ++L L +   VPD 
Sbjct: 514 GFTYLVVAALELLFTR----ADRRFPALQDILPYWPQWIFILVLETIWLCLTFLLPVPDC 569

Query: 208 QFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
               +      D+GK  N T G          A GYIDR +LG  HMY HP    S   T
Sbjct: 570 PRGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLILGEKHMYQHP----SSGVT 615

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARL 322
             S                P++PEG+L +++SI+   +G+  G + +  K H    ++R 
Sbjct: 616 YQSTM--------------PYDPEGILGTINSIVMAFLGLQAGKITLFYKDHPKQIMSRF 661

Query: 323 KQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
             W + MG    I          IP+NK L+++SYV   S
Sbjct: 662 IIWGIVMGVISAILTKCSKEEGFIPVNKNLWSISYVTTMS 701


>gi|443694948|gb|ELT95966.1| hypothetical protein CAPTEDRAFT_92095 [Capitella teleta]
          Length = 431

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 185/397 (46%), Gaps = 66/397 (16%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           +S  K ++T  +     E D         +K +RL SLD FRGL + LMI V++ GG + 
Sbjct: 7   ISSAKTDSTRRNS---EEKDEGKLITPKEVKKERLRSLDAFRGLNILLMIFVNYGGGGYW 63

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
             SHA WNG  + D + P+F+FI+G ++ L +    K+  D  + +  +I+R++KL   G
Sbjct: 64  YFSHAVWNGLYITDLIFPWFIFIMGTSLGLGISSLVKKEVDPVEGLWGIIWRSVKLFAVG 123

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           I+     S+         D+  IR+ GVLQR+A+ Y + ++V    + +Q   +S G  S
Sbjct: 124 IMYNTKSSN---------DLENIRMTGVLQRLAMVYFITAIVHYAGESLQCCMRSRGTVS 174

Query: 177 IFRL-------YCWHWLMAACVLVVYLALLYGTYVPDWQFTII-----NKDSADYGKVFN 224
            +R        Y   W+    ++ +Y    Y   VP  +   +     ++D A  G    
Sbjct: 175 RWRHILSDLAPYFGEWITMLVIIGIYCYFTYWFAVPGCEAGYVGPGGLHRDGAHAG---- 230

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
             C   A L        YID KV  + H+Y  P  R       DS               
Sbjct: 231 --CTGGAAL--------YIDLKVYTMRHIYQWPDIR--TIYQTDS--------------- 263

Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTL 339
             F+PEGLL +++SI    +G+  G +++  KGH  RL +W     +T G   L+    L
Sbjct: 264 -AFDPEGLLGTLTSIFLCFLGLQAGKILVCHKGHRERLVRWLIWAIITGGIGTLLCKAQL 322

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                +P+NK + ++S+V V++G   ++ S +Y L+D
Sbjct: 323 E-DGWVPINKNIMSISFVLVSAGTGFIMLSVMYILID 358


>gi|417411831|gb|JAA52337.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 179/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMS 258

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + KV +R+  L+  G+++       P+     +    +RL GVLQR+ 
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313

Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E+ F K V ++    GR S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W      L+  GLT    N   IP+NK L+++SYV   S  A  +   +Y +VD  
Sbjct: 466 RFTAWCCF-LGLISVGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPIVD-- 522

Query: 379 IFGPWT 384
           + G WT
Sbjct: 523 VKGVWT 528


>gi|345781561|ref|XP_539948.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Canis
           lupus familiaris]
          Length = 638

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 182/378 (48%), Gaps = 53/378 (14%)

Query: 24  QQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           QQE  H  +   RL S+D FRGLA+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 230 QQEAWHPPSALPRLRSIDTFRGLALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 289

Query: 82  FIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +I L++  +  R  +    + K+ +R+  L   G+++       P+     +   
Sbjct: 290 FIMGSSIFLSMTSMLQRGCSKFRLLGKIAWRSFLLFCIGVVIVN-----PNYCLGPLSWD 344

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E IF K V +   S  R    R  +  W  WL    +  
Sbjct: 345 KVRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERRCFSLRDIILSWPQWLFILLLES 404

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++L L +   VP      +      D GK  N T G          A GYIDR +LG +H
Sbjct: 405 IWLGLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 454

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP    S A    +                P++PEG+L ++SSI+   +G+  G ++
Sbjct: 455 IYQHP----SSAVLYHT--------------KVPYDPEGILGTISSIVMAFLGIQAGKIL 496

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TK  L R   W      L+   LT    N   IP+NK L+++SYV   S  A  
Sbjct: 497 LYYKDQTKDILIRFTAWCCF-LGLISVALTKISENEGFIPINKNLWSISYVTTLSSFAFF 555

Query: 367 VFSAIYALVDCNIFGPWT 384
           +   +Y +VD    G WT
Sbjct: 556 ILLILYPIVDVK--GLWT 571


>gi|390367684|ref|XP_789038.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 624

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 183/385 (47%), Gaps = 57/385 (14%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++E D +  Q  S  K +RL SLD FRG+++ +MI V++ GG +   +H+ WNG  +AD 
Sbjct: 215 VAEADSNSIQRPSRDKPKRLKSLDAFRGMSLVIMIFVNYGGGQYSFFNHSIWNGLTVADL 274

Query: 76  VMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F++I+GV+I ++    ++    R     K+I R + L   GI+L G          
Sbjct: 275 VFPWFIWIMGVSITMSFYALVRHGVSRRVIFTKIIRRFVILFGLGIILDG---------- 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL------YCWHW 185
            G+D    R+ GVLQRIA SYL+V+ V +F    +D++  + R  ++R       Y + W
Sbjct: 325 -GIDFSTFRVPGVLQRIAFSYLVVATVHLFAVKHKDEEYRI-RHVVYRELRDLLDYWYEW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           ++    L +++ L +   VP      +       G+         + +N    A  YID+
Sbjct: 383 IIMISFLALHICLTFFLPVPGCPTGYLGPGGPLVGE-------NESLVNCTGGAANYIDK 435

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L  NH Y     R+    T                   P +PEG+L +++SI  T +G
Sbjct: 436 VILTYNHTYPRGTPRKIYQTT------------------VPHDPEGILGTLTSIFMTFLG 477

Query: 306 VHFG---HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA---IPLNKQLYTLSYVCV 359
           +  G   H+  + +  + R   W  +    +I G    F+     IP+NK L+++S++  
Sbjct: 478 LQAGKIFHLFSYPRDRILRFLGWCVV--TGVIAGALCGFSKEDGIIPVNKNLWSVSFILA 535

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
           T+  A  + +  Y L+D  I+  WT
Sbjct: 536 TASMAFFLLAIFYYLIDVQIW--WT 558


>gi|443731781|gb|ELU16770.1| hypothetical protein CAPTEDRAFT_135912, partial [Capitella teleta]
          Length = 388

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 169/359 (47%), Gaps = 44/359 (12%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
           RL SLD FRG+++ +MI V++ GG +    H+ WNG  LAD V P+F+FI+G ++AL   
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 91  -ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            AL+R   R   + KV+ R + L   G+++      A       VD+R +R+ GVLQR+A
Sbjct: 61  GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGKA-------VDLRTLRVPGVLQRLA 113

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+YL++ ++E       D  Q              W      + V+  L +   VP    
Sbjct: 114 LTYLVLGIMEAALAKSHDPHQWWSSVRDVVGNLGQWAAVLMFVAVHCCLTFLLPVPGCPK 173

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
             +      +G  +    G          A  YIDR + G  HMY HP        T   
Sbjct: 174 GYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP--------TCMI 216

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
           P++  +          P +PEG+L +++SI    +G+  G VI+  +G  +R+ +W+   
Sbjct: 217 PYQTTV----------PLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWMCWS 266

Query: 330 FAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
               L+ G    F+     IP+NK L++LSYV   +  A L+ S  +  VD  IF  W+
Sbjct: 267 LVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMAFLLLSVCFLAVD--IFRVWS 323


>gi|198434539|ref|XP_002120178.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Ciona intestinalis]
          Length = 624

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 182/394 (46%), Gaps = 55/394 (13%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           ET  H  L  +EP+   +   + L   K++R+ S+D FRGL + +M+ V+  GGD+    
Sbjct: 189 ETQIHEDLGNTEPNSVQEANPTPLVREKSERIKSIDTFRGLCLVVMVFVNFRGGDYWFFH 248

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGIL 118
           H+PW+G  +AD V P+F+FI+GV I L++     K +P+   A  K+I RT+ L   G+ 
Sbjct: 249 HSPWHGLTVADLVFPWFMFIMGVNITLSINSLITKNVPNSKIAY-KLIRRTVLLFGLGMF 307

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           +          + +       R+ GVLQR A++Y L  +++        + ++  + +  
Sbjct: 308 V----------VNHSTSWAAFRVPGVLQRFAIAYFLPFVLQWAFHLTPIEIETRAKTNEG 357

Query: 179 RLYCWH-----------WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY-GKVFNVT 226
            L  WH           WL+   +  ++L L +   +P      +     D  GK  N T
Sbjct: 358 ELKWWHWCKDVVPYWLQWLIVLAMEALWLFLTFLLPIPGCPTGYLGPGGLDNDGKYINET 417

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           C           A GYIDR + G  H+Y HP  +     T  S                P
Sbjct: 418 C--------VGGAAGYIDRVIFGEAHIYGHPTCKNVYYPTYTSD------------QRVP 457

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-- 344
           ++PEGLL S++S +  I+G   G + ++ K  L R  +W+   F L +  + L   +A  
Sbjct: 458 YDPEGLLGSINSCIIVILGCQAGKIFLYYKHPLDRAMRWILWCFFLGVISIILCKASANG 517

Query: 345 --IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             IP+NK L+T ++V   +  A  +   IY LVD
Sbjct: 518 GWIPVNKNLWTTTFVTTLACMAFFIIPVIYYLVD 551


>gi|167538367|ref|XP_001750848.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770669|gb|EDQ84352.1| predicted protein [Monosiga brevicollis MX1]
          Length = 779

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/427 (26%), Positives = 187/427 (43%), Gaps = 86/427 (20%)

Query: 6   AETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
            ETT     H  ++ S+  +   Q+     +QRL SLD FRG A+ +MI V+  GG +  
Sbjct: 309 GETTGLLLQHATVMPSDAGMHAIQDMKR-SSQRLRSLDSFRGFALTIMIFVNFNGGFYWF 367

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRTLKLLF-WGI 117
            +H+ WNG  +AD V P+F++I+G ++A+A   +  R      +   IFR + +LF +GI
Sbjct: 368 FNHSAWNGLTVADLVFPWFIWIMGTSMAIAFNSLLKRQTPTTTILYKIFRRMLILFAFGI 427

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--- 174
            + G F           D+R  R+ GVLQR A+SYL+V+LV ++   ++    SV     
Sbjct: 428 FIIGNFH----------DLRNGRIPGVLQRFAVSYLVVALVMLYAPKMESWCASVSTSDS 477

Query: 175 ------------------------------FSIFRL-------YCWHWLMAACVLVVYLA 197
                                            F L       Y W W+    +++++  
Sbjct: 478 PTPALVRGIAKPGSGHQLDVAADIAEMKPWVRTFLLHTRDLTPYIWEWVAMFVIIIIHTC 537

Query: 198 LLYGTYVPDWQFTIINKDS--ADYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
           + +   VP      I      A+YG+       V  +    C   A GYIDR+V G  H+
Sbjct: 538 ITFLLPVPGCPTGYIGPGGALAEYGQFAPPEGEVCGESTFCCEGGASGYIDRQVFGWRHI 597

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y  P                      P +   P++PEGLL S++SI+   +G+  G +I+
Sbjct: 598 YDQP-------------------TSQPIYETGPYDPEGLLGSLTSIVMCFLGLQSGKIIV 638

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           H K H  R + W+     L +    L         IP++K L++LS++ + +  A  + +
Sbjct: 639 HYKSHAQRSRHWLMWALVLGVIATGLCGASQNNGVIPVSKNLWSLSFIILLASFAFFLLT 698

Query: 370 AIYALVD 376
             Y ++D
Sbjct: 699 VFYWVID 705


>gi|307201549|gb|EFN81312.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 564

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 168/362 (46%), Gaps = 41/362 (11%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           ++        +R+ ++D FRG +   MI V+   G +  + H  WNG    D V P F++
Sbjct: 163 EEATNKEPTKRRVKAIDAFRGASTLFMIFVNDGSGSYSVLGHTTWNGMLPGDLVFPCFMW 222

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           I+GV + +AL     R     ++ F  LK  F  +L   G S   + L     +  IR+ 
Sbjct: 223 IMGVCVPIALSAQLRRGIPKLEIAFTVLKRSF--LLFLIGVSL--NTLGTNAQLEKIRVF 278

Query: 143 GVLQRIALSYLLVSLVEIFTK---DVQDKDQSVGRFSI----FRLYCWHWLMAACVLVVY 195
           GVLQR  ++YL+VS++ +  +    +QD+D S  R +      ++   +W     +++V+
Sbjct: 279 GVLQRFGVTYLVVSVMYLCLEPSLQLQDQDSSRNRVTRVLRDMQVLLPYWSFMLILVMVH 338

Query: 196 LALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
             L +G  VP+     +      + G   N T G          A GYIDR VL INH++
Sbjct: 339 CGLTFGLAVPNCPTGYLGPGGTHEDGYYMNCTGG----------AAGYIDRVVLTINHIF 388

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
             P                     A  +   PF+PEG+L  +++     +GVH G +++ 
Sbjct: 389 AGPT-------------------IASVYGSGPFDPEGILGCLTATFQVYLGVHAGVILMM 429

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
            K    R+ +W++      + G  LHF N IP+NK L++LS+V V++  A    S  Y L
Sbjct: 430 YKNWKERVVRWLSWAVLYGVLGCILHFCNVIPVNKNLWSLSFVFVSTSFALAFLSGCYLL 489

Query: 375 VD 376
           +D
Sbjct: 490 ID 491


>gi|157112232|ref|XP_001657450.1| hypothetical protein AaeL_AAEL000933 [Aedes aegypti]
 gi|108883723|gb|EAT47948.1| AAEL000933-PA [Aedes aegypti]
          Length = 569

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 176/380 (46%), Gaps = 51/380 (13%)

Query: 8   TTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW 67
           +    PL  + P  S    +   +  RL SLD FRG+A+ LMI V+  GG +  I HA W
Sbjct: 157 SAREAPLAAASPSSSGHPVEP--RKTRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATW 214

Query: 68  NGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           NG ++AD V P+FLFI+GV I ++L+    R   R   +  V  R+ KL   G+ L    
Sbjct: 215 NGLHVADLVFPWFLFIMGVCIPISLRSQVSRNIPRKTILANVAVRSFKLFCIGLCLNS-- 272

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS-VGRFSIFRLYC 182
                    G  V  +RL GVLQR  ++Y +VS + ++      + Q  + R ++  L  
Sbjct: 273 -------INGPQVANLRLFGVLQRFGVAYFVVSAIHLYCYSESIEFQGRLARLNVDILRL 325

Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAV 240
           W HW++   ++ +YL +++    P                ++ N T G+           
Sbjct: 326 WKHWIIMGAIVFIYLLIMFLVAAPGCPSGYFGPGGKHLMAMYPNCTGGI----------T 375

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GY+DR +LG NH+Y HP  R                 DA +     F+PEG    + +IL
Sbjct: 376 GYLDRIILGNNHLYQHPTARYV--------------YDAQA-----FDPEGPFGCLPTIL 416

Query: 301 STIIGVHFGHVII-HTK--GHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYTLSY 356
              +G+  G +I+ HT+    + R+  W T+   L          +  IP+NK L++LSY
Sbjct: 417 QVFLGLQCGVLILTHTEVMARIRRMAAWGTVLGLLGGILCGFSKNDGWIPVNKNLWSLSY 476

Query: 357 VCVTSGAAALVFSAIYALVD 376
           V VT+  A ++    + L+D
Sbjct: 477 VLVTASLAFVLLLICFVLID 496


>gi|403303686|ref|XP_003942455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Saimiri boliviensis boliviensis]
          Length = 631

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 235 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSIFLSMT 294

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 295 SIMQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 349

Query: 150 LSYLLVSLVE-IFTKDVQD---KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +    ++S             WL+   +  ++L L +   VP
Sbjct: 350 VTYFVVAVLELLFAKPVPEHCASERSCLSLQDITSSWPQWLLILALEGLWLGLTFLLPVP 409

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG NH+Y HP    S A
Sbjct: 410 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDNHLYQHP----SSA 455

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SIL   +GV  G ++++    TK  L 
Sbjct: 456 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQAGKILLYYKARTKDILI 501

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 502 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 560

Query: 379 IFGPWT 384
             G WT
Sbjct: 561 --GLWT 564


>gi|443685179|gb|ELT88879.1| hypothetical protein CAPTEDRAFT_26311, partial [Capitella teleta]
          Length = 361

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 111/364 (30%), Positives = 174/364 (47%), Gaps = 55/364 (15%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
           RL SLD FRG+++ +MI V++ GG +    H+ WNG  LAD V P+F+FI+G ++AL   
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 91  -ALKRIPDRADAVKKVIFRTLKLLFWGILL---QGGFSHAPDELTYGVDVRMIRLCGVLQ 146
            AL+R   R   + KV+ R + L   G+++   +G F           D+R +R+ GVLQ
Sbjct: 61  GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGAF-----------DLRTLRVPGVLQ 109

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQ--SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           R+AL+YL++ ++E       D  Q  S+ R  +  L    W      + V+  L +   V
Sbjct: 110 RLALTYLVLGIMEAALAKSHDPHQWWSLVRDVVGNLG--QWAAVLMFVAVHCCLTFLLPV 167

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P      +      +G  +    G          A  YIDR + G  HMY HP       
Sbjct: 168 PGCPKGYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP------- 211

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
            T   P++             P +PEG+L +++SI    +G+  G VI+  +G  +R+ +
Sbjct: 212 -TCMIPYQ----------TTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSR 260

Query: 325 WVTMGFAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
           W+       L+ G    F+     IP+NK L++LSYV   +  A L+ S  +  VD  IF
Sbjct: 261 WMCWSLVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMAFLLLSVCFLAVD--IF 318

Query: 381 GPWT 384
             W+
Sbjct: 319 RVWS 322


>gi|395842491|ref|XP_003794051.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Otolemur garnettii]
          Length = 677

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 117/381 (30%), Positives = 189/381 (49%), Gaps = 54/381 (14%)

Query: 22  SDQQEKS-HLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           SD Q  + HL T   RL  +D FRG+++ LM+ V++ GG +    H+ WNG  +AD V P
Sbjct: 266 SDAQPATWHLSTHPPRLRCVDTFRGISLTLMVFVNYGGGKYWYFKHSSWNGLTVADLVFP 325

Query: 79  FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           +F+FI+G ++ L+    L+R   +   ++K+ +R+  L+  GI++       P+     +
Sbjct: 326 WFVFIMGSSVFLSMTSVLQRGCSKGRLLRKIAWRSFLLICIGIIIVN-----PNYCLGPL 380

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFRLY-CW-HWLMAAC 190
               +R+ GVLQR+ ++Y +V+++E +F K V +   S  G FS+  +   W  WL+   
Sbjct: 381 SWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASQRGCFSLGDVTSSWPQWLLILT 440

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           +  V+L L +   VP      +      D GK  N T G          A GYID  +LG
Sbjct: 441 LESVWLCLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDHLLLG 490

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            NH+YHHP    S A    +                 ++PEG+L +++SI+   +GV  G
Sbjct: 491 ENHLYHHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAG 532

Query: 310 HVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGA 363
            ++++    TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  
Sbjct: 533 KILLYYKDQTKDILMRFAGWCCI-LGLISVALTKVSENEGFIPVNKNLWSISYVTTLSCF 591

Query: 364 AALVFSAIYALVDCNIFGPWT 384
           A  +   +Y +VD  + G WT
Sbjct: 592 AFFILLVLYPVVD--VKGLWT 610


>gi|322790964|gb|EFZ15612.1| hypothetical protein SINV_04659 [Solenopsis invicta]
          Length = 581

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/374 (30%), Positives = 193/374 (51%), Gaps = 50/374 (13%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + E + S    ++   + R+ S+D FRG+A+ LMI VD+ GG +   +H+ WNG  +AD 
Sbjct: 154 LQEAETSTPIVRTSRSSTRIRSIDTFRGIALLLMIFVDNGGGKYVFFNHSAWNGLTVADL 213

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           V+P+F +I+G++I ++ +     +++  K+IFR L+     +LL    +    E      
Sbjct: 214 VLPWFAWIMGLSITISKRSELRVSNSRMKIIFRCLQRALVLVLLGLMLNSMSME-----S 268

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAAC 190
           ++ +R  GVLQ +A+SY + + +E IF K   QD     GRFSI R  L  W  WL+   
Sbjct: 269 LKHLRFPGVLQLLAVSYFVCATIETIFMKAHSQDDVLQFGRFSILRDILNNWAQWLIILA 328

Query: 191 VLVVYLALLYGTYVPDWQFTIINK--DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
           ++V ++ + +   VP+     +    + + YGK  N T G          A GYIDR V 
Sbjct: 329 IMVTHILITFLLPVPNCPTGYLGPGGNYSRYGKFPNCTGG----------AAGYIDRLVF 378

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y           TQ+ P  G +          P +PEG+++++S IL   +GVH 
Sbjct: 379 G-SHVYSK---------TQN-PVYGTI---------LPHDPEGIMNTMSIILVVYMGVHA 418

Query: 309 GHVII---HTKGHLARLKQWVTMGFALLIFGLTLHF---TNAIPLNKQLYTLSYVCVTSG 362
           G +++      G + R   W ++    LI GL  HF   +  IP++K++ +LS+V   S 
Sbjct: 419 GKILLLYYQCNGRVIRWLLWSSV--TGLIAGLLCHFDKESGVIPVSKKMMSLSFVLTVSC 476

Query: 363 AAALVFSAIYALVD 376
            A L+++ ++ LVD
Sbjct: 477 FAFLLYAILHFLVD 490


>gi|332241088|ref|XP_003269721.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Nomascus leucogenys]
          Length = 654

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 258 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMA 317

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 318 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 372

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 373 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 432

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 433 GCPIGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 478

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 479 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 524

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 525 RFTAWCCI-LGLISVALTKVSENERFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 583

Query: 379 IFGPWT 384
             G WT
Sbjct: 584 --GLWT 587


>gi|194226375|ref|XP_001488696.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Equus caballus]
          Length = 663

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 177/367 (48%), Gaps = 51/367 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA- 91
           QRL  +D FRG+A+ +M+ V++ GG +    H+ WNG  +AD V P+F+FI+G +I L+ 
Sbjct: 266 QRLRCVDTFRGIALIIMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSM 325

Query: 92  ---LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
              L+R   +   + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+
Sbjct: 326 TSTLQRGCSKFRLLGKIAWRSFLLISLGIVVVN-----PNYCLGPLSWDKLRIPGVLQRL 380

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
            ++Y +V+++E +F K V     S  R S  R  L  W  WL    +  ++L L +   V
Sbjct: 381 GVTYFVVAVLELLFAKPVPGSGASERRCSSLRDILSSWPQWLFILLLESIWLGLTFFLPV 440

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D G+  N T G          A GYIDR +LG +H+Y HP    S 
Sbjct: 441 PGCPTGYLGPGGIGDLGRYPNCTGG----------AAGYIDRLLLGEDHLYQHP----SS 486

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHL 319
           A    +                 ++PEG+L +++SI+   +GV  G ++++    TK  L
Sbjct: 487 AVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGRILLYYKDQTKAIL 532

Query: 320 ARLKQWVTMGFALLIFGLTLHFTN--AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            R   W      L+   LT    N  +IP+NK L+++SYV   S  A  +   +Y  VD 
Sbjct: 533 LRFTAWSCF-LGLISVALTKVSENEGSIPINKNLWSISYVTTLSSFAFFILLVLYPTVDV 591

Query: 378 NIFGPWT 384
              G WT
Sbjct: 592 K--GLWT 596


>gi|297682811|ref|XP_002819101.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pongo
           abelii]
          Length = 645

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 179/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 249 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 308

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 309 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 363

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 364 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 423

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 424 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 469

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +V+SI+   +GV  G ++++    TK  L 
Sbjct: 470 VLYHT--------------EVAYDPEGILGTVNSIVMAFLGVQAGKILLYYKARTKDILI 515

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 516 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVD-- 572

Query: 379 IFGPWT 384
           + G WT
Sbjct: 573 VKGLWT 578


>gi|115292433|ref|NP_084160.1| heparan-alpha-glucosaminide N-acetyltransferase [Mus musculus]
 gi|341940800|sp|Q3UDW8.2|HGNAT_MOUSE RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 656

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 180/385 (46%), Gaps = 54/385 (14%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R       + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A  +   +Y +VD  + G WT
Sbjct: 567 LSCFAFFILLILYPVVD--VKGLWT 589


>gi|74208071|dbj|BAE29143.1| unnamed protein product [Mus musculus]
          Length = 656

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 180/385 (46%), Gaps = 54/385 (14%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R       + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKFKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A  +   +Y +VD  + G WT
Sbjct: 567 LSCFAFFILLILYPVVD--VKGLWT 589


>gi|395501613|ref|XP_003755186.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Sarcophilus harrisii]
          Length = 425

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 173/376 (46%), Gaps = 51/376 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFR--GLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           D S    K  +   R   L +    GL++ LM+ V++ GG +    HAPWNG  +AD VM
Sbjct: 13  DASVFNNKGKIINFRPWELSLVSKHGLSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVM 72

Query: 78  PFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTY 132
           P+F+FI+G ++ L    +  R       ++KV +RT  L+  G+L L  G +  P   ++
Sbjct: 73  PWFVFILGTSVGLTFHNMQKRGVKNIQLLRKVAWRTGVLIIIGVLFLNYGPADGPLSWSW 132

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-FSIFR---LYCWHWLMA 188
                  RL GVLQR+  +Y  V+L++I       +   V   ++ FR   LY   W + 
Sbjct: 133 A------RLPGVLQRLGFTYFAVALMQIAFGVTDTQIYQVNLWWAPFRDVILYWKEWFII 186

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
             + +++L L +   VP      +      D GK FN T G          A  YID+ +
Sbjct: 187 ISLEILWLCLTFLLPVPGCPRGYLGPGGIGDEGKYFNCTGG----------AAAYIDKWI 236

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG NH+Y  P+ +     TQ                  PF+PEG+L +++SIL    G+ 
Sbjct: 237 LGENHLYQFPSCKELYKTTQ------------------PFDPEGILGTINSILMAFFGLQ 278

Query: 308 FGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            G +I+  +      L R   W + +G    I  +       IP+NK L++LS+V  TS 
Sbjct: 279 AGKIILMYRNKPRSILRRFLIWSILLGIISAILTMGTQNEGFIPVNKNLWSLSFVTTTSC 338

Query: 363 AAALVFSAIYALVDCN 378
            + ++   +Y + D N
Sbjct: 339 FSFILLGLLYYITDVN 354


>gi|23272280|gb|AAH24084.1| Hgsnat protein [Mus musculus]
 gi|148700869|gb|EDL32816.1| DNA segment, Chr 8, ERATO Doi 354, expressed [Mus musculus]
          Length = 624

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 180/385 (46%), Gaps = 54/385 (14%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269

Query: 76  VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R       + K+++R+  L+  G+++       P+   
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474

Query: 305 GVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 534

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A  +   +Y +VD  + G WT
Sbjct: 535 LSCFAFFILLILYPVVD--VKGLWT 557


>gi|158294726|ref|XP_315774.3| AGAP005761-PA [Anopheles gambiae str. PEST]
 gi|157015699|gb|EAA10745.3| AGAP005761-PA [Anopheles gambiae str. PEST]
          Length = 581

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/354 (31%), Positives = 164/354 (46%), Gaps = 57/354 (16%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+A+ LMI V+  GG +  I HA WNG ++AD V P+FLFI+GV + ++L
Sbjct: 202 KRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVCVPISL 261

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           +   +R         R++KL   G+ L             G  +  +R+ GVLQR  ++Y
Sbjct: 262 RGQLNRN--------RSVKLFIIGLCLNS---------MNGPSMANLRIFGVLQRFGIAY 304

Query: 153 LLVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           L+VS V +   + Q + QS  R       I RL    WL+   + V+YL +++    P  
Sbjct: 305 LVVSTVHLLCHEQQVQVQSQNRLLRASEDIVRLKK-QWLVIGLLTVLYLVVMFFVPAPGC 363

Query: 208 QFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                          F N T G+           GYIDR +LGI H+Y HP  R      
Sbjct: 364 PSAYFGPGGKHLYNAFPNCTGGI----------TGYIDRALLGIAHLYQHPTARYV---- 409

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
               ++G            PF+PEG    + +IL   +G+  G  I+    H  R+ ++ 
Sbjct: 410 ----YDG-----------MPFDPEGPFGCLPTILQVFLGLQCGCTILAYTEHRQRMVRFA 454

Query: 327 TMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +    L L  G    FT     IP+NK L++LSYV  T+  A  +    Y  +D
Sbjct: 455 SWSLVLGLAAGALCGFTKNDGWIPINKNLWSLSYVLATASLAHALLLLCYYAID 508


>gi|74198170|dbj|BAE35261.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 175/377 (46%), Gaps = 53/377 (14%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
           Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI
Sbjct: 218 QPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFI 277

Query: 84  VGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +G +I L++  I  R       + K+++R+  L+  G+++       P+     +    +
Sbjct: 278 MGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCLGPLSWDKV 332

Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
           R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  WL    +  
Sbjct: 333 RIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQWLTILTLES 390

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++LAL +   VP      +      D GK  + T G          A GYIDR +LG NH
Sbjct: 391 IWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGDNH 440

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI+   +GV  G ++
Sbjct: 441 LYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFLGVQAGKIL 482

Query: 313 IH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           ++    TK  L R   W   +G   ++          IP+NK L+++SYV   S  A  +
Sbjct: 483 VYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFI 542

Query: 368 FSAIYALVDCNIFGPWT 384
              +Y +VD  + G WT
Sbjct: 543 LLILYPVVD--VKGLWT 557


>gi|124007195|sp|Q68CP4.2|HGNAT_HUMAN RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 663

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 179/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 441

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 533

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 534 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVD-- 590

Query: 379 IFGPWT 384
           + G WT
Sbjct: 591 VKGLWT 596


>gi|26330552|dbj|BAC29006.1| unnamed protein product [Mus musculus]
 gi|74213594|dbj|BAE35603.1| unnamed protein product [Mus musculus]
 gi|74225342|dbj|BAE31601.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 180/385 (46%), Gaps = 54/385 (14%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269

Query: 76  VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R       + K+++R+  L+  G+++       P+   
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474

Query: 305 GVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 534

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A  +   +Y +VD  + G WT
Sbjct: 535 LSCFAFFILLILYPVVD--VKGLWT 557


>gi|350412149|ref|XP_003489557.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 571

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 183/364 (50%), Gaps = 54/364 (14%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+A+ LMI V++ GG +   +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGIAILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+  G++L    S + ++L         R  G+LQ 
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGILQL 289

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
           +A+SY + +++E IF K   QD     GRF+IFR  L  W  WL+ A ++  +  + +  
Sbjct: 290 LAVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349

Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           ++P+            +  GK  N T G          A GYIDR + G NH Y      
Sbjct: 350 HMPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSK---- 394

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                 +DS +   LR D          PEGL++++S+I    +GVH G +++      A
Sbjct: 395 -----IKDSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439

Query: 321 RLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL +W     F  +I G+  +F N    IP++K++ TLSYV   S  A L+++ +Y L+D
Sbjct: 440 RLIRWFLWAIFTGIIAGILCNFENEGGVIPVSKRMMTLSYVLTCSSFAFLLYAILYFLID 499

Query: 377 CNIF 380
              F
Sbjct: 500 YKQF 503


>gi|150378452|ref|NP_689632.2| heparan-alpha-glucosaminide N-acetyltransferase precursor [Homo
           sapiens]
 gi|332826066|ref|XP_519741.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pan
           troglodytes]
 gi|194385774|dbj|BAG65262.1| unnamed protein product [Homo sapiens]
 gi|410222096|gb|JAA08267.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410256018|gb|JAA15976.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410299048|gb|JAA28124.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 564

Query: 379 IFGPWT 384
             G WT
Sbjct: 565 --GLWT 568


>gi|410332579|gb|JAA35236.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 564

Query: 379 IFGPWT 384
             G WT
Sbjct: 565 --GLWT 568


>gi|432099917|gb|ELK28811.1| Heparan-alpha-glucosaminide N-acetyltransferase [Myotis davidii]
          Length = 586

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 177/365 (48%), Gaps = 49/365 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRGLA+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 190 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 249

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + KV +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 250 SILQRGCSKFRLLGKVAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 304

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S  R S  +     W  WL+   +  V+LAL +   VP
Sbjct: 305 VTYFVVAVLELLFAKPVPESCVSERRCSCLQDITSSWPQWLVILMLESVWLALTFFLPVP 364

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 365 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHPSSNVLYH 414

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
            T                    ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 415 TT------------------VAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 456

Query: 321 RLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           R    W  +G   ++          IP+NK L+++SYV   S  A  +   +Y +VD  +
Sbjct: 457 RFTAWWCFLGLISVVLTKVSENEGFIPVNKNLWSISYVTTLSSFAFFLLLILYPIVD--V 514

Query: 380 FGPWT 384
            G WT
Sbjct: 515 KGLWT 519


>gi|397505549|ref|XP_003823319.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           1 [Pan paniscus]
          Length = 585

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 455

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 456 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK 514

Query: 379 IFGPWT 384
             G WT
Sbjct: 515 --GLWT 518


>gi|345482764|ref|XP_001600799.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Nasonia vitripennis]
          Length = 569

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 165/350 (47%), Gaps = 49/350 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D  + +       +R+ SLD  RG+++ LMI V++    +  + HA WNG  + D V P 
Sbjct: 173 DEDELEVGKKTAKRRVRSLDTVRGMSILLMIFVNNGAAGYALLEHATWNGLLVGDLVFPC 232

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYG 133
           F++I+GV I L+    L R   R    + ++ R++ L   G+ L   GG +         
Sbjct: 233 FMWIMGVCIPLSISAQLSRGSSRLRLCRAIVKRSVYLFAIGLALNTLGGRNQ-------- 284

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
             +  IR+ GVLQR  L+YL+  +V        DK        +  L    W++A  +L 
Sbjct: 285 --LERIRIFGVLQRFGLAYLVAGIVYALAARPDDKQSKRMLGDVVALIP-QWIVALLILA 341

Query: 194 VYLALLYGTYVPDWQFTIINKDS--ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
            + A+++   VP      +      AD GK +N + G          A GY+D+ +LG++
Sbjct: 342 AHCAVVFLLPVPGCPRGYLGPGGRHAD-GKYWNCSGG----------ATGYVDKVLLGVD 390

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+Y  P        T +S +              PF+PEG+L S++SI    +G+  G +
Sbjct: 391 HIYQLP--------TANSVYG-----------SGPFDPEGVLGSLTSIFQVFLGIQAGQI 431

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           +       ARL +W+     L   G  LH+TN +P+NK L+++S+V VT+
Sbjct: 432 LRTYGSWKARLVRWLLWAVLLGAVGAALHYTNVVPVNKNLWSVSFVLVTT 481


>gi|51491261|emb|CAH18694.1| hypothetical protein [Homo sapiens]
          Length = 459

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 182/382 (47%), Gaps = 51/382 (13%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           + DV     +      RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V 
Sbjct: 47  DGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVF 106

Query: 78  PFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI++       P+     
Sbjct: 107 PWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGP 161

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
           +    +R+ GVLQR+ ++Y +V+++E +F K V +   S       R     W  WL+  
Sbjct: 162 LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLIL 221

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            +  ++L L +   VP      +      D+GK  N T G          A GYIDR +L
Sbjct: 222 VLEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLL 271

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y HP    S A    +                 ++PEG+L +++SI+   +GV  
Sbjct: 272 GDDHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQA 313

Query: 309 GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSG 362
           G ++++    TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S 
Sbjct: 314 GKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSS 372

Query: 363 AAALVFSAIYALVDCNIFGPWT 384
            A  +   +Y +VD    G WT
Sbjct: 373 FAFFILLVLYPVVDVK--GLWT 392


>gi|194762450|ref|XP_001963347.1| GF20351 [Drosophila ananassae]
 gi|190629006|gb|EDV44423.1| GF20351 [Drosophila ananassae]
          Length = 576

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYTWIDHAAWNGLHLADLVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +     ++++R++KL   G+ L        +EL         R+ GVLQ
Sbjct: 242 SVKAQLSRGASKGRICLRILWRSIKLFAIGLCLNSMSGPGLEEL---------RIMGVLQ 292

Query: 147 RIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           R  +++L+V ++      +D     +S  R      L+     +   ++  YL L +G  
Sbjct: 293 RFGVAFLVVGVLHTLCSRRDPISPQRSWQRAVHDICLFSGELAVLLALVATYLGLTFGLR 352

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGINHMYHHPAWRR 261
           VP        K     G  F+         NP C   A GY+D KVLG  H+Y HP    
Sbjct: 353 VPG-----CPKGYLGPGGKFDYAS------NPNCIGGAAGYVDLKVLGNAHIYQHP---- 397

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           +     DS               A F+PEG+   + S++  ++G   G  ++      +R
Sbjct: 398 TAKYVYDS---------------AAFDPEGIFGCILSVVQVLLGAFAGVTLLVHPTWQSR 442

Query: 322 LKQWVTMGFAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +++W+ +   L LI G    F+    AIP+NK L++LS+VCVT   A ++ S +Y  +D 
Sbjct: 443 IRRWLILAVVLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLALIILSLLYYFIDV 502

Query: 378 NIFGPWT 384
                W+
Sbjct: 503 RQTWNWS 509


>gi|193664422|ref|XP_001945789.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 568

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 174/361 (48%), Gaps = 52/361 (14%)

Query: 17  SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           +EP +   Q  + +K    R+ SLD FRG+++ LM+ V+  GG +    HAPWNG  LAD
Sbjct: 162 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 221

Query: 75  FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           F++P+F +++GV+IA++    L+    R     +VI R++ LL  G++L           
Sbjct: 222 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 272

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
               ++R  R  GVLQR+AL Y + + +E      Q    +  R  + R        W +
Sbjct: 273 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 331

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              ++ ++  + +   VP        K     G ++N +    +  N    A GYIDR V
Sbjct: 332 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 382

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NHMY              SP         P +   PF+PEG+LS++++ L   +GVH
Sbjct: 383 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 422

Query: 308 FGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGA 363
            G +I+  +    R+K+W+     L LI G   +F+     IP+NK L++LSY  +T  +
Sbjct: 423 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 482

Query: 364 A 364
           A
Sbjct: 483 A 483


>gi|328696746|ref|XP_003240114.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 591

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 174/361 (48%), Gaps = 52/361 (14%)

Query: 17  SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           +EP +   Q  + +K    R+ SLD FRG+++ LM+ V+  GG +    HAPWNG  LAD
Sbjct: 185 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 244

Query: 75  FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           F++P+F +++GV+IA++    L+    R     +VI R++ LL  G++L           
Sbjct: 245 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 295

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
               ++R  R  GVLQR+AL Y + + +E      Q    +  R  + R        W +
Sbjct: 296 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 354

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              ++ ++  + +   VP        K     G ++N +    +  N    A GYIDR V
Sbjct: 355 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 405

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NHMY              SP         P +   PF+PEG+LS++++ L   +GVH
Sbjct: 406 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 445

Query: 308 FGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGA 363
            G +I+  +    R+K+W+     L LI G   +F+     IP+NK L++LSY  +T  +
Sbjct: 446 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 505

Query: 364 A 364
           A
Sbjct: 506 A 506


>gi|380789677|gb|AFE66714.1| heparan-alpha-glucosaminide N-acetyltransferase precursor [Macaca
           mulatta]
 gi|383410547|gb|AFH28487.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
 gi|384945386|gb|AFI36298.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
          Length = 635

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 107/365 (29%), Positives = 174/365 (47%), Gaps = 49/365 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 457

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 458 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 505

Query: 321 RLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD   
Sbjct: 506 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK- 564

Query: 380 FGPWT 384
            G WT
Sbjct: 565 -GLWT 568


>gi|291409013|ref|XP_002720836.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Oryctolagus cuniculus]
          Length = 613

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 179/378 (47%), Gaps = 53/378 (14%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  HL     RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 205 QPETWHLSAAKHRLRCVDTFRGIALVLMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFV 264

Query: 82  FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +I L    AL+R   +   + K+ +R+  L+  GI++       P+     +   
Sbjct: 265 FIMGSSIFLSMMSALQRGCSKLRLLGKIAWRSFLLIMIGIVIVN-----PNYCLGPLSWD 319

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E +F K V +        +  R     W  WL+   +  
Sbjct: 320 KVRIPGVLQRLGVTYFVVAVLELLFAKPVPENWVLESSCTCLRDVTSSWPQWLLILLLES 379

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++L L +   VP      +      D+GK  N T G          A GYIDR +LG +H
Sbjct: 380 IWLGLSFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRVLLGDDH 429

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI++  +GV  G ++
Sbjct: 430 LYKHPS------------------STVLYHTEVAYDPEGILGTINSIVTAFLGVQAGKIL 471

Query: 313 I----HTKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           +     TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A  
Sbjct: 472 LFYKDQTKSILIRFTAWSCI-LGLISVALTKISENEGFIPINKNLWSISYVTTLSSFAFF 530

Query: 367 VFSAIYALVDCNIFGPWT 384
           +   +Y +VD    G WT
Sbjct: 531 ILLVLYPIVDVK--GLWT 546


>gi|355779672|gb|EHH64148.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           fascicularis]
          Length = 596

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 107/365 (29%), Positives = 174/365 (47%), Gaps = 49/365 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 260 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466

Query: 321 RLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD   
Sbjct: 467 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK- 525

Query: 380 FGPWT 384
            G WT
Sbjct: 526 -GLWT 529


>gi|301765942|ref|XP_002918389.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Ailuropoda melanoleuca]
          Length = 851

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 181/368 (49%), Gaps = 51/368 (13%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+
Sbjct: 453 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 512

Query: 92  LKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +  +  R       + K+ +R+  L+  G+++       P+     +    +R+ GVLQR
Sbjct: 513 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 567

Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
           + ++Y +V+++E IF K V +   S  G FS+   ++ W  WL    +  ++L L +   
Sbjct: 568 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 627

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP    S
Sbjct: 628 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 673

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
            A    +                 ++PEG+L +++SI+   +G+  G ++++    TK  
Sbjct: 674 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 719

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD
Sbjct: 720 LIRFTAWCCF-LGLISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVD 778

Query: 377 CNIFGPWT 384
             + G WT
Sbjct: 779 --VKGLWT 784


>gi|195476975|ref|XP_002100049.1| GE16376 [Drosophila yakuba]
 gi|194187573|gb|EDX01157.1| GE16376 [Drosophila yakuba]
          Length = 576

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 112/376 (29%), Positives = 175/376 (46%), Gaps = 70/376 (18%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    ++++R++KL   G+ L             G ++  +R  GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MSGPNLEQLRFMGVLQ 292

Query: 147 RIALSYLLVSLV-------------EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           R  ++YL+V ++              ++ + V D     G  ++        L+A  ++ 
Sbjct: 293 RFGVAYLVVGVLHTLCCRREPISPQRLWQRAVHDVCLFSGELAV--------LLA--LVA 342

Query: 194 VYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
            YL L YG  VP      +      DY    N   G          A GY+D +VLG  H
Sbjct: 343 TYLGLTYGLRVPGCPRGYLGPGGKHDYNAHPNCIGG----------AAGYVDLQVLGNAH 392

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP    +     DS                 F+PEG+   + S++  ++G   G  +
Sbjct: 393 IYQHP----TAKYVYDS---------------TAFDPEGIFGCILSVVQVLLGAFAGVTL 433

Query: 313 IHTKGHLARLKQWVTMGFAL-LIFGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVF 368
           +      +R+++W  +   L LI G    F+    AIP+NK L++LS+VCVT   A L+ 
Sbjct: 434 LVHPNWQSRIRRWTFLAILLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLALLIL 493

Query: 369 SAIYALVDCNIFGPWT 384
           S +Y  +D      W+
Sbjct: 494 SLLYYFIDVRETWSWS 509


>gi|432907420|ref|XP_004077635.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 482

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 175/359 (48%), Gaps = 47/359 (13%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +  RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ L
Sbjct: 85  RPARLLSLDTFRGFALTVMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFIMGTSVVL 144

Query: 91  AL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           A     +R   R   + K+ +RT+ L+  G       +++P +    +    +R+ GVLQ
Sbjct: 145 AFSSMQRRGVGRRQLLGKITWRTVVLMLLGFCF---LNYSPRDGP--LSWSWLRIPGVLQ 199

Query: 147 RIALSYLLVSLVEIF--TKDV-QDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           R+A +Y ++SL++ F   K + + ++          L+   WL+   +  ++L + +   
Sbjct: 200 RLAFTYFVLSLLQTFWGRKAIPESENHWWNPVQDVVLFWPQWLLIFLLETLWLCITFLMP 259

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP+     +      D+G   N T G          A G IDR + G N MY +P  ++ 
Sbjct: 260 VPNCPTGYLGAGGIGDHGLYPNCTGG----------AAGSIDRWMFGDN-MYRYPTCKKL 308

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GH- 318
               Q                  P++PEG+L +++SI+   +G+  G +I+  K   GH 
Sbjct: 309 YRTEQ------------------PYDPEGVLGTINSIVMGFLGMQAGKIIVFYKRKSGHI 350

Query: 319 LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L R   W V +G +  I          IP+NK L++LSYV  T   + L+   +Y L+D
Sbjct: 351 LWRYLTWAVILGISAAILSKCTRDGGFIPVNKNLWSLSYVTCTGALSFLLLGGMYFLID 409


>gi|363733262|ref|XP_420455.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gallus
           gallus]
          Length = 581

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 169/354 (47%), Gaps = 39/354 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 184 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 243

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
                   + +KV+++ L   F  ILL G     P+     +    +R+ GVLQR+ L+Y
Sbjct: 244 SSTLRWGSSKQKVLWKILWRSFLLILL-GVIVVNPNYCLGALSWENLRIPGVLQRLGLTY 302

Query: 153 LLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           L+V+ +E +FT+   D    + S         +   W+    + V++L L +   VP   
Sbjct: 303 LVVAALELLFTRTGADSGTLEMSCPALQDILPFWPQWIFILMLEVIWLCLTFLLPVPGCP 362

Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +      D+G   N T G          A GYIDR VLG  H+Y HP+       T 
Sbjct: 363 RGYLGPGGIGDFGNYLNCTGG----------AAGYIDRLVLGEKHIYQHPSCNVLYQTT- 411

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLK 323
                             P++PEG+L ++++IL   +G+  G +I+  K      ++R  
Sbjct: 412 -----------------VPYDPEGILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFF 454

Query: 324 QW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            W V MG    I          IP+NK L++ SYV   S  A ++   +Y LVD
Sbjct: 455 IWSVVMGIISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYLVD 508


>gi|281351504|gb|EFB27088.1| hypothetical protein PANDA_006846 [Ailuropoda melanoleuca]
          Length = 557

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 180/368 (48%), Gaps = 51/368 (13%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+
Sbjct: 159 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 218

Query: 92  LKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +  +  R       + K+ +R+  L+  G+++       P+     +    +R+ GVLQR
Sbjct: 219 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 273

Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
           + ++Y +V+++E IF K V +   S  G FS+   ++ W  WL    +  ++L L +   
Sbjct: 274 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 333

Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP    S
Sbjct: 334 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 379

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
            A    +                 ++PEG+L +++SI+   +G+  G ++++    TK  
Sbjct: 380 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 425

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD
Sbjct: 426 LIRFTAWCCF-LGLISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVD 484

Query: 377 CNIFGPWT 384
               G WT
Sbjct: 485 VK--GLWT 490


>gi|395507548|ref|XP_003758085.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Sarcophilus harrisii]
          Length = 634

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/352 (29%), Positives = 172/352 (48%), Gaps = 47/352 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D+  +  +  L   RL SLD FRG+A+ +M+ V++ GG +    H  WNG  LAD V P+
Sbjct: 224 DLQVEAWRLTLPVYRLRSLDTFRGIALIIMVFVNYGGGKYWFFKHESWNGLTLADLVFPW 283

Query: 80  FLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F+FI+G +IAL+L  +  R       + K+++R+L L   GI +       P+     + 
Sbjct: 284 FVFIMGSSIALSLSSMLRRGCSKWKLLGKILWRSLLLCVIGIFIVN-----PNYCLGPLS 338

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
              +R+ GVLQR+ L+YL+V+++E +F K V +    ++S   F     Y   W+    +
Sbjct: 339 WDKLRIPGVLQRLGLTYLVVAVLELLFAKAVPENSAMERSCSSFQDIISYWPQWIFILML 398

Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
              ++ + +   VP      +      D+GK  N T G          A GYIDR +LG 
Sbjct: 399 EAAWVCVTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGE 448

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y HP+                   +       P++PEGLL +++SI+   +GV  G 
Sbjct: 449 DHIYQHPS------------------PNVLYHTKVPYDPEGLLGTINSIVMAFLGVQAGK 490

Query: 311 VIIHTKGHLARLKQWVTMGFALL--IFGLTLHFTNA---IPLNKQLYTLSYV 357
           +++  K    ++     +  A+L  I G+   F+     IP+NK L+++SYV
Sbjct: 491 ILLFYKDQPKQIMLRFLLWSAMLGIISGVLTKFSQNEGFIPVNKNLWSISYV 542


>gi|355697915|gb|EHH28463.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           mulatta]
          Length = 596

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 174/365 (47%), Gaps = 49/365 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 260 SVLQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466

Query: 321 RLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           R   W   +G   ++          IP+NK L++LSYV   S  A  +   +Y +VD   
Sbjct: 467 RFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK- 525

Query: 380 FGPWT 384
            G WT
Sbjct: 526 -GLWT 529


>gi|91078976|ref|XP_974454.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Tribolium castaneum]
 gi|270003686|gb|EFA00134.1| hypothetical protein TcasGA2_TC002950 [Tribolium castaneum]
          Length = 566

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 179/364 (49%), Gaps = 41/364 (11%)

Query: 19  PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           P V ++         R+ S+D+FRG  + +MI V++ GG +   SH+ WNG  +AD V P
Sbjct: 166 PLVIERTPSIRKHPHRIKSIDVFRGFCIMIMIFVNYGGGKYWFFSHSVWNGLTVADLVFP 225

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           +FL+++GV+ A++L+    RA   ++++   ++  F  ILL    +   +  T G     
Sbjct: 226 WFLWLMGVSFAVSLQAKLRRAVPRRQLVIGVMRRSFILILLGIIINSNQNLQTIG----S 281

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
           +R  GVLQRI + Y +V ++E IFTK  + + +SV       +    WL    ++V++  
Sbjct: 282 LRFPGVLQRIGVCYFIVGMLEIIFTK--RSEVESVSCIYDVAVAWPQWLCVTVLVVIHTC 339

Query: 198 LLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           + +   VP      +      D G+ +N T GV           GYIDR+V G  HM+ +
Sbjct: 340 VTFLGDVPGCGRGYLGPGGLDDNGRFYNCTGGV----------AGYIDRQVFG-EHMHKN 388

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
           P  ++                         F+PEG+L +++S+L+   GV  G  +   +
Sbjct: 389 PVCKKLYE------------------IDVYFDPEGILGTLTSVLTVYFGVQAGRTLNTYQ 430

Query: 317 GHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
              A++ +WV  G  A L+ G    F      IPLNKQL++LS+  V SG A ++ + ++
Sbjct: 431 NVKAKVIRWVVWGSLAGLLGGALCEFKQNDGLIPLNKQLWSLSFALVLSGMAFIIQAFLF 490

Query: 373 ALVD 376
            LVD
Sbjct: 491 VLVD 494


>gi|410956346|ref|XP_003984803.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Felis
           catus]
          Length = 629

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 174/366 (47%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 233 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMT 292

Query: 94  RIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R       + K+ +R+  L+  G+     F   P+     +    +R+ GVLQR+ 
Sbjct: 293 SILQRGCSKLKLMGKIGWRSFLLICIGM-----FIVNPNYCLGPLSWDKVRIPGVLQRLG 347

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E IF K V +   S       R  ++ W  WL    +  ++L L +   VP
Sbjct: 348 VTYFVVAMLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVP 407

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 408 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----SSA 453

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 454 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 499

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W      L+   LT    N   IP+NK L+++SYV   S  A  +   +Y +VD  
Sbjct: 500 RFTAWCCF-LGLISIALTKISENEGFIPINKNLWSISYVTTLSSFAFFILLILYPIVDVK 558

Query: 379 IFGPWT 384
             G WT
Sbjct: 559 --GLWT 562


>gi|195447210|ref|XP_002071113.1| GK25317 [Drosophila willistoni]
 gi|194167198|gb|EDW82099.1| GK25317 [Drosophila willistoni]
          Length = 537

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 174/370 (47%), Gaps = 53/370 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + +RL SLD FRGLA+ LMI V+  GG +  I H  WNG +LAD V P 
Sbjct: 176 SIGDAAAKA-TQRKRLRSLDTFRGLAIVLMIFVNSGGGGYDSIDHVAWNGLHLADLVFPC 234

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++I+R+ KL   G+ L             G  
Sbjct: 235 FLWIMGVCIPLSIKSQLGRGTSKIQICGRIIWRSFKLFAIGVCLNS---------INGPK 285

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +R+ GVLQR  +++L+V L+  + ++      Q   + SI+   ++   + +   ++
Sbjct: 286 LEQLRVMGVLQRFGVAFLVVGLLHTVCSRRDHISPQQAWQRSIYDICIFSGEFAVLLALI 345

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L YG  VP      +       GK  N         NP C   A GYID++VLG 
Sbjct: 346 ATYLGLTYGLKVPGCPRGYLGPG----GKSNNAA-------NPHCIGGAAGYIDQQVLGN 394

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y +P  +     T                    F+PEGL   + S++  ++G   G 
Sbjct: 395 AHIYQYPTAKYVYDAT-------------------AFDPEGLFGCLLSVVHVLLGAFAGV 435

Query: 311 VIIHTKGHLARLKQW-VTMGFALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAAL 366
            ++      +R+K+W +      L+ G    F+     IP+NK L++LS+V VT   A L
Sbjct: 436 TLLVHPTWQSRMKRWTILALLLGLMGGALCGFSKENGLIPVNKNLWSLSFVFVTVTVALL 495

Query: 367 VFSAIYALVD 376
           +   +Y ++D
Sbjct: 496 LLDFLYYIID 505


>gi|307178500|gb|EFN67189.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 466

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 190/375 (50%), Gaps = 53/375 (14%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + E + S+    ++  + R+ S+D FRG+A+ LMI V++ GG++   +H+ WNG  +AD 
Sbjct: 59  LQEAETSNPIIGTNRSSTRIRSVDTFRGIAILLMIFVNNRGGEYVFFNHSAWNGLTVADL 118

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELT 131
           V+P+F +I+G++I ++ +     +++  K+I R L+    L+  G++L    S++   L 
Sbjct: 119 VLPWFAWIMGLSITISKRSELRVSNSRTKIILRCLQRAFILILLGLMLNSIRSNSLQNL- 177

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWL 186
                   R  GVLQ +A+SY + + +E IF +   QD     GRF+  R  L  W  WL
Sbjct: 178 --------RFPGVLQLLAVSYFVCATIETIFMRMHSQDDLLQFGRFTFLRDILNNWAQWL 229

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           +   ++V +  + +   VP+     +     + +G   N T G          A GYIDR
Sbjct: 230 IILAIVVTHTLITFLLPVPNCPTGYLGPGGYSHFGNFPNCTGG----------AAGYIDR 279

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            V G +HMY+             +P  G +          P +PEG++++VS IL   +G
Sbjct: 280 LVFG-SHMYNK----------TKNPVYGTI---------LPHDPEGIMNTVSIILVVYLG 319

Query: 306 VHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTS 361
           VH G +++      AR+ +W+   G   +I GL  +F      IP++K++ +LS+V   S
Sbjct: 320 VHAGKILLLYYQCNARVVRWLLWSGVTGIIAGLLCNFDKEGGVIPVSKKMMSLSFVLTVS 379

Query: 362 GAAALVFSAIYALVD 376
             A L+F+ ++  VD
Sbjct: 380 CFAFLLFTILHFFVD 394


>gi|431902215|gb|ELK08716.1| Heparan-alpha-glucosaminide N-acetyltransferase [Pteropus alecto]
          Length = 585

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 173/365 (47%), Gaps = 49/365 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL  +D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + KV +R+  L+  GI     F   P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKVTWRSFLLICIGI-----FIVNPNYCLGPLSWDKLRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +      R S  +  +  W  WL    +  ++LAL +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPESCTVERRCSSLQDIISSWPQWLFILMLESIWLALTFFLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D GK  N T G          A GYIDR +LG +H+Y HP+      
Sbjct: 364 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 407

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
                                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 408 ------------STVLYHTKVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 455

Query: 321 RLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           R   W   +G   + F         IP+NK L+++SYV   S  A  +   +Y +VD   
Sbjct: 456 RFTAWCCFLGLMSVAFTKISENEGFIPVNKNLWSISYVTTLSSFAFFILLILYPVVDVK- 514

Query: 380 FGPWT 384
            G W+
Sbjct: 515 -GLWS 518


>gi|426256612|ref|XP_004021932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Ovis
           aries]
          Length = 641

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 183/385 (47%), Gaps = 52/385 (13%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 225 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 284

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  GI     F   P+ 
Sbjct: 285 DLVFPWFVFIMGASIFLSMASILQRGCSKLRLLGKIVWRSFLLICIGI-----FVVNPNY 339

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--FSIFRLYC-W-HW 185
               +     R+ GVLQR+  +Y +V+++E+       +  ++ R  FS+  +   W  W
Sbjct: 340 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCALERSCFSLLDITASWPQW 399

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L    +  V+LAL +   VP      +       G  +    G          A GY+DR
Sbjct: 400 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 450

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   +G
Sbjct: 451 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 492

Query: 306 VHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCV 359
           V  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV  
Sbjct: 493 VQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSVSYVTT 551

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A L+  A+Y +VD    G WT
Sbjct: 552 LSSLAFLILLALYPVVDVK--GLWT 574


>gi|351712254|gb|EHB15173.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Heterocephalus glaber]
          Length = 537

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 177/378 (46%), Gaps = 52/378 (13%)

Query: 17  SEPDVSDQQEKSHLKT---QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           ++P   D Q ++   +    RL  LD FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 180 ADPLTGDPQPEAQCASASGHRLRCLDTFRGIALVLMVFVNYGGGRYWYFKHSSWNGLTVA 239

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G ++ L++  +  R       + K+ +R+  L+  GI++       P+ 
Sbjct: 240 DLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIAWRSFLLICIGIVIVN-----PNY 294

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHW 185
               +    +R+ GVLQR+ ++Y +V+++E +F K + +    ++S             W
Sbjct: 295 CLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPIPENCVLERSCPSLRDITSSWSQW 354

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +  ++L L +   VP      +      D GK  N T G          A  YID
Sbjct: 355 LLILLLEGIWLGLTFLLPVPGCPTGYLGPGGIGDLGKYANCTGG----------AARYID 404

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
             +LG +H+Y HP+                           P++PEG+L +++SI+   +
Sbjct: 405 HLLLGSDHLYQHPS------------------STVLYHTEVPYDPEGILGTINSIVMAFL 446

Query: 305 GVHFGHVIIHTKGH----LARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVC 358
           GV  G +++  KG     L R   W  +   L+   LT    N   IP+NK L+++SYV 
Sbjct: 447 GVQAGKILLCYKGQTKDILIRFTAWCCV-LGLISVALTKMSENKGFIPINKNLWSISYVT 505

Query: 359 VTSGAAALVFSAIYALVD 376
             S  A  +  A+Y +VD
Sbjct: 506 TLSAFAFFILLALYLVVD 523


>gi|297491309|ref|XP_002698775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
 gi|296472360|tpg|DAA14475.1| TPA: Heparan-alpha-glucosaminide N-acetyltransferase-like [Bos
           taurus]
          Length = 723

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 183/385 (47%), Gaps = 52/385 (13%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 307 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 366

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI     F   P  
Sbjct: 367 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 421

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
               +     R+ GVLQR+  +Y +V+++E +F K V +   S    FS+  +   W  W
Sbjct: 422 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 481

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L    +  V+LAL +   VP      +       G  +    G          A GY+DR
Sbjct: 482 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 532

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   +G
Sbjct: 533 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 574

Query: 306 VHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCV 359
           V  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV  
Sbjct: 575 VQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTT 633

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A L+  A+Y +VD  + G WT
Sbjct: 634 LSSLAFLILLALYPVVD--VKGLWT 656


>gi|432845830|ref|XP_004065874.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 622

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 176/358 (49%), Gaps = 47/358 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+A+ +M+ V++ GG +    H  WNG  +AD V P+F+F++G +IAL++
Sbjct: 225 KRLRSLDTFRGIALVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFVMGTSIALSI 284

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      R   ++K+++R+++L   G+     F   P+    G+    +R+ GVLQR+
Sbjct: 285 NSLLRAGLTRGSLLRKIVWRSIQLFLIGV-----FIINPNYCQGGLSWENLRIPGVLQRL 339

Query: 149 ALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           A SYL+V+ +++       DV   D     F    LY   W++   + V++L+L +   V
Sbjct: 340 AFSYLVVASLDLMVARGHLDVLQTDAWWSPFLDVLLYWPAWVVVLLLEVLWLSLTFLLPV 399

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           PD     +      D G   N T G          A G++DR +LG  H+Y  P+  R  
Sbjct: 400 PDCPTGYLGPGGIGDMGLYANCTGG----------AAGFLDRWLLGEKHIYQTPS-SRVL 448

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK----GHL 319
             TQ                  P++PEG+L S++S+L   +G+  G +I+H +      +
Sbjct: 449 YLTQ-----------------IPYDPEGVLGSINSVLMAFLGLQAGKIILHYRDLPTSIM 491

Query: 320 ARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +R   W + MG    +          IP+NK L++LSYV   +  A ++   +Y  VD
Sbjct: 492 SRFLIWGLFMGVISAVLTRCSTDQGFIPVNKNLWSLSYVTTLACFAFVLLVLVYYTVD 549


>gi|354472121|ref|XP_003498289.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cricetulus griseus]
          Length = 782

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 181/378 (47%), Gaps = 53/378 (14%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  H      RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+
Sbjct: 374 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 433

Query: 82  FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G ++ L    AL R   +   + K+ +R+  L+  GI++       P+     +   
Sbjct: 434 FIMGSSVFLSMTSALHRGCSKFRLLGKITWRSFLLICIGIIVVN-----PNYCLGPLSWD 488

Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK-DQSVGRFSIFRLYC-W-HWLMAACVLV 193
            +R+ GVLQR+ ++Y +V+++E IF+K V D+        S+  + C W  WL+   +  
Sbjct: 489 KVRIPGVLQRLGVTYFVVAVLELIFSKPVPDRCALERSYLSLRDITCSWPQWLVVLILES 548

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           ++LAL +   VP      +      D GK  + T G          A GYID  +LG NH
Sbjct: 549 IWLALTFFLPVPGCPTGYLGPGGIGDMGKYPHCTGG----------ASGYIDHLLLGDNH 598

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP+                            ++PEG+L +++SI+   +GV  G ++
Sbjct: 599 LYQHPS------------------STVLYHTQVAYDPEGILGTINSIVMAFLGVQAGKIL 640

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TK  L R   W  +   L+   LT    +   IP+NK L+++SYV   S  A  
Sbjct: 641 LYYKDQTKAILMRFTAWCCI-LGLISIALTKMSADEGFIPINKNLWSISYVSTLSCFAFF 699

Query: 367 VFSAIYALVDCNIFGPWT 384
           +   +Y +VD  + G WT
Sbjct: 700 ILLILYPVVD--VKGLWT 715


>gi|195041852|ref|XP_001991329.1| GH12115 [Drosophila grimshawi]
 gi|193901087|gb|EDV99953.1| GH12115 [Drosophila grimshawi]
          Length = 573

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 174/370 (47%), Gaps = 53/370 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + +RL SLD FRGL + LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 171 SIGDAAAKAT-QRKRLRSLDTFRGLCIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPS 229

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++++R  KL   G+ L             G  
Sbjct: 230 FLWIMGVCIPLSIKAQLARGTSKTRICLRIVWRACKLFAIGLCLNS---------VNGPQ 280

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +RL GVLQR  ++YLLV+++  + ++  Q   Q   + +I+   L+   + +   ++
Sbjct: 281 LEQLRLMGVLQRFGIAYLLVAILHTVCSRRDQLSPQRAWQRAIYDICLFSGEFAVLLALI 340

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L +G  VP      +       GK  N         +P C   A GYID  VLG 
Sbjct: 341 ATYLGLTFGLRVPGCPVGYLGPG----GKHNNAA-------HPNCIGGAAGYIDLLVLGN 389

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y HP  +     T                    F+PEG+   + S++ T++G   G 
Sbjct: 390 AHIYQHPTAKYVYDAT-------------------AFDPEGIFGCLLSVVQTLLGAFAGV 430

Query: 311 VIIHTKGHLARLKQWV---TMGFALLIFGLTLHFTNA-IPLNKQLYTLSYVCVTSGAAAL 366
            ++       RLK+W+   T+   L          N  IP+NK L++LS+V VT+  A +
Sbjct: 431 TLLVHSTWQGRLKRWLLVATVLGLLGGALCGFSKENGIIPINKNLWSLSFVFVTAALALV 490

Query: 367 VFSAIYALVD 376
           + S IY +VD
Sbjct: 491 LLSLIYYVVD 500


>gi|24639786|ref|NP_572198.1| CG6903 [Drosophila melanogaster]
 gi|7290544|gb|AAF45996.1| CG6903 [Drosophila melanogaster]
 gi|21483396|gb|AAM52673.1| LD22376p [Drosophila melanogaster]
          Length = 576

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 174/380 (45%), Gaps = 56/380 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D   +      + +RL SLD FRG+++ LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 171 DSIGEAATKATQRKRLRSLDTFRGISIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPS 230

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +A    ++++R++KL   G+ L             G +
Sbjct: 231 FLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MSGPN 281

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVL 192
           +  +R+ GVLQR  ++YL+V+++      ++     +S  R      L+     +   ++
Sbjct: 282 LEQLRIMGVLQRFGVAYLVVAILHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLLALV 341

Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLG 249
             YL L +G  VP      +      DY              +P C   A GY D +VLG
Sbjct: 342 ATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQVLG 389

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y HP    +     DS                 F+PEG+   + S++  ++G   G
Sbjct: 390 NAHIYQHP----TAKYVYDS---------------TAFDPEGIFGCILSVVQVLLGAFAG 430

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIF-----GLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
             ++      +R+++W  +   L +      G +     AIP+NK L++LS+VCVT   A
Sbjct: 431 VTLLVHPNFQSRIRRWTLLAILLGLIGGALCGFSRE-GGAIPMNKNLWSLSFVCVTVSLA 489

Query: 365 ALVFSAIYALVDCNIFGPWT 384
            L+ S +Y  +D      W+
Sbjct: 490 LLILSLMYYFIDVRETWSWS 509


>gi|194679266|ref|XP_588978.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
          Length = 734

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 182/385 (47%), Gaps = 52/385 (13%)

Query: 17  SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
           S    SD Q ++  ++    RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +A
Sbjct: 318 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 377

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI     F   P  
Sbjct: 378 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 432

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
               +     R+ GVLQR+  +Y +V+++E +F K V +   S    FS+  +   W  W
Sbjct: 433 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 492

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           L    +  V+LAL +   VP      +       G  +    G          A GY+DR
Sbjct: 493 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 543

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP    S A    +                 ++PEG+L +++SI+   +G
Sbjct: 544 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 585

Query: 306 VHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCV 359
           V  G ++++    T+G L R   W  +   L+   LT    N   IP+NK L+++SYV  
Sbjct: 586 VQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTT 644

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
            S  A L+  A+Y +VD    G WT
Sbjct: 645 LSSLAFLILLALYPVVDVK--GLWT 667


>gi|126272886|ref|XP_001369919.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Monodelphis domestica]
          Length = 389

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 165/348 (47%), Gaps = 49/348 (14%)

Query: 44  LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
           L++ LMI V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ LA     ++   + 
Sbjct: 3   LSLTLMIFVNYGGGGYWFFEHAPWNGLTIADLVMPWFVFILGTSVGLAFHVMQRKGVKKF 62

Query: 100 DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +KV +RT  L+  G L L  G    P   ++       RL GVLQR+  +Y +V+L+
Sbjct: 63  KLFRKVAWRTGVLIAIGALFLNYGPVDGPLSWSWA------RLPGVLQRLGFTYFIVALM 116

Query: 159 EIFTKDVQDKDQSVG-RFSIFR---LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
           +I       +   VG  ++ FR   LY   W++   +  ++L L +   VP      +  
Sbjct: 117 QIAFGVADMQKYQVGVWWAPFRDIVLYWQEWIIIIGLECIWLCLTFLLPVPGCPRGYLGP 176

Query: 215 DS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
               D GK FN T G          A  YID+ +LG NH+Y  P+ +     TQ      
Sbjct: 177 GGIGDEGKYFNCTGG----------AAAYIDKWILGENHLYRFPSCKELYKTTQ------ 220

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLKQW-VTM 328
                       PF+PEG+L +++SI+    G+  G +I+  +      L R   W V +
Sbjct: 221 ------------PFDPEGILGTINSIIMAFFGLQAGKIILMYRSKPRSILKRFLIWSVLL 268

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           G    I  L       IP+NK L++LS+V  TS  +  +   +Y ++D
Sbjct: 269 GIISAILTLGTQNEGFIPVNKNLWSLSFVTTTSCFSFFLLGLLYYIID 316


>gi|149057830|gb|EDM09073.1| rCG43316 [Rattus norvegicus]
          Length = 626

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 182/380 (47%), Gaps = 51/380 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D   +  ++     RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+
Sbjct: 216 DCQPETRRASALPHRLRCVDTFRGVALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPW 275

Query: 80  FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F+FI+G +I L+    L+R   +   + K+ +R+  L+  GI++       P+     + 
Sbjct: 276 FVFIMGSSIFLSMTSLLQRGCSKIKLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLS 330

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
              +R+ GVLQR+ ++Y +V+++E+F  K V D    ++S             WL+   +
Sbjct: 331 WDKVRIPGVLQRLGVTYFVVAMLELFFWKPVPDSCTLERSCLSLRDITSSWPQWLIILIL 390

Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
             ++LAL +   VP      +      D GK  + T G          A GYIDR +LG 
Sbjct: 391 ESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGD 440

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           +H+Y HP    S A    +                 ++PEG+L +++SI+   +GV  G 
Sbjct: 441 SHLYQHP----SSAVLYHT--------------EVAYDPEGVLGTINSIVMAFLGVQAGK 482

Query: 311 VIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAA 364
           ++++    TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A
Sbjct: 483 ILLYYKDQTKAILIRFAAWCCI-LGLISIALTKMSANEGFIPINKNLWSISYVTTLSCFA 541

Query: 365 ALVFSAIYALVDCNIFGPWT 384
             +   +Y +VD  + G WT
Sbjct: 542 FSILLILYPVVD--VKGLWT 559


>gi|170027692|ref|XP_001841731.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167862301|gb|EDS25684.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 96/279 (34%), Positives = 138/279 (49%), Gaps = 30/279 (10%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           ++ T     L  + P +S Q      KT RL SLD FRG+A+ LMI V+  GGD+  I H
Sbjct: 263 RSRTPSEPQLSPNSPTISVQATGVPQKT-RLRSLDTFRGIAIMLMIFVNSGGGDYWWIEH 321

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQ 120
           A WNG ++AD V P+FLFI+GV I ++L+    R   R + +K V  R+LKL   G+ L 
Sbjct: 322 ATWNGLHVADLVFPWFLFIMGVCIPISLRSQLGRNVPRYEILKNVAVRSLKLFLIGLCLN 381

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGRFSI 177
                       G  V  +RL GVLQR  ++Y +VS + ++     D+ Q   +     I
Sbjct: 382 S---------INGPTVADLRLFGVLQRFGVAYFVVSAIHLYCYQENDQLQHPLARSHADI 432

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPP 236
            RL+  HW++   ++ VYL +++   VP+               ++ N T G+       
Sbjct: 433 LRLW-KHWVIVGTIVFVYLLVIFFVPVPNCPSGYFGPGGKHLMLLYPNCTGGI------- 484

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
               GYIDR+VLGI H+Y HP  R         P EGP 
Sbjct: 485 ---TGYIDRQVLGIRHLYQHPTARYMYDAMPFDP-EGPF 519


>gi|328790778|ref|XP_623715.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 572

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 183/371 (49%), Gaps = 68/371 (18%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+A+ LMI V++ GG +   +H+ W G ++AD V+P+F +I+G+ I ++
Sbjct: 180 STRIHSVDTFRGIAILLMIFVNNGGGKYIFFNHSAWFGLSIADLVLPWFAWIMGLMITVS 239

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+F G++L    S +  +L         R  GVLQ 
Sbjct: 240 KRTELRLTTSRIKITLYCLRRSAILIFLGLMLNSKDSESLHDL---------RFPGVLQL 290

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLY-- 200
           + +SY + +++E IF K   QD     GRF++FR  L  W  WL+ A ++  +  + +  
Sbjct: 291 LGVSYFVCAILETIFMKPHSQDILHQFGRFAMFRDILESWPQWLIMAGIVTTHTLITFLL 350

Query: 201 -------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                  G + P  ++          GK  N T G          A GYIDR + G NH 
Sbjct: 351 PISNCPKGYFGPGGEYHF-------RGKYINCTAG----------AAGYIDRLIFG-NHT 392

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y+H         T++  +   LR D          PEGL++++S+I    +GVH G +++
Sbjct: 393 YNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILL 433

Query: 314 HTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFS 369
                 +R+ +W     F  +I G+  +F      IP++K++ TLSYV + S  A L+++
Sbjct: 434 LYYQCNSRVIRWFLWTVFTGIIAGILCNFETQGGIIPISKRMMTLSYVLICSSFAFLLYA 493

Query: 370 AIYALVDCNIF 380
            +Y L+D   F
Sbjct: 494 LLYVLIDYKQF 504


>gi|345481194|ref|XP_001603332.2| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Nasonia vitripennis]
          Length = 570

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 182/384 (47%), Gaps = 52/384 (13%)

Query: 5   KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
           K   +H+    + E + + +         RL +LD FRG+AV LMI V++ GG++  ++H
Sbjct: 154 KHAESHNDIDRLQESESTPEMVAVSKTAMRLQALDAFRGIAVLLMIFVNNGGGEYVFLNH 213

Query: 65  APWNGCNLADFVMPFFLFIVGVAIALALK---RIP-DRADAVKKVIFRTLKLLFWGILLQ 120
           A WNG  +AD V+P+F + +G  I  +++   R+   R   +   + RT+ L+ +G+ + 
Sbjct: 214 AAWNGLTVADLVLPWFAWAMGFTIVNSVRVHLRVSVSRTRLIIMQLRRTVLLILFGLFIN 273

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR- 179
              +    EL         R  GVLQ +A++Y + S++E      Q   Q  GRF   + 
Sbjct: 274 SQHNSTLSEL---------RFPGVLQLLAVAYFICSVIETCLASPQRTFQ-FGRFVFLQD 323

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPP 236
            L  W  W++   +++V+  + +  +VP      +       YG   N T G        
Sbjct: 324 ILERWTQWMVVLVIILVHTCITFFLHVPGCPRGYLGPGGYHHYGLNVNCTGG-------- 375

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A GYIDR + G  HMY              +P  GP           P +PEGL++++
Sbjct: 376 --AAGYIDRLIFG-QHMYQKTM----------NPVYGPT---------LPHDPEGLMNTI 413

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNA---IPLNKQLY 352
           S++L   +GV  G + +      +R+ +W     A  L+ G+  +F+     +P+NK + 
Sbjct: 414 SAVLIVFMGVQAGRIFVTYYQANSRIIRWFAWFVATGLLAGILCNFSQEKGWVPVNKNMM 473

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           +LS+V  TS  A L+FS +Y L+D
Sbjct: 474 SLSFVLCTSSFAFLLFSILYYLID 497


>gi|186683151|ref|YP_001866347.1| hypothetical protein Npun_R2871 [Nostoc punctiforme PCC 73102]
 gi|186465603|gb|ACC81404.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 375

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 173/360 (48%), Gaps = 77/360 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMAGVADNIYPPLAHADWHGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +  +       + +R L+    L   G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTEDNKPTSAIYWRILRRAAILFALGLLLNGFWNQG----VWTFDLSSIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+++YLL SL+ +   ++  K Q              W++AA +L+ Y  ++    VP
Sbjct: 117 QRISITYLLASLIVL---NLPRKGQ--------------WILAAVILIGYWLMMMYLPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   YIDR ++   H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NLGAYIDRMIIPKAHLYKGDGFK----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I+S + G +F    I ++   +R    
Sbjct: 194 -----FMG--------------DPEGLFSTIPAIVSVLAG-YFTGQWIRSQPVQSRTSIG 233

Query: 326 VTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           + + G   LI G    +T   P+NK+L+T SYV  +SG A ++ +A Y L++  +   W+
Sbjct: 234 LGLFGVGCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWALILLAACYELIEVRLNRRWS 291


>gi|195340719|ref|XP_002036960.1| GM12376 [Drosophila sechellia]
 gi|194131076|gb|EDW53119.1| GM12376 [Drosophila sechellia]
          Length = 576

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 175/379 (46%), Gaps = 54/379 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D   +      + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 171 DSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPS 230

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +A    +++ R++KL   G+ L             G +
Sbjct: 231 FLWIMGVCIPLSVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MSGPN 281

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVL 192
           +  +R+ GVLQR  ++YL+V ++      ++     +S  R      L+     +   ++
Sbjct: 282 LEQLRVMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLLALV 341

Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLG 249
             YL L +G  VP      +      DY              +P C   A GY D +VLG
Sbjct: 342 ATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQVLG 389

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y HP  +     T                    F+PEG+   + S++  ++G   G
Sbjct: 390 NAHIYQHPTAKYVYDST-------------------AFDPEGIFGCILSVVQVLLGAFAG 430

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTL-HFTN---AIPLNKQLYTLSYVCVTSGAAA 365
             ++    + +R+++W  +   L + G  L  F++   AIP+NK L++LS+VCVT   A 
Sbjct: 431 VTLLVHPNYQSRIRRWTLLAILLGLIGGALCGFSSEGGAIPVNKNLWSLSFVCVTVSLAL 490

Query: 366 LVFSAIYALVDCNIFGPWT 384
           ++ S +Y  +D      W+
Sbjct: 491 VILSLMYYFIDVRETWSWS 509


>gi|326918494|ref|XP_003205523.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Meleagris gallopavo]
          Length = 532

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 172/362 (47%), Gaps = 51/362 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 131 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 190

Query: 93  KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +      +   + K+++R+  L   G+++       P+     +    +R+ GVLQR+
Sbjct: 191 SSMLRWGSSKQKVLGKILWRSFLLTLLGVIVVN-----PNYCLGALSWENLRIPGVLQRL 245

Query: 149 ALSYLLVSLVE-IFTKDVQ-------DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
            L+YL+V+ +E +FT+ V         ++ S         +   W+    + V++L L +
Sbjct: 246 GLTYLVVAALELLFTRAVNISPSLHLMQEMSYPALQDVLPFWPQWIFILTLEVIWLCLTF 305

Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
              VP      +      D+GK  N T G          A GYIDR VLG  H+Y HP+ 
Sbjct: 306 LLPVPGCPRGYLGPGGIGDFGKYANCTGG----------AAGYIDRLVLGEKHIYQHPSC 355

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH- 318
                 T                   P++PEG+L ++++IL   +G+  G +I+  K   
Sbjct: 356 NVLYQTT------------------VPYDPEGILGTINTILMAFLGLQAGKIILSYKDQH 397

Query: 319 ---LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
              ++R   W + MG    I          IP+NK L++ SYV   S  A ++   +Y L
Sbjct: 398 KQIMSRFLIWSLVMGIISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYL 457

Query: 375 VD 376
           VD
Sbjct: 458 VD 459


>gi|196012186|ref|XP_002115956.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
 gi|190581732|gb|EDV21808.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
          Length = 580

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 169/364 (46%), Gaps = 75/364 (20%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            +E  H   QR+ ++D FRGL + +MI V+  GG +      PWNG   AD ++P+F+FI
Sbjct: 238 SEESIHPLAQRIYAVDAFRGLCITIMIFVNSGGGGYWYFRSTPWNGLTFADLILPWFIFI 297

Query: 84  VGVAIALAL--------KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           VG+ IAL+          R+P  + AV KV+ R++ L   G+ L             GV+
Sbjct: 298 VGICIALSFYNHRYITASRLPP-SSAVLKVLSRSVILFLIGLFLND-----------GVN 345

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WH-WLMAACVLV 193
           +   R+ G LQ++A+SY++VSL  ++    +  D      +I  + C W  W+    +L 
Sbjct: 346 LSTWRIPGNLQKVAISYIVVSLSVLYL--AKPPDTITNLRAIREIVCIWKIWIGMIGLLS 403

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +YL+L++   VP            +D    +N T G          A GYIDR + G NH
Sbjct: 404 IYLSLIFALPVPGCPTGYFGPGGLSDDANHYNCTGG----------ATGYIDRFIFG-NH 452

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +  +P+ +                       H PF+ EG LS+++SIL+  +G+      
Sbjct: 453 LDANPSCKVLYR------------------THMPFDSEGCLSTLTSILTCFMGLQ----- 489

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + T   L   KQ   MGF              IPLN+ L++LSY+ +  G A  V   +Y
Sbjct: 490 VATGVALCGGKQ--NMGF--------------IPLNRNLWSLSYITLLGGLAYFVLMMLY 533

Query: 373 ALVD 376
            L+D
Sbjct: 534 LLID 537


>gi|307209305|gb|EFN86390.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 552

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 191/384 (49%), Gaps = 45/384 (11%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
           ++  ++ H     + E + S+   ++     R+ S+D FRG+++ LMI V++ GG +   
Sbjct: 132 KLSPDSVHDDLDRLQEAESSNPVIRTSRVNTRIRSVDTFRGISILLMIFVNNGGGKYVFF 191

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           +H+ WNG  +AD V+P+F +I+G++I ++ +     +++  K+I R L+  F  ILL   
Sbjct: 192 NHSVWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIILRCLQRAFILILLGLM 251

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR- 179
            +    +      ++ +R  G+LQ +A+SY + + +E IF +   QD     GRF++ R 
Sbjct: 252 LNSIHTK-----SLKDLRFPGILQLLAVSYFVCATIETIFMRAHSQDDLLQFGRFTVLRD 306

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPP 236
            L  W  W +   +   +  + +   V D     +       +GK  N T G        
Sbjct: 307 ILDSWAQWSIIVAIATTHTLITFLLPVLDCPKGYLGPGGYHLFGKNANCTGG-------- 358

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A GYIDR V G +HMY+             +P  G +          P++PEG+++++
Sbjct: 359 --AAGYIDRLVFG-SHMYNK----------THNPVYGTI---------LPYDPEGIMNTI 396

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHF---TNAIPLNKQLY 352
           S IL   +GVH G +++      AR+ +W+   G   LI G+  HF   +  IP+NK++ 
Sbjct: 397 SVILVVYMGVHAGKILLLYYQCNARIVRWLLWSGVTGLIAGILCHFDKESGVIPVNKRMM 456

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           +LS+V   S  A L+++ ++  VD
Sbjct: 457 SLSFVLTVSCFAFLLYAILHFFVD 480


>gi|194888520|ref|XP_001976930.1| GG18736 [Drosophila erecta]
 gi|190648579|gb|EDV45857.1| GG18736 [Drosophila erecta]
          Length = 576

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 174/380 (45%), Gaps = 56/380 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D   +      + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 171 DSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPS 230

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +A    ++++R++KL   G+ L             G +
Sbjct: 231 FLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MSGPN 281

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVL 192
           +  +R  GVLQR  ++YL+V ++      ++     +S  R      L+     +   ++
Sbjct: 282 LEQLRFMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLLALV 341

Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLG 249
             YL L +G  VP      +      DY              +P C   A GY D +VLG
Sbjct: 342 ATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPHCIGGAAGYADLQVLG 389

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y HP    +     DS                 F+PEG+   + S++  ++G   G
Sbjct: 390 NAHIYQHP----TAKYVYDS---------------TAFDPEGVFGCILSVVQALLGAFAG 430

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIF-----GLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
             ++      +R+++W+ +   L +      G +     AIP+NK L++LS+VCVT   A
Sbjct: 431 VTLLVHPNWQSRMRRWMLLAILLGLIGGALCGFSRE-GGAIPVNKNLWSLSFVCVTVSLA 489

Query: 365 ALVFSAIYALVDCNIFGPWT 384
            ++ S +Y ++D      W+
Sbjct: 490 LVILSLLYYIIDVRETWSWS 509


>gi|121489785|emb|CAK18864.1| hypothetical protein [Phillyrea latifolia]
          Length = 129

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 84/131 (64%), Gaps = 3/131 (2%)

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
              L P CN+ G IDR VLGI+H+Y  P +R  K C   S   G + + APSWCHAPF+P
Sbjct: 1   NGDLGPACNSAGMIDRNVLGIDHLYAKPVYRNLKECNISS--HGQVPETAPSWCHAPFDP 58

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           EG+LSS+++ +S IIG+ +GH+++  + H  RL  W    FA L  GL L F   IPLNK
Sbjct: 59  EGILSSLTAAVSCIIGLQYGHILVRLQDHKERLCNWSIFSFAFLGLGLFLAFV-GIPLNK 117

Query: 350 QLYTLSYVCVT 360
            LYT+SY+ VT
Sbjct: 118 SLYTISYLLVT 128


>gi|380028317|ref|XP_003697852.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 555

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 164/359 (45%), Gaps = 47/359 (13%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D+        +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F++
Sbjct: 166 DETAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPCFIW 225

Query: 83  IVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           I+GV I +A+    KR+  R   +  ++ R++ +   G+ L        + ++ G  +  
Sbjct: 226 IMGVCIPIAMASQMKRMLPRHVILYGIVKRSILMFLIGLSL--------NTVSTGPQLET 277

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IR+ GVLQR  ++YL+V+L+       + K   V     F L    W +   ++ V+  +
Sbjct: 278 IRVFGVLQRFGITYLIVALIYFCLMARKPKKTQV--MQDFLLLLPQWCVMLVIVAVHCVI 335

Query: 199 LYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            +   VP      +      D  K F+   G          A GYIDR +L   H++H  
Sbjct: 336 TFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEPHLHHSA 385

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
              +S                       P++PEG+L ++++     +G+H G +++  K 
Sbjct: 386 TVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGIIMMTYKD 423

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              R+ +W+         G  LHF+N IP+NK+L++LS+V VT+  +    SA Y LVD
Sbjct: 424 WKERVIRWLAWAAFFSCIGCILHFSNIIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVD 482


>gi|340727662|ref|XP_003402158.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 554

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 168/363 (46%), Gaps = 48/363 (13%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           +D         +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F+
Sbjct: 159 ADDGAMKQPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFI 218

Query: 82  FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           +I+GV I +A+    KR+  +   +  ++ R++ L   G+ L        + ++ G  + 
Sbjct: 219 WIMGVCIPIAMSSQMKRMTLKHQILYGIVKRSILLFLIGLSL--------NTVSTGGQLE 270

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQ-DKDQS--VGRFSIFRLYCWHWLMAACVLVV 194
            IR+ GVLQR  ++YL+V+L+       +  K QS  +     F L    W +   ++VV
Sbjct: 271 TIRIFGVLQRFGITYLVVALLYFLLMSRRPSKIQSPMLREVQDFLLLLPQWCVMLVIVVV 330

Query: 195 YLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           + A+ +   VP      +      D  K F+   G          A GYIDR +L   H+
Sbjct: 331 HCAITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEAHL 380

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           ++     +S                       P++PEG+L ++++     +G+H G +++
Sbjct: 381 HYSATVYKS----------------------GPYDPEGILGTLTTAFQVFLGLHAGIIMM 418

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +    SA Y 
Sbjct: 419 TYKDWKERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYL 478

Query: 374 LVD 376
           LVD
Sbjct: 479 LVD 481


>gi|313242995|emb|CBY39713.1| unnamed protein product [Oikopleura dioica]
          Length = 597

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 176/385 (45%), Gaps = 69/385 (17%)

Query: 20  DVSDQQEKSHLKTQ-----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
           D +D+QE   ++             R  SLD  RGL++ +MI V++ GG++  + H  WN
Sbjct: 182 DQTDKQEDQEVQEDEPAPPAPAKKKRYKSLDTLRGLSLIIMIFVNYGGGEYWFMEHVAWN 241

Query: 69  GCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
           G  +AD VMP+FLF+ GV+I +AL    KR   + +   +++ R++KL+  G++  GG  
Sbjct: 242 GLTVADLVMPWFLFMSGVSIRIALQSRIKRGISKTEISYEILVRSVKLIGLGMITIGG-- 299

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYC 182
                          R  GVLQRI  SY +V+++ +   +  DK+     G F       
Sbjct: 300 --------NESWEYFRFPGVLQRIGFSYFVVAIIHLLVIEHPDKEPETNWGLFKEMSFNF 351

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVG 241
              L++  +L  ++ L Y   +P            ++ G+ ++   G          A G
Sbjct: 352 KEHLISWSILGAFICLTYLLPIPGCPTGYTGPGGLSENGEHYHCIGG----------AAG 401

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YIDRK+LG  H+Y+ P          D P               PF+PEGLL +++SI  
Sbjct: 402 YIDRKLLGEKHIYNWP------TAYHDEP------------NGVPFDPEGLLGTLTSIFM 443

Query: 302 TIIGVHFGH----------VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQL 351
             +G+  G           +I+H  G LA +     M  A + FG        IP+NK L
Sbjct: 444 VYLGLQAGKCFDIFKTPKSIILHLLG-LAAVYGISGMLLATIGFGKIAE--AIIPINKNL 500

Query: 352 YTLSYVCVTSGAAALVFSAIYALVD 376
           +++S+V V S  A  + + +Y ++D
Sbjct: 501 WSVSFVFVLSSMAFTLLAFLYFIID 525


>gi|281209034|gb|EFA83209.1| hypothetical protein PPL_03999 [Polysphondylium pallidum PN500]
          Length = 1154

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 169/384 (44%), Gaps = 74/384 (19%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E +  K  R+ SLD+FRGL++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 217 ESNDPKKDRMKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFVFIMG 276

Query: 86  VAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            A+ ++   +  R    K    K++ R++ L   G+ L             G D++  R+
Sbjct: 277 CAMPMSFNALESRGVPKKTIVIKLVRRSITLFALGMFLNN-----------GNDLQHWRI 325

Query: 142 CGVLQRIALSYLLVSLVEIFTK---------------------DVQDKDQS--VGRFSIF 178
            GVLQR  +SYL+  L+ +F                        +QD+ +S     F+  
Sbjct: 326 LGVLQRFGISYLVTGLIMMFVPVWRYRQLDDLSEEQQPLYGGGSIQDRIRSRYPRMFADI 385

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPC 237
             Y   W++A  +L V+  + +   VP      I        G+  N T G         
Sbjct: 386 LPYWIQWVVALMLLSVWFLVTFLLPVPGCPTGYIGPGGIGSQGQYANCTGG--------- 436

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
            A  Y+D K+ G NH+Y  P  +                     +    ++PEG L  ++
Sbjct: 437 -AARYVDLKIFGENHIYQTPTCQT-------------------IYNTGSYDPEGTLGYIT 476

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNAIPLNKQLY 352
           SI    +GV  G  I+  K    RL +W     V  G A  + G++ +    +P+NK L+
Sbjct: 477 SIFMCFLGVQCGRTILAFKKASCRLIRWSIWGVVLCGIAAGLCGMSQN-NGWLPINKNLW 535

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           T S+V + SG    V S +Y  +D
Sbjct: 536 TPSFVLLLSGFGFFVLSFMYIFID 559


>gi|350423601|ref|XP_003493532.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 565

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 169/367 (46%), Gaps = 49/367 (13%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
           +  V D   K   K +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + 
Sbjct: 167 KSQVDDGAMKQPAK-RRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLF 225

Query: 78  PFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P F++I+GV I +A+    KR+  +   +  ++ R++ L   G+ L        + ++ G
Sbjct: 226 PCFIWIMGVCIPIAMSSQMKRMTPKRQILYGIVKRSILLFLIGLSL--------NTVSTG 277

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QS--VGRFSIFRLYCWHWLMAAC 190
             +  IR+ GVLQR  ++Y +V+L+       + +  QS  +     F L    W +   
Sbjct: 278 GQLETIRIFGVLQRFGITYFVVALLYFLLMSRRPRKIQSPMLREVQDFLLLLPQWCVMLV 337

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           ++VV+  + +   VP      +      D  K F+   G          A GYIDR +L 
Sbjct: 338 IVVVHCVITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRVILK 387

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H++H     +S                       P++PEG+L ++++     +G+H G
Sbjct: 388 EAHLHHSATVYKS----------------------GPYDPEGILGTLTAAFQVFLGLHAG 425

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
            +++  K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +    S
Sbjct: 426 IIMMTYKDWKERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLS 485

Query: 370 AIYALVD 376
           A Y LVD
Sbjct: 486 ACYLLVD 492


>gi|260816362|ref|XP_002602940.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
 gi|229288254|gb|EEN58952.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
          Length = 512

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 173/369 (46%), Gaps = 50/369 (13%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           S  Q  S    +RL SLD FRGL++A+M+ V++ GG +    HA WNG  +AD V P+F+
Sbjct: 109 SSTQPASQ-GIRRLRSLDTFRGLSLAVMVFVNYGGGGYWFFKHARWNGLTVADLVFPWFV 167

Query: 82  FIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FI+G +IAL+ +R+      R   + KVI RT+ L   G+ +           T      
Sbjct: 168 FIMGTSIALSFRRLLKKGVSRLSLLWKVIQRTVILFLLGLFIINTKKGHNSWST------ 221

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKD--VQDKDQSVGRFSIFR--LYCW-HWLMAACVL 192
            +R+ GVLQR+AL+Y +V+L+E +     +        R +  R  +  W  WL    V+
Sbjct: 222 -LRIPGVLQRLALTYFIVALMESWKPRGYLSLYLLQTSRIAPIRDIVNSWGQWLFMIVVV 280

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
            ++L L++   VP+     +          +N T G          A GYIDR V   +H
Sbjct: 281 TLHLVLMFWLQVPNCPIGYLGPGGLSDIAHYNCTGG----------AAGYIDRAVFTDDH 330

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP        T  + +E             PFEPEGLL +++S L   +G+   ++ 
Sbjct: 331 IYQHP--------TPITVYE----------TEVPFEPEGLLGTLTSALLCFLGLQVKNMY 372

Query: 313 IH-TKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           ++    +     +   + F L   G  L         IPLNK L++LS+V V S  A ++
Sbjct: 373 MYIIMNYCVFTNRSPAICFCLGAIGGLLCNGRQNEGWIPLNKNLWSLSFVLVLSCFAFVL 432

Query: 368 FSAIYALVD 376
            S  Y +VD
Sbjct: 433 LSVCYIIVD 441


>gi|17229379|ref|NP_485927.1| hypothetical protein all1887 [Nostoc sp. PCC 7120]
 gi|17130977|dbj|BAB73586.1| all1887 [Nostoc sp. PCC 7120]
          Length = 375

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 164/362 (45%), Gaps = 83/362 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA++
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          V   IFR   +LF  G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTQENKPTSVVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL  +   ++  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLSYLFASLTVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   YIDR ++  +H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NFGAYIDRLIIPKSHLYAGDGFKNLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI----IHTKGHLAR 321
                                 +PEGL S++ +I+S + G   G  I    + T+  L  
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQTRTSLGL 234

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
                  G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L++  +  
Sbjct: 235 ----ALFGIGCLIVGWGWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLIK 288

Query: 382 PW 383
            W
Sbjct: 289 RW 290


>gi|328780782|ref|XP_396570.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 569

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 166/365 (45%), Gaps = 48/365 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D         +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P 
Sbjct: 172 QLDDTTAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPC 231

Query: 80  FLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F++I+GV I +A+    KR+  +      ++ R++ +   G+ L        + ++ G  
Sbjct: 232 FIWIMGVCIPIAMAGQMKRMLPKHMIFYGIVKRSILMFLIGLSL--------NTVSTGPQ 283

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI---FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           +  IR+ GVLQR  ++Y +V+L+ +     K  + +   +     F L    W +   ++
Sbjct: 284 LETIRIFGVLQRFGITYFIVALIYLCLMTRKPKKTQSPMLKEVQDFLLLLPQWCVMLVIV 343

Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
            V+  + +   VP      +      D  K F+   G          A GYIDR +L  +
Sbjct: 344 AVHCFITFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKES 393

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H++H     +S                       P++PEG+L ++++     +G+H G +
Sbjct: 394 HLHHSATVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGII 431

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    R+ +W+T        G  LHFTN IP+NK+L++LS+V VT+  +    SA 
Sbjct: 432 MMTYKDWKERVIRWLTWAAFFSCIGCILHFTNIIPVNKKLWSLSFVFVTTSFSLAFLSAC 491

Query: 372 YALVD 376
           Y LVD
Sbjct: 492 YLLVD 496


>gi|332027964|gb|EGI68015.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 569

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 185/371 (49%), Gaps = 45/371 (12%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + E + ++   +++  + R+ S+D FRG+++ LMI V++ GG +   +H+ WNG  +AD 
Sbjct: 162 LQEAETANIMIRTNRSSIRIRSVDTFRGISILLMIFVNNGGGQYMFFNHSAWNGLTVADL 221

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           V+P+F +I+G++I ++ +     +++  K+I R L+       +            Y   
Sbjct: 222 VLPWFAWIMGLSITISKRSELRVSNSRGKIIVRCLQR-----TIILILLGLMLNSIYAKS 276

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFSIFR--LYCW-HWLMAAC 190
           +  +R  GVLQ +A+SY + + +E IF K   QD     GRF++ R  L  W  WL+   
Sbjct: 277 LDDLRFPGVLQLLAVSYFICATIETIFMKTHPQDDVLQFGRFTVLRDILNNWAQWLIILA 336

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
           ++  ++ + +   VP+     +       +G+  N T G          A GYIDR V G
Sbjct: 337 IMTTHILITFLLPVPNCPTGYLGPGGYHHFGEFANCTGG----------AAGYIDRLVFG 386

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
            +HMY           TQ+ P  G +          P +PEG+++++S IL   +GVH G
Sbjct: 387 -SHMYSK---------TQN-PVYGTI---------LPHDPEGIMNTISIILVVYLGVHAG 426

Query: 310 HVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAA 365
            +++      A++ +W+   F   LI G+   F      IP++K++ +LS+V   S  A 
Sbjct: 427 KILLLYYQCNAKVIRWLLWSFVTGLIAGILCDFDKEFGVIPVSKKMMSLSFVLTVSCFAF 486

Query: 366 LVFSAIYALVD 376
           L+++ +Y  VD
Sbjct: 487 LLYAILYVFVD 497


>gi|75909960|ref|YP_324256.1| hypothetical protein Ava_3756 [Anabaena variabilis ATCC 29413]
 gi|75703685|gb|ABA23361.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 375

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 161/359 (44%), Gaps = 75/359 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA++
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDR---ADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +         AV   IFR   +LF  G+LL G ++       +  D+  IR+ GVL
Sbjct: 61  FSLSKYTQENKPTSAVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL  +   ++  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLSYLFASLAVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   Y+DR ++   H+Y    ++     
Sbjct: 160 DYGAGVLTREG---------------------NFGAYVDRLIIPQAHLYAGDGFKNLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                 +PEGL S++ +I+S + G   G  I             
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQSVQTRTSVGL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
              G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L++  +   W+
Sbjct: 235 ALFGIGCLIVGWGWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLIKRWS 291


>gi|195399031|ref|XP_002058124.1| GJ15666 [Drosophila virilis]
 gi|194150548|gb|EDW66232.1| GJ15666 [Drosophila virilis]
          Length = 572

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 171/353 (48%), Gaps = 53/353 (15%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           + D  EK+  + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P F
Sbjct: 171 IGDAAEKA-TQRKRLRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADLVFPSF 229

Query: 81  LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           L+I+GV I L++K    R   ++    ++++R+ KL   G+ L           T G  +
Sbjct: 230 LWIMGVCIPLSIKSQLGRGISKSRICGRIVWRSCKLFAIGLCLNS---------TNGPQL 280

Query: 137 RMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLV 193
             +RL GVLQR  +++L+V L+  + ++  Q   Q   + +I+   L+     +   ++ 
Sbjct: 281 EQLRLMGVLQRFGIAFLVVGLLHTVCSRRDQLSPQRAWQRAIYDICLFSGELAVLLALIA 340

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGIN 251
            YL L +G  VP      +       GK  N         NP C   A GYIDR+VLG  
Sbjct: 341 AYLGLTFGLPVPGCPRGYLGPG----GKHNNAA-------NPNCIGGAAGYIDRQVLGNA 389

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H+Y HP  +     T                    F+PEG+   + S++ T++G   G  
Sbjct: 390 HIYQHPTAKYVYDAT-------------------AFDPEGIFGCLLSVVQTLLGAFAGVT 430

Query: 312 IIHTKGHLARLKQW-VTMGFALLIFGLTLHFT---NAIPLNKQLYTLSYVCVT 360
           ++      ARLK+W +      L+ G    FT     IP+NK L++LS+V VT
Sbjct: 431 LLVHATWQARLKRWLLGATLLGLLGGALCGFTREQGVIPVNKNLWSLSFVFVT 483


>gi|298491757|ref|YP_003721934.1| hypothetical protein Aazo_3034 ['Nostoc azollae' 0708]
 gi|298233675|gb|ADI64811.1| Protein of unknown function DUF2261, transmembrane ['Nostoc
           azollae' 0708]
          Length = 376

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 164/355 (46%), Gaps = 66/355 (18%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +  + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMVGVADHKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +L +        K V  R L+      LL    +   ++  +  D+  IR  GVLQRI+
Sbjct: 61  FSLSKYTADNKPTKAVYLRILRRAAILFLLGLLLNGFWNKGVWTFDLSSIRFMGVLQRIS 120

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           LSYL  SL+ +    V  K+Q              W++A  +L+ Y   +    VPD+  
Sbjct: 121 LSYLFASLIVL---KVPGKNQ--------------WVLAGVLLIGYWLTMMYVPVPDYGA 163

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            ++ ++                      N  G+IDR ++   H+Y               
Sbjct: 164 GVLTREG---------------------NFGGFIDRLIIPKAHLY--------------- 187

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                 + D  ++     +PEGL S++ +I+S ++G +F  + I  + HL        + 
Sbjct: 188 ------KGDGFNYLG---DPEGLYSTIPAIVSVLVG-YFAGIRIKERKHLNSQTSMDFVL 237

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           F L    +++ +  A P+NK+L+T SYV  T+G A ++ +A Y L++  +   W+
Sbjct: 238 FGLCCLVVSIIWDVAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRLIKRWS 292


>gi|126304129|ref|XP_001381943.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Monodelphis domestica]
          Length = 638

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 164/341 (48%), Gaps = 51/341 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL SLD FRG+++ +MI V++ GG +    H  WNG  +AD V P+F+FI+G +IAL+L
Sbjct: 241 HRLRSLDTFRGISLIIMIFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGSSIALSL 300

Query: 93  KRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +  R       + K+++R+  L   G+L+       P+     +    +R+ GVLQR+
Sbjct: 301 SSMLRRGCSKWKLLGKILWRSFLLCVIGVLIMN-----PNYCLGPLSWDKLRIPGVLQRL 355

Query: 149 ALSYLLVSLVEI-FTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
            L+YL+V+++E+ F K V +    +     F     Y   W+    +  V++ + +   V
Sbjct: 356 GLTYLVVAVLELLFAKAVPENSTMESLCASFQDIISYWPQWIFILMLEAVWVCVTFLLPV 415

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+GK  N T G          A GYIDR +LG +H+Y HP+     
Sbjct: 416 PGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS----- 460

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----L 319
                         +        ++PEGLL +++SI+   +GV  G +++  K      +
Sbjct: 461 -------------PNVLYHTKVAYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQHKQIM 507

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNA---IPLNKQLYTLSYV 357
            R   W  M    +I G+   F+     IP+NK L+++SYV
Sbjct: 508 LRFLLWSAM--LAIISGVLTKFSQNEGFIPVNKNLWSISYV 546


>gi|383859754|ref|XP_003705357.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 572

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 192/386 (49%), Gaps = 64/386 (16%)

Query: 17  SEPDVSDQQEKSHLKTQ----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAP 66
           + PD  D+ +++   T           R+ S+D FRG+A+ LMI V++ GG +   +H+ 
Sbjct: 155 NAPDDLDRLQETESTTHPVIRTTRASTRIRSVDTFRGIAILLMIFVNNGGGKYVFFNHSA 214

Query: 67  WNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           W G  +AD V+P+F +I+G+ I ++    L+    R   +   I R+L L+  G++L   
Sbjct: 215 WYGLTVADLVLPWFAWIMGLTITISKRAELRVTVSRVKIMLHCIRRSLVLILLGLMLNSI 274

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFSIFR- 179
            +++  +L         R  GVLQ + +SY + S++E IF K   QD     GRF+ FR 
Sbjct: 275 KNNSFSDL---------RFPGVLQLLGVSYFVCSMLETIFMKPHSQDTLLQFGRFASFRD 325

Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DY-GKVFNVTCGVRAKLNP 235
            L  W  WL+ A ++  +  + +   VP+            +Y GK  N T G       
Sbjct: 326 ILDSWPQWLVMAVIMTTHTLITFLLPVPNCPKGYFGPGGQYEYRGKYMNCTAG------- 378

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
              A GYIDR + G NHMY  P         ++S +   LR          ++PEGL+++
Sbjct: 379 ---AAGYIDRLIFG-NHMYPKP---------KESIYGDILR----------YDPEGLMNT 415

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQL 351
           +S+I    +GVH G +++    + +R+ +W+       +I G   HF+     IP++K++
Sbjct: 416 ISAIFIVYLGVHAGKILLLYYQYNSRVIRWILWAVLTGIIAGNLCHFSTQGGVIPVSKRM 475

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDC 377
            TLS+    S  A L+++ +Y+L+DC
Sbjct: 476 MTLSFDLTCSSFAFLLYAILYSLIDC 501


>gi|428306334|ref|YP_007143159.1| heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
 gi|428247869|gb|AFZ13649.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
          Length = 371

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 165/354 (46%), Gaps = 81/354 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV+ AG  D  +P ++HA WNG   AD V PFFLFI+GVA+A
Sbjct: 1   MRLTSLDVFRGMAIAGMILVNKAGVADQVYPALAHADWNGWTFADLVFPFFLFIIGVAMA 60

Query: 90  LALKRIPDRADA-VKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            +  +  +  +   K++  R L+    L   G+LL G +++         D   IR+ GV
Sbjct: 61  FSFAKYTEGDNKPTKQLYLRILRRSAILFILGLLLNGFWNY---------DFSTIRVMGV 111

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI+++YLL SL  +    +  K Q              W +AA +L+ Y  ++    V
Sbjct: 112 LQRISVAYLLASLAVL---TLPKKGQ--------------WALAAVLLIGYWLIMSFVPV 154

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P +   ++ ++                      N   YIDR ++G  H+Y    +     
Sbjct: 155 PGYGAGVLTREG---------------------NFGAYIDRLIIGAAHLYKGDNY----- 188

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                              ++  +PEGL SS+ +++S +IG   G  +            
Sbjct: 189 -------------------NSLGDPEGLFSSLPAVVSVLIGYFTGEWLRKQPERSRTSIN 229

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
            +  G + L+ G   +F    P+NK+L+T SYV  T+G A ++ +A Y L+D  
Sbjct: 230 MLIAGLSCLVVGEVWNFW--FPINKKLWTSSYVLFTAGIALILLAACYELIDVR 281


>gi|326427923|gb|EGD73493.1| heparan-alpha-glucosaminide N-acetyltransferase [Salpingoeca sp.
           ATCC 50818]
          Length = 788

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 112/417 (26%), Positives = 190/417 (45%), Gaps = 75/417 (17%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M  +  +     PL+ S   + + +        RL SLD FRG+A+ +MI V++ GGD+ 
Sbjct: 333 MLLLNTQKYTRDPLLSSTHAIGNPKRSK----TRLQSLDSFRGMALTIMIFVNYGGGDYN 388

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWG 116
              H+ WNG  +AD V P+F++I+G ++A+    +  R   ++ +++    RTL L   G
Sbjct: 389 FFDHSVWNGLTVADLVFPWFIWIMGTSMAITFNSLFKRHTPLRTILYKVARRTLLLFGIG 448

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF--------------- 161
           ++           +    D+R  R+ GVLQR A++YL+V+LV IF               
Sbjct: 449 VIF----------INVVHDLRFARVPGVLQRFAIAYLVVALVIIFVPKAVSLLRNVDEVT 498

Query: 162 ------TKDVQD--KDQSVGRFSIFR------LYCWHWLMAACVLVVYLALLYGTYVPDW 207
                 T  V++   D   G   + R       Y   W+    ++V++  + +   VP  
Sbjct: 499 PLIRRLTPTVRNPASDLDPGGCGMLRHLPDVAPYVGEWIAIIVLVVIHTCITFLLPVPGC 558

Query: 208 QFTIINKDSA--DYGKVF--NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
               I    A  ++G+    N +C V         A G++DR +L   H+Y  P      
Sbjct: 559 PTGYIGPGGALAEFGQFAPANGSC-VNGTFCCEGGAAGHVDRWLLSWKHIYGSPT----- 612

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
             +Q++   G             ++PEG+L S++SIL   +G+  G +I+H K   AR  
Sbjct: 613 --SQETYQTGA------------YDPEGILGSLTSILICYLGLQSGKIIVHYKAARARSV 658

Query: 324 QWVTMGF--ALLIFGLTLHFTN--AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +W+  G     +  GL     N   IP++K L++LS+V + SG   +  +A Y L+D
Sbjct: 659 RWLAWGVLCCAIATGLCGGSKNDGVIPVSKNLWSLSFVLLMSGFGFISLTAFYWLID 715


>gi|383849627|ref|XP_003700446.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 552

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 167/372 (44%), Gaps = 69/372 (18%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           D   +  +K +R+ ++D  RG +  LMI V+   G +  + HA WNG    D + P F++
Sbjct: 159 DDTARQPVK-RRVKAIDTVRGASTLLMIFVNDGSGGYKTLGHATWNGLLPGDLLFPCFIW 217

Query: 83  IVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           I+GV I +AL    KR+  +   +  ++ R++ L   G+ L        + +  G  +  
Sbjct: 218 IMGVCIPIALGSQLKRMVPKHVILYGILKRSVLLFLIGVSL--------NTVGTGPQLES 269

Query: 139 IRLCGVLQRIALSYLLVSLVEIFT-------------KDVQDKDQSVGRFSIFRLYCWHW 185
           IR+ GVLQR  ++Y +V+++ +F              +DVQD          F L    W
Sbjct: 270 IRIFGVLQRFGVTYFIVAVIYLFLISKRPTKVQSPMLRDVQD----------FLLLLPQW 319

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYID 244
            +   ++  +  + +   VP      +      D  K F+   G          A GYID
Sbjct: 320 TVMLAIVAAHCIITFCLPVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYID 369

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           + VL   H++H     +S                      APF+PEG+L  ++S     +
Sbjct: 370 KVVLKEQHLHHSMTVYKS----------------------APFDPEGILGCLTSTFHVFL 407

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+H G +++  K    R+ +W+         G  LHFTN IP+NK+L++LS+V VT+  +
Sbjct: 408 GLHAGIIMMTYKDWKERVIRWLAWAAFFSCIGCALHFTNVIPVNKKLWSLSFVFVTTSFS 467

Query: 365 ALVFSAIYALVD 376
               SA Y LVD
Sbjct: 468 LAFLSACYLLVD 479


>gi|449283383|gb|EMC90042.1| Heparan-alpha-glucosaminide N-acetyltransferase [Columba livia]
          Length = 560

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 172/361 (47%), Gaps = 49/361 (13%)

Query: 17  SEPDVSDQQEK--SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
           ++P  SD   +  S    QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD
Sbjct: 145 ADPISSDPAPQLWSSAPRQRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVAD 204

Query: 75  FVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            V P+F+FI+G     +++  L++   +   + K+++R+  L+  G+++       P+  
Sbjct: 205 LVFPWFVFIMGTSISLSLSSMLRQGSSKWKVLGKILWRSFLLILLGVIVVN-----PNYC 259

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWL 186
              +    +R+ GVLQR+ L+YL+V+ +E +FT+   +           R  L  W  W+
Sbjct: 260 LGPLSWENLRIPGVLQRLGLAYLVVAALELLFTRAGAESGTLETPCPALRDILPYWPQWV 319

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
               + V++L L +   VP      +      D+G   N T G          A GYIDR
Sbjct: 320 FVLMLEVLWLCLTFLLPVPGCPRGYLGPGGIGDFGNYANCTGG----------AAGYIDR 369

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +LG  H+Y HP+   S    Q +                P++PEG+L ++++I    +G
Sbjct: 370 LLLGDKHIYQHPS---SNVIYQTT---------------MPYDPEGILGTINTIFMAFLG 411

Query: 306 VHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G +I+  K      ++R   W + MG    I          IP+NK L+++SYV  T
Sbjct: 412 LQAGKIILFYKDQHKRIMSRFFIWTIVMGIISAILTKCSKEEGFIPINKNLWSVSYVTTT 471

Query: 361 S 361
           S
Sbjct: 472 S 472


>gi|348577435|ref|XP_003474490.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cavia porcellus]
          Length = 638

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 182/370 (49%), Gaps = 55/370 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA-- 91
           RL  LD FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G ++ L+  
Sbjct: 238 RLRCLDTFRGIALILMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSVFLSVT 297

Query: 92  --LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
             L+R   +   + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 298 SVLQRGCSKLKLLGKIAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 352

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR-LYC-W-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +FTK V +   S  RF   R + C W  WL+   +  ++L L +   VP
Sbjct: 353 VTYFVVAVLELLFTKPVHENCVSDRRFPFLRDITCSWPQWLLILLLESLWLGLTFLLPVP 412

Query: 206 DWQFT-----IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
              +      +      D GK  N T G          A GYID  +LG +H+Y HP   
Sbjct: 413 GCPYVSEPGYLGPGGIGDLGKYVNCTGG----------AAGYIDHLLLGSDHLYQHP--- 459

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TK 316
            S A    +                 ++PEG+L +++SI+   +GV  G ++++    TK
Sbjct: 460 -SSAVLYHT--------------KVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTK 504

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
             L R   W  +   ++   LT    N   IP+NK L+++SYV   S  +  +   +Y +
Sbjct: 505 DILIRFTAWCCV-LGVISAALTKMSENEGFIPVNKNLWSISYVTTLSTFSFCILLVLYFI 563

Query: 375 VDCNIFGPWT 384
           +D  + G WT
Sbjct: 564 ID--VRGLWT 571


>gi|320168011|gb|EFW44910.1| hgsnat protein [Capsaspora owczarzaki ATCC 30864]
          Length = 800

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 167/352 (47%), Gaps = 52/352 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           R+ SLD FRG+A+++M+ V++ GG +    H+ WNG  +AD V P+F+F++GV+++L+ +
Sbjct: 419 RVRSLDTFRGIALSIMLFVNYGGGGYWFFDHSTWNGLTVADLVFPWFIFMMGVSMSLSFE 478

Query: 94  RI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           ++      R     KVI R++ L   G+ L          +   +     R+ GVLQR A
Sbjct: 479 KLRRRGAPRGALFLKVIRRSMTLFALGLFL----------VCRQIIFATWRMPGVLQRFA 528

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           +SYL V+ + +F           G F     +   W++    + ++  + +   VP    
Sbjct: 529 VSYLFVAAIVMFVPIFATLP---GPFRDLTSHWLQWVVIGIFITIHTCITFLYDVPGCGT 585

Query: 210 TIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             I      D+G+  N T G          A GYID +V G  H+Y  P  +        
Sbjct: 586 GYIGPGGIGDFGQYMNCTGG----------AAGYIDSQVFG-RHIYQAPTAQ-------- 626

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                        +    ++PEGLL  ++S++ T +G   G +++    H ARL++W   
Sbjct: 627 -----------AYYLTGAYDPEGLLGCLTSVVITFLGYQAGRILVTFSTHSARLRRWAAW 675

Query: 329 GFALLIFGLTLHFTNA----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           G  L + GL L         IP+NK L++LS+V + +G   L+ ++ Y +VD
Sbjct: 676 GVGLGLLGLILCKGTQNDGWIPINKNLWSLSFVLIMAGLGYLMLASCYIIVD 727


>gi|426359530|ref|XP_004047024.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gorilla
           gorilla gorilla]
          Length = 635

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 175/366 (47%), Gaps = 51/366 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG      ++  F+FI+G +I L++ 
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGAEGCIEMIEMFVFIMGSSIFLSMT 298

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
               +                 ++PEG+L +++SI+   +GV  G ++++    TK  L 
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505

Query: 321 RLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y +VD  
Sbjct: 506 RFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVD-- 562

Query: 379 IFGPWT 384
           + G WT
Sbjct: 563 VKGLWT 568


>gi|434403337|ref|YP_007146222.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
 gi|428257592|gb|AFZ23542.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
          Length = 375

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 163/363 (44%), Gaps = 83/363 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG  D  +P + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGMTIAAMILVNMAGVADEIYPLLDHAKWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +        K V  R L+    L   G+LL G ++       +  D+  IR  GVL
Sbjct: 61  FSLSKYTAANKPTKAVYLRILRRAAILFALGLLLNGFWNKG----VWTFDLSNIRFMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+YLL SL  +    +  K Q              W++A  +LV Y   +    VP
Sbjct: 117 QRISLTYLLASLAVL---QLPRKGQ--------------WILAVVLLVGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFNLLG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI----IHTKGHLAR 321
                                 +PEGL S++ ++++ + G   G  I    + ++  L  
Sbjct: 197 ----------------------DPEGLFSTIPAVVNVLAGYFAGEWIRSQPVKSRTSLGL 234

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
               V +G A L+ G    +    P+NK+++T SYV  TSG A L+ +A Y L++  +  
Sbjct: 235 ----VLVGVACLVIGWGWGWI--FPINKKIWTSSYVVFTSGWALLLLAACYELIEVRLIK 288

Query: 382 PWT 384
            W+
Sbjct: 289 RWS 291


>gi|66826507|ref|XP_646608.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
 gi|60474509|gb|EAL72446.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
          Length = 426

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 104/352 (29%), Positives = 156/352 (44%), Gaps = 79/352 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R+ SLD  RGL +  MILVD+  G+   WP ++   WNG + AD + P F+FI G +IA
Sbjct: 44  RRMGSLDAVRGLTIFGMILVDNQAGNDVIWP-LNETEWNGLSTADLIFPSFIFISGFSIA 102

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           LALK   +       +I RTL L F    +Q   +   D   +       R+ GVLQRIA
Sbjct: 103 LALKNSKNTTSTWYGIIRRTLLLFF----IQCFLNLMGDHFNFTT----FRIMGVLQRIA 154

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           + Y    L             S   F IF       L    V V Y++++Y   VP    
Sbjct: 155 ICYFFSCL-------------SFLCFPIFL----QRLFLLSVTVTYISIMYALNVPK--- 194

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                            CG RA L   CNA  YID KV G+N M             ++S
Sbjct: 195 -----------------CG-RANLTQNCNAGAYIDSKVFGLNIM-------------KES 223

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
              GP   D          PEGL+S++SS ++  +G+ FG +       H  G+   + +
Sbjct: 224 NLNGPYYND----------PEGLISTMSSFITAWMGLEFGRIFTRFYKKHDFGNTDIIVR 273

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           W+ +    ++  ++L  T  +P NK++++ S+   T GA+  +    + L+D
Sbjct: 274 WILLVILFMVPAISLGAT-VMPFNKKIWSFSFALFTVGASGSLILIAFILID 324


>gi|392968994|ref|ZP_10334410.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
 gi|387843356|emb|CCH56464.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
          Length = 390

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 161/362 (44%), Gaps = 82/362 (22%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVM 77
           S +  +S    QRL SLD FRGL VA MILV++AG DW      + HA WNG    D + 
Sbjct: 9   SVRSSESLTNPQRLLSLDAFRGLTVAGMILVNNAG-DWQYVYAPLKHAAWNGWTPTDLIF 67

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           PFFLFIVGV+I  AL    +  + + K++ R+  L   G+ L    +  P       D+ 
Sbjct: 68  PFFLFIVGVSITFALAGGQEHTNVIGKILKRSFTLFMLGLFL----AFFPK-----FDIT 118

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R+ GVLQRIAL YL  SL+ + T   Q                  WL+AA ++  +L 
Sbjct: 119 TVRIPGVLQRIALVYLACSLIYLRTTTRQQT----------------WLLAALLVGYWLV 162

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +           T++      Y           A L P  N   ++DR VL  +H+Y   
Sbjct: 163 M-----------TVVPVPGVGY-----------ANLEPTTNLAAWLDRTVLTTDHLY--- 197

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
             R +K                       ++PEGLLS++ +I + + GV  G  +     
Sbjct: 198 --RSTKV----------------------WDPEGLLSTIPAIGTGLAGVLVGTWLRRRDV 233

Query: 318 HLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             A    W+ T+G    + GL   +    P+NK L+T SYV + +G A L  +  Y L+D
Sbjct: 234 ADADKIAWLFTVGCLTTLGGLI--WDAWFPINKALWTSSYVLLAAGLAMLGLALCYWLID 291

Query: 377 CN 378
             
Sbjct: 292 AQ 293


>gi|423226736|ref|ZP_17213201.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627009|gb|EIY21050.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 368

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 168/373 (45%), Gaps = 104/373 (27%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+GV+  +AL +   +A    V+KV+ RTL +L  G        +  G F  
Sbjct: 59  LVFPFFLFIMGVSTYIALSKFHFQASGSVVRKVLKRTLVILCIGWAIHWFHFICDGDF-- 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y +VS V ++        + +G           W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           ++  C++  Y  LL     Y PD        D+                     N +  I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR VLG +H+YH                             +P +PEGL S++S+I  T+
Sbjct: 183 DRNVLGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG   G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G 
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGL 272

Query: 364 AALVFSAIYALVD 376
           AA++ S +   +D
Sbjct: 273 AAMLQSVLIYFID 285


>gi|427728534|ref|YP_007074771.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
 gi|427364453|gb|AFY47174.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
          Length = 380

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 161/360 (44%), Gaps = 77/360 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +  ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 15  MRLTSLDVFRGITIAAMILVNMAGVADDVYLPLTHADWHGCTPTDLVFPFFLFIVGVAMT 74

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          + +R L+    L   G+ L G ++       +  D   IR+ GVL
Sbjct: 75  FSLSKYTQDNKPTSAIYWRILRRAAILFILGLFLNGFWNQG----VWTFDFTSIRMMGVL 130

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYV 204
           QRI+LSYLL SL+ +    +  K Q              WL+A  +L+ Y LA++Y   V
Sbjct: 131 QRISLSYLLASLIVL---KLPRKGQ--------------WLLAGVLLIGYWLAMMY-IPV 172

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           PD+   ++ ++                      N   Y+DR ++   H+Y    +     
Sbjct: 173 PDYGAGVLTREG---------------------NFGAYVDRLIIPKAHLYKGDGFN---- 207

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                 F G              +PEGL S++ +I+S + G   G  I            
Sbjct: 208 ------FMG--------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQTRTSVG 247

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
               G   L+ G    +    P+NK+L+T SYV  TSG A L+ +A Y L++  +   W+
Sbjct: 248 LALFGIGCLMIGWAWGWV--FPINKKLWTSSYVVFTSGWALLLLAACYELIEVRLMRRWS 305


>gi|340371415|ref|XP_003384241.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Amphimedon queenslandica]
          Length = 743

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 178/367 (48%), Gaps = 50/367 (13%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D+ ++   S  K +RL SLD FRG+++ +MI V++ GG +   +H+ WNG  +AD V P+
Sbjct: 345 DLLNEDPLSTRKKERLRSLDTFRGMSLIIMIFVNYGGGGYWFFNHSIWNGITVADLVFPW 404

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           F++I+GV+I  + K    + D+ K  +++ ++     ++L          L  G  +   
Sbjct: 405 FVWIMGVSIVYSFK--GRKKDSFKLRLYQVVRR---SVIL----LGLGLFLNNGYRLSHW 455

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYL 196
           R+ GVLQR A++Y +V++ E+    V +K +        R   + W  WL+   +  ++L
Sbjct: 456 RIPGVLQRFAIAYFVVAMTELLAPMVYNKYKLKWDVISVRDLTHNWVQWLVIVFLESLWL 515

Query: 197 ALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
            + +    P      +     AD GK  N T G+           GYID  +L  NH+Y 
Sbjct: 516 IITFSLKAPGCPRGYLGPGGRADGGKYSNCTGGI----------AGYIDSWILTDNHIYG 565

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
           HP     KA                      ++PEG+L S++SI+    GV  G ++IH 
Sbjct: 566 HPT---CKAIYHT----------------GSYDPEGILGSINSIVMCFFGVQAGRILIHH 606

Query: 316 KGHLARLKQWVTMGFALLIFGLTLHFTNA------IPLNKQLYTLSYVCVTSGAAALVFS 369
           K   +R+ ++V  G  LL+ GL      A      IPLNK L++LS++ V +G   ++ +
Sbjct: 607 KQFGSRIVRFVVWG--LLMGGLGTILCEATLNKGVIPLNKNLWSLSFILVIAGLGYILLA 664

Query: 370 AIYALVD 376
             Y ++D
Sbjct: 665 LFYFIID 671


>gi|282898832|ref|ZP_06306819.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196359|gb|EFA71269.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 375

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 161/359 (44%), Gaps = 75/359 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRGL +A+MI+ + AG     +P +SHAPWNGC   D V PFFLFIVGVA++
Sbjct: 1   MRLISLDVFRGLTIAMMIIANMAGVVPDVYPFLSHAPWNGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +    +     V F    R + L   G+LL G ++          D++ +R+ GVL
Sbjct: 61  FSLSKYSLESKLDNLVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L+YL  SL+ +    + +K Q              W +A  +L+ Y   +    VP
Sbjct: 117 QRIGLAYLFASLIVL---KLPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I++ + G   G  I  +  +       
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQWIRKSTINSHTSMDL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           V  G   L+ G+   +    P+NK+L+T SYV  ++G   +  +A Y L++      W+
Sbjct: 235 VLWGLCSLVVGMI--WDGLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIEVRKIKRWS 291


>gi|321474731|gb|EFX85695.1| hypothetical protein DAPPUDRAFT_309035 [Daphnia pulex]
          Length = 588

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/378 (27%), Positives = 169/378 (44%), Gaps = 62/378 (16%)

Query: 19  PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
           P V+D+      K+ RL SLD FRG+ + LMI V+   G +    HA WNG  LAD + P
Sbjct: 180 PAVADEITPKK-KSSRLKSLDTFRGITIVLMIFVNDGAGQYFIFQHATWNGLQLADVIFP 238

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHAPDELTYG 133
           +F++I+GV + ++L+    R ++ K  IF     R+  L F GI+         + L   
Sbjct: 239 WFMWIMGVCMPISLRSSLRRKES-KLTIFAGILRRSCLLFFLGIM--------NNSLGGP 289

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFR--LYCW-HWLM- 187
           VD+  +R+ GVLQR A++YL V    +     D+     S     +F+  +  W  W++ 
Sbjct: 290 VDLGRLRVPGVLQRFAITYLAVGTAGLLLTPADLSAPHPSSKARKLFQDIVVLWPQWILF 349

Query: 188 -----AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
                A C +  +L +  G  V       ++ D+A  G                  A GY
Sbjct: 350 LLLVAAHCFITFFLPVEEGCPVGYLGPAGLHLDNAYPGHCIG-------------GAAGY 396

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           IDR +L + H+++ P                        +   P++PEG+L S+      
Sbjct: 397 IDRLMLSVQHIFNKPT-------------------TIGVYGSGPYDPEGILGSMLCTFQV 437

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA----IPLNKQLYTLSYVC 358
            +G   G  ++   G  +RL +W+       + G  L   +     IP+NK L++LS+V 
Sbjct: 438 FLGAQAGMTLLIFSGWKSRLIRWLAWSVLTGLIGALLCLASQNDGWIPVNKNLWSLSFVL 497

Query: 359 VTSGAAALVFSAIYALVD 376
           VT+G A  +  A Y L+D
Sbjct: 498 VTTGLAFFLLGACYWLID 515


>gi|414077874|ref|YP_006997192.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
 gi|413971290|gb|AFW95379.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
          Length = 376

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 164/359 (45%), Gaps = 74/359 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL+SLD+FRG+ +A MIL + AG     +  +SHA W+GC   D + P FLFIVGVA+ 
Sbjct: 1   MRLSSLDVFRGITIAAMILANMAGVADDVYRPLSHAQWHGCTPTDLIFPCFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +   +    K V  R L+    L   G++L G ++       +  D+  IRL GVL
Sbjct: 61  FSLAKYTAQNKPTKAVYLRILRRTAILFILGLVLNGFWNQG----VWTFDLSSIRLMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL+YL  SL+ +    +  K Q              WL+A  +L+ Y   +    VP
Sbjct: 117 QRIALTYLFASLIVL---KLPRKSQ--------------WLVAGGLLIAYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I+S + G   G  I   K   ++    
Sbjct: 194 -----FLG--------------DPEGLFSTIPAIVSVLAGYFTGQWIKDKKQATSQTSMD 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           + + F L    + + +  + P+NK+++T SYV  T+G A ++ +A Y L++  +   W+
Sbjct: 235 LVL-FGLCCLVIAIIWDVSFPINKKIWTSSYVLFTTGWALMLLAACYELIEVRLIKRWS 292


>gi|224537467|ref|ZP_03678006.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520905|gb|EEF90010.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 368

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 168/373 (45%), Gaps = 104/373 (27%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+G++  +AL +   +A    V+K++ RTL +L  G        +  G F  
Sbjct: 59  LVFPFFLFIMGISTYIALGKFHFQASGSVVRKILKRTLVILCIGWAIHWFHFICDGDF-- 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y +VS V ++        + +G           W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           ++  C++  Y  LL     Y PD        D+                     N +  I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR +LG +H+YH                             +P +PEGL S++S+I  T+
Sbjct: 183 DRNILGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           IG   G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G 
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGL 272

Query: 364 AALVFSAIYALVD 376
           AA++ S +   +D
Sbjct: 273 AAMLQSVLIYFID 285


>gi|427709244|ref|YP_007051621.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
 gi|427361749|gb|AFY44471.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
          Length = 375

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 158/359 (44%), Gaps = 75/359 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+ AG     +P ++HA W+GC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMAGVADDVYPLLAHADWHGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +          V   IFR   +LF  G+LL   ++          D   IR+ GVL
Sbjct: 61  FSLSKYTADNKPTSTVYLRIFRRAAILFALGLLLNVFWNKGVGTF----DFSSIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYLL SL  +   ++  K Q              W++AA +L+ Y   +    VP
Sbjct: 117 QRISLSYLLASLAVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYVPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           ++   ++ ++                      N   Y DR ++   H+Y    ++     
Sbjct: 160 EYGAGVLTREG---------------------NFGAYFDRLIIPQTHLYAGDGFKSMG-- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                 +PEGL S++ +++S + G   G  I             
Sbjct: 197 ----------------------DPEGLFSTIPAVVSVLAGYFTGQWIRKQPVQTRTSVGL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           +  G   LI G    +    P+NK+L+T SYV  TSG A L+ +A Y L++  +   W+
Sbjct: 235 ILFGMGCLIIGWAWGWV--FPINKKLWTSSYVIFTSGWALLLLAACYELIEVRLIKRWS 291


>gi|427716050|ref|YP_007064044.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
 gi|427348486|gb|AFY31210.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
          Length = 375

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 163/356 (45%), Gaps = 69/356 (19%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +P + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAAMILVNMVGVADDKYPLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            +L +  +       V +R L+     + L    +   ++  +  D+  IRL GVLQRI+
Sbjct: 61  FSLSKYTEGNKPNSSVYWRILRRAAILLALGLLLNGFWNKGVWTFDLSSIRLMGVLQRIS 120

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           LSYL+ S+  +   ++  K Q              W++AA +L+ Y   +    VP    
Sbjct: 121 LSYLVASVTVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYLPVPGHGA 163

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
            ++ ++                      N   YIDR ++   H+Y               
Sbjct: 164 GVLTREG---------------------NLGAYIDRLIIPKAHLYKG----------DKF 192

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM- 328
            F G              +PEGL S++ +I+S + G +F  + I ++   +R    + + 
Sbjct: 193 NFMG--------------DPEGLFSTIPAIVSVLAG-YFAGLWIRSQPVRSRTSIGLALF 237

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           G   LI G    +T   P+NK+L+T SYV  +SG A L+ +A Y L++  +   W+
Sbjct: 238 GIGCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWALLLLAACYELIEVRLIRRWS 291


>gi|340727561|ref|XP_003402110.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 571

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 181/364 (49%), Gaps = 54/364 (14%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ S+D FRG+ + LMI V++ GG +   +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGITILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238

Query: 92  ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
               L+    R       + R+  L+  G++L    S + ++L         R  GVLQ 
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGVLQL 289

Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
           +++SY + +++E IF K   QD     GRF+IFR  L  W  WL+ A ++  +  + +  
Sbjct: 290 LSVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349

Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           ++P+            +  GK  N T G          A GYIDR + G NH Y      
Sbjct: 350 HIPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSR---- 394

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                  +S +   LR D          PEGL++++S+I    +GVH G +++      A
Sbjct: 395 -----ITNSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439

Query: 321 RLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           RL +W     F  +I G+  +F +    IP++K++ TLSYV   S  A L+++ +Y L+D
Sbjct: 440 RLIRWFLWAVFTGIIAGILCNFESEGGVIPVSKRMMTLSYVLTCSSFAFLLYALLYFLID 499

Query: 377 CNIF 380
              F
Sbjct: 500 YKQF 503


>gi|270160204|ref|ZP_06188860.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|289165026|ref|YP_003455164.1| hypothetical protein LLO_1691 [Legionella longbeachae NSW150]
 gi|269988543|gb|EEZ94798.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|288858199|emb|CBJ12067.1| putative membrane protein [Legionella longbeachae NSW150]
          Length = 372

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 165/361 (45%), Gaps = 84/361 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           L+ +R+ SLD+FRGL +ALM+LV+  G    +P + H+ WNGC LAD V P FLFIVG+ 
Sbjct: 6   LQNERILSLDVFRGLTMALMVLVNSLGTRISYPILLHSEWNGCTLADLVFPSFLFIVGMT 65

Query: 88  IALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LKR      + +    +  RT+ L   GI L        +     VD+  IR+ G+
Sbjct: 66  TVISLKRHIKEESKTEIYYSIFKRTIILFLLGIFL--------NVFPKNVDISSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL YL+ + + + T           R  IF               ++L +L G   
Sbjct: 118 LQRIALCYLICAFIYLHTTI---------RAQIF---------------IFLGILLGY-- 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             W F             F++      +L    N VGYID+ +    H+           
Sbjct: 152 --WYFL----------ACFHLPVSGMNQLTITRNWVGYIDQLLFSPKHLLFR-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 F+PEGLLS++ SI +T+ G+  G++++     + + K+
Sbjct: 192 ---------------------NFDPEGLLSTIPSIATTLSGLIAGNLLL---AQIQKQKK 227

Query: 325 WVTMGFALLIFGLTLHFTN-AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            + M  + L+F L     N + P+NK L+T S+V   SG + ++FS  Y ++D   +  W
Sbjct: 228 CILMVASGLVFLLLAWLWNYSFPINKNLWTSSFVLWCSGFSLIIFSFCYFIIDIKGYNKW 287

Query: 384 T 384
           +
Sbjct: 288 S 288


>gi|427385206|ref|ZP_18881711.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727374|gb|EKU90234.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
          Length = 368

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 163/372 (43%), Gaps = 102/372 (27%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S+P   + Q K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D
Sbjct: 4   SQPSTFNSQPK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYESLQHSAWNGLTLCD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
            V PFFLFI+G++ ++AL +   +A    V+K++ RTL +L  G        +  G FS 
Sbjct: 59  LVFPFFLFIMGISTSIALSKFHFQASGSVVRKILKRTLIILCIGWVIHWFDFICDGDFS- 117

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                        +RL GVL RIAL Y + S V ++        + +G           W
Sbjct: 118 ---------PFAHLRLTGVLPRIALCYCVASFVALYV-----NHKYIG-----------W 152

Query: 186 LMAACVLVVYLALLYGT-YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +      L  G  Y PD                               N +  ID
Sbjct: 153 LIGILLAGYTFLLCIGNGYAPD-----------------------------STNLLAIID 183

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VLG +H+YH                             +P +PEGL S+ S+I  T+I
Sbjct: 184 RNVLGADHLYH----------------------------KSPIDPEGLTSTFSAIAHTLI 215

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   G +I+  K    +  +   +GF L+  G  L    A+PLNK++++ ++V VT G A
Sbjct: 216 GFCCGKLILAKKNLEQKTLKLFVVGFILMACGFCL--MEALPLNKRIWSPTFVLVTCGLA 273

Query: 365 ALVFSAIYALVD 376
           A++ + +   +D
Sbjct: 274 AMLQALLIYFID 285


>gi|195133238|ref|XP_002011046.1| GI16326 [Drosophila mojavensis]
 gi|193907021|gb|EDW05888.1| GI16326 [Drosophila mojavensis]
          Length = 570

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 175/370 (47%), Gaps = 53/370 (14%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
            + D   K+  + QR+ SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P 
Sbjct: 168 SIGDAAAKA-TQRQRMRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPT 226

Query: 80  FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FL+I+GV I L++K    R   +     ++++R  KL   G+ L           T G  
Sbjct: 227 FLWIMGVCIPLSIKAQLGRGISKPRICLRIVWRACKLFAIGLCLNS---------TNGPQ 277

Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
           +  +RL GVLQR  ++YL+  ++  I ++      Q   + +I+   L+     +   ++
Sbjct: 278 LEQLRLMGVLQRFGIAYLVAGVLHTICSRRDYLSPQRAWQRAIYDICLFSGELAVLLALI 337

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
             YL L +G  VP      +       GK  N         +P C   A GY+DR +LG 
Sbjct: 338 AAYLGLTFGLRVPGCPRGYLGPG----GKHNNAA-------DPNCIGGAAGYVDRLILGN 386

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+Y HP                    DA +     F+PEG+   + SI+  ++G   G 
Sbjct: 387 AHIYQHPT--------------AKFVYDASA-----FDPEGVFGCLLSIVQAMLGCFAGV 427

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTL-HFTNA---IPLNKQLYTLSYVCVTSGAAAL 366
            ++      ARL++W+     L + G  L  F+     IP+NK L++LS+V VT   A +
Sbjct: 428 TLLVHVTWQARLRRWLLGATLLGVLGGALCGFSKEHGIIPVNKNLWSLSFVFVTVALALV 487

Query: 367 VFSAIYALVD 376
           + S +Y +VD
Sbjct: 488 LLSVLYYVVD 497


>gi|328870644|gb|EGG19017.1| hypothetical protein DFA_02260 [Dictyostelium fasciculatum]
          Length = 759

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 110/399 (27%), Positives = 163/399 (40%), Gaps = 102/399 (25%)

Query: 1   MSEIKAETTHHHPLI-----ISEPDVSDQQEKSHLKT----------QRLASLDIFRGLA 45
           +  I    T   PL+     +S P  +D    +  KT          +R+ SLD  RGL 
Sbjct: 3   IENISHNHTEKSPLLNEQQHVSLPINNDDSTATITKTPSATPTTTQRKRVLSLDTVRGLT 62

Query: 46  VALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
           +  MILVD+ GG    WP +    WNG + AD + P FLFI G ++ALALK   +     
Sbjct: 63  IFGMILVDNQGGPQVIWPLL-ETEWNGLSTADLIFPSFLFICGFSVALALKSAKNDIKTW 121

Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
             +I RTL L F    L        + + +       R+ GVLQRI++ Y          
Sbjct: 122 YNIIRRTLLLFFIQAFL--------NLMAHKFVFDSFRVMGVLQRISICYF--------- 164

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
                       F +  L      + AC   +YL+++YG  VP                 
Sbjct: 165 -------ACCCSFLLLPLVGQRIFLVACA-AIYLSVMYGLDVPG---------------- 200

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
               CG R  L P CNA  YID  VLG N ++                            
Sbjct: 201 ----CG-RGVLTPSCNAGSYIDNSVLGANMIH---------------------------- 227

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQWVTMGFALLIFGL 337
              P +PEGLLS+ S+ ++T +G+  G +       H   HL  L +W+ +     + G+
Sbjct: 228 ---PNDPEGLLSTFSAFITTWMGLELGRIFTRFYRKHDYAHLNILIRWIGIAVVFGVTGI 284

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            L  T  +P+NK +++ S+  +T    +L+ S  Y L+D
Sbjct: 285 ALGVTK-MPVNKLIWSFSFALITVACGSLLISVAYYLLD 322


>gi|440684188|ref|YP_007158983.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
 gi|428681307|gb|AFZ60073.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
          Length = 376

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 160/360 (44%), Gaps = 76/360 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+ +A MILV+  G     +  + HA WNGC   D V PFFLFIVGVA+ 
Sbjct: 1   MRLTSLDVFRGITIAGMILVNMVGVADNKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60

Query: 90  LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +        K V  R L+    L   G+LL G ++       +  D+  IRL GVL
Sbjct: 61  FSLSKYTADNKPTKAVYLRILRRAAILFILGLLLNGFWNKG----VWTFDLSSIRLMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+LSYL  SL+ +    +  K Q +              +A  +L+ Y   +    VP
Sbjct: 117 QRISLSYLFASLIVL---KLPRKSQLI--------------LAGVLLIGYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           ++   ++ ++                      N   +IDR ++    +Y    +      
Sbjct: 160 EYGAGVLTREG---------------------NFGAFIDRLIIPKAQLYKGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ- 324
                F G              +PEGL S++ +I+S + G   G  I   K   ++    
Sbjct: 194 -----FMG--------------DPEGLYSTIPAIVSVLAGYFTGEWIKDKKQANSQTSMD 234

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            V  G   L+ G+   +  A P+NK+L+T SYV  T+G A ++ +A Y L++  +   W+
Sbjct: 235 LVLFGLCCLVIGII--WDVAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRVIKRWS 292


>gi|189465173|ref|ZP_03013958.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
 gi|189437447|gb|EDV06432.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
          Length = 369

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 170/370 (45%), Gaps = 100/370 (27%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
           +P +S   +K     +RL SLD+ RG+ V  MILV+++GG   +  + H+ WNG  L D 
Sbjct: 6   QPPISGYPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDL 60

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSHA 126
           V PFFLFI+G++  +AL +   +A    ++K++ RTL +L  G        + +G F   
Sbjct: 61  VFPFFLFIMGISTYIALNKFHFQASGPVIRKILKRTLVILCIGWAIHWFHFICEGDF--- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                    +  +RL GVL RIAL Y  VS V ++      K + +G           W+
Sbjct: 118 -------FPLAHLRLTGVLPRIALCYCAVSFVALYV-----KPKYIG-----------WM 154

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +    L++  A+L G         I N  + D                   N +  IDR 
Sbjct: 155 IG--FLIIGYAVLLG---------IGNGYTLD-----------------STNILAIIDRN 186

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           VLG +H+YH                             +P +PEGL S++++I  T+IG 
Sbjct: 187 VLGADHLYH----------------------------KSPIDPEGLTSTLAAIAHTLIGF 218

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
             G +I+  +    +  +    GF L+  G  L  T A+PLNK++++ ++V VT G AA+
Sbjct: 219 CCGRIILAKEALEQKTLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGLAAM 276

Query: 367 VFSAIYALVD 376
           + + +   +D
Sbjct: 277 LQAVLIYFID 286


>gi|119512372|ref|ZP_01631456.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
 gi|119462961|gb|EAW43914.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
          Length = 369

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 169/359 (47%), Gaps = 81/359 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG--GD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL+SLD+FRG+ +A MILV+ AG  G+ +P ++HA W+GC   D V PFFLFIVGVA++
Sbjct: 1   MRLSSLDVFRGITIAAMILVNMAGVAGEVYPPLAHADWHGCTPTDLVFPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
            +L +  ++       IFR   +LF  G+LL G ++       +  D+  IR+ GVLQRI
Sbjct: 61  FSLSKYTEKG---YSRIFRRAAILFALGLLLNGFWNQG----IWTFDLSKIRIMGVLQRI 113

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           +L+YLL SL  +   ++  K Q              W++A  +L+ Y   +    VP++ 
Sbjct: 114 SLAYLLASLAVL---NLPRKGQ--------------WILAGVLLIGYWLTMMYVPVPEYG 156

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             ++ ++                      N   YIDR ++   H+Y    ++        
Sbjct: 157 AGVLTREG---------------------NFGAYIDRLIIPQVHLYAGDGYQNLG----- 190

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                              +PEGL S++ ++++ + G   G  I +       +K   ++
Sbjct: 191 -------------------DPEGLFSTIPAVVNVLAGYFTGQWIRNQP-----VKTRTSI 226

Query: 329 GFALLIFG---LTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           G  L   G   +   +    P+NK+L+T SYV  ++G A L+ +A Y L++      W+
Sbjct: 227 GLGLFGIGCLIIGWAWGWIFPINKKLWTSSYVVFSNGWALLLLAACYELIEVRQIRRWS 285


>gi|10177926|dbj|BAB11337.1| unnamed protein product [Arabidopsis thaliana]
          Length = 384

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 86/118 (72%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct: 144 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 203

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
           K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQ + +
Sbjct: 204 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQNLKV 261



 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 50/98 (51%), Positives = 69/98 (70%)

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
             V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + P  GPL  DAPSW
Sbjct: 259 LKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSW 318

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
           C APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 319 CQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 356


>gi|405978397|gb|EKC42791.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Crassostrea gigas]
          Length = 549

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 132/267 (49%), Gaps = 32/267 (11%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
            S+I  ++   H     + + S QQ  +H K +RL SLD FRGL++ +M+ V++ GG + 
Sbjct: 184 FSDISEDSGTAH-----DRNNSPQQYSTHNKRERLKSLDTFRGLSLMIMVFVNYGGGGYW 238

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
              H PWNG  +AD V P+F+FI+G A+  +    +KR   R   + KV+ R + L F G
Sbjct: 239 FFDHPPWNGITVADLVFPWFIFIMGTAMNYSFRGMMKRGTPRYRMLYKVLRRAILLFFIG 298

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-F 175
           I+L   +          V+++ IR+ GVLQR +L+YL++ L E+        ++  GR +
Sbjct: 299 IVLNTNWGP--------VNLKTIRIPGVLQRFSLTYLVLGLFEVCFSRYDTPEKYQGRCW 350

Query: 176 SIFR---LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRA 231
           S  R   L+   W +A  +L  Y+ L +   +       I      D  K +N T G   
Sbjct: 351 SSLRDILLFLPQWFLALGILAAYVCLTFLLPIGPCPTGYIGPGGLHDSSKYYNCTAG--- 407

Query: 232 KLNPPCNAVGYIDRKVLGINHMYHHPA 258
                  A  YID  VLG NH+Y  P 
Sbjct: 408 -------AAAYIDIMVLGKNHIYGKPT 427


>gi|297172331|gb|ADI23307.1| uncharacterized conserved protein [uncultured nuHF2 cluster
           bacterium HF0770_19K18]
          Length = 373

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 157/353 (44%), Gaps = 82/353 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K+ RL SLD FRGL +A MI+V+  G +W  +     HA W+GC   D V PFFLFIVGV
Sbjct: 6   KSDRLLSLDAFRGLTIAFMIIVNTPG-NWSYVYGPLRHAEWHGCTPTDLVFPFFLFIVGV 64

Query: 87  AI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A+  + A       +D +KK+ +RT+ +  +G+LL     +A   +    D   +R+ GV
Sbjct: 65  AMRFSFAQHNYQPSSDLLKKIFWRTVTIFSFGLLL-----NAYPFIRQNWDWSSLRIMGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRI L+Y L +++ ++  + +                  W+    +L+ Y         
Sbjct: 120 LQRIGLAYGLAAILSLYLSEKK-----------------LWISCGIILIGY--------- 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             W   ++   S  +G                 N    ID  +LG NH+     WR +  
Sbjct: 154 --WLILLLFGGSDPFGL--------------SSNIARTIDIAILGENHL-----WRGTG- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                PF+PEGLLS++ +I++ +IG   G +I      ++ ++ 
Sbjct: 192 --------------------IPFDPEGLLSTIPAIVTVLIGFSIGQLIQENSNRISLVQT 231

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            +  G  +   G    F    P+NKQL+T +YV  T G A+   +A   L+D 
Sbjct: 232 ILIRGAGIAAVGWLWGFI--FPINKQLWTSTYVLYTGGLASFFLAAFIWLIDI 282


>gi|332709783|ref|ZP_08429740.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
 gi|332351381|gb|EGJ30964.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
          Length = 366

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 157/357 (43%), Gaps = 82/357 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA W+G    D V P FLFI GVA+A
Sbjct: 1   MRLTSLDVFRGIAMASMILVNNPGSWSYVYPPLLHAKWHGFTPTDLVFPAFLFIAGVAMA 60

Query: 90  LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
            +L +  +   +V +  +R  +   +LF   LL  GF       TY  D   IR+ GVLQ
Sbjct: 61  FSLVKYTNNNQSVSQGYWRIGRRCAILFALGLLLNGFP------TYNWDT--IRIMGVLQ 112

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RI+L+Y L ++  +   +++ +                W++   VL+ Y A +    VP 
Sbjct: 113 RISLAYFLSAVAVL---NLRRRGL--------------WVLTGIVLLGYWAAMSLVPVP- 154

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                      DYG            L P  N   YIDR VLG NH+Y            
Sbjct: 155 -----------DYGA---------GNLTPEGNFAAYIDRMVLGTNHLYK----------- 183

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                             A F+PEGL S+  ++++ + G   G  + H           V
Sbjct: 184 -----------------QAQFDPEGLFSTFPAVVTVLAGYFVGDWLRHQPIPSRTSLGLV 226

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             G   L  G    F    P+NKQL+T SYV  ++G + L+ +A Y L++   F  W
Sbjct: 227 LFGVGCLGLGWVWDFW--FPINKQLWTSSYVVFSAGWSMLLLAACYELIEVRGFRRW 281


>gi|390344818|ref|XP_795043.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 680

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 114/403 (28%), Positives = 185/403 (45%), Gaps = 72/403 (17%)

Query: 5   KAETTHHHPLIISEPDVSDQ---------QEKSHLKTQRLASLDIFRGLAVALMILVDHA 55
           K+   +H+  I+S     DQ         + KS L   RL S+D FRGLA+  ++L    
Sbjct: 255 KSLPINHNGSILSNGSQDDQTPLTFPASDKPKSSL---RLRSVDTFRGLAITHLVLGASG 311

Query: 56  GGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLK 111
            G +   +HA W G  +ADF+ P+F+FI+G +I L+   +  +  +     KK++FR++ 
Sbjct: 312 DGHFWYSNHARWYGITVADFMFPWFVFIMGTSIHLSFNILLSKGLSYCAIFKKIVFRSIS 371

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ-DKDQ 170
           L   G+ +Q   SH         D+R +R+ GVLQR  ++Y +V+   + ++ +Q  + +
Sbjct: 372 LFIMGVCIQ---SHN--------DLRNLRIPGVLQRFGITYFIVASSYLLSRRLQARRAE 420

Query: 171 SVGR-FSIFRLYC--WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS---ADYGKVFN 224
             G+ + +FR         +AAC LVV+L L +   VP               + G++ N
Sbjct: 421 KTGKCYMMFRDITDYLELPLAACCLVVHLCLTFLLPVPGCPLGYQGPGGPLVGENGELTN 480

Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
            T G          A GYIDR      H+            T D  +   +R D      
Sbjct: 481 CTGG----------ASGYIDRTFFTEAHLI--------LVNTCDDVYRTIVRSD------ 516

Query: 285 APFEPEGLLSSVSSILSTIIGVHFG---HVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
               PEG+L + +SI   + G+  G   H+    +G L RL  W   G AL+     L  
Sbjct: 517 ----PEGILGTFTSIALCVFGLQSGKILHLFTTVRGRLVRLLLW---GLALISCSAVLCK 569

Query: 342 TNA----IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
            +     IPLNK L+++S++ +T G A +V +  + L+D   F
Sbjct: 570 CSMADGWIPLNKNLWSVSFIALTGGTAFIVQALFHVLIDVTHF 612


>gi|282896863|ref|ZP_06304869.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281198272|gb|EFA73162.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 375

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 162/359 (45%), Gaps = 75/359 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRGL +A+MI+ + AG     +P +SHA WNGC   D V PFFLFIVGVA++
Sbjct: 1   MRLISLDVFRGLTIAMMIIANMAGVAPDVYPFLSHALWNGCTPTDLVYPFFLFIVGVAMS 60

Query: 90  LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            +L +    +   K V F    R + L   G+LL G ++          D++ +R+ GVL
Sbjct: 61  FSLSKYSLESKLDKFVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+YL+ SL+ +      +K Q              W +A  +L+ Y   +    VP
Sbjct: 117 QRISLAYLVASLIVL---KFPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D+   ++ ++                      N   +IDR ++   H+Y    +      
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                F G              +PEGL S++ +I++ + G   G  +  +  +       
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQWMRKSTINSHTSMDL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           V  G   L+ G+   +    P+NK+L+T SYV  ++G   +  +A Y L++      W+
Sbjct: 235 VLWGLCSLVVGMI--WDGLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIEVRKIKRWS 291


>gi|224496100|ref|NP_001139059.1| uncharacterized protein LOC565246 precursor [Danio rerio]
          Length = 582

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 173/389 (44%), Gaps = 61/389 (15%)

Query: 2   SEIKAETTHH---HPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD 58
             +K    HH   + + + EP+   Q ++S  K+ RL SLD FRG ++ +M+ V++ GG 
Sbjct: 168 QRLKNRMCHHGSQNSMEMEEPNTEQQIDESKPKSSRLKSLDTFRGFSLTVMVFVNYGGGG 227

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLF 114
           +    HAPWNG  +AD VMP+F+FI+G ++ LA   +  +       ++KV +RT+ L+ 
Sbjct: 228 YWFFQHAPWNGLTVADLVMPWFVFIIGTSVMLAFTSMHRKGVSLLQLLRKVTWRTVVLML 287

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVG 173
            G      +S     L    D R     G+       +L  S  +  +   +QD      
Sbjct: 288 IGFCFM-NYSPRDGILVLAADTRSSPASGL-------HLFRSGTDHNWWNPIQD------ 333

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAK 232
                 LY   WL    +  ++L L +   VP+     +      D G   N T G    
Sbjct: 334 ----VILYWPEWLFIVLLETLWLCLTFLLPVPNCPTGYLGAGGVGDAGLYPNCTGG---- 385

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
                 A  +ID+   G N M+ +P  +     T+                  PF+PEG+
Sbjct: 386 ------AAAHIDKWFFGDN-MFWYPTCKVLYRTTE------------------PFDPEGV 420

Query: 293 LSSVSSILSTIIGVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPL 347
           L +++SI+   +G+  G +++      KG LAR   W + +G +  I          IP+
Sbjct: 421 LGTINSIVMGFLGMQAGKILLFFRQMNKGILARFLVWALILGISAAILSKCTRDEGFIPV 480

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NK L++LS+V   +  + L+ + +Y ++D
Sbjct: 481 NKNLWSLSFVTCMACMSFLLLAVMYFIID 509


>gi|397505551|ref|XP_003823320.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           2 [Pan paniscus]
          Length = 622

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 179/403 (44%), Gaps = 88/403 (21%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF---------------- 308
               +                 ++PEG+L +++SI+   +GV                  
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQCCPDWVTKQACLTEPLS 455

Query: 309 ---------------------GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTN 343
                                G ++++    TK  L R   W  +   L+   LT    N
Sbjct: 456 PLWRILFGPCLEVRATEPAQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSEN 514

Query: 344 A--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
              IP+NK L++LSYV   S  A  +   +Y +VD  + G WT
Sbjct: 515 EGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVD--VKGLWT 555


>gi|374580713|ref|ZP_09653807.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
 gi|374416795|gb|EHQ89230.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
          Length = 375

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 165/360 (45%), Gaps = 86/360 (23%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWP-EISHAPWNGCNLADFVMPFFL 81
            EK  L   R   +DIFRGL ++LM++  + G   + P ++ HA WNG  + DFV PFF+
Sbjct: 1   MEKGKL---RFDCIDIFRGLTISLMLICSNPGNITNIPAQLRHADWNGATIGDFVFPFFI 57

Query: 82  FIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           F +G+ + +A+ R  ++  +  ++I     R++ +   G++L G     P       D+ 
Sbjct: 58  FSMGIVVPIAINRRLEKGISQMRIIINVLNRSIVMFLLGLILNG----FPT-----FDLA 108

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ-SVGRFSIFRLYCWHWLMAACVLVVYL 196
           +IR+ GVLQRIA+ Y   +L+ +  K +  KD   +G  +         L+A  +L +Y 
Sbjct: 109 IIRVPGVLQRIAIVYFCSALIYLLFKSIVKKDLVQIGILT---------LIAVLLLAIYY 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
            LL G  VP  +                   G++  L      V YID K L   H+Y  
Sbjct: 160 WLLKGLQVPGIE-------------------GLKGGL------VSYIDLKYLK-GHLY-- 191

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                                  P+     F+PEG+LS++ ++ S IIGV  G + +   
Sbjct: 192 ----------------------TPT-----FDPEGILSTIPALSSGIIGVVVGMIFLRRD 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
               ++  +V  G  L+IF     F    P NKQL++ S+V +TSG   LV +  Y L D
Sbjct: 225 SRFVKMTIFVCSGILLIIFA--EWFNAYFPYNKQLWSSSFVLLTSGFGILVLTIFYLLTD 282


>gi|119583586|gb|EAW63182.1| hCG1993224, isoform CRA_a [Homo sapiens]
          Length = 382

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 166/351 (47%), Gaps = 51/351 (14%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKK 104
           M+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++  I  R  +    + K
Sbjct: 1   MVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 60

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK 163
           + +R+  L+  GI++       P+     +    +R+ GVLQR+ ++Y +V+++E +F K
Sbjct: 61  IAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAK 115

Query: 164 DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADY 219
            V +   S       R     W  WL+   +  ++L L +   VP      +      D+
Sbjct: 116 PVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPTGYLGPGGIGDF 175

Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
           GK  N T G          A GYIDR +LG +H+Y HP    S A    +          
Sbjct: 176 GKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSAVLYHT---------- 211

Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLARLKQWVTMGFALLIF 335
                  ++PEG+L +++SI+   +GV  G ++++    TK  L R   W  +   L+  
Sbjct: 212 ----EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISV 266

Query: 336 GLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            LT    N   IP+NK L++LSYV   S  A  +   +Y +VD    G WT
Sbjct: 267 ALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVK--GLWT 315


>gi|344281343|ref|XP_003412439.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Loxodonta africana]
          Length = 782

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 178/390 (45%), Gaps = 61/390 (15%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           D+  +  +      RL  +D FRGLA+ +M+ V++ GG +    HA WNG  +AD V P 
Sbjct: 362 DIQLEAWRPSAPPSRLRCVDTFRGLALIIMVFVNYGGGKYWYFKHASWNGLTVADLVFPC 421

Query: 80  FLFI--------------VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           FL I              + +++   L+R   +   + K+ +R+  L+  G+++      
Sbjct: 422 FLEILFGEDLLCTRDPLEIFLSMTSILQRGCSKFKLLGKIAWRSFLLICIGVVIVN---- 477

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC- 182
            P+     +    +R+ GVLQR+ ++Y +V+++E +F K V +   S    FS+  L   
Sbjct: 478 -PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASERSCFSLRDLTAS 536

Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAV 240
           W  WL    +  ++L L +   VP      +      D+GK  N T G          A 
Sbjct: 537 WPQWLFILTLESIWLTLTFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AA 586

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
           GY+DR +LG  H+Y HP    S A    +                 ++PEG+L +++SI+
Sbjct: 587 GYMDRVLLGDEHLYQHP----SSAVLYHT--------------EMAYDPEGILGTINSIV 628

Query: 301 STIIGVHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTL 354
              +GV  G ++++    TK  + R   W  +   L    LT    N   IP+NK L+++
Sbjct: 629 MAFLGVQAGKILLYYKDQTKDIVIRFTAWCCI-LGLTSAALTKVSENEGFIPVNKNLWSI 687

Query: 355 SYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           SYV   S  A  +   +Y +VD  + G WT
Sbjct: 688 SYVTTLSSFAFFILLVLYPIVD--VKGLWT 715


>gi|348507459|ref|XP_003441273.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 460

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 167/377 (44%), Gaps = 69/377 (18%)

Query: 20  DVSDQQEKSH----LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           + S   E +H     K  RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD 
Sbjct: 60  EESHASETAHGTVKAKPTRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADL 119

Query: 76  VMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE-- 129
           VMP+F+F++G ++ LA     +R   R   ++K+ +RT+ LL  G       +++P +  
Sbjct: 120 VMPWFVFVIGTSVVLAFSSMQRRGVSRLQLLRKITWRTVVLLLLGFCF---LNYSPRDGP 176

Query: 130 ---LTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLYCWHW 185
              L    D R     G+       +LL S+    +   VQD            LY   W
Sbjct: 177 CSVLVLAEDPRSAAASGL-------HLLCSVSPYNWWNPVQD----------ILLYWPQW 219

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L+   +  ++L L +   VP+     +      D G   N T G          A GYID
Sbjct: 220 LIIILLETLWLCLTFLMPVPNCPTGYLGAGGIGDNGLYPNCTGG----------AAGYID 269

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R + G N MY +P  +     TQ                  PF+PEG+L +++SI+   +
Sbjct: 270 RWMFGDN-MYRYPTCKEMYRTTQ------------------PFDPEGVLGTINSIVIGFL 310

Query: 305 GVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           G+  G ++I  K      L R   W V +G +  I          IP+NK L++LSYV  
Sbjct: 311 GMQAGKILIFYKKMNIHILCRFLVWTVILGISAAILSKCTRDGGFIPVNKNLWSLSYVMC 370

Query: 360 TSGAAALVFSAIYALVD 376
               + L+   ++ +VD
Sbjct: 371 MGCLSFLLLGGMFFVVD 387


>gi|354568330|ref|ZP_08987495.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
 gi|353540693|gb|EHC10166.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
          Length = 384

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 114/368 (30%), Positives = 164/368 (44%), Gaps = 86/368 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG+ +A MILV+ A       +P + HA W+GC   D V PFFLFIVGVA+
Sbjct: 1   MRLTSLDVFRGITIAGMILVNTASIAEPNVYPPLLHAEWHGCTPTDLVFPFFLFIVGVAM 60

Query: 89  ALALKRIPDRADAVKK-----------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           + +  +  D     +K           +I R   L   G+LL G ++       +  D  
Sbjct: 61  SFSFSKYTDSKLHGEKEKVFVSLPYWRIIRRAAILFVLGLLLNGFWNQG----VWTFDFN 116

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-L 196
            IR+ GVLQRI+L+YLL SLV     ++  K Q              W++A  +L+ Y L
Sbjct: 117 SIRVMGVLQRISLTYLLASLVVF---NIPRKGQ--------------WILAGVLLIGYWL 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           A++   YVP           + YG       GV   L    N   YIDR ++   H+Y  
Sbjct: 160 AMM---YVP----------VSGYG------AGV---LTRDGNLGAYIDRLIIPKAHLYKG 197

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                      +  F G              +PEGL S++ +I+S + G   G  I   K
Sbjct: 198 ----------DNYNFMG--------------DPEGLFSTIPAIVSVLAGYFAGQWIRSQK 233

Query: 317 GHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
              ++     V  G + L+ G       A P+NK+L+T SYV  T+G A L+ +  Y L+
Sbjct: 234 QINSKTSMDLVLFGLSCLVIGGIWDL--AFPINKKLWTSSYVVFTTGWALLLLAFCYELI 291

Query: 376 DCNIFGPW 383
           D      W
Sbjct: 292 DVRRVRRW 299


>gi|226225918|ref|YP_002760024.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
 gi|226089109|dbj|BAH37554.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
          Length = 401

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/374 (29%), Positives = 165/374 (44%), Gaps = 81/374 (21%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIV 84
              K +RL SLD+FRG+ VA M+LV++ G     +P + HAPW+G    D + PFFLFIV
Sbjct: 6   GSFKAERLLSLDVFRGMTVAGMLLVNNPGTWSAIYPPLQHAPWHGWTPTDLIFPFFLFIV 65

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG--GFSHAPDELTYGVDVRM 138
           G+   L+L+    R D  + ++ + LK    +  +G+LL G   F+  P         R+
Sbjct: 66  GITTELSLRARRARGDDEQAILRQILKRGALIFLFGLLLAGFPFFTWPPGLPGASFGERV 125

Query: 139 I------RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           I      R+ GVLQRI ++YL  +L+   T     + Q V              + A +L
Sbjct: 126 IDRFEHWRIMGVLQRIGVAYLCGALL---TWRTTVRQQGV--------------ILAALL 168

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV-GYIDRKVLGIN 251
             Y AL+    VPD                       R  L+ P   +  ++DR VLG+N
Sbjct: 169 FGYWALMTLVPVPD------------------TGVAGRFVLDKPDQLLSAWLDRTVLGVN 210

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H++       S A T D                    PEGLLS++ +I + I G   G  
Sbjct: 211 HLW-------SGAKTWD--------------------PEGLLSTIPAIGTMICGTFAGRW 243

Query: 312 IIHTKGHL-ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           I   +  L  RL     +G   ++ GL  H+    P+NK ++T SYV  T+G  A+  + 
Sbjct: 244 IARQELTLHERLVALFAVGALAMMVGLMWHWV--FPINKSIWTSSYVVFTAGTGAVTLAT 301

Query: 371 IYALVDCNIFGPWT 384
              L+D      WT
Sbjct: 302 CMWLIDGIGLRRWT 315


>gi|118378164|ref|XP_001022258.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila]
 gi|89304025|gb|EAS02013.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila
           SB210]
          Length = 827

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 153/359 (42%), Gaps = 82/359 (22%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMP 78
            D Q+ +    QRL  LDI+RGL +  MILVD+ G     WP +    WNG + AD V P
Sbjct: 450 KDIQQPAAAPKQRLECLDIYRGLTMVGMILVDNMGNSSVIWP-LDETEWNGLSTADCVFP 508

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
            FLFI G+AI LA+K   ++     +++ R +KL   G+ L    ++   +         
Sbjct: 509 SFLFISGMAITLAIKHNGNKKQQFFRILERFVKLFVIGVALNAACANYKQQF-------- 560

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
            R+ GVLQRIA+ Y + S   +F ++                +   +++    L++Y+  
Sbjct: 561 -RIMGVLQRIAICYFVTSTSYLFLQN----------------FAVQFVLNGVFLLIYIYF 603

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           +Y   VPD                    CG    + P CN   Y+D ++  +N+M     
Sbjct: 604 MYFFDVPD-------------------GCGAN-NVTPTCNFGRYLDMQIFTLNYM----- 638

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                                      P +PEGL +++ ++++T IG+ +G  +   K  
Sbjct: 639 -------------------------MKPSDPEGLFTTLGALVTTFIGLCYGLALQEFKSQ 673

Query: 319 LARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             RL   W  M   L+  G    F    P+NK++++ S+V +    +    +  + +VD
Sbjct: 674 KKRLSCIWFVMSLVLVFIGGICCFLT--PINKKVWSPSFVFIVGSMSGAFLNLCFIVVD 730


>gi|327275365|ref|XP_003222444.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Anolis carolinensis]
          Length = 632

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/343 (28%), Positives = 159/343 (46%), Gaps = 47/343 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV----AI 88
            RL SLD FRGLA+ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G     ++
Sbjct: 235 HRLRSLDTFRGLALIIMVFVNYGGGKYWFFKHQSWNGLTVADLVFPWFVFIMGTSISLSL 294

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +  L+R   +   + K+++R+L L   G+++       P+     +    +R+ GVLQR+
Sbjct: 295 SSMLRRGCSKWKLLGKILWRSLLLFLIGVIIVN-----PNYCLGPLSWENLRIPGVLQRL 349

Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
           + +Y +V+++E +F K V D           R  L  W  WL    +  V+L L +   V
Sbjct: 350 SCTYFVVAVLELLFAKPVPDNSTLEIPCPALRDILPYWPQWLFMMALETVWLCLTFLLNV 409

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+G   N T G          A  YID  +LG  H+Y HP+   S 
Sbjct: 410 PGCPNGYLGPGGIGDFGNYPNCTGG----------AAAYIDHVLLGEKHIYQHPS---SN 456

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----L 319
              Q +                 F+PEG+L +++S++   +G+  G +++  K      +
Sbjct: 457 VLYQTT---------------VAFDPEGILGTINSVIMAFLGLQAGKILLFYKDQHRQIM 501

Query: 320 ARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
            R   W V MG    +          IP+NK L+++SYV   S
Sbjct: 502 LRFCIWSVVMGVISAVLTECSKDEGFIPVNKNLWSISYVTTLS 544


>gi|328869407|gb|EGG17785.1| transmembrane protein [Dictyostelium fasciculatum]
          Length = 651

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/405 (23%), Positives = 171/405 (42%), Gaps = 97/405 (23%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E +  K  RL SLD+FRGL++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 218 ESNQPKKDRLKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFIFIMG 277

Query: 86  VAI-----ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           +A+     A+ ++ +P R   + K++ R++ L   G+ +             G ++   R
Sbjct: 278 IAMPLSFNAMEIRGVPKRTIFI-KLVRRSVILFSLGLFINN-----------GNNLGHWR 325

Query: 141 LCGVLQRIALSYLLVSLVEIF--------------------------------------- 161
           + GVLQR  +SY +   + +F                                       
Sbjct: 326 ILGVLQRFGVSYFVTGCIMMFVPLYRPNGGGGGNSHHQYNRFDGTGNDREREPSESDPLF 385

Query: 162 -TKDVQDKDQ--SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-A 217
            +  +Q+K +  S    + F  +   WL A  +L V+  + +   VP      +      
Sbjct: 386 QSSSIQEKFKAHSASMLADFIPFWLQWLFALLILAVWFLVTFLLPVPGCPTGYLGPGGLG 445

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
           D G+  N T G          A   +D  +   NH++  P  +                 
Sbjct: 446 DQGQHVNCTGG----------AAKIVDLHIFSNNHIFQTPTCQ----------------- 478

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
             P +    ++PEG L  ++S+    +GVH G  I+  K + +RL +W  +  ++L+ G+
Sbjct: 479 --PIYNTGAYDPEGTLGYLTSVFMCFLGVHAGRTIMTYKSNRSRLIRWTIL--SILLCGI 534

Query: 338 TLHFTNA------IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                        IP+NK L+T S++ + SG    V +  Y +VD
Sbjct: 535 AAGLCGVSQNGGWIPINKNLWTPSFIFLLSGFGFFVLAIFYVVVD 579


>gi|427710153|ref|YP_007052530.1| heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
 gi|427362658|gb|AFY45380.1| Heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
          Length = 387

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 166/370 (44%), Gaps = 79/370 (21%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
           +  E S   + RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V PF
Sbjct: 7   NVMENSSTPSTRLVSLDVFRGIAIASMILVNNPGSWDYIYPPLDHAEWHGCTPTDLVFPF 66

Query: 80  FLFIVGVAIALAL-KRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV-- 134
           FLFIVGVA+  +  K  P+      V + I R   +LF   L    F+   D L  G+  
Sbjct: 67  FLFIVGVAMPFSFAKYTPENRPTATVYQRILRRGLILFALGLFLALFTLTLDWLIKGITP 126

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+ GVLQRI+L+Y++ +L  +          ++ R  +       W++AA +L+ 
Sbjct: 127 NFSTLRIMGVLQRISLAYVIAALAVL----------NLSRRGL-------WILAAVILIG 169

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +    VP             +G            L P  N  GYIDR +LG  H+Y
Sbjct: 170 YWLAMQFIPVP------------GFGA---------GNLTPEGNLGGYIDRIILG-KHIY 207

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEGL S++ ++++  +G +F    + 
Sbjct: 208 R----------------------------SGSFDPEGLFSTLPAVVTVFLG-YFTGDWLR 238

Query: 315 TKGHLARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           T+   +R     V  G   LI G    F    P+NKQL+T SYV  T+G A L  +  Y 
Sbjct: 239 TQPIKSRTSINLVIFGLITLIIGQLWGFI--FPINKQLWTSSYVLFTAGWALLTLALCYE 296

Query: 374 LVDCNIFGPW 383
           L++   F  W
Sbjct: 297 LIEVRHFRRW 306


>gi|260061394|ref|YP_003194474.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
 gi|88785526|gb|EAR16695.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
          Length = 382

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 168/372 (45%), Gaps = 97/372 (26%)

Query: 22  SDQQEKSHLKTQ------RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCN 71
           +DQ+E+++ + Q      R+ S+DIFRGL +ALMILV+   G W  +     HA W+G  
Sbjct: 6   NDQRERTNPEKQTKAMKERIVSVDIFRGLTIALMILVN-TPGTWEAVYAPFRHAEWHGYT 64

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
             D V PFFLFIVG +I  A +     A   +K+I RTLKL+  GI L G F+  P    
Sbjct: 65  PTDLVFPFFLFIVGTSIVFAYRNKQPDAATHRKIIVRTLKLILLGIFL-GAFTVEPP--- 120

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
           +      IR  GVLQRI + +   +L+ + T                    +  L+A  V
Sbjct: 121 FFEPFSEIRFPGVLQRIGVVFFAAALLFLHTN-------------------YKTLLAITV 161

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +++    ++  ++P      +  +     +V             P N   YID  V G +
Sbjct: 162 VILLGYWVWMAFIP------LGGEPPSLERV-------------PNNWANYIDVHVFG-S 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y                     + D        ++PEGLLS++ +I S ++G+  G V
Sbjct: 202 HTY---------------------KPD--------YDPEGLLSTLPAIASALLGIFTGRV 232

Query: 312 IIHTKGHLARLKQWVTMGFALL-----IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           ++  +   A   QW+ +  A L     I+GL        P+NK L+T S+V VT+G A L
Sbjct: 233 LVSDR---ANKTQWMLLAGAALLAAGSIWGLVF------PVNKALWTSSFVLVTAGWANL 283

Query: 367 VFSAIYALVDCN 378
           + + IY L D  
Sbjct: 284 LLALIYYLTDVK 295


>gi|156401294|ref|XP_001639226.1| predicted protein [Nematostella vectensis]
 gi|156226353|gb|EDO47163.1| predicted protein [Nematostella vectensis]
          Length = 387

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 154/353 (43%), Gaps = 49/353 (13%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL SLD FRG+++ +MI V+  GG +   +H+ WNG  +AD V P+F++I+GV++ L+ +
Sbjct: 1   RLKSLDTFRGISLTVMIFVNFGGGGYYFFAHSIWNGLTVADLVFPWFMWIMGVSMVLSFR 60

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
            +  +  +  ++I +  K     +L   G   + +   Y       R+ GVLQR A  Y 
Sbjct: 61  VLRRKQISTYRIIIKITKRTL--LLFALGLFTSNNLTNY-------RIPGVLQRFAACYF 111

Query: 154 LVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           +V+++++      +  Q  G +      +  L+   WL+    L++Y+ + Y T +    
Sbjct: 112 VVAVIQVLAGPSVEDSQPRGSWWDGIRDVVSLWA-QWLLMFAFLIIYVVVTYATELHGCP 170

Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                    +D    FN T G+ +                       H  +W   K   Q
Sbjct: 171 RGYTGPGGISDNSSAFNCTGGMAS-----------------------HVDSWLLGKHVYQ 207

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F+   R           +PEG++ +++SI    +GV  GH +     H  RL +W  
Sbjct: 208 RGTFKDMYRTTVAH------DPEGVMGTLTSIFIVFLGVQAGHTLFTFSHHRQRLVRWFV 261

Query: 328 MGFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
               L +  + L         IP+NK L+++S+V  T   A L+ S  Y  ++
Sbjct: 262 WAVLLGVIAIGLSGGTQNDGVIPINKNLWSISFVLATGSMAFLLLSFCYVTIE 314


>gi|410900570|ref|XP_003963769.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Takifugu rubripes]
          Length = 581

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 166/365 (45%), Gaps = 71/365 (19%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRG A+ +M+ V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ L
Sbjct: 196 RPKRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFIIGTSVVL 255

Query: 91  ALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGF---SHAPDE-----LTYGVDVRM 138
           A + +      R   ++K+ +RT      G+LL  GF   +++P +     L    D   
Sbjct: 256 AFRSMQRRRVRRLQLLRKITWRT------GVLLMLGFCFLNYSPRDGPCSVLVLAQDSWS 309

Query: 139 IRLCGVLQRIALSYLLVSLV-EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
               G+       +LL S+    +   VQD            +Y   WL+   +  ++L 
Sbjct: 310 PAASGL-------HLLHSITPHRWWSSVQD----------VVVYWPQWLIIILLETLWLC 352

Query: 198 LLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           + +   VPD     +      D+G   N T G          A GYIDR + G N MY +
Sbjct: 353 VTFLMPVPDCPTGYLGAGGIGDHGLYPNCTGG----------AAGYIDRWMFGDN-MYRY 401

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
           P  +     TQ                  PF+PEG+L +V+SI+   +G+  G +++  +
Sbjct: 402 PTCKEMYQTTQ------------------PFDPEGVLGTVNSIVMGFLGMQAGKILLFYR 443

Query: 317 GH----LARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           G     L R   W V +G +  I          IP+NK L++LSYV      + L+ + I
Sbjct: 444 GADVHILCRFLLWAVILGISAAILTKCTRDEGFIPVNKNLWSLSYVTCMGCFSFLLLAGI 503

Query: 372 YALVD 376
           Y + D
Sbjct: 504 YLVTD 508


>gi|385809567|ref|YP_005845963.1| heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
 gi|383801615|gb|AFH48695.1| Heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
          Length = 378

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 164/362 (45%), Gaps = 81/362 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           T+RL SLD+FRG+ +  MILV++ G     +P++ HA W+GC   D + PFFLFIVGVA+
Sbjct: 4   TERLVSLDVFRGITIMGMILVNNPGTWSAVYPQLLHAEWHGCTFTDLIFPFFLFIVGVAV 63

Query: 89  ALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
           + +L +   +  ++K     +I RT+ L   GI+L G     P  L +G       +R+ 
Sbjct: 64  SYSLTKRKAQGGSMKSLYLNIIRRTVILFLLGIILNG----FPFGLLFGHQFSWETLRIP 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ Y + + + + T     K Q             +W  AA +L++Y A++  +
Sbjct: 120 GVLQRIAIVYFVAAFLFLTT---STKFQ-------------YWFTAA-ILILYAAVM--S 160

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           ++P                   V     A   P  N   +ID+ +LG +HM     W  +
Sbjct: 161 FIP-------------------VPGIGYANFEPGKNLSAWIDQMILG-SHM-----WSGT 195

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           K                       ++PEG+LS++ +I S ++G+  G+ +        ++
Sbjct: 196 KL----------------------WDPEGILSTIPAIGSAMLGIFTGNWLRSENDQKEKV 233

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
                    L++ G    F    PLNK L+T SYV  T+G A       Y  +D      
Sbjct: 234 VYLFIWANVLMLAGWIWSF--WFPLNKNLWTSSYVLWTAGLALHFLGFCYWFIDVKKITW 291

Query: 383 WT 384
           WT
Sbjct: 292 WT 293


>gi|387793162|ref|YP_006258227.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
 gi|379655995|gb|AFD09051.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
          Length = 393

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 168/375 (44%), Gaps = 93/375 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            Q  +     RL SLD+FRG  VA MILV++ G  W  I     HA WNGC   D + PF
Sbjct: 1   MQASASEPKPRLLSLDVFRGATVAAMILVNNPG-SWSNIYAPLEHAKWNGCTPTDLIFPF 59

Query: 80  FLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILL---------QGGFSHA 126
           FLFIVG++IA AL     R  + + A+K +  R+LKL   G++L         + G    
Sbjct: 60  FLFIVGISIAYALSGKKSRPEEHSAAIKSITIRSLKLFGLGLILALFPIVYFDKFGEVDV 119

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
            D++   +    +R+ GVLQRI + + +  ++ I     + K +++          W   
Sbjct: 120 WDQIV--MRFSGVRIMGVLQRIGIVFFIAGIIFI-----KAKPKTI---------AWT-- 161

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDR 245
            A  +LV+Y  L+  T+VP                      GV  A L P  N   +IDR
Sbjct: 162 -AGSLLVIYYLLM--TFVP--------------------VPGVGYANLEPETNLGAWIDR 198

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L  +H+     W++SK                 +W     +PEGLL ++ ++ + ++G
Sbjct: 199 LILTTDHL-----WKQSK-----------------TW-----DPEGLLGTIPAVATGLLG 231

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT--LHFTNAIPLNKQLYTLSYVCVTSGA 363
              G  +     +  +    +T  F    FG+   L +    P+NK L+T S+V  T+G 
Sbjct: 232 TLCGTWMKKPMDNALK----ITWLFVFAAFGIAAGLVWDLFFPINKSLWTSSFVLYTAGL 287

Query: 364 AALVFSAIYALVDCN 378
            +  F+ +Y L+D  
Sbjct: 288 GSAFFAVLYWLIDVQ 302


>gi|334121382|ref|ZP_08495452.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
 gi|333455096|gb|EGK83757.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
          Length = 406

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 164/387 (42%), Gaps = 115/387 (29%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R  SLD+FRG+A+A MILV++ G     +P + HA W+GC   D + PFFLFIVG A++
Sbjct: 1   MRFKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWHGCTPTDLIFPFFLFIVGCAMS 60

Query: 90  LALKR-----------------IPDRADAVKKVIFRTLKLL-----FWGI---------- 117
            +L +                   +++++ K  +  +L LL     +W I          
Sbjct: 61  FSLSKYTQNYPQTGIETSKITQTKEKSESAKNPLPSSLFLLPYSNIYWRIARRAAILFIL 120

Query: 118 -LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
            LL    S A D L     V     IR+ GVLQRI L+Y + ++  I   ++  ++Q   
Sbjct: 121 GLLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFISAIAII---NLSPRNQK-- 175

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+A  VL+ Y A L                      VF V      +L
Sbjct: 176 ------------LLAVAVLLGYWAAL---------------------TVFAVGGYTAGEL 202

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
            P  N  GY+DR +LG  H+Y                               PF+PEGLL
Sbjct: 203 TPEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLL 234

Query: 294 SSVSSILSTIIGVHFGHVI----IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           S++ ++++ +IG   G  +    I T+  +         G + ++ G    F    P+NK
Sbjct: 235 STLPAVVTVLIGYFTGEWLRVQPIKTRTSI----NLAICGLSCVVIGRLWGFL--FPINK 288

Query: 350 QLYTLSYVCVTSGAAALVFSAIYALVD 376
           QL+T SYV  T+G A L+ +  Y  ++
Sbjct: 289 QLWTSSYVVFTAGWALLLLATCYETIE 315


>gi|428308802|ref|YP_007119779.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
 gi|428250414|gb|AFZ16373.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
          Length = 381

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 156/355 (43%), Gaps = 82/355 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAG--GD---WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           ++RL SLD+FRG+ +A MILV+  G  GD   +P + HA WNG    D V PFFLFIVG 
Sbjct: 9   SKRLTSLDVFRGITIAGMILVNMIGVAGDKNVYPPLLHADWNGFTPTDLVFPFFLFIVGA 68

Query: 87  AIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A+A +  +             ++I R+L L   GILL G + +         +   IR+ 
Sbjct: 69  AMAFSFSKYKHGNKPTPTVYWRIIRRSLILFALGILLNGFWEY---------NWSSIRIM 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRI+L+YL+ SL+ +   +V  K Q              W +AA +L+ Y   +   
Sbjct: 120 GVLQRISLTYLIASLIVL---NVPRKGQ--------------WAIAAFLLIGYWFAMSLI 162

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP            DYG            L    N   Y DR ++   H+Y        
Sbjct: 163 PVP------------DYG---------MGNLTREGNFGAYFDRLIIPTAHLY-------- 193

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
               +   F G              +PEGL S++ +++S + G   G  +          
Sbjct: 194 ----KGDDFNG------------MGDPEGLFSTLPAVVSVLFGYLTGDWLRQQPIKSTTS 237

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
              V +G + L+ G    F    P+NK+L+T SYV  T+G A L+ +A Y  ++ 
Sbjct: 238 MDLVLLGLSCLVIGQVWDF--WFPINKKLWTSSYVLFTTGWALLLLAACYESIEV 290


>gi|284041413|ref|YP_003391343.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820706|gb|ADB42544.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 364

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 158/356 (44%), Gaps = 90/356 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +  RL SLD  RG  +A MI+ +  G +   +  + H+ WNG +  D + PFFLFIVGV+
Sbjct: 3   QPHRLISLDAMRGFTIAAMIVANFPGSEEFVYFTLRHSRWNGLSFTDLIAPFFLFIVGVS 62

Query: 88  IALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           I LA  R   RAD       ++K++ R+LK+   G+ L    +  PD      D   +R 
Sbjct: 63  IVLAYAR--KRADGSPKGPLIQKIVLRSLKIFAVGMFL----NLLPD-----FDFATLRW 111

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            G L RIA+ +L+ +L+ + T   Q                  W+    +L  +LA+   
Sbjct: 112 TGTLHRIAIVFLVCALLFLTTSWRQQA----------------WIATLTLLAYWLAM--- 152

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T +P          + D G+V          L P  N   ++DR+ L             
Sbjct: 153 TQIP----------TPDVGRVV---------LEPGQNLAAWLDRRYL------------- 180

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                       P R    +W     +PEG+LS+  SI++ I+G+  G +++      A+
Sbjct: 181 ------------PGRMWQGTW-----DPEGILSTFPSIVTGILGMLAGRLMVSPASQTAK 223

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +   +T G      G     T   P+N+ L+T S+V VT+G AALV  A+Y LVD 
Sbjct: 224 VSYLMTAGVFTAALGYFWGLT--FPVNENLWTSSFVLVTAGFAALVLGAVYFLVDV 277


>gi|270339962|ref|ZP_06203500.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333113|gb|EFA43899.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 389

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 159/351 (45%), Gaps = 78/351 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           QRL SLD+ RGL V LMI V++  G   +  + H+ WNG  L D V PFFLF+VGV+  L
Sbjct: 18  QRLISLDVLRGLTVMLMIFVNNGAGTQIFSPLRHSRWNGMTLCDLVFPFFLFMVGVSTYL 77

Query: 91  ALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +L++      A  ++K+  RT  L   G+ +   F  A +     +D+  +R+ GV+QRI
Sbjct: 78  SLRKSNFAWSAKTLRKIARRTALLFLIGLTIN-WFDMACNG--SPLDLAHLRIMGVMQRI 134

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL Y   + V I +  V             RL+   WL+A  ++   L L+ G       
Sbjct: 135 ALCYGATAFVAILSSKVPQ-----------RLHLIPWLIAVLLIAYSLLLIIG------- 176

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                    DY              +   N +  +D  +LG +H+YH             
Sbjct: 177 ------GGYDY--------------SSATNLLAIVDTHILGYDHLYH------------- 203

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA---RLKQW 325
                           +P +PEGLLS++ +I  T+IG     + I  +G  +    ++ +
Sbjct: 204 ---------------RSPVDPEGLLSTLPAIAHTLIGFWVARLTIGKQGSHSTQNTVRMF 248

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +  G  L++ G+ L  T  +PLNK++++ SYV  T G A+     +  L++
Sbjct: 249 LLAGAMLVVSGIIL--TLLLPLNKRIWSPSYVLTTCGLASFTQGLLVYLIE 297


>gi|374263976|ref|ZP_09622521.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
 gi|363535543|gb|EHL28992.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
          Length = 372

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 159/355 (44%), Gaps = 75/355 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           ++R+ SLD+FRGL +ALM+LV+  G    +P + HA WNGC LAD V P FLFIVGV   
Sbjct: 5   SKRILSLDVFRGLTMALMVLVNSQGSRSIYPILDHAAWNGCTLADLVFPAFLFIVGVTTV 64

Query: 90  LALKRIPDRADAVKKVIFRT-LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           ++L R     +A +  I+++ LK      L     +  P    + +    +R+ G+LQRI
Sbjct: 65  VSLNRQVTTNEAARLDIYKSILKRSILLFLFGLFLNAFPFH--FDLSFANLRIYGILQRI 122

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ Y + +L+ +   +   K Q +  + I   Y W+W+               T +P   
Sbjct: 123 AICYFICALIYL---NTTVKTQIILFWGILLGY-WYWI---------------TQIPVPG 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F+                     +L+   N V Y+D+ +                     
Sbjct: 164 FS-------------------GGQLSLANNWVAYVDKMIF-------------------- 184

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                     +P   H  F+PEGL+S++S++ +T+ G+  GH ++       +      +
Sbjct: 185 ----------SPVHLHKNFDPEGLISTISAVATTLAGLITGHFLLMQLSKKKKCLLMFLV 234

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           G A L+ G    +    P+NK L+T S+V  T G + +VF+  + ++D   +  W
Sbjct: 235 GMAFLVLGWAWGYY--FPINKNLWTSSFVLWTGGVSLIVFALCFYIIDVLGYSKW 287


>gi|395803976|ref|ZP_10483217.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
 gi|395433620|gb|EJF99572.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
          Length = 423

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 109/395 (27%), Positives = 166/395 (42%), Gaps = 120/395 (30%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA WNGC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAHWNGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGIL------------------------------ 118
            LA+           K++ R+L++   GI                               
Sbjct: 63  PLAMPEKKYDETTFNKILIRSLRMFCLGIFFNFFGKIQLFGLDGIPLLLVRLIITFAVGY 122

Query: 119 -LQGGFSHAPDE------------LTYG--VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
            L G FS+                L YG   +   +RL GVLQRIA+ Y +VSL+ + T 
Sbjct: 123 ALMGNFSNKLKNIFAFSILAIYIILAYGGFENYADVRLPGVLQRIAIVYFVVSLLYLKT- 181

Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV-YLALLYGTYVPDWQFTIINKDSADYGKV 222
               K Q               L    VL+  Y A++  T VP                 
Sbjct: 182 --SRKTQ---------------LFTGIVLLFGYWAIM--TLVP----------------- 205

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
             V     A L    N   ++D  +L   HMYH                      +  +W
Sbjct: 206 --VPGIGEANLERGTNLAAWVDSVLLK-GHMYH----------------------ETNTW 240

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
                +PEG+LS++ SI++ IIG+  G +++     + + ++   +G +L+ FGL     
Sbjct: 241 -----DPEGILSTIPSIVNGIIGLFIGQILLLNITKIQKAQRMGMIGTSLIFFGLIWDL- 294

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
              P+NK ++T SYV  T+G A +  + +Y ++D 
Sbjct: 295 -VFPINKSIWTSSYVLYTTGLATVCLTVLYYIIDI 328


>gi|428319268|ref|YP_007117150.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428242948|gb|AFZ08734.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 406

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 155/373 (41%), Gaps = 115/373 (30%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA WNGC   D V PFFLFIVG A++
Sbjct: 1   MRLKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWNGCTPTDLVFPFFLFIVGCAMS 60

Query: 90  LALKR----IPDRADAVKKVIFRTLKL------------------LFWGI---------- 117
            +L +     P       K+I +  KL                  ++W I          
Sbjct: 61  FSLSKYIQNYPKTGIETSKIIQKNEKLESDKNPFPSSFFLLPASNIYWRIARRAAILFIL 120

Query: 118 -LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
            LL    S A D L     V     IR+ GVLQRI L+Y + ++  +   ++  ++Q   
Sbjct: 121 GLLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFIGAIAIL---NLSPRNQK-- 175

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+AA VL+ Y   L                      VF V      +L
Sbjct: 176 ------------LLAAAVLLGYWGAL---------------------TVFAVGGYTAGEL 202

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
            P  N  GY+DR +LG  H+Y                               PF+PEGLL
Sbjct: 203 TPEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLL 234

Query: 294 SSVSSILSTIIGVHFGHVI----IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNK 349
           S++ ++++ +IG   G  +    I T+  +         G + ++ G    F    P+NK
Sbjct: 235 STLPAVVTVLIGYFTGEWLRVQPIKTRTSI----NLAICGLSCVVIGHLWGFL--FPINK 288

Query: 350 QLYTLSYVCVTSG 362
           QL+T SYV  T+G
Sbjct: 289 QLWTSSYVVFTAG 301


>gi|305665862|ref|YP_003862149.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
 gi|88710633|gb|EAR02865.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
          Length = 362

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 155/351 (44%), Gaps = 85/351 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DIFRGL + LMILV+  G  W  +     HA W+G    D V PFFLFIVG +I
Sbjct: 3   NRVISVDIFRGLTIVLMILVNTPG-TWSSVYTPFLHAEWHGYTPTDLVFPFFLFIVGTSI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           + A ++        KK+  R+LKL+  G+ L G F+ +     +  D   IR  GVLQRI
Sbjct: 62  SFAYQKKKASTQTYKKIAVRSLKLIGLGLFL-GAFTLS---FPFIKDFADIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC-VLVVYLALLYGTYVPDW 207
            + +L  ++                   +F  + W  L+  C VL+V   LL G YVP  
Sbjct: 118 GVVFLFTAV-------------------LFVNFNWKTLLGICIVLLVGYWLLMG-YVP-- 155

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLN-PPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                               G+ +  +  P N   Y+D K+ G  H Y            
Sbjct: 156 ------------------VEGIESTFDRAPNNLANYLDVKIFG-THNY------------ 184

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                    + D        ++PEG LS++ SI S + GV  G ++   K +    K  V
Sbjct: 185 ---------KPD--------YDPEGFLSTLPSIASALTGVFTGLILTSKKDN----KTMV 223

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            +G  +++  L   +    P+NK L++ S+V VTSG A +  + +Y + D 
Sbjct: 224 LVGLGVVMLALGYLWHTVFPINKALWSSSFVLVTSGWANIFLALVYYISDV 274


>gi|260788586|ref|XP_002589330.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
 gi|229274507|gb|EEN45341.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
          Length = 382

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 166/352 (47%), Gaps = 51/352 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +RL SLD FRG+ + +M  V++ GG +  + H+ WNG  +AD V P+F++I+G + AL+ 
Sbjct: 1   RRLKSLDTFRGMCLCIMAFVNYGGGGYWFLDHSVWNGITVADLVFPWFMWIMGTSTALSF 60

Query: 93  KRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           + +  +A        K++ RT+ L   G+ +     +APD      D   IR+ GVLQR 
Sbjct: 61  RGLQRKATPKLTIFGKIVRRTITLFLLGLFI----VNAPD------DWATIRIPGVLQRF 110

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+SY  VS + +   + +     V        Y   WL   C+L V+  L +   VP   
Sbjct: 111 AVSYFAVSTMMLLHMETEWYRDLVP-------YWKQWLFVLCLLAVHTCLTFLMPVPGCP 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              +           N T G          A GYID  +L  +H+Y             +
Sbjct: 164 TGYLGAGGLSDLDHTNCTGG----------AAGYIDNWLLTQDHIY-----------GDE 202

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
           +P    L +   +     ++PEG+L S++SI  T +G+  G +++  + H +RL +W+  
Sbjct: 203 TPKVRILYQILVN-----YDPEGVLGSLTSIFMTFLGLQAGKILLSYEDHGSRLVRWLLW 257

Query: 329 GFALLIFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           G  L +  + L         +P+NK L+++++V   +  A ++ S  Y LVD
Sbjct: 258 GIGLGLLAILLCEGRQNGGWVPINKNLWSVTFVLSLASMAFILLSVYYFLVD 309


>gi|281209662|gb|EFA83830.1| hypothetical protein PPL_02898 [Polysphondylium pallidum PN500]
          Length = 409

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 148/365 (40%), Gaps = 87/365 (23%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFV 76
           DV           +R+ SLD  RGL +  MILVD+ GG    WP +    WNG + AD +
Sbjct: 24  DVDKDTTSKPPPKKRMLSLDTARGLTIFGMILVDNQGGPEVIWP-LKETDWNGISTADLI 82

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            P FLFI G +I+LALK   +       +I RT+ LLF       G     + + +    
Sbjct: 83  FPSFLFICGFSISLALKNAKNDRPTWINIIRRTI-LLF-------GIQLFLNLMAHKFVF 134

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV-LVVY 195
              R+ GVLQRI+L Y                       S   L  W   +A  +   +Y
Sbjct: 135 STFRVMGVLQRISLCYCFSCC------------------SFMLLPKWAQRVALVISATIY 176

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
           L L+Y   VP                     CG R  +   CNA GYID  +L  N ++ 
Sbjct: 177 LCLMYAYPVPG--------------------CG-RGNITRSCNAAGYIDNLILRKNMIH- 214

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
                                         P +PEG +S+ S+ ++T +GV  G ++   
Sbjct: 215 ------------------------------PTDPEGFISTFSAFITTWMGVELGRILTTH 244

Query: 314 --HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
                G    L +W+++G    + GL L  TN I  NK +++ S+  +T    AL  SA+
Sbjct: 245 ARSADGWKDILIRWLSIGMVCAMIGLFLDATNVIQFNKIIWSFSFAMLTVACGALFLSAL 304

Query: 372 YALVD 376
           Y  +D
Sbjct: 305 YYSMD 309


>gi|321463338|gb|EFX74354.1| hypothetical protein DAPPUDRAFT_129175 [Daphnia pulex]
          Length = 409

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 160/345 (46%), Gaps = 46/345 (13%)

Query: 42  RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
           R L +  MI+V++ GG +    H+PWNG  +AD + P F++I+G +  L+L     RA +
Sbjct: 29  RRLTIVFMIIVNYGGGGYWFFEHSPWNGITIADVIFPCFVWILGASCVLSLNSQLRRALS 88

Query: 102 VKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
            +++++    R++ +L  G++L    ++         +++  R+ GVLQR++  YL+V+L
Sbjct: 89  KQRILYSTVRRSVAMLVIGLVLNSLSNN---------NIKTFRIPGVLQRMSFVYLIVAL 139

Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYC-W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKD 215
           +E+   D +D  +      I  + C W  W++    +   L + +   VPD         
Sbjct: 140 IELTGFDPEDNQRYAWFAPIRDIVCSWRQWIIVTVFVSTQLLITFLLPVPDCPLGYTGAG 199

Query: 216 SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
             +   ++    G  A+L         +D  + G +H+Y  P        T  + ++  L
Sbjct: 200 GLEKNGLYRNCTGGAARL---------VDVSLFGNDHIYQRP--------TPRAIYDATL 242

Query: 276 RKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LI 334
                      F+PEG L  ++ +L   +G     V++    +  R+ +W+       L 
Sbjct: 243 ----------AFDPEGALGGLTCVLCAYLGAEAAKVLLVFPANKQRIVRWMLWALVTGLS 292

Query: 335 FGLTLHFT---NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            G+   F      IP+NK L++LSYV VTS  A ++ + +Y  +D
Sbjct: 293 GGILCDFKIDDGPIPINKNLWSLSYVLVTSSIAFILLTILYVFID 337


>gi|375149929|ref|YP_005012370.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063975|gb|AEW02967.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 392

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 151/364 (41%), Gaps = 94/364 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  SLD+FRG  VA MILV++ G  W  +     HAPW+GC   D V PFFLF VG A+
Sbjct: 3   KRFYSLDVFRGATVAFMILVNNPG-SWSNLYAPLEHAPWHGCTPTDLVFPFFLFAVGNAL 61

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGG--FSHAPDELTY------GVDVRM 138
           A  + R+ +      +KKVI R+  +   G  L          D LT+      G +  +
Sbjct: 62  AFVMPRLQEAGTTAFLKKVITRSFLIFLIGFFLNWSPFIRWDNDHLTFKAWEYAGANGNL 121

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             IR+ GVLQRIAL Y   SL+  F K                      +  A V    L
Sbjct: 122 IGIRILGVLQRIALCYFFASLIIYFFK----------------------IRGAFVSAFVL 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINH 252
            L Y      W   +   ++AD                 P +  GY    +D+ +LG +H
Sbjct: 160 LLGY------WVLCMFFGNAAD-----------------PYSLNGYFGLGVDKAILGESH 196

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MYH                EG             F+PEG+ S++++I+  I G   G  I
Sbjct: 197 MYHG---------------EG-----------VAFDPEGITSTLTAIVQVIFGYFVGFYI 230

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
                +   L      G  L+  G    +    P+NK+++T SYV  T+G A LV S   
Sbjct: 231 QQKGKNFEMLSHLFVAGCILIFTGYA--WDMMFPINKKIWTSSYVLYTTGLAILVLSLCI 288

Query: 373 ALVD 376
            L++
Sbjct: 289 FLIE 292


>gi|393783262|ref|ZP_10371437.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669541|gb|EIY63029.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
          Length = 365

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 162/355 (45%), Gaps = 78/355 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QRL SLD+ RG+ VA MILV++AG     +  + HA W+G   AD V P F+F++G++  
Sbjct: 9   QRLLSLDVLRGITVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGISTY 68

Query: 90  LALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           ++L++   +    + K+I R   L+  GI ++    H+ +   +  D   +R+ GV+QR+
Sbjct: 69  ISLRKYNFQWQLTIGKIIKRAFLLILIGIAMK-WLIHSFETGIWN-DWEHMRILGVMQRL 126

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            + Y + +++ +F            RF                L + L LL G +     
Sbjct: 127 GICYGITAVMALFIPH--------KRF----------------LPIALLLLIGYF----- 157

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             I+      + K              P N +  +D  VLG +HMY            Q 
Sbjct: 158 --ILQLAGNGFEK-------------SPDNIMAIVDSTVLGTSHMY-----------LQG 191

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
             F                EPEG+LS++ ++   +IG   GH++I+ K +  R++Q   M
Sbjct: 192 RQF---------------VEPEGILSTIPAVAQVMIGFVCGHMLINRKDNQERMQQLFFM 236

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           G  LL  G  L +  A PLNK+L++ S+V VT G AAL  + +  ++D      W
Sbjct: 237 GTLLLFAGFLLSY--ACPLNKRLWSPSFVLVTCGIAALALAVLIEIIDVRKKKEW 289


>gi|345856403|ref|ZP_08808889.1| putative membrane protein [Desulfosporosinus sp. OT]
 gi|344330527|gb|EGW41819.1| putative membrane protein [Desulfosporosinus sp. OT]
          Length = 381

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 167/367 (45%), Gaps = 100/367 (27%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
           +EK   K  RL  +D+FRG+AVA+M++V + G    ++P++ HA WNG  +AD   PFF+
Sbjct: 8   EEKG--KFGRLNCIDVFRGIAVAIMLIVTNPGNPLRNYPQLRHAAWNGYTVADLAFPFFM 65

Query: 82  FIVGVAIALAL-KRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
            I+G+ I  A+ KRI +    +     ++ R++ L   GILL G   +         D+ 
Sbjct: 66  LIMGMVIPYAVDKRIKEGKSNLSIFNHILIRSIGLFCIGILLNGFPVY---------DLS 116

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD--QSVGRFSIFRLYCWHWLMAACVLVVY 195
           +IR+ GVLQRIA++YL   ++E+  K    K   Q +   S          +A  ++ VY
Sbjct: 117 IIRIPGVLQRIAIAYLCTGIIELIVKATVKKSYLQIIVESS----------LALSIISVY 166

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
             LL     PD++                             N V  ID   L   H+Y 
Sbjct: 167 SVLLIKYSFPDYK-----------------------------NLVQTIDLYFLK-GHLY- 195

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                                   P W     +PEG+L++ SSI + I G   G+ I+  
Sbjct: 196 -----------------------TPDW-----DPEGILTTFSSIATAIFGSIAGN-ILFN 226

Query: 316 KGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAALVFSA 370
           + + AR K+++T    + I+G+      +I     P NK L++ SYV +T+G A L  S 
Sbjct: 227 RDNKAR-KKFIT----IFIYGVVTIIVASIIQRWFPYNKNLWSSSYVLITAGIAYLTISM 281

Query: 371 IYALVDC 377
           ++ ++D 
Sbjct: 282 LFLVIDV 288


>gi|296222155|ref|XP_002757067.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Callithrix jacchus]
          Length = 516

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 139/283 (49%), Gaps = 42/283 (14%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G ++ L++ 
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSVFLSMT 326

Query: 94  RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GVLQR+ 
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381

Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
           ++Y +V+++E +F K V +   S       R     W  WL+   +  ++L L +   VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCTSERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 441

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +      D+GK  N T G          A GYIDR +LG +H+Y HP    S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
               +                 ++PEG+L +++SIL   +GV 
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQ 516


>gi|256425421|ref|YP_003126074.1| hypothetical protein Cpin_6469 [Chitinophaga pinensis DSM 2588]
 gi|256040329|gb|ACU63873.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 358

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 108/352 (30%), Positives = 154/352 (43%), Gaps = 80/352 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QRL SLD FRGL VA MILV++ G     +P + H+ WNGC   D V PFFLF+VGV++ 
Sbjct: 3   QRLLSLDFFRGLTVAAMILVNNPGSWSYVYPPLEHSKWNGCTPTDLVFPFFLFMVGVSVT 62

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            AL     +AD           +    IL   G +     L    D   +R+ GVLQRI+
Sbjct: 63  FALSS--RKADVSGHTSLIIHIIRRAAILFAIGLAF---RLIPSFDFHNLRILGVLQRIS 117

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV--LVVYLALLYGTYVPDW 207
           + +L++SL+ +       K  +  R          WL   C+  LV+Y  L+    VP  
Sbjct: 118 IVFLVISLLYL-------KTGTKPRI---------WL---CISFLVIYWLLMTVVPVP-- 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                      YG          A L    N   +IDR VLG  H++             
Sbjct: 157 ----------GYGP---------ANLEAETNLAAWIDRTVLGEQHLW------------- 184

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV- 326
                    K A +W     +PEGLLS++ +I + ++G+  G  +       A    W+ 
Sbjct: 185 ---------KQARTW-----DPEGLLSTLPAISTGLLGIMTGDWLRRKDVADADKVSWLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
             GF  +I GL   +    P+NK L+T S+V  T G AA+  +  Y L+D  
Sbjct: 231 AAGFLSVIAGLI--WDGFFPINKSLWTSSFVLYTGGLAAMGLALSYWLIDVQ 280


>gi|333031144|ref|ZP_08459205.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741741|gb|EGJ72223.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 363

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 162/357 (45%), Gaps = 80/357 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL SLD+ RG+ VA MILV++ G    ++  + HA W+G N AD V P F+F++G++ 
Sbjct: 8   SKRLLSLDVLRGITVAGMILVNNTGSCGYNYTALRHASWDGLNFADLVFPMFMFMMGIST 67

Query: 89  ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L++   ++  A  K+  RT  L+  G+ ++   +     +  G+++  +RL GV+QR
Sbjct: 68  YISLRKYENNKKTAFYKIFKRTSLLIIIGLFMECIITW----IEVGLNLSTLRLMGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           + L Y                    G  ++  LY  H      +L + L++L G ++   
Sbjct: 124 LGLCY--------------------GITALLSLYVPH----KYLLKIALSVLLGYFIIQI 159

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             +  +K +                     N +G +DR VLG+NH+Y            Q
Sbjct: 160 VGSGFDKSAE--------------------NVIGVVDRSVLGVNHIY-----------LQ 188

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F                +PEG+LS++ +I   +IG   G  I+  + H  ++     
Sbjct: 189 GKQF---------------VDPEGVLSTLPAIAQVMIGFFCGRKILEKREHKQQMLILYR 233

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           +G   L  G    F+   P+NK+L++ ++V VTSG A +  S +   +D      W+
Sbjct: 234 LGSLFLFVGFV--FSYVCPINKRLWSPTFVLVTSGVACMALSLLIDTLDIKQKKHWS 288


>gi|300865789|ref|ZP_07110543.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300336202|emb|CBN55698.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 376

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 85/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA WNGC   D + PFFLF VG A++
Sbjct: 1   MRLTSLDVFRGIAIASMILVNNPGSWDYVYPPLDHAEWNGCTPTDLIFPFFLFAVGAAMS 60

Query: 90  LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVR---MIRLCG 143
            +L +  +    +  V +R L+   LLF   LL   FS   D L  G  +     IR+ G
Sbjct: 61  FSLSKYTEENPPISTVYWRILRRATLLFLLGLLLNSFSIFLDVLLNGSPIENFGKIRILG 120

Query: 144 VLQRIALSYLL--VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
           VLQRI+L+Y L  ++++ + +++++                   ++AA +L+ Y   L  
Sbjct: 121 VLQRISLAYFLAAIAILNLSSRNLR-------------------ILAATLLLGYWGALTL 161

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             VP             YG            L P  N   YIDR +LG  H+Y       
Sbjct: 162 IPVP------------GYGANL---------LTPEGNLGAYIDRLILGTQHLYR------ 194

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                    ++PE LL ++ +I +T++  +F    +  +    R
Sbjct: 195 ----------------------QGQYDPESLLGTLPAI-ATVLAGYFTTQWLRVQPIKTR 231

Query: 322 LKQW--VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
              W  V  G A L  G    F    P+NKQL+T SYV +T+G A L+ +  Y  ++ 
Sbjct: 232 -TTWNLVIFGLASLTIGQLWGF--WFPINKQLWTSSYVLLTAGWAILLLAICYETIEV 286


>gi|149280688|ref|ZP_01886799.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
 gi|149228553|gb|EDM33961.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
          Length = 385

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 153/357 (42%), Gaps = 74/357 (20%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL SLD FRG  VA MILV++ G DW  I     HA W+GC   D V PFFLFIVGV+IA
Sbjct: 13  RLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHADWHGCTPTDLVFPFFLFIVGVSIA 71

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVLQRI 148
            A+        +  K I + LK       L    S  P+  +  V+  + +R+ GVLQRI
Sbjct: 72  YAMGSKKTDPSSHGKTILKALKRTLILFGLGLFLSLFPNVFSNPVEAFQQVRIPGVLQRI 131

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ + + S+  IF K  +         +IFR           +L  Y A++    VP   
Sbjct: 132 AVVFFICSI--IFLKSSER--------TIFR-------TMVIILAAYWAIMTFIPVPGTG 174

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F  + K++                     N   +IDR V    H+     W+ SK     
Sbjct: 175 FPNLEKET---------------------NLGAWIDRGVFTEAHL-----WKSSKT---- 204

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV-T 327
                             ++PEGLLS++ +I + + G+  G  +       A    W+ +
Sbjct: 205 ------------------WDPEGLLSTLPAIATGLFGILVGSYLKRKDIEPATKIAWLFS 246

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            G A    GL        P+NKQL+T S+V  T G A  + S  Y ++D   +  +T
Sbjct: 247 TGAAATALGLLWDL--QFPINKQLWTSSFVLYTGGLATTILSLSYWIIDVQQYNRFT 301


>gi|344203119|ref|YP_004788262.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343955041|gb|AEM70840.1| hypothetical protein Murru_1800 [Muricauda ruestringensis DSM
           13258]
          Length = 375

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 156/367 (42%), Gaps = 82/367 (22%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS---HAPWNGCNLADFVMPF 79
           +Q   S LK + L SLD+FRGL VALMI+V+  G      S   HA WNG  L D V P 
Sbjct: 6   NQPNMSKLKNRYL-SLDVFRGLDVALMIIVNSPGNGSTTFSPLLHADWNGFTLTDLVFPT 64

Query: 80  FLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF+VG +++ ++K+          KKV+ RT  +   G L+         +L       
Sbjct: 65  FLFVVGNSMSFSMKKYESMGKPAFFKKVLKRTAIIFLLGFLMYWYPFFDDGQLK---PFS 121

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRIAL Y+  S++  F K               +   W   ++A  LV Y  
Sbjct: 122 ETRVFGVLQRIALCYMFASIILHFVKT--------------KTAIW---LSALFLVGYHL 164

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L G                               L    NAV  +D  ++G NHMYH  
Sbjct: 165 ILIG----------------------------FGDLTLTGNAVLKLDEWLIGANHMYHG- 195

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         EG             F+PEGLLS++ +I++ IIG   G  I +   
Sbjct: 196 --------------EG-----------IAFDPEGLLSTLPAIVNVIIGYLAGRFIQNNGQ 230

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +   + + +  GFAL+  GL       +P+NK+L+T S+V +T G      + +  ++D 
Sbjct: 231 NFETVAKLMMFGFALVFAGLAWDLV--LPINKKLWTSSFVLLTCGIDLFAIAILIYILDM 288

Query: 378 NIFGPWT 384
                W+
Sbjct: 289 KKAKSWS 295


>gi|428300562|ref|YP_007138868.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
 gi|428237106|gb|AFZ02896.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
          Length = 402

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 164/385 (42%), Gaps = 102/385 (26%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG  +A MILV+         +P + HA W+GC LAD V PFFLFIVGVA+
Sbjct: 1   MRLTSLDVFRGATIAGMILVNMVSLAEPNVYPALLHADWHGCTLADLVFPFFLFIVGVAM 60

Query: 89  ALALKRIPD-------RADAVK--------------------KVIFRTLKLLFWGILLQG 121
           + +  +  D         DA+                     K IFR   +LF   L   
Sbjct: 61  SFSFAKYTDVIPKVEKEKDAIGALQQFLAKESSAAGGAKPPYKKIFRRGAILFALGLFLN 120

Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
            F ++ + L Y  D   +R+ GVLQRIAL+YL  SL+ +    +  K Q           
Sbjct: 121 LFWNSKN-LPY-FDFSTLRIMGVLQRIALTYLFASLIVL---KLPKKAQ----------- 164

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              W++A  +LV Y  L+    +P++    I   + ++                      
Sbjct: 165 ---WIVAGVLLVGYWLLMMYVPIPEYGAGEIGTRTGNFA--------------------A 201

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YIDR ++   H+Y    +                            +PEGL S++ +I+S
Sbjct: 202 YIDRFIIPKAHLYKGDGFNNFG------------------------DPEGLFSTIPAIVS 237

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFG---LTLHFTNAIPLNKQLYTLSYVC 358
            ++G +F    I ++     +K   ++G AL   G   +   F    P+NK+L+T SYV 
Sbjct: 238 -VLGGYFSGQWIRSQ----PVKTRTSLGLALAGIGCLIIAWLFALDFPINKKLWTSSYVI 292

Query: 359 VTSGAAALVFSAIYALVDCNIFGPW 383
            T+G A L+ +  Y L++      W
Sbjct: 293 FTTGWALLLLAGCYELIEVQKIRSW 317


>gi|365877201|ref|ZP_09416706.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442587874|ref|ZP_21006688.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
 gi|365755061|gb|EHM96995.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442562373|gb|ELR79594.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
          Length = 400

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 160/373 (42%), Gaps = 99/373 (26%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K+ R  SLD+FRG  VALMILV++ G     +P + HA W+GC   D V PFFLF VG 
Sbjct: 1   MKSARYYSLDVFRGATVALMILVNNPGTWSAIYPPLEHAKWHGCTPTDLVFPFFLFAVGN 60

Query: 87  AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG--GFSHAPDELTY----GVDVRM 138
           A+   + +      +V  KKVI RTL +   G+ L     F    D L++      D   
Sbjct: 61  AMTFVIPKFQQHNSSVFWKKVIKRTLLIFGIGLFLNWCPFFQWDHDSLSFISWESSDENG 120

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIA++Y   S++  + K+              ++  W   ++  +LV+Y   
Sbjct: 121 VRIMGVLQRIAIAYFFASVIAYYFKE--------------KMVLW---ISGALLVIY--- 160

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
                   W  T+    +  Y         +   +  P      ID  +LGI H Y    
Sbjct: 161 --------WLLTLFLGGTDPY--------SLEGFIGVP------IDHSILGIAHEYKG-- 196

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                        EG            PF+PEGL S++ +I   + G   G+  I  KG+
Sbjct: 197 -------------EG-----------VPFDPEGLFSTIPAISQVLFGYLIGNY-IQKKGN 231

Query: 319 LARLKQWV----------TMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGA 363
           +    QW           +M   L I G+   F + +     P NK++++ SY  +T+G 
Sbjct: 232 I----QWFGKSLKENSIYSMLSGLFILGIIALFISYVWQLDFPYNKKIWSSSYTLLTTGL 287

Query: 364 AALVFSAIYALVD 376
           A +V   +   ++
Sbjct: 288 AIMVLGVLIWFIE 300


>gi|255534024|ref|YP_003094396.1| hypothetical protein Phep_4143 [Pedobacter heparinus DSM 2366]
 gi|255347008|gb|ACU06334.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366]
          Length = 384

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 159/361 (44%), Gaps = 76/361 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K  RL SLD FRG  VA MILV++ G DW  I     HA WNGC   D + PFFLFIVGV
Sbjct: 9   KPVRLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHAEWNGCTPTDLIFPFFLFIVGV 67

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVL 145
           +IA A+        +  K I + LK       L    S  P   T  ++  + +R+ GVL
Sbjct: 68  SIAYAMGGKKADPSSHGKTIVKALKRASILFGLGLFLSLFPKVFTAPLEAFQQVRIPGVL 127

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIA+ +L+ ++  IF K+ +         +IF++          +L VY AL+  T++P
Sbjct: 128 QRIAVVFLISAI--IFLKNTEK--------NIFKILL-------AILAVYWALM--TFIP 168

Query: 206 DWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                                 GV  A L    N   ++DR +L   H++          
Sbjct: 169 --------------------VPGVGYANLEKETNLGAWLDRSILTEAHLW---------- 198

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                       K A +W     +PEG+LS++ +I + + G+  G  +       A    
Sbjct: 199 ------------KSAKTW-----DPEGILSTLPAIATGLFGILVGVYLKRKDVDAATKIS 241

Query: 325 WV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           W+   G A +  GL        P+NK L+T S+V  T G A ++ S  Y ++D   +  +
Sbjct: 242 WLFCTGCAAVALGLLWDLQ--FPINKSLWTSSFVLYTGGLATMILSLCYWIIDVQQYNRF 299

Query: 384 T 384
           T
Sbjct: 300 T 300


>gi|255038072|ref|YP_003088693.1| hypothetical protein Dfer_4326 [Dyadobacter fermentans DSM 18053]
 gi|254950828|gb|ACT95528.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 368

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 162/361 (44%), Gaps = 88/361 (24%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFL 81
            E   + + RL SLD  RG  +A MI+V+  G +   +P + H+ WNG    D + P FL
Sbjct: 1   MENPSVPSSRLLSLDAMRGFTIAAMIMVNFPGHEDYVFPTLRHSKWNGLTFTDLIAPTFL 60

Query: 82  FIVGVAIALAL--KRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FIVGV+I LA   KR+ +  ++   +K++ R+LK+   G+ L    +  PD      +  
Sbjct: 61  FIVGVSITLAYSKKRLSNAPKSGLYRKIVIRSLKIFAVGMFL----NMLPD-----FNFS 111

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R  G L RIA+ +L+ +++ + T   Q    +VG                 +LV+Y  
Sbjct: 112 DLRYTGTLHRIAIVFLVCAILFLNTSWKQQLGIAVG-----------------ILVLYWL 154

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            L G   P              GKV          L P  N   ++D++ L         
Sbjct: 155 ALTGIPTP------------GIGKVM---------LEPGVNLAAWVDQQYL--------- 184

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                           P +    +W     +PEG+LS+  +I +TI G+  G +++    
Sbjct: 185 ----------------PGKMWQGNW-----DPEGILSTFPAIATTITGILAGRLMLLPFS 223

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              +    +T GFA    G   +F N I P+N+ L+T S+V VTSG A+++F A+Y L+D
Sbjct: 224 PNEKSNFLLTAGFATAALG---YFWNLIFPVNENLWTSSFVLVTSGFASMLFGALYFLID 280

Query: 377 C 377
            
Sbjct: 281 I 281


>gi|345517324|ref|ZP_08796801.1| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
 gi|345457717|gb|EET14395.2| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
          Length = 365

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 156/367 (42%), Gaps = 90/367 (24%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
           E D    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D 
Sbjct: 5   ELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PDE 129
           V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D 
Sbjct: 64  VFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGDF 120

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           L +      +R+ GVLQRIAL Y ++S   +F                            
Sbjct: 121 LPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------- 151

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
                        ++P   F ++   +        + C          N +  IDR++ G
Sbjct: 152 ------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLFG 192

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y                              +P +PEG +S++S+I  T IG   G
Sbjct: 193 EAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCG 224

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  +
Sbjct: 225 KWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282

Query: 370 AIYALVD 376
            +   +D
Sbjct: 283 TLMYYID 289


>gi|347738959|ref|ZP_08870332.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
 gi|346917867|gb|EGY00078.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
          Length = 400

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 106/339 (31%), Positives = 163/339 (48%), Gaps = 73/339 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
            + RL SLD+ RGLAVA MILV   G DW +    + HAPW+G  LAD V P FLF VG+
Sbjct: 19  TSPRLPSLDVLRGLAVAGMILVVSPG-DWSKAYTPLKHAPWDGWTLADMVFPTFLFSVGL 77

Query: 87  AIALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           AIAL+  +I  +R  A  K+  R L L+  G++L        + L Y  D+  +RL G+L
Sbjct: 78  AIALSFTKIAQNRRAAGVKIARRALALIVLGLVL--------NALPY-FDLAHLRLPGIL 128

Query: 146 QRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           QRIAL Y+L +L+ + T +   D   +V + ++        +  A VL+ Y A+L    V
Sbjct: 129 QRIALCYVLATLLCLVTARTGADGSPTVNQRALL-------IAMAVVLLGYCAVLAWVPV 181

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAK-LNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P                      G+ A  L+P  +   +IDR V  + H++         
Sbjct: 182 P----------------------GIGAGHLDPGGSLAAWIDRGVFTVPHLW--------- 210

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                     P   DA       ++PEGLLS++ + ++ ++GV  G  +  ++  L  L 
Sbjct: 211 ----------PYGTDAAG--AVVYDPEGLLSTLPATVNVLVGVLAGTALKASRSRLNLLV 258

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
             V     L++ GL L     + +NK+++T S+   +SG
Sbjct: 259 AAVM----LMMAGLALD--PVLVINKRIWTSSFALFSSG 291


>gi|54297581|ref|YP_123950.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
 gi|53751366|emb|CAH12784.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
          Length = 372

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 151/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI+V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG +S+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFVSTFTSIATTLSGVLAGSLLINPCNQFKKFYL 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              +G   L+ G    +  + P+NK L+T SYV  TSG A LVF+  Y L+D
Sbjct: 231 LAGVGMLFLLLGWL--WNMSFPINKNLWTSSYVLWTSGLALLVFAFCYLLID 280


>gi|294777712|ref|ZP_06743163.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|319640295|ref|ZP_07995020.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
 gi|294448780|gb|EFG17329.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|317388070|gb|EFV68924.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
          Length = 372

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 156/367 (42%), Gaps = 90/367 (24%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
           E D    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D 
Sbjct: 5   ELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PDE 129
           V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D 
Sbjct: 64  VFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGDF 120

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
           L +      +R+ GVLQRIAL Y ++S   +F                            
Sbjct: 121 LPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------- 151

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
                        ++P   F ++   +        + C          N +  IDR++ G
Sbjct: 152 ------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLFG 192

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y                              +P +PEG +S++S+I  T IG   G
Sbjct: 193 EAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCG 224

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  +
Sbjct: 225 KWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282

Query: 370 AIYALVD 376
            +   +D
Sbjct: 283 TLMYYID 289


>gi|449664780|ref|XP_002169793.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Hydra magnipapillata]
          Length = 369

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 100/340 (29%), Positives = 162/340 (47%), Gaps = 54/340 (15%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKK 104
           MI V++ GG +   SH+ WNG  +AD + P+F+FI+G +I +++    K++  R   V K
Sbjct: 1   MIFVNYGGGGYYFFSHSSWNGLTVADLLFPWFIFIMGSSIYISMHSLRKKLSKRKMTV-K 59

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
           +I+R+ KL                 L  G D+   RL GVLQR A+SY +V+LV ++   
Sbjct: 60  IIYRSFKL-----------LLLGLFLNNGFDLANWRLPGVLQRFAISYFVVALVFLWFDS 108

Query: 165 VQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGK 221
             ++ ++    ++FR   + +  ++   +L +YL ++Y   VP             D+G 
Sbjct: 109 PNEESETNSWKNMFRDVWFPFQHIVMLLLLTIYLLIIYLLNVPGCPKGYFGPGGDGDHGA 168

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
               T G          A GY+DR V G+NH+Y +P  +    C                
Sbjct: 169 YEKCTGG----------ASGYVDRTVFGLNHIYKNPTCKSLYNCFT-------------- 204

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GHLARLKQWVTM--GFALLIFG 336
                ++PEGLL ++ SIL T +G+     ++  K   GH+ R   W  +    A+ + G
Sbjct: 205 -----YDPEGLLGTIPSILLTYLGLQAARTLLFYKSKNGHIIRWFIWSVLLGALAVGLCG 259

Query: 337 LTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            TL+   AIP+NK L++LSY+      A L+    Y LVD
Sbjct: 260 GTLN-DGAIPINKNLWSLSYIFAMGSTAYLLLLICYVLVD 298


>gi|125981811|ref|XP_001354909.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
 gi|54643221|gb|EAL31965.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
          Length = 574

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/341 (28%), Positives = 160/341 (46%), Gaps = 48/341 (14%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 240

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    ++++R++KL   G+ L             G  +  +RL GVLQ
Sbjct: 241 SVKAQLSRGNSKARICLRILWRSIKLFAIGLCLNS---------MSGPSLEQLRLMGVLQ 291

Query: 147 RIALSYLLVSLVEIF-TKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTY 203
           R  +++L+V ++    ++  Q + Q     +I+   L+     +   ++  YL L +G  
Sbjct: 292 RFGIAFLVVGILHTLCSRREQVQPQRAWHRAIYDVCLFSGELAVLLALIAAYLGLTFGLP 351

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +                + A  N    A GY+D +VLG  H+Y HP    + 
Sbjct: 352 VPGCPRGYLGPGGKH---------DLAAHPNCIGGAAGYVDLQVLGNAHIYQHP----TA 398

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG-HVIIHTKGHLARL 322
               DS               + F+PEG+   + S++  ++G   G  +++HT       
Sbjct: 399 KYVYDS---------------SAFDPEGVFGCLLSVVQVLLGAFAGLTLLVHTTWQTRLR 443

Query: 323 KQWVTMGFALLIFGLTLHFT---NAIPLNKQLYTLSYVCVT 360
           +  +      L+ G    F+     IP+NK L++LS+V VT
Sbjct: 444 RWLLLSLLLGLVGGALCGFSREGGVIPVNKNLWSLSFVFVT 484


>gi|410664067|ref|YP_006916438.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409026424|gb|AFU98708.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 390

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 168/356 (47%), Gaps = 77/356 (21%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL S+D+ RGLA+A M+LV++ G  W  +    +HA W+G    D + P FL++VG++I 
Sbjct: 15  RLMSVDVLRGLAIAAMVLVNNPG-SWSHVYAPLAHAEWHGWTPTDVIFPLFLYVVGLSIV 73

Query: 90  LALK----RIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           LA K     +P R+    A K         LF+    +  FS   D+L   +DVR++   
Sbjct: 74  LAQKGETFALPGRSTWLRAAKLFGLGLFLALFYFPFAKPEFSWWRDQL---LDVRIL--- 127

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL YL    +    +  Q                  WL+    L V++ L Y  
Sbjct: 128 GVLQRIALVYLACCYLAWLCQKRQ------------------WLLWLATL-VFMWLAYAL 168

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                  +I   D  D G+++      R +L    +   ++D+ +LG  H+Y+  A    
Sbjct: 169 -----MLSIPYAD--DTGEIY------RGQLVFGNHFSAWLDQLLLGREHLYYQTA---- 211

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                  PF               F+PEGLL+++ +I S ++GV  G + +   GH +RL
Sbjct: 212 ------QPFA--------------FDPEGLLTTLPAISSGLLGVLAG-LQLKAAGHSSRL 250

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           + W   G  +L+ G  LH     P+NK L+T S+V VT+G + L+ +++Y L D  
Sbjct: 251 EIWFAGGVLMLVAGQLLH--PVCPINKALWTPSFVLVTAGVSQLLLASLYWLCDVR 304


>gi|284040246|ref|YP_003390176.1| hypothetical protein Slin_5410 [Spirosoma linguale DSM 74]
 gi|283819539|gb|ADB41377.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
           linguale DSM 74]
          Length = 385

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 110/369 (29%), Positives = 164/369 (44%), Gaps = 84/369 (22%)

Query: 16  ISEP--DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNG 69
           ++EP    SD   K  + T RL SLD FRGL VA MILV++ G DW  I     HAPW+G
Sbjct: 3   VNEPIQKASDYGLKP-VGTSRLLSLDFFRGLTVAAMILVNNPG-DWGHIYAPLEHAPWHG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
               D + PFFLFIVGV+I  AL+    +   V K++ R++ L     LL    +  P  
Sbjct: 61  WTPTDLIFPFFLFIVGVSITFALEGGKSKKGVVGKIVKRSVTL----FLLGLFLNFFPK- 115

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                D+ ++R+ GVLQRIA+ YL+ SL+ + T   Q            +LY     +  
Sbjct: 116 ----FDITLVRIPGVLQRIAVVYLVCSLIFLKTNSRQ------------QLY-----ILV 154

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            VL+ Y  L+          T++      Y           A L P  N   + D  +L 
Sbjct: 155 IVLIGYWLLM----------TVVPVPGVGY-----------ANLEPATNLAAWFDYTILT 193

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+Y                      K A +W     +PEG+LS++ ++ + +IG+  G
Sbjct: 194 PAHVY----------------------KPAKTW-----DPEGVLSTLPAVGTGLIGMLVG 226

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
             +  ++  +A    W+     L   G  + +    P+NK L+T SYV +  G A L  +
Sbjct: 227 TWLRSSR-PVADKVAWLFATGCLATLG-GVMWDGFFPINKALWTSSYVLLAGGLAMLGLA 284

Query: 370 AIYALVDCN 378
             Y L+D  
Sbjct: 285 LCYWLIDVQ 293


>gi|345513910|ref|ZP_08793425.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
 gi|229435722|gb|EEO45799.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
          Length = 372

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 157/368 (42%), Gaps = 90/368 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG  +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVD 376
           + +   +D
Sbjct: 282 ATLMYYID 289


>gi|237711644|ref|ZP_04542125.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|423230937|ref|ZP_17217341.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|423244648|ref|ZP_17225723.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
 gi|229454339|gb|EEO60060.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|392630057|gb|EIY24059.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|392641497|gb|EIY35273.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
          Length = 372

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 157/368 (42%), Gaps = 90/368 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG  +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVD 376
           + +   +D
Sbjct: 282 ATLMYYID 289


>gi|255013110|ref|ZP_05285236.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|256838332|ref|ZP_05543842.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|410102572|ref|ZP_11297498.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
 gi|423333958|ref|ZP_17311739.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|256739251|gb|EEU52575.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409226793|gb|EKN19699.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|409238644|gb|EKN31435.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
          Length = 368

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 167/361 (46%), Gaps = 78/361 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +DR + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++D N    W
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 T 384
           T
Sbjct: 293 T 293


>gi|150009610|ref|YP_001304353.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|262383102|ref|ZP_06076239.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298374002|ref|ZP_06983960.1| membrane protein [Bacteroides sp. 3_1_19]
 gi|149938034|gb|ABR44731.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|262295980|gb|EEY83911.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298268370|gb|EFI10025.1| membrane protein [Bacteroides sp. 3_1_19]
          Length = 368

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 167/361 (46%), Gaps = 78/361 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +DR + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++D N    W
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 T 384
           T
Sbjct: 293 T 293


>gi|332664355|ref|YP_004447143.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332333169|gb|AEE50270.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 438

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 166/397 (41%), Gaps = 93/397 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+ RGL +A MILV++ G DW  +     HA W+GC   D+V PFFLF+VGVA
Sbjct: 4   SNRLLSLDVMRGLTIAGMILVNNPG-DWGNVYGPLLHADWHGCTPTDWVFPFFLFMVGVA 62

Query: 88  IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I LAL +  D  + +    +K+I R+L ++  G+ L    +H     T       +    
Sbjct: 63  IPLALGKRKDEGEDLRKIYRKIISRSLIIIGLGLFLT---AHPTFYFTDKTSPWYV---- 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH----------WLMAACVLV 193
               + L  +  ++V +FT++V ++ Q       F+   W           W   AC++V
Sbjct: 116 ----VHLIIMATAMVAVFTREVLNQKQ-------FQTETWQQRRKWVSYLAWSAIACMVV 164

Query: 194 ----------------------VYLA--LLYGTYVPDWQFTIINKDSADYGKVFN---VT 226
                                 VYLA   L+    P  Q          Y  +     V 
Sbjct: 165 LGIFYYDFSHMRFPGVLQRIGLVYLACGFLFLKASPRMQLLTGVGLLLLYWGLMTLVPVP 224

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
            G+   L    N   ++DR +   NH+     W   K                       
Sbjct: 225 GGIAPNLEAETNLGAWLDRAIFSTNHL-----WAAVKT---------------------- 257

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           ++PEGLLS++ +I + I G+  G  +   K    ++   + +G   L+F L L +    P
Sbjct: 258 WDPEGLLSTIPAIGTGIAGMLAGEWVRSEKSDYEKVSGLLAVG--ALLFALGLIWNEFFP 315

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           LNK+++T SYV   +G + L    IY LVD   +  W
Sbjct: 316 LNKKIWTSSYVVYMAGISLLFLGTIYWLVDIKGWKGW 352


>gi|262406057|ref|ZP_06082607.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648122|ref|ZP_06725665.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294806856|ref|ZP_06765681.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510562|ref|ZP_08790129.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|229443274|gb|EEO49065.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|262356932|gb|EEZ06022.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636506|gb|EFF54981.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294445885|gb|EFG14527.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 361

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 162/350 (46%), Gaps = 80/350 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +     +                    +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAVTIPHKKFMP-----------------LAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +G  LL  G  L +  A PLNK+L++ S+V +T G A L  + +  ++D
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIATLSLALLLYIID 278


>gi|225012704|ref|ZP_03703139.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225003237|gb|EEG41212.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 366

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 149/356 (41%), Gaps = 87/356 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
           K QRL +LDI RGL + LMI+V+  G  W E+     HA WNG    D+V P FLFIVGV
Sbjct: 4   KNQRLLALDILRGLTIILMIVVNDPG-SWSEVYAPFLHAEWNGLTPTDYVFPTFLFIVGV 62

Query: 87  AIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +I L+L +  +    R+   KKV++R LK+   GI L          L    +   IR  
Sbjct: 63  SIVLSLSKQLEAGKTRSQIAKKVLWRALKIYLVGIFLW---------LWPSFNFEGIRWV 113

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVL RIAL +L   L+ ++T         +G      +   +W+M A V           
Sbjct: 114 GVLPRIALVFLACGLIFLYTTKKTQWYLGIG------ILLGYWIMMAYV----------- 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                                            P   +G+ D  V         P   ++
Sbjct: 157 ---------------------------------PVPGIGFPDLSV---------P--EKN 172

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
            A   DS F  P R    +W     +PEG LS++ +I+S +IG+  G+V++  +    RL
Sbjct: 173 WAHYLDS-FLIPGRLWKYTW-----DPEGFLSTLPAIVSGLIGMWAGYVLMKKEELKTRL 226

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
            Q   +GF LL  G  + +    P NK L++ S+     G   L  +A     D  
Sbjct: 227 NQLFFIGFILLFLGDAMQWV--FPFNKNLWSSSFTLFMGGIGMLSLAAFSYYFDVR 280


>gi|299140549|ref|ZP_07033687.1| membrane protein [Prevotella oris C735]
 gi|298577515|gb|EFI49383.1| membrane protein [Prevotella oris C735]
          Length = 359

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 158/352 (44%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG  V LMILV++  G   +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             L+L++      R  A  K++ RT+ L F G+ +   F  A       +D+  +R+  V
Sbjct: 61  TYLSLEKTNFTWSRQVAF-KIVKRTVLLFFIGLFIN-WFDMAISG--NALDLSHLRIWAV 116

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QRIA+ Y  V                    SIF L C H      ++++  A       
Sbjct: 117 MQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA------- 149

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             +   +I  +   Y              +   N +  ID ++ GI H+YH         
Sbjct: 150 --YSLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH--------- 184

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                              ++P +PEG  SS+S+I  T+IG + G  +   K    ++ +
Sbjct: 185 -------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLR 225

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  L+I G  + F   +PLNK++++ SYVC+T G AA+    +   +D
Sbjct: 226 FLITGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLAAVTQGLLMYCID 275


>gi|330792857|ref|XP_003284503.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
 gi|325085533|gb|EGC38938.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
          Length = 373

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 149/352 (42%), Gaps = 87/352 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R++SLD+ RG+ +  MILVD+ GG    WP +    WNG + AD + P FLFI G ++A
Sbjct: 2   KRMSSLDVARGITIFGMILVDNQGGPDVIWP-LKETEWNGLSTADLIFPSFLFICGFSVA 60

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           LALK   +       +I RT  L F    L        + + +       R+ GVLQRIA
Sbjct: 61  LALKTAKNTRSTWYNIIRRTFLLFFIQCFL--------NLMAHHFVFSSFRVMGVLQRIA 112

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L Y L                S   F  F ++     +     V Y++++Y   VP    
Sbjct: 113 LCYFL----------------SCVSFLCFPVFLQRLFLLGTT-VTYISVMYALPVPG--- 152

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                            CG +  L P CNA  Y+D KV G N ++               
Sbjct: 153 -----------------CG-KGVLTPTCNAGAYLDFKVFGPNMIH--------------- 179

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
                           P +PEGLLS++S+ ++T +G+ FG V       +   ++  + +
Sbjct: 180 ----------------PNDPEGLLSTLSAFITTWMGLEFGRVFTTYYRKYDYSNVDLIVR 223

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           W+ M    ++  + L  T  +P NK +++ S+   T G    + +  Y L+D
Sbjct: 224 WIVMIALFIVPAIGLGAT-VMPFNKLIWSFSFALFTVGTGGCLITVAYILID 274


>gi|113475212|ref|YP_721273.1| hypothetical protein Tery_1515 [Trichodesmium erythraeum IMS101]
 gi|110166260|gb|ABG50800.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 366

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 154/356 (43%), Gaps = 82/356 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+ +A MI+V++ G     +P + HA W+GC   D + PFFLFI+GVA+  
Sbjct: 2   RLKSLDVFRGITIASMIIVNNPGSWNHVYPPLLHAKWHGCTPTDLIFPFFLFIMGVAMTF 61

Query: 91  ALKRIPDRADAVKKV---IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           +L +  D+   +  +   IFR   ++F   LL  GF +         ++  IR+ GVLQR
Sbjct: 62  SLSKYTDKNQPIPHIYQRIFRRCLIIFLFGLLLNGFPN--------YNLATIRVMGVLQR 113

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I+L YLL ++             ++   S  +LY     +A  +L+ Y   +    VP  
Sbjct: 114 ISLVYLLAAI-------------AILNLSRKQLYG----LATTLLIGYWIAMQLIPVP-- 154

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                      YG            L+P  N   YIDR +L   H+              
Sbjct: 155 ----------GYG---------LGNLSPEGNFAAYIDRLILTQQHL-------------- 181

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                         W    ++PEGL S++ +I++ +IG   G  + H   +       V 
Sbjct: 182 --------------WAGKQYDPEGLFSTLPAIVTVLIGYLTGEWLKHQSTNSRTTLNMVI 227

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            G + L+ G         P+NK L+T SYV  T+G A L  +  Y  ++   +  W
Sbjct: 228 SGLSCLVVGYLWGL--FFPINKALWTSSYVIFTAGWALLFLAICYQAMEVRGWHKW 281


>gi|158337501|ref|YP_001518676.1| hypothetical protein AM1_4380 [Acaryochloris marina MBIC11017]
 gi|158307742|gb|ABW29359.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 383

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 168/360 (46%), Gaps = 86/360 (23%)

Query: 38  LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
           LD+FRG+A+A M+LV+ +G     +P++ HA W+G  LAD V PFFLF++G ++A ++ R
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKDAYPQLQHADWHGWTLADLVFPFFLFVLGASMAFSMAR 72

Query: 95  -----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
                   +     K++ R++ L   G+ L G +S+         ++  +R+ G+LQRI+
Sbjct: 73  HTASLTQPKRRVYLKILRRSVVLFGLGLFLNGFWSY---------NLSTLRVMGILQRIS 123

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYVPDWQ 208
           L+YL+ +LV +    +  K Q              W M   +LV Y LAL   +++P  +
Sbjct: 124 LTYLVSALVIL---KLPRKSQ--------------WGMTGLLLVGYWLAL---SFIPVPE 163

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
           F   N                   L    N   Y+DR ++G +H+Y    +         
Sbjct: 164 FGAGN-------------------LTRTGNFGAYVDRLIIGSSHLYVGDQF--------- 195

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK----Q 324
                          ++  +PEGL S++ +I + ++G +F    I  +G   ++K     
Sbjct: 196 ---------------NSMGDPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIKTSRQS 239

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
                + L+  GL L ++   P+NK+L+T SYV  T G A ++ +  Y L++      W+
Sbjct: 240 LALASYGLISTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIEVRRIRLWS 299


>gi|410926267|ref|XP_003976600.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Takifugu rubripes]
          Length = 497

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 74/236 (31%), Positives = 123/236 (52%), Gaps = 22/236 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
            ++RL SLD FRG+++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +IAL
Sbjct: 278 SSKRLQSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFMFIMGTSIAL 337

Query: 91  A----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           +    L+    R   + KV +R+L+L   G+ ++   +   P  L++G     +R+ GVL
Sbjct: 338 SVHALLRTGSTRLSLLGKVAWRSLQLFLIGLFIINPNYCQGP--LSWGT----LRIPGVL 391

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QR+AL+YL+V+ +++       +  +    +   LY   W+    +  V+L L +   VP
Sbjct: 392 QRLALAYLVVACLDLLVARAHLEIYTTVSSTDVLLYWPAWVCVLLLESVWLFLTFLLPVP 451

Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
           D     +      D G   N T G          A G+IDR +LG  H+Y +P+ R
Sbjct: 452 DCPTGYLGPGGIGDMGLFPNCTGG----------AAGFIDRWLLGEKHIYQNPSSR 497


>gi|431796483|ref|YP_007223387.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
 gi|430787248|gb|AGA77377.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
          Length = 369

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 163/363 (44%), Gaps = 86/363 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           +  QR+ +LD+FRG+ +  MILV++ G  W  +     HA W+GC   D + PFFLFIVG
Sbjct: 1   MPKQRILALDVFRGITIFAMILVNNPG-SWSHVYAPLLHAKWHGCTPTDLIFPFFLFIVG 59

Query: 86  VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
           VAI L+    LK+   +   ++K + R LKL+                + Y  D+  +R 
Sbjct: 60  VAIELSLGGQLKKGTPKGFLLRKSLIRALKLIG--------LGLLLTAIPY-FDLAHLRF 110

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI L Y + +++ ++             FS   L   +WL   C+          
Sbjct: 111 PGVLQRIGLVYFISTVMYLYW------SPKALVFSSGILLIGYWL---CM---------- 151

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T++P                   V     A L P  N   +ID++VL   HM     W +
Sbjct: 152 TFIP-------------------VPGIGPANLEPGTNLAAWIDQQVL-TGHM-----WSQ 186

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           +K                       ++PEGL S++ +I++ ++GV  G ++     H AR
Sbjct: 187 TKT----------------------WDPEGLFSTLPAIVTCLLGVACGKILTGNSSHKAR 224

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           L +W   G  L+  GL   ++   PLNK L+T S+V  T+G A L  +A Y ++D   + 
Sbjct: 225 LTKWGIAGVTLVFGGLA--WSLFFPLNKALWTSSFVLYTAGWAFLGLAACYWILDVKGWK 282

Query: 382 PWT 384
            W+
Sbjct: 283 KWS 285


>gi|383115204|ref|ZP_09935962.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
 gi|313695379|gb|EFS32214.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
          Length = 361

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 100/336 (29%), Positives = 159/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNERRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|301307595|ref|ZP_07213552.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423337400|ref|ZP_17315144.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834269|gb|EFK64882.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409237229|gb|EKN30029.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
          Length = 368

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 167/361 (46%), Gaps = 78/361 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 4   QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
           +  +L++   +   ++V KV+ RT+ +   G+ L   F H   +  T   D + +R+ GV
Sbjct: 64  MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QR+AL+Y   SL+ +               +I   Y     +AA +L+ Y ALL  T+ 
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
            +     +++DS                       +  +D+ + G +HMYH         
Sbjct: 163 ME-----MSEDS----------------------IIAIVDKALFGTSHMYH--------- 186

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKGHLARLK 323
                        D        F+PEGLLS + SI   ++G + G VI    K +   ++
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIR 234

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
                G  +L  G  L +    P+NK++++ ++V VT G A+L  + +  ++D N    W
Sbjct: 235 NIFIFGTIILFAGFLLSY--GCPINKKIWSSTFVLVTCGFASLFLALLIWIIDINGKKKW 292

Query: 384 T 384
           T
Sbjct: 293 T 293


>gi|294674520|ref|YP_003575136.1| hypothetical protein PRU_1851 [Prevotella ruminicola 23]
 gi|294472648|gb|ADE82037.1| putative membrane protein [Prevotella ruminicola 23]
          Length = 357

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 162/360 (45%), Gaps = 83/360 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++++RL SLDI RG+ VA MILV++  G+  E + H+ WNG    D V PFFLFI+G++ 
Sbjct: 1   MESKRLLSLDILRGITVAGMILVNNGWGESFEMLRHSKWNGMTPCDLVFPFFLFIMGISC 60

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHA--PDELTYGVDVRMIRLCGV 144
            L+L +         +++++ RT+ L   G+ +   F HA   D L +G     +R+  V
Sbjct: 61  YLSLVKSEFKPTPQVIRRIVKRTVLLFAIGLFIN-WFDHAIEGDLLCFG----HLRIWAV 115

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           +QRIAL Y +VSL                    F L+C H                    
Sbjct: 116 MQRIALCYGIVSL--------------------FALFCNH-------------------- 135

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             +  ++I    A Y  +  +  G     N   N +   D K+ G +H+YH         
Sbjct: 136 -KYTLSVIGGLLAIYTAILILGNGYAEDAN--VNVLAQADLKLFGYDHIYH--------- 183

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                               +P +PEGL+ ++SS+   ++G + G +I   +    ++  
Sbjct: 184 -------------------KSPVDPEGLMGTISSVAHVLLGFYCGMLIRKRETVEQKVIA 224

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
              +G   +I G  L +   +PLNK++++ SYV +T G A+L+ + +  ++D      WT
Sbjct: 225 LFVVGAVCVIGGYLLSY--GLPLNKRIWSPSYVLMTCGLASLMQALLMYVIDIQKKSGWT 282


>gi|441501363|ref|ZP_20983482.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
 gi|441434899|gb|ELR68324.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
          Length = 368

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 161/355 (45%), Gaps = 82/355 (23%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +RL SLD+FRG+ +A MI+V++ G     +P + HA W+GC L D V PFFLFIVGVA
Sbjct: 3   KNKRLLSLDVFRGITIAAMIVVNNPGSWAAVYPPLLHAGWHGCTLTDLVFPFFLFIVGVA 62

Query: 88  IALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           + L+L R  +     K++IF  LK   +LF    L G F +A        D+  +R+ GV
Sbjct: 63  VCLSLSRAVEDKGRHKQIIFTVLKRSVILF----LIGLFLNAFPYF----DLYHLRIPGV 114

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ + + +   ++ K        +G               + +L+VY  L      
Sbjct: 115 LQRIAVVFFICAF--LYLKTGWKVQVYIG---------------SAILMVYWLL------ 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
               F II    A  G            L    N   ++D ++L   HM     W  +K 
Sbjct: 152 ----FLIIPIPGAATG-----------SLESGANLAAWVDSQLLT-GHM-----WEVTKT 190

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG+LS++ +I++ IIGV  G  ++  +    ++  
Sbjct: 191 ----------------------WDPEGVLSTLPAIVTGIIGVLVGQWLMADRTEKEKVIY 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
                  L++ G  L +    P+NK L+T SYV  T+G A    + +Y L+D  +
Sbjct: 229 LFVAANLLIVAG--LFWDLFFPINKSLWTSSYVLYTAGIAIHFLAFLYWLLDIKM 281


>gi|423241434|ref|ZP_17222547.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
 gi|392641327|gb|EIY35104.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
          Length = 372

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 156/368 (42%), Gaps = 90/368 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   +A+PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVD 376
           + +   +D
Sbjct: 282 ATLMYYID 289


>gi|256420508|ref|YP_003121161.1| hypothetical protein Cpin_1463 [Chitinophaga pinensis DSM 2588]
 gi|256035416|gb|ACU58960.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 374

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 167/359 (46%), Gaps = 78/359 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           QR   LD+FRGL V  MI+V+  G D  +  ++HA WNGC   D V P FLF VG A++ 
Sbjct: 7   QRFLPLDVFRGLTVCFMIIVNTPGWDTSYYILNHAQWNGCTPTDMVFPSFLFAVGNAMSF 66

Query: 91  ALKRIP--DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++++    +    + K+  RTL +   G L+       H    L + + +   R+ GVLQ
Sbjct: 67  SMRKFQQLENTAVLSKIFRRTLLIFLLGFLMYWLPFVRHTESGLEF-IPLSDTRILGVLQ 125

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL Y   SL+  +        ++V            W ++A +L+ Y A++Y      
Sbjct: 126 RIALCYCFASLLIHYLPK-----KAV------------WAVSAVLLLGYWAVMYAF---- 164

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                   D AD             + +   NA  + D+ ++G +H+YH           
Sbjct: 165 -------GDPAD-------------RYSLTGNAALHFDKLIMGDSHLYHG---------- 194

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR-LKQW 325
                EG             F+PEGLLS++ +I++ I G + G + I  +G   + L++ 
Sbjct: 195 -----EG-----------IAFDPEGLLSTLPAIVNVIAGYYTG-LFIQQEGKTGKGLRKL 237

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           + MG ALLI  + L ++ A P+NK+L+T SYV  T G   L+ S +  ++D      WT
Sbjct: 238 LQMG-ALLIL-VALVWSMAFPINKKLWTSSYVLYTVGIDLLILSLLIFVIDFKKQEGWT 294


>gi|116789271|gb|ABK25182.1| unknown [Picea sitchensis]
          Length = 124

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 67/93 (72%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
           + +  TT    +I+ +   +   + ++    R+A+LD+FRGL +A+MILVD AGG WP+I
Sbjct: 9   QKQPSTTESSNVIVIQDGQTIPAKPTNETKTRVATLDVFRGLTIAVMILVDDAGGKWPQI 68

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRI 95
           +H+PWNGC LADFVMPFFLFIVGVA+AL  K +
Sbjct: 69  NHSPWNGCTLADFVMPFFLFIVGVAVALTFKVV 101


>gi|374309722|ref|YP_005056152.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751732|gb|AEU35122.1| hypothetical protein AciX8_0773 [Granulicella mallensis MP5ACTX8]
          Length = 377

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 157/345 (45%), Gaps = 69/345 (20%)

Query: 44  LAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIP 96
           + +A MILV   G     +P++ HA WNG    D + P FL I+GVA+  +    ++R  
Sbjct: 1   MIIAGMILVTDPGTYSAVYPQLMHAQWNGATATDMIFPSFLVIIGVAMTFSFASRIERGA 60

Query: 97  DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
           DR   +  V+ R++ L+F G+L+ G     P+      ++  IR+ G+LQRIAL Y   S
Sbjct: 61  DRRQILWHVLTRSVLLIFLGLLVNG----FPEY-----NLHTIRIPGILQRIALCYFAGS 111

Query: 157 LVEIFT---KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
           L+ +     KD   + Q + R ++         + A +LV+Y  LL G  VP +      
Sbjct: 112 LLYLAVSGKKDANTESQRLRRGTVIG------AVLAGLLVLYWVLLKGYPVPGFG----- 160

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                             +L+   N   Y DRK+ G+ H++ +            +P  G
Sbjct: 161 ----------------SGRLDSLGNVAAYFDRKIFGVQHLWAYGL----------TPGYG 194

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
                        F+PEGLLS++ ++ + + GV  G  +   +    +       G AL+
Sbjct: 195 -----------VTFDPEGLLSTLPALATLLFGVLAGEWLRTNQARGRKALVLAVAGVALV 243

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           + GL L  +  +PLNK++ T ++   + G A L+F+  Y ++D  
Sbjct: 244 LVGLAL--SPLLPLNKKILTSTFAIFSGGVALLLFAGFYFVLDVK 286


>gi|281423205|ref|ZP_06254118.1| putative membrane protein [Prevotella oris F0302]
 gi|281402541|gb|EFB33372.1| putative membrane protein [Prevotella oris F0302]
          Length = 359

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 158/354 (44%), Gaps = 86/354 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG  V LMILV++  G   +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
             L+LK+        ++V F+ +K   LLF    L G F +  D    G  +D   +R+ 
Sbjct: 61  TYLSLKK--TNFTWSRQVAFKIVKRTVLLF----LIGLFINWFDMAISGNALDFSHLRIW 114

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
            V+QRIA+ Y  V                    SIF L C H      ++++  A     
Sbjct: 115 AVMQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA----- 149

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
               +   +I  +   Y              +   N +  ID ++ GI H+YH       
Sbjct: 150 ----YNLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH------- 184

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                ++P +PEG  SS+S+I  T+IG + G  +   K    ++
Sbjct: 185 ---------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKV 223

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +++  G  L+I G  + F   +PLNK++++ SYVC+T G AA+    +   +D
Sbjct: 224 LRFLITGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLAAVTQGLLMYCID 275


>gi|338210835|ref|YP_004654884.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336304650|gb|AEI47752.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 363

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 103/350 (29%), Positives = 155/350 (44%), Gaps = 84/350 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            RL SLD+FRG+ VA MILV++ G DW  +     HA WNGC   D + PFFLFIVGV++
Sbjct: 3   NRLLSLDVFRGMTVAAMILVNNPG-DWDHVYAPLLHAHWNGCTPTDLIFPFFLFIVGVSV 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           A A+ + P    ++ K+I R+  L      L    +  P       D   +R+ GVLQRI
Sbjct: 62  AFAMGKNP---PSLLKIIKRSAIL----FGLGLFLNLYPK-----FDFANVRIPGVLQRI 109

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL YL+ SL  IF K  + K Q +                  +L+ Y  L+         
Sbjct: 110 ALVYLVCSL--IFIKTTR-KTQVI--------------TTVLLLIAYWLLM--------- 143

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
            T++      Y           A L P  N   ++DR +L   H+     W+ +K     
Sbjct: 144 -TLVPVPGVGY-----------ANLEPTTNLGAWVDRGLLTTAHL-----WKSAKV---- 182

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                             ++PEG+ S++ +I + ++GV  G  +   K    ++      
Sbjct: 183 ------------------WDPEGMFSTIPAIGTGLLGVLTGQWLRSEKPVAEKMAWLFLS 224

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           G AL++ GL   +    P+NK L+T S+V    G A +  +A Y L+D  
Sbjct: 225 GNALILGGLI--WNEFFPINKSLWTSSFVLYAGGWAMVGLAACYWLIDVQ 272


>gi|399028182|ref|ZP_10729485.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
 gi|398074259|gb|EJL65410.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
          Length = 423

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 170/370 (45%), Gaps = 70/370 (18%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA WNGC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGHVFPPLLHAKWNGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDRA---DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            LA+   PD+        K++ R+L++L  GI     F+       +G++   + +  ++
Sbjct: 63  PLAM---PDKIYDDTTFNKILVRSLRMLCLGIF----FNFFEKIQLFGLEGIPLLIGRLI 115

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW---------------HWLMAAC 190
             IA+ Y+L+        +   K +++  FSI  +Y +                 L    
Sbjct: 116 ITIAVGYVLMG-------NFSSKLKNIFAFSILIIYLFLAYSEIEAYQDVRLPGVLQRIA 168

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNV--TCGV-RAKLNPPCNAVGYIDRKV 247
           V+   ++LLY       Q          Y  V N+    G+  A L    N   ++D  +
Sbjct: 169 VVYFVVSLLYLKTSQKTQIITGVFLLLGYWAVMNLIPVPGIGEANLEKGTNLASWLDSIL 228

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           L   HMYH                      +  +W     +PEG+LS++ SI++ IIG+ 
Sbjct: 229 LK-GHMYH----------------------ETKTW-----DPEGILSTIPSIVNGIIGLL 260

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G ++      +   K+   +G AL+I GL  +     P+NK ++T SYV  T+G AA  
Sbjct: 261 IGQLLFLKIAKIEIAKKIALIGIALIITGLLWNI--VFPINKSIWTSSYVLYTTGLAATT 318

Query: 368 FSAIYALVDC 377
            S +Y ++D 
Sbjct: 319 LSVLYFIIDI 328


>gi|375146803|ref|YP_005009244.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361060849|gb|AEV99840.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 376

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 160/371 (43%), Gaps = 95/371 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + +R  +LD+FRG+ +  MI+V+ +      +  + HA W+G    D V P FLF VG A
Sbjct: 3   QQKRFLALDVFRGMTICFMIIVNTSPDGSHTYAPLLHAQWHGFTPTDLVFPSFLFAVGNA 62

Query: 88  IALALKRIPDRADA--VKKVIFRTLKLLFWGILL-----------QGGFSHAPDELTYGV 134
           ++  + R  + +    + K++ RTL +   G L+            G ++  P E T   
Sbjct: 63  MSFVMPRWENASTGFVLGKILKRTLLIFILGYLMYWFPFVRMDKVTGVYAFYPFEKT--- 119

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
                R+ GVLQRIAL+Y   SL+  F K              FR      ++ A +L++
Sbjct: 120 -----RVFGVLQRIALAYCFASLMLYFLK--------------FRATI---IITAAILLL 157

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  +LY              DSAD              L+   NAV  +D  +LG +H+Y
Sbjct: 158 YWPVLY-----------FFGDSAD-------------PLSLAGNAVLKLDLWLLGPDHLY 193

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
           H                EG            PF+PEG LS+  +I + + G   G  +  
Sbjct: 194 HG---------------EG-----------VPFDPEGFLSTFPAIANVVGGYWVGRFLQQ 227

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTN-AIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             G    L + +  GFALL+     HF N + P+NK+L+T S+V  T G   L+ +AI  
Sbjct: 228 KGGTYEALTKLMLAGFALLVLA---HFWNYSFPINKKLWTSSFVLHTVGIDCLIIAAIVY 284

Query: 374 LVDCNIFGPWT 384
           + D      WT
Sbjct: 285 IADIQQKTSWT 295


>gi|299144716|ref|ZP_07037784.1| putative membrane protein [Bacteroides sp. 3_1_23]
 gi|298515207|gb|EFI39088.1| putative membrane protein [Bacteroides sp. 3_1_23]
          Length = 361

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/336 (29%), Positives = 159/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|373460170|ref|ZP_09551926.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
 gi|371956555|gb|EHO74341.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
          Length = 359

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 158/355 (44%), Gaps = 88/355 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ +RL SLD+ RG+ V LMILV++  G+  +  + H+ WNG    D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGMTVCLMILVNNGAGEHIYSTLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 88  IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
             L+LK+     +R  A  K+  RT+ L   G+     F +  D L  G  +D   +R+ 
Sbjct: 61  TFLSLKQTNFAWNRQTAC-KIAKRTVLLFAIGL-----FINWFDLLLQGRALDFEHLRIW 114

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GV+QRIA+ Y     V +F   +  K                 L+A  ++   + L+ G 
Sbjct: 115 GVMQRIAICY---GAVSVFALSINHKRTLP-------------LIATLLIAYAMFLMLGN 158

Query: 203 -YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            Y  D Q                             N +  ID  + G  H+YH      
Sbjct: 159 GYAYDSQ----------------------------QNLIAQIDIHLFGQAHLYH------ 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                  +P +PEGL SS+ +I  T+IG + G ++   +    +
Sbjct: 185 ----------------------KSPVDPEGLASSLPAIAHTLIGFYCGRLMAMARTTEEK 222

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           + +++ +G  L++ G    F   +PLNK++++ SYVC+T G AA     +  ++D
Sbjct: 223 VLKFMLVGGVLVLIGYLASF--GLPLNKRIWSPSYVCITCGLAATCLGLLMYVID 275


>gi|311747386|ref|ZP_07721171.1| membrane protein [Algoriphagus sp. PR1]
 gi|126579104|gb|EAZ83268.1| membrane protein [Algoriphagus sp. PR1]
          Length = 381

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 166/367 (45%), Gaps = 80/367 (21%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           K+ L   R  +LD+ RGL +A MI+V+ AG DW  +    +HA W+G    D V P FLF
Sbjct: 6   KTDLLKNRYLALDVLRGLTIAFMIVVNSAG-DWSNLYAPLAHAKWHGFTPTDLVFPTFLF 64

Query: 83  IVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRM 138
           +VG A++ ++K++ +   +   KKV  RTL +   G LL    F    +   +  +++  
Sbjct: 65  VVGNAMSFSMKKLQEMPTSAFFKKVGKRTLLIFLIGWLLNAFPFYDISETGNFSLINITE 124

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +RL GVLQRIAL Y   +++ ++   V             RL    W+ +   L+ Y  +
Sbjct: 125 VRLFGVLQRIALCYFFAAII-LYIGGV-------------RL---GWIFSGIALLTYWGI 167

Query: 199 LYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +Y           +  DS+D YG                 NA   +D  ++G++ MY   
Sbjct: 168 MY-----------VFGDSSDPYGLT--------------GNAAIKLDLSLIGVDRMYGG- 201

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         EG            PF+PEGLLS++ SI++ I G   G ++     
Sbjct: 202 --------------EG-----------IPFDPEGLLSTLPSIVNVIAGYIIGKMVQKYGN 236

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            L  +K+ +     L++  L   +    P+NK+++T SYV +T G   ++ + +  +++ 
Sbjct: 237 TLESIKKLLIGAVVLIV--LAYIWDIVFPINKKIWTSSYVLLTVGIDMVLLALLVYIIEL 294

Query: 378 NIFGPWT 384
                WT
Sbjct: 295 QKVKNWT 301


>gi|423215264|ref|ZP_17201791.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392691832|gb|EIY85072.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 371

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 157/355 (44%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLD+ RG+ +  MILV++ G  W  +     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPG-TWESVYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++A +L+ Y+ LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L    +  L+D
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLID 283


>gi|295132874|ref|YP_003583550.1| hypothetical protein ZPR_1009 [Zunongwangia profunda SM-A87]
 gi|294980889|gb|ADF51354.1| membrane protein [Zunongwangia profunda SM-A87]
          Length = 376

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 157/359 (43%), Gaps = 100/359 (27%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++R  SLD+FRGL +ALMILV+  G     +P + HA W G  LAD V P FLF VG A+
Sbjct: 6   SERFLSLDVFRGLTIALMILVNTPGTGADLYPYLVHAQWFGFTLADLVFPSFLFAVGNAM 65

Query: 89  ALALKRIPDRA--DAVKKVIFRTLKLLFWGILL---------QGGFSHAPDELTYGVDVR 137
           + ++++  + A  D  KKV+ RT  +   G L+         +G    +P   T      
Sbjct: 66  SFSMRKFQEAAPADFWKKVLKRTAIIFLLGFLMYWFPFFRMNEGHLELSPFSET------ 119

Query: 138 MIRLCGVLQRIALSYLL-VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRIAL Y     LV  F+       +++G             + A +L+ Y 
Sbjct: 120 --RIMGVLQRIALCYFFGAVLVRYFSV------KTIG------------FICAAILLAYW 159

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
            +LYG   P                          +L    NA    D  +LG  H+Y  
Sbjct: 160 GILYGFGEPG------------------------HELEMATNAAAKFDYAILGEGHIY-- 193

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                              +KDA      PF+PEG+LS++ SI++ + G +   V I  K
Sbjct: 194 -------------------KKDA-----IPFDPEGILSTLPSIVNVLAG-YLAGVFIRRK 228

Query: 317 GH-LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG-----AAALVFS 369
           G     + + +  GF  L+F L   +    PL+K+L+T  +  +T G      AAL+F+
Sbjct: 229 GKSYETIAKLMLAGF--LVFALAEWWALIFPLSKKLWTSPFAMLTIGLNLSMLAALIFA 285


>gi|265753064|ref|ZP_06088633.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236250|gb|EEZ21745.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 372

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 155/368 (42%), Gaps = 90/368 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   + +PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAIYALVD 376
           + +   +D
Sbjct: 282 ATLMYYID 289


>gi|160884063|ref|ZP_02065066.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|423291476|ref|ZP_17270324.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
 gi|156110405|gb|EDO12150.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|392663476|gb|EIY57026.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
          Length = 361

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/336 (29%), Positives = 158/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLAVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|196002389|ref|XP_002111062.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
 gi|190587013|gb|EDV27066.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
          Length = 431

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 66/317 (20%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +R+ SLD++RGL   +M   D  GG +    H+ WNG  + D V P F+FI G +++++L
Sbjct: 161 RRIRSLDLYRGLCAIVMAFGDSGGGQYRFFKHSIWNGLTIVDVVFPGFIFISGFSLSISL 220

Query: 93  KRIPDRADAVKKV-----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +   +    K +     I R+  L F G+L+ G     P +++        RL GVLQR
Sbjct: 221 VKRLYKMQTPKLILIVTTIRRSFYLFFLGLLING-----PCQISNW------RLLGVLQR 269

Query: 148 IALSYLLVSLVEI--------FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           I++++L+VS + +        FTKD   +++ + +        W  ++    L  Y+ L 
Sbjct: 270 ISVTFLVVSCLAVWLYPTIKSFTKDQVLQEKVLRKM-------WPIMVLIVGLHTYVTLT 322

Query: 200 YGTYVPDWQFTIINK-DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
               VPD           +D GK +N T G+           G+IDR V G NH+Y  P 
Sbjct: 323 AA--VPDCPVGYSGPGGKSDDGKYYNCTGGI----------AGFIDRFVFGSNHLYSRPT 370

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT--K 316
            +     +Q                  PF+PEG+L +++SI    +G+  G  I+H    
Sbjct: 371 CKLLYQSSQ------------------PFDPEGVLGTLTSIFLCFLGLQMG--ILHNIFS 410

Query: 317 GHLARLKQWVTMGFALL 333
            +L  ++ W+  G  L+
Sbjct: 411 NNLRIMRTWILFGLLLV 427


>gi|293371912|ref|ZP_06618316.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292633158|gb|EFF51735.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 361

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 159/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  A D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|298480127|ref|ZP_06998326.1| membrane protein [Bacteroides sp. D22]
 gi|336404355|ref|ZP_08585053.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
 gi|295085510|emb|CBK67033.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298273936|gb|EFI15498.1| membrane protein [Bacteroides sp. D22]
 gi|335943683|gb|EGN05522.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
          Length = 361

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 159/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYDFQCRPAITKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---AIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|423214205|ref|ZP_17200733.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693150|gb|EIY86385.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 361

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 159/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLAVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + SAD                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II+ K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIINIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|423293378|ref|ZP_17271505.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
 gi|392678321|gb|EIY71729.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
          Length = 361

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 158/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|223935576|ref|ZP_03627492.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223895584|gb|EEF62029.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 160/362 (44%), Gaps = 88/362 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+FRG  +A M+LV++ G  W  I     HA WNG    D + PFFL+IVGVA
Sbjct: 22  SARLMSLDVFRGATIASMMLVNNPG-SWDSIYRQLDHAEWNGWTFTDLIFPFFLWIVGVA 80

Query: 88  IALALKRIPD----RADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDV 136
           I L+ ++  D    R +    V+ R        L L F+  L+ G +        +  ++
Sbjct: 81  IPLSTQKRLDGGASRTNLWLHVVRRAAIIFGLGLFLAFFSFLINGSYGRLGGFGPWFNEI 140

Query: 137 -RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IR+ GVLQRIA+ YL+ S + + TK        +G  + +      W++  CV V  
Sbjct: 141 CGTIRIPGVLQRIAVCYLIASTIYLTTKLRGQIAWLIGLLAAY------WVLMKCVPV-- 192

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
                                  +G       GV   L P  N   Y+D  VLG  H +H
Sbjct: 193 ---------------------PGHG------AGV---LTPEGNFSAYVDGNVLG-RHTWH 221

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                                        AP++PEG++S++ +I + + G+  G +++  
Sbjct: 222 ----------------------------GAPWDPEGVISTIPAIATCLFGILTGQLLL-I 252

Query: 316 KGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K  + +   WV + G  L++ G  ++    +P+NK L+T SY    +G A  VF+  Y L
Sbjct: 253 KRSVEQKTTWVFVSGILLILAGAVMNIW--LPINKNLWTSSYSVFMAGMAMNVFAVFYWL 310

Query: 375 VD 376
           VD
Sbjct: 311 VD 312


>gi|298479647|ref|ZP_06997847.1| membrane protein [Bacteroides sp. D22]
 gi|336403243|ref|ZP_08583960.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
 gi|295084924|emb|CBK66447.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298274037|gb|EFI15598.1| membrane protein [Bacteroides sp. D22]
 gi|335946636|gb|EGN08437.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
          Length = 371

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 156/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLD+ RG+ +  MILV++ G  W  +     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPG-TWESVYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++A +L+ Y+ LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G  LL  G  L +    PLNK++++ ++V VT G A+L    +  L+D
Sbjct: 231 LLQISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVLLTWLID 283


>gi|237719042|ref|ZP_04549523.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229451820|gb|EEO57611.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 361

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 158/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +LVVY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG ID  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|262407085|ref|ZP_06083634.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648023|ref|ZP_06725570.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294809833|ref|ZP_06768513.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345512215|ref|ZP_08791750.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|229445856|gb|EEO51647.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|262355788|gb|EEZ04879.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636642|gb|EFF55113.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294442971|gb|EFG11758.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 371

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 156/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLD+ RG+ +  MILV++ G  W  +     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPG-TWESVYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++A +L+ Y+ LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G  LL  G  L +    PLNK++++ ++V VT G A+L    +  L+D
Sbjct: 231 LLQISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLID 283


>gi|189502592|ref|YP_001958309.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
 gi|189498033|gb|ACE06580.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
          Length = 380

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 160/362 (44%), Gaps = 85/362 (23%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFI 83
            HLK QRL SLD FRGL VA MIL ++ G  W  I     HA W+GC   D + PFFLFI
Sbjct: 2   QHLK-QRLVSLDFFRGLTVAGMILANNPG-SWGHIYAPLKHAEWHGCTPTDLIFPFFLFI 59

Query: 84  VGVAIALAL---KRIPD-RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RM 138
           VGV+IA A+   K +P+  +  + K + R L L   GI L    +  P   T  ++  + 
Sbjct: 60  VGVSIAFAIGSKKELPETHSQLILKSVRRMLTLFCLGIFL----ALYPKIFTSPIEAFKT 115

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQR A+ Y + ++  IF K                           +L + L L
Sbjct: 116 VRIPGVLQRTAIVYFISTI--IFLK----------------------FTPRTILKIMLGL 151

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L   ++     T +      Y           A L    N   +IDR +L   H++    
Sbjct: 152 LVSYWI---LMTFVPVPGIGY-----------ANLEQETNLAAWIDRNLLTEPHLW---- 193

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                             K   +W     +PEG+L ++S+I + + G+  G +++  K  
Sbjct: 194 ------------------KAVRTW-----DPEGILGTISAIATGLSGILAG-ILLQRKDQ 229

Query: 319 --LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
               ++ +  + G   +I GL  +F    P+NK L+T S+V  TSG A ++ +  Y ++D
Sbjct: 230 SDTEKIARLFSSGTLAVITGLIWNFI--FPINKSLWTSSFVLYTSGLAYIILALCYWIID 287

Query: 377 CN 378
             
Sbjct: 288 VK 289


>gi|156401292|ref|XP_001639225.1| predicted protein [Nematostella vectensis]
 gi|156226352|gb|EDO47162.1| predicted protein [Nematostella vectensis]
          Length = 376

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 157/347 (45%), Gaps = 63/347 (18%)

Query: 49  MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
           MI V+  GG +    HA WNG  +AD V P+F++I+GV+I L+ K +  R     K+  +
Sbjct: 1   MIFVNFGGGGYYFFGHAAWNGLLVADLVFPWFIWIMGVSITLSFKSLKRRKVKKWKICLK 60

Query: 109 TLK--LLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
            ++  L+ +G+ L    F+          D+   R+ GVLQR A  Y++++L+++F    
Sbjct: 61  VIRRSLILFGLGLFTSNFN----------DLETYRIPGVLQRFAACYIVIALMQLFLGPS 110

Query: 166 QDKDQSV---------GRFSIFRLYCWHWLMAACVLVVYLALLYGTYV---PDWQFTIIN 213
           +++ Q +            SI++     WL    +L +Y+ + Y   +   P   +T   
Sbjct: 111 EEQTQVLYPKWWDPIRDVVSIWK----QWLAMLLLLAIYVTVTYAVKLDGCPR-GYTGPG 165

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                Y + FN T GV            YIDRK  G  H+Y  P        T    ++ 
Sbjct: 166 GIGRGYPEAFNCTGGV----------ANYIDRKFFG-KHIYQWP--------TVKQLYKT 206

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
            L          P EPEG L +++SI    +GV  G ++   +    R+ +W+  G  L 
Sbjct: 207 KL----------PHEPEGFLGTLTSIFLVFLGVQAGRILHTYRKSTERITRWLAWGVFLG 256

Query: 334 IFGLTL----HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           + G+ L         +P+NK L+++S++ VT  ++  + +  Y   D
Sbjct: 257 LIGVGLCKASENEGVVPINKNLWSVSFILVTGSSSFFLLTFCYIFTD 303


>gi|307610361|emb|CBW99930.1| hypothetical protein LPW_16871 [Legionella pneumophila 130b]
          Length = 372

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 151/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R +       +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|146302702|ref|YP_001197293.1| hypothetical protein Fjoh_4975 [Flavobacterium johnsoniae UW101]
 gi|146157120|gb|ABQ07974.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 165/397 (41%), Gaps = 126/397 (31%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL SLD+FRGL + LM +V++ G DW    P + HA W+GC   D V PFF+FI+GVA+
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAEWHGCTPTDLVFPFFIFIMGVAV 62

Query: 89  ALALKRIPDR---ADAVKKVIFRTLKLLFWGIL--------------------------- 118
            LA+   PD+   +    K++ R+L++L  GI                            
Sbjct: 63  PLAM---PDKFYDSTTFNKILVRSLRMLCLGIFFNFFGKIQLFGLEGIPLLIGRLAITIA 119

Query: 119 ----LQGGFSHAPDE------------LTY-GVDVRM-IRLCGVLQRIALSYLLVSLVEI 160
               L G FS                 L Y G++    +RL GVLQRIA+ Y +VSL  +
Sbjct: 120 VGYALMGSFSSKVKNILAFSILFIYLFLAYSGIEAYHDVRLPGVLQRIAIVYFVVSL--L 177

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
           + K  Q      G                 +L+ Y A++    VP               
Sbjct: 178 YLKTSQRTQIITG---------------IVLLLGYWAIMTLIPVP--------------- 207

Query: 221 KVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
                  G+  A L    N   ++D  +L   HMY                      +  
Sbjct: 208 -------GIGEANLEKGTNLASWLDSVLLK-GHMY----------------------RGT 237

Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
            +W     +PEG+LS++ SI++ IIG+  G V+      + + ++    G  L+ FGL  
Sbjct: 238 ITW-----DPEGILSTLPSIVNGIIGLLIGQVLQRDTTKILKAQKMGIAGTILIFFGLMW 292

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                 P+NK L+T SYV  T+G A +  + +Y  +D
Sbjct: 293 DL--VFPINKSLWTSSYVLYTTGLATVFLTILYYTID 327


>gi|212695334|ref|ZP_03303462.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
 gi|212662113|gb|EEB22687.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
          Length = 284

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 153/363 (42%), Gaps = 90/363 (24%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
            E +    Q+   +K +RL SLD  RG+ VA MILV++AGG   +  + H+ WNG    D
Sbjct: 4   EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62

Query: 75  FVMPFFLFIVGVAIALALKRIPDRA--DAVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
            V PFFLFI+G++  ++L +         V K++ RT  +L   W I   G F H    D
Sbjct: 63  LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
            L +      +R+ GVLQRIAL Y ++S   +F                           
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
                         ++P   F ++   +        + C          N +  IDR++ 
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  H+Y                              +P +PEG +S++S+I  T IG   
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G  II +     ++ +    GF L+  G  L   + +PLNK++++ ++V VT GAA++  
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSL 281

Query: 369 SAI 371
           + +
Sbjct: 282 ATL 284


>gi|242062186|ref|XP_002452382.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
 gi|241932213|gb|EES05358.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
          Length = 108

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 69/97 (71%), Gaps = 1/97 (1%)

Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
           APSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H  R+  W+ + F++L     
Sbjct: 13  APSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLILSFSMLALAFL 72

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
           + F+  + ++K LYT+SY   T+G A L+F+ IYALV
Sbjct: 73  MDFSG-MRMSKPLYTMSYTLATAGTAGLLFAGIYALV 108


>gi|372268395|ref|ZP_09504443.1| hypothetical protein AlS89_10850 [Alteromonas sp. S89]
          Length = 395

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 163/352 (46%), Gaps = 69/352 (19%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL S+D+ RG+A+A M+LV++ G     +  ++HA W+G    D + P FLF+VGV++ L
Sbjct: 16  RLMSVDLLRGIAIAAMVLVNNPGSWSFVYAPMAHAQWHGWTPTDVIFPLFLFVVGVSMVL 75

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLCGVLQRI 148
           +  +  D          R LKL   G+ L   F +  D     ++ R+  IR  GVLQRI
Sbjct: 76  STGKRGDFPPVGWAQWSRALKLFALGLFLAIFFYNFRDASYNWIEDRLEGIRWMGVLQRI 135

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           AL Y+L                          Y   WL A  +LV   A +  + VP W 
Sbjct: 136 ALVYILCC------------------------YLVRWLPAKGLLV---AAILCSVVP-WT 167

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
             ++    +  G+VF      + +L    +   ++D+ +LG  H+Y+  A          
Sbjct: 168 LMLVVPYQSASGEVF------QGQLAFGNHFAAWLDQWLLGSAHVYYRDA---------- 211

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV----HFGHVIIHTKGHLARLKQ 324
            PF               F+PEG+L++ S+  + ++GV     +     + +  L   + 
Sbjct: 212 QPFA--------------FDPEGVLTTFSAASTCLLGVLAALAWKSADSNGEAQLRLCRN 257

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           W+  G  +++ G  +H  + +P+NK L++ S+V VT+G + L+ + +Y LVD
Sbjct: 258 WLVAGTLMVLVGQLMH--SIVPINKALWSPSFVLVTAGVSLLLMAGLYYLVD 307


>gi|386819709|ref|ZP_10106925.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
 gi|386424815|gb|EIJ38645.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
          Length = 366

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 156/358 (43%), Gaps = 83/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  SLD+FRGL + LMI+V+  G  W  I     HAPWNG  L D V P FLF+VG A+
Sbjct: 6   ERYLSLDVFRGLTLFLMIIVNTPG-SWSFIYKPLHHAPWNGFTLTDLVFPTFLFVVGNAM 64

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           + +LK+  +  +   +KKV  RT  +   G LL   F    D       +   R+ GVLQ
Sbjct: 65  SFSLKKFEEIGNTAFLKKVFKRTFLIFLIGFLLY-WFPFFKDGAL--KPISETRIFGVLQ 121

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL Y   +L+  +      K +    FS+  L  +H ++          LL+G     
Sbjct: 122 RIALCYCFAALILHYW-----KPKGALIFSVIALVGYHIIL----------LLFG----- 161

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                                     L    NA    D  ++G +HMY    +       
Sbjct: 162 -------------------------DLTMQGNAAIKADLWLIGSSHMYKGEGF------- 189

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                              PF+PEG+LS++ +I++ I G +F  V +  KG        +
Sbjct: 190 -------------------PFDPEGVLSTLPAIVNVIAG-YFAGVFLQQKGKTYEAIAKL 229

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           TM   +LIF L L +   +P+NK+L+T SYV +T G   +V S +  +++      WT
Sbjct: 230 TMVGGVLIF-LALGWDLLLPINKKLWTSSYVLLTVGIDVMVLSILVFVIEILKKRSWT 286


>gi|374312698|ref|YP_005059128.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358754708|gb|AEU38098.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 383

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 163/363 (44%), Gaps = 85/363 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL S+D+ RGL VA MILV++ G +   +  ++H+ WNG    D V P FLFI+G+++
Sbjct: 15  SSRLLSIDVLRGLTVAFMILVNNNGNNDLAYRALNHSLWNGFTPTDLVFPTFLFIMGISM 74

Query: 89  ALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
            L+      RA A    +       R   L+F+G+++ G      D L         R+ 
Sbjct: 75  VLSFS--AHRAKATSNTVMLTSIGRRFALLIFFGLVVNGFPYFHLDTL---------RIY 123

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ YLL +L+++ T  +  +   V  F  F     +W++   V V       G 
Sbjct: 124 GVLQRIAVCYLLAALLQLVTDRIAPR---VVLF--FAAVIGYWVLLRFVPVP------GH 172

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P   F +++ D                      N V ++DR      H++ H  + ++
Sbjct: 173 GIPGRDFPLLDHD---------------------INLVAWLDR------HIFPHRLFEKT 205

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
           +                        +PEGLLS + +  STI+G+  G  I   +    +L
Sbjct: 206 R------------------------DPEGLLSDIPAFASTILGLLAGAWIKQARPAGQKL 241

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC-NIFG 381
                 G AL + GL   ++ + P+NK+L+T SYV    G + L+ +  Y +++   + G
Sbjct: 242 MGLFGAGIALAVAGLL--WSQSFPINKKLWTSSYVLYAGGLSILLLAVAYYVIEIRQLRG 299

Query: 382 PWT 384
            WT
Sbjct: 300 RWT 302


>gi|163786877|ref|ZP_02181325.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
 gi|159878737|gb|EDP72793.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
          Length = 361

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/349 (28%), Positives = 147/349 (42%), Gaps = 82/349 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + R+ S+DI RGL +  MILV+  G     +  + HA W+G    D + PFFLFIVG++I
Sbjct: 2   SARIESVDILRGLTILAMILVNTPGTWGHVYTPLRHAEWHGLTPTDLIFPFFLFIVGISI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A K  P+     KK+I R+LKL+  G+ L     H P    +  D    R+ GVLQRI
Sbjct: 62  YFAYKNKPNTKLTYKKIIIRSLKLIGLGLFLNLFLPHFP----FFNDFETHRIPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYGTYVPDW 207
            L +L                      SI  L C W  L A  + ++    L   ++P  
Sbjct: 118 GLVFLFS--------------------SILYLNCSWKSLTAIGITIILGYWLCLGFIPFQ 157

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             ++   D A                  P N   YID  +LG  HM     W+       
Sbjct: 158 DGSLPTFDRA------------------PNNWANYIDLNILG-EHM-----WKTD----- 188

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEGL+S++ +I + I G+  G ++      L  +K    
Sbjct: 189 -------------------YDPEGLISTIPAIATCISGILIGKLL----DGLTHIKYLFI 225

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                LI G  L+     P NK +++ S+V VTSG A L+ + IY L D
Sbjct: 226 TALGFLISGYILNI--WFPTNKAIWSSSFVLVTSGWATLILAVIYYLKD 272


>gi|345320430|ref|XP_001516736.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Ornithorhynchus anatinus]
          Length = 448

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 80/284 (28%), Positives = 133/284 (46%), Gaps = 42/284 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           QRL SLD FRG ++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct: 198 QRLRSLDTFRGFSLIIMVFVNYGGGKYWFFKHEGWNGLTVADLVFPWFVFIMGSSISLSL 257

Query: 93  ----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
               +R   +   + K+++R+  L   G+L+       P+     +    +R+ GVLQR+
Sbjct: 258 SSMLRRGYSKWRLLWKILWRSFLLFLIGVLIVN-----PNYCLGPLSWDKLRIPGVLQRL 312

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL----YCWHWLMAACVLVVYLALLYGTYV 204
             +YL+V+ +E+       +  S+ R   F      Y   W+    +   +L L +   V
Sbjct: 313 GFTYLVVATLELLFAKAVPESNSLERTCSFLQEIISYWPQWIFILMLETAWLCLTFLLPV 372

Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           P      +      D+GK  N T G          A GYID  +LG NH+Y HP+     
Sbjct: 373 PGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLLGENHIYQHPS----- 417

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                         +        ++PEG+L +++SI+   +GV 
Sbjct: 418 -------------PNVLYHTKVAYDPEGILGTINSIVMAFLGVQ 448


>gi|381188372|ref|ZP_09895934.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
 gi|379650160|gb|EIA08733.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
          Length = 430

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 112/410 (27%), Positives = 169/410 (41%), Gaps = 130/410 (31%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  QR+ SLD+ RG+ + +M+LV++ G  W  +     HA WNGC   D V PFF+F++G
Sbjct: 1   MTKQRIISLDVLRGITIMMMVLVNNPG-SWDNVFAPLEHANWNGCTPTDLVFPFFIFVLG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGI---------------------------- 117
            AI LA+        +  K++ R+L+++  G+                            
Sbjct: 60  AAIPLAILTKELNQQSFLKILTRSLRIISLGLFLGFYGKIEIFNLVGYPLLISKLIITGI 119

Query: 118 ---LLQGGFSH-----------------APDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
              +L G F                   A   +T   +VR   L GVLQRI + Y   SL
Sbjct: 120 VAYILLGNFKQKIKFSLVLTLFFVFVFLAFSGITAYNEVR---LPGVLQRIGIVYFFTSL 176

Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA 217
           V + T   + K Q +              +   +LV Y A +    VP            
Sbjct: 177 VYLKT---EIKGQII--------------IIGLLLVGYWATMTLIPVP------------ 207

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
           D+G          A LN   N  G+ID  +L  NH+     W  SK              
Sbjct: 208 DFGP---------ANLNKGTNLAGWIDNLLLK-NHL-----WSFSKT------------- 239

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM---GFALLI 334
                    ++PEG+LS++ +I S IIG+  G ++      LA+ ++ + M   G AL+I
Sbjct: 240 ---------WDPEGILSTIPAIASGIIGLLVGQLL---NSSLAKKEKGLKMFGAGLALVI 287

Query: 335 FGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            GL   +    PLNK L+T S+V  T+G A L  +A Y  +D   +  WT
Sbjct: 288 SGLI--WNEFFPLNKSLWTSSFVLYTAGFATLFLAAFYYAIDIKGYKNWT 335


>gi|325299496|ref|YP_004259413.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
 gi|324319049|gb|ADY36940.1| hypothetical protein Bacsa_2393 [Bacteroides salanitronis DSM
           18170]
          Length = 356

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 159/361 (44%), Gaps = 92/361 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           QRL SLD+ RG+ V  MI+V++AGG++    + H+ WNG    D V PFFLFI+G++  +
Sbjct: 3   QRLLSLDVLRGITVFGMIVVNNAGGEYSYDSLRHSVWNGLTPCDLVFPFFLFIMGISTYI 62

Query: 91  ALKRIPDRAD--AVKKVIFRTLK--LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           AL++   +     ++K++ RTL   L+ WGI     F+   D L +      IR+ GVL 
Sbjct: 63  ALRKFQFQPSPAVLRKIVRRTLLIFLIGWGIY-WFEFACEGDFLPFA----HIRILGVLP 117

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL Y    +V +    V+ K  +             W+    +L   L L +G     
Sbjct: 118 RIALCY---GIVSLLALYVRPKGLA-------------WIAGILLLGYALLLQWG----- 156

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                 N  + D                   N +   D KVLG  H+YH           
Sbjct: 157 ------NGYAMD-----------------STNILAIWDTKVLGYEHLYHK---------- 183

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                             +P +PEGL S++++I  TIIG   G +I+ T+    ++ +  
Sbjct: 184 ------------------SPVDPEGLTSTLAAIAHTIIGFLCGRLIMETQDLGQKIVKLF 225

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD-------CNI 379
             GF L   G  L     +PLNK++++ SY   T G AA++ + +   +D       C I
Sbjct: 226 VAGFLLAAAGFLL--VEWLPLNKRIWSPSYALATCGMAAMLQATLLYFIDAQGKKRWCRI 283

Query: 380 F 380
           F
Sbjct: 284 F 284


>gi|148359197|ref|YP_001250404.1| hypothetical protein LPC_1091 [Legionella pneumophila str. Corby]
 gi|296107241|ref|YP_003618941.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
 gi|148280970|gb|ABQ55058.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
 gi|295649142|gb|ADG24989.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
          Length = 372

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 151/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI+V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|410613391|ref|ZP_11324450.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
 gi|410167053|dbj|GAC38339.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
          Length = 400

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 74/373 (19%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
           Q    H+   RL +LD+FRG+ +  MILV++ G  W  I    +HA W+G  L D + PF
Sbjct: 10  QSIYQHVPANRLLALDVFRGMTITAMILVNNPG-SWQYIYSPLAHAKWHGWTLTDLIFPF 68

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTY 132
           F+FIVGV+I+L+ +R  ++      +I          L L  +  L    FS A D   +
Sbjct: 69  FIFIVGVSISLSGQRQKEQGLGHGHIIHHALLRMFKLLLLGCFLALFYYNFS-AADYDWF 127

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
              +  +R  GVLQRIAL Y+   L+ +F   +Q             + C    M A ++
Sbjct: 128 TQRLMQMRFMGVLQRIALVYMACVLLWLFLSRLQ------------LVIC----MLAILV 171

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             +LA+ +  Y  D               + N   G+    N   N   ++D  +    H
Sbjct: 172 AYWLAMAFIPYHDD---------------LGNQYVGLLEYAN---NLSAWLDNYLFAKTH 213

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y+  A           PF               F+PEG+LS++ +I S + GV  G  +
Sbjct: 214 LYYSSA----------QPFA--------------FDPEGVLSTLPAIASGLSGVLAGQWL 249

Query: 313 IHTKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
             +   +    +W+ + G   L+ G    + + +P+NK L+T SYV +TSG AAL+ + +
Sbjct: 250 SFSHHSMRHKAKWLAICGVVALLLGQM--WAHWLPINKALWTSSYVLLTSGYAALILAGL 307

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W 
Sbjct: 308 MYVLDIKQWRLWA 320


>gi|223940501|ref|ZP_03632350.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223890825|gb|EEF57337.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 168/391 (42%), Gaps = 91/391 (23%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMI----LVDHAGGDWP---------E 61
           I+S P  S  Q  S   T+RL SLD  RG  +  ++    LV      WP         +
Sbjct: 4   ILSPPLQSKPQVTSPSTTKRLLSLDALRGFDMFWIVGGEELVHALYNAWPNGPLGIINSQ 63

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
           + H  W G    D + P F+FIVGV++  +L +   +  +A A+K+V FR+L L  +G+L
Sbjct: 64  MDHKVWQGVAFYDLIFPLFVFIVGVSLVFSLTKAIEVNGKAAALKRVFFRSLLLYVFGLL 123

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           + GG S   D          IR  GVLQRIA+ Y   SLV                F  F
Sbjct: 124 IYGGISKGIDG---------IRWMGVLQRIAICYFSTSLV----------------FCFF 158

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV-------------FNV 225
           +L     + AA +L+ Y AL+  T+VP   F  +   SA   ++              + 
Sbjct: 159 KLRG-MIVAAAALLLTYWALM--TFVP---FPDVRPASASPQEITKHNGFTNVAQLNLSS 212

Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
           T  +  +  P  N   Y+D+K L        P ++                     W   
Sbjct: 213 TTMLHGQFIPGVNLANYVDQKYL--------PGYK---------------------W-DG 242

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
            ++PEGLLS++ +I++ ++GV F  +++       + K  +  G  +    L   +    
Sbjct: 243 TYDPEGLLSTLPAIVTCLLGV-FAGLLLRNPNVPDQKKVLLLAGAGIAGVALGFLWGLEF 301

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           P+ K+L+T SYV V  G A +  +A Y +++
Sbjct: 302 PVIKKLWTSSYVLVAGGYACIFLAAFYQVIE 332


>gi|397667386|ref|YP_006508923.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395130797|emb|CCD09044.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 150/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|397664114|ref|YP_006505652.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395127525|emb|CCD05722.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 150/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGGGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|54294550|ref|YP_126965.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
 gi|53754382|emb|CAH15866.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
          Length = 372

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 150/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R +       +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+      H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSAPHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I+      +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|336412607|ref|ZP_08592960.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942653|gb|EGN04495.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
          Length = 361

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 158/336 (47%), Gaps = 80/336 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLD+ RG+ VA MILV++ G    ++   +HA W+G + AD V P F+F++G++ 
Sbjct: 4   NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L +   +   A+ K+I R+L L+F G++++  F  + D   Y  D+  +RL GV+QR
Sbjct: 64  YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
           + + Y + +L+ +    +  K     RF           +A  +L+VY    L+G     
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                  + S D                   N VG +D  +LG NHMY            
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
           Q   F                +PEG+LS++ ++   +IG   G +II  K +  R+    
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            +G  LL  G  L +  A PLNK+L++ S+V +T G
Sbjct: 231 LIGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|300770061|ref|ZP_07079940.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762537|gb|EFK59354.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 404

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 159/379 (41%), Gaps = 108/379 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  SLD+FRG  VALMI+V++ G  W      + HA W+GC   D V PFFLF VG A+
Sbjct: 3   QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
           +  + R+ +   AV  +KV+ RT+ +   G+ +        A D L     +Y  D +R 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDSMRG 121

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIAL+Y   S++  + ++              +   W   ++  +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
                                       C        P +  G+     D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                            EG            PF+PEGL+S++ +I+  ++G +     I 
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLG-YLAGTYIK 229

Query: 315 TKG---------------HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
            +G               H   L      GF L++  L   ++   P+NK+++T SYV  
Sbjct: 230 KQGQVDWLWKKVPASQEPHFKLLSGLFVTGFILVV--LAWVWSLGFPINKKIWTSSYVLY 287

Query: 360 TSGAAALVFSAIYALVDCN 378
           T+G   +    +   ++  
Sbjct: 288 TTGLGVMTIGGMIWFIEVQ 306


>gi|227538516|ref|ZP_03968565.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227241435|gb|EEI91450.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 404

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 159/379 (41%), Gaps = 108/379 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  SLD+FRG  VALMI+V++ G  W      + HA W+GC   D V PFFLF VG A+
Sbjct: 3   QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
           +  + R+ +   AV  +KV+ RT+ +   G+ +        A D L     +Y  D +R 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDPMRG 121

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIAL+Y   S++  + ++              +   W   ++  +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
                                       C        P +  G+     D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                            EG            PF+PEGL+S++ +I+  ++G +     I 
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLG-YLAGTYIK 229

Query: 315 TKG---------------HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
            +G               H   L      GF L++  L   ++   P+NK+++T SYV  
Sbjct: 230 KQGEVDWLWKKVPASQEPHFKLLSGLFVTGFILVV--LAWVWSLGFPINKKIWTSSYVLY 287

Query: 360 TSGAAALVFSAIYALVDCN 378
           T+G   +    +   ++  
Sbjct: 288 TTGLGIMTIGGMIWFIEVQ 306


>gi|52841889|ref|YP_095688.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|378777523|ref|YP_005185961.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|52629000|gb|AAU27741.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|364508338|gb|AEW51862.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 372

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 149/352 (42%), Gaps = 82/352 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           LK QRL SLD+FRG+ + LMI V+  A  D +P   H  WNGC LAD V PFFLFIVG+ 
Sbjct: 6   LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65

Query: 88  IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
             ++LK   +R         +I R++                 +   + ++   IR+ G+
Sbjct: 66  SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ YL+ + + +   +   K Q                       ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
              Q  +       YG           +L    + V Y D+     +H+Y          
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 ++PEG LS+ +SI +T+ GV  G ++I       +   
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLISPCNQFKKF-- 228

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           ++  G  LL   L   +  + P+NK L+T SYV  TSG A L F+  Y L+D
Sbjct: 229 YLLAGIGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLAFAFCYLLID 280


>gi|423289836|ref|ZP_17268686.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
 gi|392666578|gb|EIY60091.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 155/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++  +L  Y  LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  L+D
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLID 283


>gi|146302547|ref|YP_001197138.1| hypothetical protein Fjoh_4820 [Flavobacterium johnsoniae UW101]
 gi|146156965|gb|ABQ07819.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 163/402 (40%), Gaps = 116/402 (28%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +RL SLD+FRG  + LM +V++ G     +P + HA W+GC   D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTILLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
           AI  A+           K++ R+L++   G+ L                           
Sbjct: 61  AIPFAMPVKHFDGAVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGVRLVIAFAV 120

Query: 121 -----GGFSHAPDE-LTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
                G FS      L  G+ + ++             R+ GVLQRIA+ Y   S++ + 
Sbjct: 121 AYALLGNFSMKVKTILAVGIFIILLSLAFSGLEHFEDTRIPGVLQRIAIVYFFASILYLK 180

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
           T     K Q              W++A+ +++ YL +    +VP   F   N D      
Sbjct: 181 T---NLKTQ-------------LWVVASILVIYYLLM---AFVPVPGFGPANFDKG---- 217

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                           N   ++D  VL   H+     W  SK                 +
Sbjct: 218 ---------------TNLAAWLDNLVLN-GHL-----WSVSK-----------------T 239

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           W     +PEG+LS++ +I + I+G++ G ++         LK+    G  LLI GL  + 
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLQTNRTEILKKTAVTGVILLIGGLLWNI 294

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
               P+NK L+T SYV  T+G A L  S +Y ++D   +  W
Sbjct: 295 I--FPINKSLWTSSYVLYTAGIATLCLSLLYYIIDIQGYKKW 334


>gi|326800650|ref|YP_004318469.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326551414|gb|ADZ79799.1| hypothetical protein Sph21_3257 [Sphingobacterium sp. 21]
          Length = 396

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 155/354 (43%), Gaps = 92/354 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            QR  SLD+FRG  VALMILV++ G     +  + HAPW+GC   D V PFFLF VG A+
Sbjct: 2   NQRYYSLDVFRGATVALMILVNNPGSWSYAFSPLKHAPWHGCTPTDLVFPFFLFAVGNAM 61

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLF------WGILLQGGFSHAPDELTYGVDV---- 136
           +  + R+  +A  V  KKV+ RT+ +        W   +Q  +S+      Y ++     
Sbjct: 62  SFVIPRLRTQAGKVFWKKVLKRTILIFLIGLLLNWYPFVQ--WSNDTLLFKYWINPIKSD 119

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             IR+ GVLQRIAL Y   S++  F      K ++V   SIF L   +WL+  C+L+   
Sbjct: 120 SGIRILGVLQRIALCYCFASILVYFF-----KTKTVVLISIFILLS-YWLI--CILL--- 168

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                             DS  Y        G +            ID  +L I HMY  
Sbjct: 169 -----------------GDSDPYS--LQGWFGTK------------IDVSILQIAHMYKG 197

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI---- 312
                          EG            PFEPEG+ S+ ++++  +IG   G  I    
Sbjct: 198 ---------------EG-----------VPFEPEGIASTFTAVIQVVIGFLVGQYIQTGT 231

Query: 313 --IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
             I     + R    + +  ALL  G  + +  A P+NK+++T SYV  +SG A
Sbjct: 232 KAIEGPMLIYRTVSTLMVTAALLTLG-GMTWGLAFPINKKIWTSSYVLYSSGLA 284


>gi|336416828|ref|ZP_08597160.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
 gi|335937266|gb|EGM99170.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 155/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++  +L  Y  LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  L+D
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLID 283


>gi|160883406|ref|ZP_02064409.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
 gi|156111126|gb|EDO12871.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
          Length = 371

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 155/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++  +L  Y  LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  L+D
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLID 283


>gi|383111974|ref|ZP_09932776.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|423296747|ref|ZP_17274817.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
 gi|313696106|gb|EFS32941.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|392669124|gb|EIY62615.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
          Length = 371

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 155/355 (43%), Gaps = 80/355 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
           V+++ AL R         + K++ RT+ L   G+ L   FS        GV+     IR+
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL+Y   SL+ +  +   +               W   ++  +L  Y  LL  
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                             G  F ++           N +   DR + G  H+Y       
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         R+  P      F+PEGLLS++  I   IIG   G+++        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           L Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  L+D
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLID 283


>gi|433652541|ref|YP_007296395.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
 gi|433303074|gb|AGB28889.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
          Length = 394

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 151/345 (43%), Gaps = 81/345 (23%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVG 85
           + + +QRL SLD+ RGL V LMI V++  G+  + ++ H+ WNG  L D V PFFLFI+G
Sbjct: 7   TRMSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMG 66

Query: 86  VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           V+  L+L++      A   +K+  RTL L   G+ +   F  A     +  D+  +R+ G
Sbjct: 67  VSTYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMG 123

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           V+QRIAL Y   +L+ +                     C  WL                 
Sbjct: 124 VMQRIALCYGATALIAV--------------------GCQRWLH---------------- 147

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
             D++       +        +  G     +   N +  +D+ VLG  H+YH        
Sbjct: 148 --DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH-------- 197

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII----HTKGHL 319
                                +P +PEGL+S+++++  T+ G +  H  +       G  
Sbjct: 198 --------------------KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPA 237

Query: 320 AR--LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           AR  + ++   G  L + GL L  +  +PLNK++++ SYVC++ G
Sbjct: 238 ARRSMMRFAAAGTVLAVVGLAL--SPLLPLNKRVWSPSYVCLSCG 280


>gi|119491291|ref|ZP_01623345.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
 gi|119453455|gb|EAW34617.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
          Length = 371

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 157/353 (44%), Gaps = 83/353 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA W+G    D V P FLFIVGVA+  
Sbjct: 2   RLTSLDVFRGIAIASMILVNNPGSWNHVYPLLKHAEWHGYTPTDLVFPSFLFIVGVAMTF 61

Query: 91  AL-KRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ K +P+  +  +    K+  R L+      LL    +  P+      D+  IR+ GVL
Sbjct: 62  SMSKYLPENRNLEENISPKIYLRILRRCLILFLLGLLLNGYPN-----YDLANIRIMGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI+L+Y L ++  +     Q    S+G                 +L+ Y  ++    VP
Sbjct: 117 QRISLAYGLSAITILHLSRKQIWGLSIG-----------------LLIGYAVVMQLIPVP 159

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           +                     GV   L P  N   Y+DR +LG +H+            
Sbjct: 160 N--------------------SGV-VNLTPEGNFAAYLDRLILGEHHLL----------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                ++PEGLLS++ ++++ +IG   G+ +     +       
Sbjct: 188 -----------------GGGKYDPEGLLSTLPAVVTVLIGYLTGNWLKKQPINSQTSLNL 230

Query: 326 VTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           V +G   +I G   H    I P+NK L+T SYV VT+G A ++ +A Y L++ 
Sbjct: 231 VIIGLCNIIVG---HLWGLIFPINKSLWTSSYVLVTAGWALVLLAACYELIEV 280


>gi|410631381|ref|ZP_11342056.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148827|dbj|GAC18923.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 330

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 165/371 (44%), Gaps = 82/371 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
            RL +LD+FRG+ +  MILV++ G  W  I     HA W+G  L D + PFF+FIVGV+I
Sbjct: 21  NRLLALDVFRGITITAMILVNNPG-SWQHIYGPMRHAQWHGWTLTDLIFPFFIFIVGVSI 79

Query: 89  ALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLC 142
            L+ +R+      R+  +K+ + RT KL+  G  L   +     E    V+ R+  IR  
Sbjct: 80  QLSGQRMLASGTSRSSIIKQALLRTFKLVLLGWFLALFYYDFGAEHYNWVEQRLLNIRFM 139

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA+ Y +  L+ +F                        L+ + V ++ L  L   
Sbjct: 140 GVLQRIAVVYFICVLMWLFLSK-------------------RGLLVSFVGILLLYWLALA 180

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P             Y    N   G+    N   N   ++D  +    H+Y+  A    
Sbjct: 181 AIP------------YYDNAGNSYSGLLEFGN---NLSAWLDSWLFAAPHLYYSSA---- 221

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                 +PF               F+PEG+LS++ ++ S I G+  G+++  +  +  R 
Sbjct: 222 ------TPFA--------------FDPEGILSTLPAVASGISGMLVGYLLTQSSLN-TRN 260

Query: 323 KQWV--TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD---- 376
           K  V   +G   ++ G   H     P+NK L+T SYV +TS  A LV +++  ++D    
Sbjct: 261 KTIVLLVLGSIGVLLGELWH--GYFPINKALWTSSYVLLTSAYACLVLASLIFILDSKKN 318

Query: 377 ----CNIFGPW 383
               C++ G W
Sbjct: 319 HLLECSLCGFW 329


>gi|402878146|ref|XP_003902762.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Papio
           anubis]
          Length = 708

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 90/330 (27%), Positives = 151/330 (45%), Gaps = 49/330 (14%)

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
           G  +AD V P+F+FI+G +I L++  I  R  +    + K+ +R+  L+  GI++     
Sbjct: 347 GLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN--- 403

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LY 181
             P+     +    +R+ GVLQR+ ++Y +V+++E+ F K V +   S       R    
Sbjct: 404 --PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITS 461

Query: 182 CW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNA 239
            W  WL+   +  ++L L +   VP      +      D+GK  N T G          A
Sbjct: 462 SWPQWLLILALEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------A 511

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
            GYIDR +LG +H+Y HP+                            ++PEG+L +++SI
Sbjct: 512 AGYIDRLLLGDDHLYQHPS------------------STVLYHTEVAYDPEGILGTINSI 553

Query: 300 LSTIIGVHFGHVIIH----TKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           +   +GV  G ++++    TK  L R   W   +G   ++          IP+NK L++L
Sbjct: 554 VMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTKVSENEGFIPVNKNLWSL 613

Query: 355 SYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           SYV   S  A  +   +Y +VD  + G WT
Sbjct: 614 SYVTTLSSFAFFILLVLYPVVD--VKGLWT 641


>gi|410096828|ref|ZP_11291813.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225445|gb|EKN18364.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 367

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 93/367 (25%), Positives = 158/367 (43%), Gaps = 90/367 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           ++ RL SLD+ RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+GV+
Sbjct: 3   QSGRLLSLDVMRGITIAGMIMVNNPGSWGYVYAPLRHASWNGLTPTDLVFPFFMFIMGVS 62

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQ-------GGFSHAPDELTYGVDVRM 138
           +  +L++   +   ++V KV+ RT+ +   G  L         GFS+  +          
Sbjct: 63  MFFSLRKYDFKLSRESVTKVLKRTVLIFLVGFALNLFGHLCYNGFSNFEN---------- 112

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GV+QR+AL+Y + SL+ +  K                        AA +L+ Y  L
Sbjct: 113 LRILGVMQRLALAYGIGSLIGLSVKHKYILQT-----------------AAGILLFYWIL 155

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L  T                            ++     N +  +DR + G  HMYH   
Sbjct: 156 LAAT---------------------------GSQTLSENNIIAIVDRALFGNTHMYH--- 185

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-TKG 317
                    D   +G             F+PEGLLS + SI   ++G + G +I+   K 
Sbjct: 186 ---------DYLADGT---------RIAFDPEGLLSCLGSIGHVLLGFYVGKMILDCKKN 227

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +   ++     G  +L  G  L +    P+NK+L++ ++V  T G  +L  + +  ++D 
Sbjct: 228 NELIIRNLFIFGTIILFLGFLLSY--GCPINKKLWSSTFVLTTCGFGSLFLALLIWIIDI 285

Query: 378 NIFGPWT 384
           N    W+
Sbjct: 286 NGKKKWS 292


>gi|340347656|ref|ZP_08670761.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
 gi|339608850|gb|EGQ13733.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
          Length = 386

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 150/343 (43%), Gaps = 81/343 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + +QRL SLD+ RGL V LMI V++  G+  + ++ H+ WNG  L D V PFFLFI+GV+
Sbjct: 1   MSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGVS 60

Query: 88  IALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
             L+L++      A   +K+  RTL L   G+ +   F  A     +  D+  +R+ GV+
Sbjct: 61  TYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGVM 117

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y   +L+ +                     C  WL                   
Sbjct: 118 QRIALCYGATALIAVG--------------------CQRWLH------------------ 139

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
           D++       +        +  G     +   N +  +D+ VLG  H+YH          
Sbjct: 140 DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH---------- 189

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII----HTKGHLAR 321
                              +P +PEGL+S+++++  T+ G +  H  +       G  AR
Sbjct: 190 ------------------KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPAAR 231

Query: 322 --LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
             + ++   G  L + GL L  +  +PLNK++++ SYVC++ G
Sbjct: 232 RSMMRFAAAGTVLAVVGLAL--SPLLPLNKRVWSPSYVCLSCG 272


>gi|374385780|ref|ZP_09643283.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
 gi|373225482|gb|EHP47816.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 165/371 (44%), Gaps = 91/371 (24%)

Query: 30  LKTQ-RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIV 84
           +KT+ RL +LD+FRG+ +A MILV+  G  W  +     HA WNG    D V PFF+FI+
Sbjct: 1   MKTENRLLALDVFRGITIAGMILVNDPG-SWSAVYAPLCHASWNGLTPTDLVFPFFMFIM 59

Query: 85  GVAIALALKRIPD--RADAVKKVIFRTLKLLF-------WGILLQGGF-SHAPDELTYG- 133
           G+++  +L+R        AV K IFR   L+F       W  L  G F S    E T+  
Sbjct: 60  GISMYFSLRRYNSLFSRGAVAK-IFRRAVLIFLIGLGINWFALWFGTFMSMGNGEFTFWE 118

Query: 134 ------VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                   V  IR+ GVLQR+AL+YL  +++ +  +          R+ +F         
Sbjct: 119 RFTQNIFPVADIRILGVLQRLALAYLGGAILCLGIRP---------RYQLFT-------- 161

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           A  +LV Y  +L                    G+ F         +    N +  +DR V
Sbjct: 162 AVMILVGYFVIL------------------VVGEGF---------IRSEHNILSVVDRAV 194

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG+ H+Y                        A S     F+PEGLLS++      + GV 
Sbjct: 195 LGVRHLYG---------------------GGASSGAGMAFDPEGLLSTLPCFAHVLFGVC 233

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G ++   K +  R++Q    G  LL  G    ++ A P+NK++++ +YV ++ GAA+L+
Sbjct: 234 MGRMLGEVKENEIRIRQLFIFGTILLFAGYL--WSYACPVNKRIWSPTYVLISCGAASLL 291

Query: 368 FSAIYALVDCN 378
           F+ +   +D  
Sbjct: 292 FALLIYWIDVK 302


>gi|398341237|ref|ZP_10525940.1| hypothetical protein LkirsB1_18883 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 383

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 100/349 (28%), Positives = 154/349 (44%), Gaps = 82/349 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            + KS     R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG++I L++  K    ++     +  R++ L+  G+ L      +  EL       
Sbjct: 60  FLFAVGISIQLSVYSKNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL------- 112

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRI   Y +V+ + +                  R+    W+    +L+V+  
Sbjct: 113 --RIPGVLQRIGFVYWIVASLHLILPK--------------RMILISWI---PILLVHTW 153

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L     P         +S  Y             L P  +   +IDR V G NH+    
Sbjct: 154 VLIQIPAPG--------ESIVY-------------LEPGKDIGAWIDRNVFGENHL---- 188

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W+ SK                       ++PEGL S +SSI ++++GV  G  I+ +K 
Sbjct: 189 -WKFSKT----------------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKT 224

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L
Sbjct: 225 NEIKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFL 273


>gi|393785792|ref|ZP_10373938.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
 gi|392661411|gb|EIY54997.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
          Length = 361

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 149/335 (44%), Gaps = 78/335 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             RL SLD+ RG  VA MILV++AG     +  + HA W+G   AD V P F+F++G++ 
Sbjct: 4   NNRLLSLDVLRGFTVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGIST 63

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            ++L++   +   A+ K+I R L L+  GI ++   + +  E     D   +RL GV+QR
Sbjct: 64  YISLRKYDFQWRLAIGKIIKRALLLILIGIAMKWIINSS--ETGIWTDWEHMRLLGVMQR 121

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           + + Y   +++ +F            RF                  V L LL G +    
Sbjct: 122 LGICYGATAIMALFIPH--------KRF----------------FPVALLLLAGYF---- 153

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              I+      + K              P N +  +D  VLG NHMY            Q
Sbjct: 154 ---ILQLIGNGFEK-------------SPDNIIAIVDSTVLGTNHMY-----------LQ 186

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
              F                EPEG+LS++ +I   +IG   G +II+ K +  R+++   
Sbjct: 187 GRQFV---------------EPEGILSTIPAIAQVMIGFVCGRMIINQKDNKERMQKLFF 231

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           +G  +L  G    F+ A PLNK+L++ S+V +T G
Sbjct: 232 LGTLMLFAGFL--FSYACPLNKRLWSPSFVLLTCG 264


>gi|116621919|ref|YP_824075.1| hypothetical protein Acid_2804 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116225081|gb|ABJ83790.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 367

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 159/351 (45%), Gaps = 86/351 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD FRG  +ALM+LV++AG     + ++ H+PW+G  + D V P FL+IVGVAI L
Sbjct: 12  RLVSLDAFRGATIALMVLVNNAGSGLDSYRQLEHSPWHGWTITDTVFPSFLWIVGVAITL 71

Query: 91  AL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           +L KR+ +   R+  + +++ R   L  +G+ +   F H         D+   R+ GVLQ
Sbjct: 72  SLGKRVAEGVPRSHLLPQILRRAAILFVFGLFVY-AFPH--------FDLGTQRILGVLQ 122

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIA+ YL  S++ +++  V+ +        I  L   +W+M   + V             
Sbjct: 123 RIAICYLAASVIFLYS-GVRGQI-----LWILGLLAAYWMMMTLIPV------------- 163

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       YG           +L+   N   YID   LG  H YH           
Sbjct: 164 ----------PGYGP---------GRLDVEGNFAHYIDHLALG-RHNYH----------- 192

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                         +W     +PEGL+S++ +I + + GV  GH I+  +  LA    W+
Sbjct: 193 -----------STRTW-----DPEGLVSTLPAIATALFGVLAGH-ILRCRRTLAERTSWM 235

Query: 327 -TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            T G  LL  GL    T  +P+NK+L+T S+    +G    VF+    L+D
Sbjct: 236 FTAGSLLLAAGLIC--TAWLPINKKLWTDSFCLFMAGLDFTVFAFFAWLID 284


>gi|290999917|ref|XP_002682526.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
 gi|284096153|gb|EFC49782.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
          Length = 425

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 101/408 (24%), Positives = 164/408 (40%), Gaps = 128/408 (31%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           + +R+ +LDI RG+ + LMI+V++     +  + HA W G    D V PFFLF++G A A
Sbjct: 62  RKERMVALDIMRGMTIMLMIIVNNQPARAFIPLDHAEWFGFTPTDCVFPFFLFVMGYAAA 121

Query: 90  LALKR----------------------------IPDRADAVKKVIFRTLKLLF------- 114
           +   R                              D  D  +K    ++K ++       
Sbjct: 122 IVYSREWPSDVYLYPPSHVKLSIQSYFRELCGKKQDLMDENEKKEEESIKFMYLIPMRKS 181

Query: 115 -------WGILLQG-------GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
                  W  L +        GFS +   L +  +   +R+ GV QRIA+ Y +VSL+ +
Sbjct: 182 LYEFVSKWVKLFRRPILMFLIGFSFSV--LAHLFNFTHVRVMGVFQRIAICYFIVSLILV 239

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYVPDWQFTIINKDSA 217
                                 W ++    V++   +Y+ + +G YVP        +   
Sbjct: 240 MVP-------------------WTFVQILIVVLFQAIYITVTFGLYVP-------MEGEG 273

Query: 218 DYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
           D        CG R +L  P C A GYIDR +L  +H+Y            QDS       
Sbjct: 274 D-------GCGTRGELYEPRCTAEGYIDRLILSRDHIY-----------LQDS------- 308

Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGV-HFGHVIIHTKGHLARLKQWVTMGFALLIF 335
                     ++PEG LSS+S++ +  +G+  F       K    RL  W  MG  +++ 
Sbjct: 309 ----------YDPEGFLSSLSAVTNAFVGILAFKVARAAGKDAHKRLNYWFIMGSLMILA 358

Query: 336 GLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            L + +   +P+ K+L+T S+  +TSG          AL++C I   W
Sbjct: 359 ALAIDYA-GLPIGKKLWTTSFALITSGI---------ALLECTISLDW 396


>gi|332983392|ref|YP_004464833.1| heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
 gi|332701070|gb|AEE98011.1| Heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
          Length = 368

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/352 (26%), Positives = 157/352 (44%), Gaps = 84/352 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R+ S+D  RG+ +  MI +++ G      P + HAPWNG  LAD   P F+F++G+ I 
Sbjct: 4   KRIQSIDALRGICITAMIFMNNPGNSKYTSPLLLHAPWNGITLADLFFPCFIFVMGMVIP 63

Query: 90  LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++  +   +     ++I   LK   +LF    L G F +A        D++ +R+ GVLQ
Sbjct: 64  VSFGKRMAKGQTKGQLIAHLLKRSAMLF----LIGLFLNAFPCF----DMQHVRILGVLQ 115

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RIAL Y    L+ +F+  +                   ++++A +L+ Y  LL    VP 
Sbjct: 116 RIALVYFFSGLIFLFSSTMS-----------------MFIISAAILIGYYLLLRFVPVPG 158

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
           +   +  +                       N + YID K+L   H+Y            
Sbjct: 159 YGAGVFERTG---------------------NLIQYIDLKLLK-GHLY------------ 184

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                        P W     +PEGLLS++ +I S+++G+  G +++  K +  +L  ++
Sbjct: 185 ------------TPDW-----DPEGLLSTLPAIASSLLGILTGCLLVSDKKNTNKL--YI 225

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
            +  + L F  ++      PLNK L++ S+V  T+G A L+ S  Y L D N
Sbjct: 226 MLVCSALAFISSIITQKWFPLNKNLWSSSFVLFTTGFALLLLSVCYWLADIN 277


>gi|440791267|gb|ELR12512.1| transmembrane protein [Acanthamoeba castellanii str. Neff]
          Length = 825

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/405 (23%), Positives = 159/405 (39%), Gaps = 94/405 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           + R+ SLD  RGLA+A+MI V++ GG +   +H+ WNG  +AD V P+F++I+G ++A++
Sbjct: 396 SSRVNSLDAVRGLAIAIMIFVNYGGGGYWFFNHSAWNGITVADLVFPWFIWIMGTSMAIS 455

Query: 92  LKRIPDRADA-------------VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
              +  +                +  V+ R         ++       P  LTY +  R 
Sbjct: 456 FTSLEKKLLGLFQNNGYEWETWRIPGVLMRFAVAYLVVGVVVLFVPRWPWRLTYRIYRRF 515

Query: 139 IR-------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR----------------- 174
                      G    I       S    F +D    D    R                 
Sbjct: 516 TSGGHHRHAADGAASPILERKRFASEAINFDEDTDKSDDGFSRNVFAAEDDESCENMMKD 575

Query: 175 ---FSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
              +++F   L  W HWL+A  +L VY  + +   VP                     CG
Sbjct: 576 KWVYTLFGDILPFWPHWLVAFSLLFVYFMITFFLDVPG--------------------CG 615

Query: 229 VRAKLNPPCN-----AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
            R  L P  +     A GYID+K+   +H+Y+ P  +                   P + 
Sbjct: 616 -RGYLGPDISTATGGAAGYIDKKIFTEDHIYNQPTCQ-------------------PLYL 655

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW----VTMGFALLIFGLTL 339
              ++PEG L +++SI    +G+  G  ++  K H  R+ +W    + +GF  L      
Sbjct: 656 TGSYDPEGTLGNLTSIFMVFLGLQSGRTLMAWKDHKHRVVRWYIWSIVLGFIALGLCEAK 715

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
                 PLNK L++ S++  T+  A  + +  Y L+D  +F  W+
Sbjct: 716 QNGGFFPLNKNLWSPSFILATAAMAFFLLATFYLLID--VFPIWS 758


>gi|333030942|ref|ZP_08459003.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741539|gb|EGJ72021.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 385

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 157/376 (41%), Gaps = 94/376 (25%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           + ++RL SLDI RG  +  MILV++ G  W  I     HA W+G    D + PFF+F++G
Sbjct: 1   MSSKRLLSLDILRGGTIIGMILVNNPG-SWEYIYSPLRHAEWHGLTPTDLIFPFFIFVMG 59

Query: 86  VAIALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--- 138
           ++++L+  +  +         +KVI R+ KL      L G F      L  G++ R+   
Sbjct: 60  ISMSLSFSKFKNEEYNKTLFWEKVIKRSAKL-----FLLGLFLSWFSLLLEGINNRLEYE 114

Query: 139 -----------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                      IR+ GV+QR+ALSYL+ S   +F   +      V             + 
Sbjct: 115 SISEILFPFGQIRILGVMQRLALSYLVGS---VFVMLIPKAKHLV-------------IT 158

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           +  +L+ Y  LL                    G  F+ +           N +  +D  +
Sbjct: 159 SVILLIAYFILL------------------SLGNGFSFSSD---------NIIAIVDNSL 191

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NH+Y    W                    P      F+PEGLLS++  I+  I+G  
Sbjct: 192 FGENHVYLE--W-------------------LPDGERLRFDPEGLLSTIPCIVQVIMGYL 230

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G VI   K  L ++     +G  LL  GL L +    PLNK++++ ++  VTSG A L 
Sbjct: 231 CGEVIRKKKDLLNKMMDLAIIGIVLLFIGLLLSY--GCPLNKKIWSPTFELVTSGFAVLA 288

Query: 368 FSAIYALVDCNIFGPW 383
            + +  ++D      W
Sbjct: 289 LTLLIWIIDYKGLKKW 304


>gi|421109691|ref|ZP_15570204.1| putative membrane protein [Leptospira kirschneri str. H2]
 gi|410005185|gb|EKO58983.1| putative membrane protein [Leptospira kirschneri str. H2]
          Length = 383

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 154/359 (42%), Gaps = 102/359 (28%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            + KS     R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG++I L++  K    ++     +  R++ L+  G+ L      +  EL       
Sbjct: 60  FLFAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL------- 112

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRI   Y                                W++A+  L++   
Sbjct: 113 --RIPGVLQRIGFVY--------------------------------WIVASLHLILPKR 138

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKV 247
           ++  +++P                +  V   V  ++ PP  ++ Y          IDR V
Sbjct: 139 MILISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNV 182

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NH+     W+ SK                       ++PEGL S +SSI ++++GV 
Sbjct: 183 FGENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVF 215

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            G  I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L
Sbjct: 216 CGS-ILSSKTNEIKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFL 273


>gi|404487027|ref|ZP_11022214.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335523|gb|EJZ61992.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
          Length = 364

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 80/350 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL SLD+ RG+ V  MILV++AG     +  + HA W+G   AD V P F+FI+GV+I
Sbjct: 7   SKRLVSLDVLRGITVCGMILVNNAGACGYAYAPLKHAKWDGFTPADLVFPAFMFIMGVSI 66

Query: 89  ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH-APDELTYGVDVRMIRLCGVLQ 146
            L+L +   D   ++ +++ RT+ +   G+ L+   +  A  E      +  +R+ GVLQ
Sbjct: 67  YLSLNKSNFDWRISIARILRRTVLIFMSGVALKWILAFIATGEYN---TLENLRIMGVLQ 123

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           R+ + Y +V+L+ +  +                    H L    + V    LL G Y+  
Sbjct: 124 RLGICYGIVALLAVTVR--------------------HRLFPTIIAV----LLVGYYLLQ 159

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       +G  F    G         N V  +D  VLG +HMY   A        
Sbjct: 160 -----------LFGNGFEKCAG---------NIVSMVDYAVLGKSHMYLGGA-------- 191

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                                +PEG+LS++ +I   +IG   G VI+  K   +++ +  
Sbjct: 192 ------------------QFVDPEGVLSTIPAIAQVMIGFLCGKVIVGEKEIRSQIVKLA 233

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             G ++ + G    +  A PLNK+L++ S+V VT G  +L+F+ +  ++D
Sbjct: 234 VWGTSMFVIGYLWSY--AAPLNKRLWSPSFVLVTCGITSLIFATLIYIID 281


>gi|418694540|ref|ZP_13255577.1| putative membrane protein [Leptospira kirschneri str. H1]
 gi|409957715|gb|EKO16619.1| putative membrane protein [Leptospira kirschneri str. H1]
          Length = 383

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 154/359 (42%), Gaps = 102/359 (28%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            + KS     R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG++I L++  K    ++     +  R++ L+  G+ L      +  EL       
Sbjct: 60  FLFAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL------- 112

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRI   Y                                W++A+  L++   
Sbjct: 113 --RIPGVLQRIGFVY--------------------------------WIVASLHLILPKR 138

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKV 247
           ++  +++P                +  V   V  ++ PP  ++ Y          IDR V
Sbjct: 139 MILISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNV 182

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G NH+     W+ SK                       ++PEGL S +SSI ++++GV 
Sbjct: 183 FGENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVF 215

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            G  I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G A L
Sbjct: 216 CGS-ILSSKTNEIKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFL 273


>gi|444731031|gb|ELW71398.1| Heparan-alpha-glucosaminide N-acetyltransferase [Tupaia chinensis]
          Length = 732

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 163/382 (42%), Gaps = 71/382 (18%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL  +D FRG+A+ LM+ V++ GG +    HA WN  +     +P  L  +GV   +  
Sbjct: 265 HRLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWN-VSWDKVRIPGVLQRLGVTFFVV- 322

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGIL-------------------LQGGFSHAPDELTYG 133
                   AV +++F     +++G+                     +      P  L   
Sbjct: 323 --------AVLELLFAKPVCIYYGVFNFSVNDIYAAAGMFKIQIARENCVEEFPVNLYRD 374

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
           +    +R+ GVLQR+ +++ +V+++E+ F K V +   S       R     W  WL+  
Sbjct: 375 LSWDKVRIPGVLQRLGVTFFVVAVLELLFAKPVPENCASERSCLSLRDVTSSWPQWLVIL 434

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
            +  ++L L +   VP      +      D+GK  N T G          A GYID  +L
Sbjct: 435 MLESIWLGLTFFLPVPGCPKGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLL 484

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G +H+Y HP+                            ++PEG+L +++SI+   +GV  
Sbjct: 485 GADHLYKHPS------------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQA 526

Query: 309 GHVIIH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSG 362
           G ++++    TK  L R   W  +   L+  GLT    N   IP+NK L+++SYV   S 
Sbjct: 527 GKILLYYKDRTKDILIRFTAWCCI-LGLISIGLTKISENEGFIPVNKNLWSISYVTTLSS 585

Query: 363 AAALVFSAIYALVDCNIFGPWT 384
            A  +   +Y +VD  + G WT
Sbjct: 586 FAFFILLILYPVVD--VRGLWT 605


>gi|333378336|ref|ZP_08470067.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
 gi|332883312|gb|EGK03595.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
          Length = 387

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/373 (23%), Positives = 158/373 (42%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD+ RG+ +A MILV++ G  W  I     HA WNG    D + PFF+FI+G++
Sbjct: 6   SSRLLSLDVLRGITIAGMILVNNPG-SWGHIYAPLRHAEWNGLTPTDLIFPFFMFIMGIS 64

Query: 88  IALALKRIPDR--ADAVKKVIFRTLKLLFWGILL-----QGGFSHAPDELTYGVDVRM-- 138
             ++L++         ++K++ RT  +   G+ L       G  HA      G   R+  
Sbjct: 65  TFISLRKFNFEFSVPTLRKILKRTFVIFLIGLGLSWLGVSFGTYHALAADNLGFLERLGR 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  +R+ GV+QR+AL+Y + SL+ IF K                     +++   +
Sbjct: 125 SVTNFEHLRILGVMQRLALTYGITSLIAIFIKHKYIP----------------YIIVVGL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +  +L LL+G           N  + +   +  VT                 D+ +LG+N
Sbjct: 169 VGYFLLLLFG-----------NGFATEGYNILAVT-----------------DQSILGLN 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEG+LS++ ++   +IG + G +
Sbjct: 201 HMY----------------------------TEFGLDPEGILSTIPAVCHVLIGFYCGKI 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++ TK +  R+     +G  L   G  L +    P+NK++++ ++   T G  A   S +
Sbjct: 233 LMETKDNQQRMLHLFIIGAILTFSGFLLSY--GCPINKKIWSPTFELTTCGLGATFLSLL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 IWIIDVKGYKKWS 303


>gi|359460787|ref|ZP_09249350.1| hypothetical protein ACCM5_18818 [Acaryochloris sp. CCMEE 5410]
          Length = 383

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 158/360 (43%), Gaps = 86/360 (23%)

Query: 38  LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
           LD+FRG+A+A M+LV+ +G     +P++ HA W+G  LAD V PFFL ++G ++A ++ R
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKEAYPQLLHADWHGWTLADLVFPFFLVVLGASMAFSMAR 72

Query: 95  ------IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
                  P RA  + K++ R+  L   G+ L G +S          +   +R+ G+LQRI
Sbjct: 73  HTASLTQPKRAVYL-KILRRSAVLFGLGLFLNGFWSF---------NFSTLRVMGILQRI 122

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           +L+YL  + V +    +  K Q              W +   +LV Y   L    VP   
Sbjct: 123 SLTYLASAFVIL---KLPRKSQ--------------WGLTGLLLVGYWLALSFIPVP--- 162

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                    ++G            L    N   YIDR ++G +H+Y    +         
Sbjct: 163 ---------EFGP---------GNLTRTGNFGAYIDRLIIGSSHLYVGDQFNSMG----- 199

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK----Q 324
                              +PEGL S++ +I + ++G +F    I  +G   ++K     
Sbjct: 200 -------------------DPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIKTSRQS 239

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
                + L   GL L ++   P+NK+L+T SYV  T G A ++ +  Y L++      W+
Sbjct: 240 LALASYGLFSTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIEVRRIRLWS 299


>gi|410631394|ref|ZP_11342069.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148840|dbj|GAC18936.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 366

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 97/347 (27%), Positives = 148/347 (42%), Gaps = 87/347 (25%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ ++D+ RGLA+ALM+LV++ G  W  +     HA W+G    D V PFFLF++G ++A
Sbjct: 8   RIEAIDVLRGLALALMLLVNNPG-SWSAVYAPFLHADWHGLTPTDLVFPFFLFVMGASMA 66

Query: 90  LALK-RIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +L+ +I         +  R+  L+F G LLQ   F  APD           R+ GVLQR
Sbjct: 67  CSLRGQIQASGLPWLSIFKRSFLLVFIGFLLQIIPFDQAPDTW---------RIMGVLQR 117

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I L +LLV+ +    K+                  W  L A   L+VY  LL        
Sbjct: 118 IGLCFLLVASMLAIIKE-----------------RWLLLSAVVTLIVYWLLL-------- 152

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
               ++   A Y              +   N+V + D  +LG  HM+             
Sbjct: 153 ----LSAGQAPY--------------SLENNSVRHFDMAILGSAHMWQGKG--------- 185

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                             PF+PEGLLS++ + ++ + G      ++  K    ++ Q + 
Sbjct: 186 -----------------LPFDPEGLLSTIGAAMTVLSGYLICVNVLQQKNQKQQILQLMI 228

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +G  LL  G      +  P+NK L+T SYV V+S  A L  + I  L
Sbjct: 229 VGAILLALGFVWSVWH--PINKSLWTSSYVLVSSAFACLSLAVIILL 273


>gi|409990365|ref|ZP_11273749.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
 gi|291567406|dbj|BAI89678.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938771|gb|EKN80051.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
          Length = 378

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 150/361 (41%), Gaps = 87/361 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V P FL IVGVAIA 
Sbjct: 9   RLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAQWHGCTPTDVVFPSFLLIVGVAIAF 68

Query: 91  ALKRIPDR----ADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +L +         D V   ++    R   LLF   L+  GF +         D+  IR+ 
Sbjct: 69  SLSKFSPEHRLGGDGVPPSVYSRIGRRCLLLFLLGLILNGFPN--------YDLANIRIM 120

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIA++Y L ++  +     Q                  WL++   L+ Y   +   
Sbjct: 121 GVLQRIAIAYGLSAIAILNLSRRQ-----------------LWLISIFTLIGYWLAMTMI 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            VP +                         L+P  N   +ID+ +LG +H+         
Sbjct: 164 PVPGYS---------------------PGNLSPEGNLGAFIDQTILGSHHL--------- 193

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                              W   P++PEGL S+  + ++ IIG   G  +     +   +
Sbjct: 194 -------------------WRGGPYDPEGLFSTAPATVTVIIGYLTGEWLKSQPRNSLTV 234

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
              V    + L+ G         P+NK L+T SYV VT+G   L+ +  Y +++   +  
Sbjct: 235 INLVMFALSSLVVGYLWGIW--FPINKALWTSSYVLVTAGWGLLLLAFCYGVIEVKNWRR 292

Query: 383 W 383
           W
Sbjct: 293 W 293


>gi|212693969|ref|ZP_03302097.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|265751179|ref|ZP_06087242.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|423228550|ref|ZP_17214956.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|423243815|ref|ZP_17224891.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
 gi|212663501|gb|EEB24075.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|263238075|gb|EEZ23525.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|392635957|gb|EIY29845.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|392644181|gb|EIY37924.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
          Length = 363

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 157/358 (43%), Gaps = 84/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D   +  W
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNW 287


>gi|345516841|ref|ZP_08796327.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
 gi|229437727|gb|EEO47804.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
          Length = 363

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 157/358 (43%), Gaps = 84/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D   +  W
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNW 287


>gi|418709516|ref|ZP_13270303.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|421125723|ref|ZP_15585968.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421135286|ref|ZP_15595410.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410020544|gb|EKO87345.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410436829|gb|EKP85940.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410770179|gb|EKR45405.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|456966468|gb|EMG08068.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 381

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 149/356 (41%), Gaps = 102/356 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           ++ L   R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHARWNGCTPTDLVFPFFLF 60

Query: 83  IVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
            VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R
Sbjct: 61  AVGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------R 111

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI   Y                                W++A+  L++    + 
Sbjct: 112 IPGVLQRIGFVY--------------------------------WVVASLYLILPKRAIL 139

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGI 250
            +++P                +  V   +  +L PP  ++ Y          IDR V G 
Sbjct: 140 ISWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGE 183

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH+     W+ SK                       ++PEG  S +SSI ++++GV  G 
Sbjct: 184 NHL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS 216

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            I+ +K +  + +     GF +L   + L +   +P+NK L+T SYV  T+G A L
Sbjct: 217 -ILSSKTNETKKQILSIFGFGILFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFL 271


>gi|237710367|ref|ZP_04540848.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|229455829|gb|EEO61550.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 366

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 154/358 (43%), Gaps = 84/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 10  KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHASFNGLTPTDLVFPFFMFIMGISTY 69

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 70  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 126

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 127 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 164 --------------GDGFN---------QSETNVVARFDAWALGTSHMYHESG------- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 194 -------------------MAFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 234

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D   +  W
Sbjct: 235 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNW 290


>gi|395804141|ref|ZP_10483382.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
 gi|395433785|gb|EJF99737.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
          Length = 423

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 102/403 (25%), Positives = 160/403 (39%), Gaps = 116/403 (28%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +RL SLD+FRG  + LM +V++ G     +P + HA W+GC   D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTIFLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
           AI  A+           K++ R+L++   G+ L                           
Sbjct: 61  AIPFAMPVKHFDGSVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGLRLIVAFAV 120

Query: 121 -----GGFSHAPDE-LTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
                G FS      L  GV + +I             R+ GVLQRIA+ Y   S++ + 
Sbjct: 121 AYALLGNFSMKVKTILAVGVFLILISLSFSGLEHFEDTRIPGVLQRIAIVYFFTSILYLK 180

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
           T     K Q +    +   Y   WL+ A             +VP   F   N D      
Sbjct: 181 T---NLKTQLLVLAGLLVGY---WLLMA-------------FVPVPGFGPANFDKG---- 217

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                           N   +ID                       D+   G L   + +
Sbjct: 218 ---------------TNLAAWID-----------------------DTLLNGHLWASSKT 239

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
           W     +PEG+LS++ +I + I+G++ G ++  +   +  +K+    G AL+I GL  + 
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLSVDKMEIVKKTAIAGTALVIGGLIWNI 294

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
               P+NK L+T SYV  T+G A L  + +Y ++D      WT
Sbjct: 295 --FFPINKSLWTSSYVLYTAGIATLCLTLLYYIIDIKGHKKWT 335


>gi|330805524|ref|XP_003290731.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
 gi|325079117|gb|EGC32733.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
          Length = 644

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 86/145 (59%), Gaps = 15/145 (10%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           +S  + +++ K  RL SLD+FRG ++ +MI V++ GG +   +H+ WNG  +AD V P+F
Sbjct: 194 ISSVERENNKKKDRLKSLDVFRGFSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWF 253

Query: 81  LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +FI+G+A+ L+   +  R       V+K++ R+  L   G+ +             GV++
Sbjct: 254 VFIMGIAMPLSFNAMERRGTTKLVIVQKLVRRSAILFALGLFINN-----------GVNL 302

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIF 161
           +  R+ GVLQR A+SYL+V L+ +F
Sbjct: 303 QHWRILGVLQRFAISYLIVGLIMLF 327



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 73/146 (50%), Gaps = 13/146 (8%)

Query: 241 GYIDRKVLGINHMYHHPAWRRS-----KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
           GY+    +G N +Y +     +     K  T +  F+ P  +D   +  + ++PEG +  
Sbjct: 430 GYLGPGGIGDNSLYPNCTGGAARLIDMKIFTNNHIFQSPTCQDI--YKTSSYDPEGTVGY 487

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNAIPLNKQ 350
           ++SI    IGV  G +I+  K + +RL +W     V  G A  + GL+ +    IP+NK 
Sbjct: 488 LTSIFICFIGVQAGRIILIYKSNRSRLIRWMVWSAVCCGIAAGLCGLSQN-DGVIPINKN 546

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVD 376
           L++ S+V + +G    V + ++ ++D
Sbjct: 547 LWSPSFVFLMAGFGFFVLTIMFIVID 572


>gi|417760159|ref|ZP_12408187.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|417775681|ref|ZP_12423532.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|418673844|ref|ZP_13235155.1| putative membrane protein [Leptospira interrogans str. 2002000623]
 gi|409944118|gb|EKN89707.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|410574555|gb|EKQ37586.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|410579122|gb|EKQ46972.1| putative membrane protein [Leptospira interrogans str. 2002000623]
          Length = 381

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 148/356 (41%), Gaps = 102/356 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           ++ L   R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHARWNGCTPTDLVFPFFLF 60

Query: 83  IVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
            VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R
Sbjct: 61  AVGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------R 111

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI   Y                                W++A+  L++    + 
Sbjct: 112 IPGVLQRIGFVY--------------------------------WVVASLYLILPKRAIL 139

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGI 250
            +++P                +  V   +  +L PP  ++ Y          IDR V G 
Sbjct: 140 ISWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGE 183

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH+     W+ SK                       ++PEG  S +SSI ++++GV  G 
Sbjct: 184 NHL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS 216

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L
Sbjct: 217 -ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFL 271


>gi|383934719|ref|ZP_09988159.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
 gi|383704254|dbj|GAB58250.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
          Length = 397

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 164/364 (45%), Gaps = 67/364 (18%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           + L T R+ +LD+ RGL +  MILV++ G     +  + HA W+G  + D + PFF+ IV
Sbjct: 14  AKLNTNRMLALDVLRGLTITAMILVNNPGSWNYVYSPLLHAQWHGWTITDLIFPFFIVIV 73

Query: 85  GVAIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD-ELTYGVD-VRMIR 140
           G+++ L+L++  + ++   +++ + R+ KL   G+LL   + +  D E  Y  D +  +R
Sbjct: 74  GMSLQLSLRQHSLNNKGPLIRQALLRSGKLFGLGLLLALFYYNFRDPEFNYVEDRLLTVR 133

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
             GVLQRI L YL   L+ ++                             ++ VYLA L+
Sbjct: 134 WLGVLQRIGLVYLATVLIVLYFGQRGRLL-----------------WLLGLMAVYLAGLW 176

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
                D Q           G  F      R  L    + V ++D+ VLG NH+Y+  A  
Sbjct: 177 WLPYQDAQ-----------GHEF------RGLLLFGNSFVAWLDQLVLGANHVYYRSA-- 217

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                   +PF               F+PEGL S++ +I S + GV     +   +    
Sbjct: 218 --------TPFA--------------FDPEGLWSTLPAIASCLTGVLMAQWLQSERSLAQ 255

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
           +++  +  G   +      HF+  +P+NK L+T S+V ++SG  A+  +A   L D   +
Sbjct: 256 KIRGLLLCGVVAVWLAELWHFS--LPVNKSLWTPSFVLLSSGYCAIALAACLWLCDVKGW 313

Query: 381 GPWT 384
             W+
Sbjct: 314 RRWS 317


>gi|24213473|ref|NP_710954.1| hypothetical protein LA_0773 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45658672|ref|YP_002758.1| hypothetical protein LIC12842 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386073105|ref|YP_005987422.1| hypothetical protein LIF_A0631 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417764272|ref|ZP_12412242.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|417786789|ref|ZP_12434477.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|418669621|ref|ZP_13231000.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418701883|ref|ZP_13262801.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418702896|ref|ZP_13263788.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|418717763|ref|ZP_13277304.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|418729566|ref|ZP_13288113.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|421083731|ref|ZP_15544602.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|421102101|ref|ZP_15562711.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421121310|ref|ZP_15581607.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|24194245|gb|AAN47972.1|AE011263_12 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45601916|gb|AAS71395.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353456894|gb|AER01439.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400353508|gb|EJP05677.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|409950064|gb|EKO04595.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|410345744|gb|EKO96814.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|410368246|gb|EKP23624.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433648|gb|EKP77988.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|410754552|gb|EKR16202.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410759015|gb|EKR25234.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410767440|gb|EKR38115.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|410775744|gb|EKR55735.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|410786933|gb|EKR80669.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|456824782|gb|EMF73208.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. LT1962]
          Length = 381

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 148/356 (41%), Gaps = 102/356 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           ++ L   R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHARWNGCTPTDLVFPFFLF 60

Query: 83  IVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
            VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R
Sbjct: 61  AVGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------R 111

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI   Y                                W++A+  L++    + 
Sbjct: 112 IPGVLQRIGFVY--------------------------------WVVASLYLILPKRAIL 139

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGI 250
            +++P                +  V   +  +L PP  ++ Y          IDR V G 
Sbjct: 140 ISWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGE 183

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH+     W+ SK                       ++PEG  S +SSI ++++GV  G 
Sbjct: 184 NHL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS 216

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L
Sbjct: 217 -ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFL 271


>gi|418690664|ref|ZP_13251772.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|418722429|ref|ZP_13281595.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|400360164|gb|EJP16144.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|409963797|gb|EKO27519.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|455790461|gb|EMF42326.1| putative membrane protein [Leptospira interrogans serovar Lora str.
           TE 1992]
          Length = 381

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 148/356 (41%), Gaps = 102/356 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           ++ L   R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHARWNGCTPTDLVFPFFLF 60

Query: 83  IVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
            VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R
Sbjct: 61  AVGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------R 111

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI   Y                                W++A+  L++    + 
Sbjct: 112 IPGVLQRIGFVY--------------------------------WVVASLYLILPKRAIL 139

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGI 250
            +++P                +  V   +  +L PP  ++ Y          IDR V G 
Sbjct: 140 ISWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGE 183

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH+     W+ SK                       ++PEG  S +SSI ++++GV  G 
Sbjct: 184 NHL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS 216

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L
Sbjct: 217 -ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFL 271


>gi|320105553|ref|YP_004181143.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
 gi|319924074|gb|ADV81149.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
          Length = 412

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 157/367 (42%), Gaps = 76/367 (20%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPF 79
           +D   ++  K  RL SLD+ RG+ +  MILV++  G+  +  + HA WNG    D V P 
Sbjct: 18  TDSAARTTHKPARLLSLDVLRGVTIGFMILVNNQTGEGAFFPLQHAKWNGFTPTDLVFPT 77

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
           FL +VG++  L+ +    R  A   +   TL+    L  +G+++    F H         
Sbjct: 78  FLLLVGLSTVLSTEARLARGVAKSTIFLHTLQRSAVLFLFGLIVNNAPFFH--------- 128

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            ++ +R+ GVL RIA+ Y +V  + +  +D++            R +      AAC LV 
Sbjct: 129 -LQTLRVYGVLPRIAVCYFIVGSLYLLVRDLKQ-----------RAFILAAAAAAC-LVG 175

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y AL+    +P                 F          +P  N V YIDR +   +H+Y
Sbjct: 176 YWALMRFIPIPG----------------FGTPTHEIPINDPDGNLVAYIDRHIFSASHLY 219

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                      T+D                    PEGLLS++ ++ + + G+  G  +  
Sbjct: 220 EK---------TRD--------------------PEGLLSTIPAVATALFGILAGIWLRT 250

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           ++  + + K     G + LI G   H   A P+NK+L+T S+     G + L+ +    L
Sbjct: 251 SRSTMQKAKGIEYAGISFLILGGAWHL--AFPINKKLWTSSFSLWAGGWSLLLLALFIYL 308

Query: 375 VDCNIFG 381
           +D + FG
Sbjct: 309 IDIHRFG 315


>gi|417770421|ref|ZP_12418329.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|418680131|ref|ZP_13241383.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|421117858|ref|ZP_15578212.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|400328139|gb|EJO80376.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|409947562|gb|EKN97558.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|410010535|gb|EKO68672.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|455668600|gb|EMF33807.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
          Length = 381

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 148/356 (41%), Gaps = 102/356 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           ++ L   R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF
Sbjct: 2   ENKLNQNRILSLDLFRGMTVAGMILVNNPG-SWSFIYSPLKHARWNGCTPTDLVFPFFLF 60

Query: 83  IVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
            VG++I  ++  K     +     +  R++ L+  G+ L      +  EL         R
Sbjct: 61  AVGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------R 111

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI   Y                                W++A+  L++    + 
Sbjct: 112 IPGVLQRIGFVY--------------------------------WVVASLYLILPKRAIL 139

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGI 250
            +++P                +  V   +  +L PP  ++ Y          IDR V G 
Sbjct: 140 ISWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGE 183

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH+     W+ SK                       ++PEG  S +SSI ++++GV  G 
Sbjct: 184 NHL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS 216

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            I+ +K +  + +     GF  L   + L +   +P+NK L+T SYV  T+G A L
Sbjct: 217 -ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFL 271


>gi|297300348|ref|XP_001115683.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Macaca mulatta]
          Length = 547

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 83/294 (28%), Positives = 138/294 (46%), Gaps = 49/294 (16%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG-----CNLADFVMPFFLFIVGVAI 88
           RL S+D FRG+A+ LM+ V++ GG +    HA WNG     C +  F M  F+FI+G +I
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGTLPMQCGICIFAM-MFVFIMGSSI 325

Query: 89  ALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            L++  I  R  +    + K+ +R+  L+  GI++       P+     +    +R+ GV
Sbjct: 326 FLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGV 380

Query: 145 LQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           LQR+ ++Y +V+++E +F K V +    ++S             WL+   +  ++L L +
Sbjct: 381 LQRLGVTYFVVAVLELLFAKPVPEHCALERSCLSLRDITSSWPQWLLILALEGLWLGLTF 440

Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
              VP      +      D+GK  N T G          A GYIDR +LG +H+Y HP+ 
Sbjct: 441 LLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS- 489

Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH-FGHVI 312
                                      ++PEG+L +++SI+   +GV  F H I
Sbjct: 490 -----------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQVFVHFI 526


>gi|410941669|ref|ZP_11373463.1| putative membrane protein [Leptospira noguchii str. 2006001870]
 gi|410783218|gb|EKR72215.1| putative membrane protein [Leptospira noguchii str. 2006001870]
          Length = 381

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 156/363 (42%), Gaps = 112/363 (30%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS+    R+ SLD+FRG+ VA MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSN--QNRILSLDLFRGMTVAGMILVNNPG-SWSFIYTPLKHAKWNGCTPTDLVFPF 57

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ--GGFSHAPDELTYG 133
           FLF+VG +I  +L         + K+ F    R++ L+  G+ L   G +S +       
Sbjct: 58  FLFVVGTSIPFSLYS--KNKIYISKIWFGICIRSITLILIGLFLNFFGEWSFSK------ 109

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                +R+ G+LQRI   Y                                W++A+  L+
Sbjct: 110 -----LRIPGILQRIGFVY--------------------------------WVVASLYLM 132

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------I 243
           +   ++  +++P                +  V   V  ++ PP  ++ Y          I
Sbjct: 133 LPKRIILISWIP----------------ILIVHTWVLIQIPPPGESIVYLEPGKDIGAWI 176

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           DR V G NH+     W+ SK                       ++PEG  S +SSI +T+
Sbjct: 177 DRNVFGENHL-----WKFSKT----------------------WDPEGFFSGISSIATTL 209

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           +GV  G  I+ +K +  + +     GF +L+  + L +   +P+NK L+T SYV  T+G 
Sbjct: 210 LGVFCGS-ILSSKTNETKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGL 268

Query: 364 AAL 366
           A L
Sbjct: 269 AFL 271


>gi|237720190|ref|ZP_04550671.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293371122|ref|ZP_06617659.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|229450742|gb|EEO56533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292633780|gb|EFF52332.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 371

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 155/370 (41%), Gaps = 96/370 (25%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +K++RL SLDI RG+ +  MILV++ G  W  I     HA WNG    D V PFF+FI+G
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLF------WGILLQGG----FSHAPDELTYG 133
           V+++ AL R         + K++ RTL L        W  L+  G    FSH        
Sbjct: 60  VSMSFALSRFDHHFSRGFIIKLVRRTLILFLLGLFLSWFSLVCTGVEQPFSH-------- 111

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                IR+ GVLQR+AL+Y   SL+ +  +   +               W   ++  +L 
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL                    G  F ++           N +   DR + G  H+
Sbjct: 151 GYSTLL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y                     R+  P      F+PEGLLS++  I   IIG   G+++ 
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  RL Q   +G ALL  G  L +    PLNK++++ ++V VT G A+L+   +  
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFASLLLVFLTW 280

Query: 374 LVDCNIFGPW 383
           L+D      W
Sbjct: 281 LIDIRKKQKW 290


>gi|423239671|ref|ZP_17220787.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
 gi|392645711|gb|EIY39434.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
          Length = 363

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 156/358 (43%), Gaps = 84/358 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + H  +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHVAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+LL      A    T+ ++    R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
           +A+ Y + SLV I  K                    H    A +LV   A  LL  T   
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                         G  FN             N V   D   LG +HMYH          
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG +           F+PEGLLS+V ++   ++G + G +++  K +  ++++ 
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             +G  L   G  L +    P+NK++++ ++V +T G A+   + +  ++D   +  W
Sbjct: 232 FLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALLIWIIDMKGYQNW 287


>gi|346224087|ref|ZP_08845229.1| hypothetical protein AtheD1_02868 [Anaerophaga thermohalophila DSM
           12881]
          Length = 369

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 157/361 (43%), Gaps = 87/361 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K++R  +LD+ RG+ +ALMI V+  G     +  + H+ W+GC   D V PFFLF+VGV+
Sbjct: 3   KSERYLALDVLRGMTIALMITVNTPGSWQYVYAPLRHSSWHGCTPTDLVFPFFLFVVGVS 62

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           +  +  +  +  +  +  ++  RTL +   G+ L    +  P  +T   D   +R+ GVL
Sbjct: 63  MFFSFAKYGNTLNKASFNRLGRRTLLIFAIGLFL----NSFPQWMT---DYSSLRIMGVL 115

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL+Y   SL+ +  K               R Y  +  +   +L+VY          
Sbjct: 116 QRIALAYGFASLIVLSMK---------------RKYIPY--LGLAILLVY---------- 148

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
            W           Y    NVT                 D  +LG  H+Y           
Sbjct: 149 -WGILAWFGGDDPYSLEGNVTIP--------------FDAAILGEQHLY----------T 183

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT-KGHL-ARLK 323
               PF+                PEGLLS++ ++++ ++G   G  I +T K  L ARL 
Sbjct: 184 GFGIPFD----------------PEGLLSTIPAVVTVLLGYLTGVFISNTEKAKLPARLA 227

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            +   G  + I G    F    P+NK L+T SYV  T+G AALVF+ +  ++D   +  W
Sbjct: 228 LY---GVLVTIIGRLWGF--VFPINKPLWTSSYVLYTAGLAALVFALLIFIIDVKGYKKW 282

Query: 384 T 384
           T
Sbjct: 283 T 283


>gi|434387287|ref|YP_007097898.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
 gi|428018277|gb|AFY94371.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
          Length = 377

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 161/362 (44%), Gaps = 77/362 (21%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG-----GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            RL SLD+FRGL +A MILV+ A        +  + HAPW+G  +AD V PFFL+I+GV+
Sbjct: 1   MRLTSLDVFRGLTMATMILVNMASLPNDDRKYAWLDHAPWHGYTIADLVFPFFLYIIGVS 60

Query: 88  IALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRL 141
           +A +L +     +P       +++ R+  L   G++L    +++   E  +  ++  +R+
Sbjct: 61  MAFSLAKYTSGDVPLSKQVYWQILRRSAILFGLGLILNNLVWNYNLTEPKFFANLDKLRI 120

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRI +++   S+  +                     C  W++   +LV Y  +L  
Sbjct: 121 MGVLQRIGIAFFFASIAVLALAQ----------------RC-LWILTGGILVGYWLIL-- 161

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++P         D++D         GV ++     N   Y+DR ++   H++     + 
Sbjct: 162 AFIPA-------LDNSD---------GVFSQFG---NFGAYVDRLIITPAHLH-----KG 197

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
           SK  +                     +PEGL S++ +I S + G   G  +         
Sbjct: 198 SKNLS---------------------DPEGLFSTLPAISSILFGYLTGTWLKRQPVATRT 236

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
               +  G A ++ GL   + +  P+NK+L+T S+V  T+G   +  +A Y LVD   + 
Sbjct: 237 SASLLMYGLAAVVIGLV--WNSFFPINKKLWTSSFVLFTTGWGLISLAACYELVDVRKYR 294

Query: 382 PW 383
            W
Sbjct: 295 QW 296


>gi|373953861|ref|ZP_09613821.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890461|gb|EHQ26358.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 379

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 148/351 (42%), Gaps = 78/351 (22%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           QRL SLD+FRGL VA MILV++ G DW  I     H+ WNGC   D + PFFLFIVGV+I
Sbjct: 14  QRLLSLDVFRGLTVACMILVNNPG-DWAHIYSPLEHSAWNGCTPTDLIFPFFLFIVGVSI 72

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             ++           K++   LK       L    S  P       +   +R+ GVLQRI
Sbjct: 73  VYSMGTKKTDPAQHGKLVLTILKRSLILFCLALFLSLYPK-----FNFHTLRIPGVLQRI 127

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A+ + +  +  IF K  + K Q +    +F L+          L+VY  L+         
Sbjct: 128 AVVFGICGI--IFLKT-ERKTQLI----LFWLF----------LIVYYLLM--------- 161

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
            T++      Y           A L P  N   +IDR V+G  H++              
Sbjct: 162 -TLVPVPGVGY-----------ANLQPETNLGAWIDRTVIGNVHLW-------------- 195

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV-T 327
                   K++ +W     +PEG+L ++ +  + + G+  G  +       +    W+  
Sbjct: 196 --------KESVTW-----DPEGILGTMPATSTGLFGILVGTWLKRKDVDESTKVAWLFC 242

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
            G   +I GL        P+NK L+T S+V    G A L  +  Y ++D  
Sbjct: 243 TGIGAVILGLLWDL--FFPINKALWTSSFVLYAGGLATLGLTLFYWIIDVQ 291


>gi|390956359|ref|YP_006420116.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
 gi|390411277|gb|AFL86781.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
          Length = 404

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 147/350 (42%), Gaps = 84/350 (24%)

Query: 18  EPDVSDQQE----KSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
           +P+++   +    ++  KT+RL S+D+ RGL +A MILV++    G + E+ HA WNG  
Sbjct: 10  DPEIASTVDSDVVRTAPKTERLLSVDVLRGLTIAFMILVNNQPGPGAFFELQHAQWNGFT 69

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG-GFSHA 126
           L D V P FLF+VG+++ L+      +  + K +   TL+    L  +GI++    F H 
Sbjct: 70  LTDLVFPTFLFLVGLSLVLSTAARLAKGASRKTLFLHTLRRSAVLALFGIVVNTFPFQH- 128

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                    +  IR  GVLQR AL YL+VS + +  K  +DK                  
Sbjct: 129 ---------LDRIRFYGVLQRTALCYLVVSGLCLLRKGWKDKAA---------------- 163

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +A   LVVY  L+    VP                 F          +P  N   ++DR 
Sbjct: 164 IAVACLVVYWVLMRFVPVPG----------------FGTPTHEIPINDPNGNLTAWLDRL 207

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +    H+Y                                 +PEGLLS++ +I + + GV
Sbjct: 208 IFAPQHLYQQVR-----------------------------DPEGLLSTLPAISTALYGV 238

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
             G  +  T+   A+       G A+ + G    +    PLNK+L+T S+
Sbjct: 239 LAGTWLRTTRSTTAKAVGLALGGVAMTVAGWLWSY--GFPLNKKLWTSSF 286


>gi|333378010|ref|ZP_08469743.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
 gi|332884030|gb|EGK04310.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
          Length = 389

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 164/372 (44%), Gaps = 91/372 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL SLDI RG+ +A MI+V++ G     +  + HA W+G    D V PFF+FI+G++ 
Sbjct: 6   SSRLLSLDILRGITIAGMIMVNNPGSWSYVYAPLGHAAWHGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQ------GGFSHAPDE--------LTY 132
            ++L++     +   + K++ RT+ +   G+ L         F+    E        +T 
Sbjct: 66  YISLRKFNFEFNKPTLFKILKRTVVIFLIGLGLGWLSLSFRTFNSLSGEDIGFFERFITA 125

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             +   IR+ GV+QR+AL+Y   +L+ IF K                     +++   ++
Sbjct: 126 ITNFEHIRILGVMQRLALTYGATALIAIFVKHKYIP----------------YIIVVTLI 169

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             +L LL+G             +  D+ +                N +  +DR +LG +H
Sbjct: 170 GYFLLLLFG-------------NGFDFSE---------------DNIISVLDRAILGADH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY            +DS                  +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MY------------KDSGLA--------------IDPEGLLSTIPAICHVLIGFCCGEIL 235

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + TK +  R+++   +G  +   G  L +    P+NK++++ ++V  T G A+ + + + 
Sbjct: 236 LTTKDNNERIQRLFIIGAIMTFLGFLLSY--GCPINKKIWSPTFVLATCGLASTMLALLI 293

Query: 373 ALVDCNIFGPWT 384
            ++D      W+
Sbjct: 294 WIIDIKGHKKWS 305


>gi|418746616|ref|ZP_13302939.1| PF07786 family protein [Leptospira santarosai str. CBC379]
 gi|410792596|gb|EKR90528.1| PF07786 family protein [Leptospira santarosai str. CBC379]
          Length = 375

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 154/352 (43%), Gaps = 93/352 (26%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F        + +  FS+             +L++
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSV------------AILLI 150

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           +  +L    +P  +  +  +   D G                     +IDR + G  H+ 
Sbjct: 151 HTWILTQIALPG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFL 268


>gi|66808259|ref|XP_637852.1| transmembrane protein [Dictyostelium discoideum AX4]
 gi|60466271|gb|EAL64333.1| transmembrane protein [Dictyostelium discoideum AX4]
          Length = 675

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 86/138 (62%), Gaps = 11/138 (7%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           E+ + K  RL SLD+FRG ++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct: 198 ERENRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMG 257

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILL--QGGFSHAPDELTYGVDVRMIRLCG 143
           +A+ L+   +  R    K++IF+  KLL   I+L   G F      +  GVD++  R+ G
Sbjct: 258 IAMPLSFHAMEKRGTP-KRIIFQ--KLLRRSIILFALGLF------INNGVDLQQWRILG 308

Query: 144 VLQRIALSYLLVSLVEIF 161
           VLQR ++SYL+V  + +F
Sbjct: 309 VLQRFSISYLVVGSIMLF 326



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/224 (23%), Positives = 93/224 (41%), Gaps = 36/224 (16%)

Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-A 217
            I+   +Q +      FS    Y   W+ A  +   +  L++   VP      +     A
Sbjct: 410 NIYMSMIQHETVISKYFSDIAPYWIQWVFALIIFSGWFLLMFLVPVPGCPTGYLGAGGLA 469

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
           D G+  + T G          A   ID K+    H++ +P       C +          
Sbjct: 470 DQGRYQHCTGG----------AARLIDLKIFTEAHIFQNPT------CLE--------VY 505

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW-----VTMGFAL 332
             PS     ++PEG +  ++SI    IGV  G +I+  K + +RL +W     V  G A 
Sbjct: 506 KTPS-----YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAA 560

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            + GLT +    +P+NK L++ S++ + +G    V + ++ L+D
Sbjct: 561 GLCGLTQN-QGWLPVNKNLWSPSFILLMAGFGFFVLTVMFILID 603


>gi|410638830|ref|ZP_11349383.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
 gi|410141358|dbj|GAC16588.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
          Length = 365

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 152/354 (42%), Gaps = 88/354 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R  SLDIFRG+ +A M+LV++ G     +P + HA W+G    D + PFFLFIVG A+ 
Sbjct: 2   NRQISLDIFRGITLAAMLLVNNPGSWSFVYPPLLHAKWHGLTPTDLIFPFFLFIVGAAMF 61

Query: 90  LALKRIPDRADAV-----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            ++ R   +A+       +K+  RT+ L   G LL        +   +  D +  R+ GV
Sbjct: 62  HSMGRYLPKANQALQVPWQKIAKRTIVLFAIGFLL--------NIFPFTGDPQNWRIMGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIA+ Y + +++       Q                   L+AAC+      LL G ++
Sbjct: 114 LQRIAICYGIAAILICVLHQKQ-------------------LIAACI-----TLLIGYWL 149

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 ++N     YG                 N V  ID +VLG  H+Y          
Sbjct: 150 ------MLNLVENPYGL--------------ETNLVRLIDIEVLGSAHLYQG-------- 181

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 F+PEGLLS + ++++ + G     ++ + K    R+K 
Sbjct: 182 ------------------FGVAFDPEGLLSCIPAVVTVLAGFFTSKMLANAKTEQQRMK- 222

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
              + ++L+ F + + +  + P+NK L+T +YV  T+G A L  + I  L D  
Sbjct: 223 -TLLLWSLVTFVVCIAWQFSFPVNKSLWTSTYVLATNGFAWLALAVIIYLHDVK 275


>gi|71278983|ref|YP_267171.1| hypothetical protein CPS_0413 [Colwellia psychrerythraea 34H]
 gi|71144723|gb|AAZ25196.1| putative membrane protein [Colwellia psychrerythraea 34H]
          Length = 358

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 92/354 (25%), Positives = 153/354 (43%), Gaps = 94/354 (26%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LD FRG+ +ALMILV+  G  W  +     HA W+G    D V PFFLFI+G A+ 
Sbjct: 3   RYLALDAFRGITIALMILVNTPG-TWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMF 61

Query: 90  LALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            + K+    A  +  +K+I R   + F G +L        + + + V+    R+ G+LQR
Sbjct: 62  FSFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQR 113

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I ++Y + + + +          ++ R  +F       + +A +L+ Y ALL        
Sbjct: 114 IGIAYTVAACLVL----------TLNRTGVF-------IASAVILLAYWALL-------- 148

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                            ++ G    L    N +  +D  V G NHMY             
Sbjct: 149 -----------------LSMG-EGALTIEGNIIRQLDLAVFGANHMY------------- 177

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                  +R  A       FEPEGLLS++ +I++ ++G      +   +   + + +   
Sbjct: 178 ------TMRGVA-------FEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLTL 224

Query: 328 MGFALLIFGLTLHFTN----AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +G      GL + F       +P+NK L+T SYV  ++G A L+ +A   L+D 
Sbjct: 225 IG------GLAVGFGALWGLVLPINKSLWTPSYVIYSTGFACLLLAAFIWLIDI 272


>gi|294463099|gb|ADE77087.1| unknown [Picea sitchensis]
          Length = 218

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/90 (51%), Positives = 64/90 (71%), Gaps = 1/90 (1%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLSS+ ++++  IG+HFGH+++H KGH  R+ Q +     L+ FG+ L     +P
Sbjct: 44  FDPEGLLSSIMAVVTCFIGLHFGHILVHFKGHSERVLQCIIPSLGLIFFGIALGIF-GMP 102

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LNK LYT +Y+CVTSGAA L+F  IY LVD
Sbjct: 103 LNKPLYTFNYMCVTSGAAGLLFVGIYLLVD 132


>gi|404485250|ref|ZP_11020448.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
 gi|404338685|gb|EJZ65130.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
          Length = 390

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 160/372 (43%), Gaps = 91/372 (24%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL SLDI RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLSLDILRGITIAGMIMVNNPGSWGYIYAPLGHAEWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDR-ADAVK-KVIFRT-------LKLLFWGILLQGGFSHAPDELTYGV----- 134
            ++L++   R + AV  K+I RT       L + ++G+ ++       + L++       
Sbjct: 66  YMSLRKFDFRLSGAVAWKIIRRTIVIFAIGLAIAWFGLTMRTYHQLGEESLSFFERLGRS 125

Query: 135 --DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             +   IR+ GV+ R+A+ Y + + + +  K                 Y  H  + +  L
Sbjct: 126 MWNFDHIRILGVMPRLAICYGVAAFIALIVK---------------HKYIPH--IVSVTL 168

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + Y  +L                    GK F  +           N +  +DR +LG NH
Sbjct: 169 IAYFVILIT------------------GKGFEFS---------EDNIISVVDRAILGSNH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MYH                                +PEGLLS++ SI   ++G+  G +I
Sbjct: 202 MYHDNG--------------------------LALDPEGLLSTIPSICHVLVGIFCGGLI 235

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           + TK +  R++     G  L   GL L +    P++K++++ ++V  T G AA   + + 
Sbjct: 236 MRTKDNAVRMQNLFIAGTILTFAGLLLEY--GCPISKKIWSPTFVLTTCGLAASSLALLI 293

Query: 373 ALVDCNIFGPWT 384
            ++D   +  W+
Sbjct: 294 WIIDIKGYKKWS 305


>gi|421110364|ref|ZP_15570862.1| PF07786 family protein [Leptospira santarosai str. JET]
 gi|410804289|gb|EKS10409.1| PF07786 family protein [Leptospira santarosai str. JET]
          Length = 375

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 151/352 (42%), Gaps = 93/352 (26%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F        + +  FS+  L    W++    L  
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSVPILLIHTWILTQIAL-- 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
                     P  +  +  +   D G                     +IDR + G  H+ 
Sbjct: 161 ----------PG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFL 268


>gi|209523049|ref|ZP_03271606.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376001698|ref|ZP_09779557.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|423062475|ref|ZP_17051265.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
 gi|209496636|gb|EDZ96934.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375329927|emb|CCE15310.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|406716383|gb|EKD11534.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
          Length = 378

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 154/362 (42%), Gaps = 87/362 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V P FL I+GVAIA
Sbjct: 8   MRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAEWDGCTPTDVVFPSFLLIMGVAIA 67

Query: 90  LALK------RIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            +L       R+P      +V   I R   LLF   L   GF H         D+  IR+
Sbjct: 68  FSLSKFAREHRLPGEKVPPSVYSRIGRRCLLLFLLGLFLNGFPH--------YDLANIRI 119

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQRIA++Y L ++  +     Q                  WL++   L+ Y   +  
Sbjct: 120 MGVLQRIAIAYGLTAIAILNLSRRQ-----------------LWLISILTLIGYWVAM-- 160

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                   TII   S  YG            L+P  N   +ID+ +LG +H+        
Sbjct: 161 --------TIIPVPS--YGP---------GNLSPEGNLGAFIDQTILGSHHL-------- 193

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                               W   P++PEGL S+  + ++ I+G   G  +     +   
Sbjct: 194 --------------------WRGGPYDPEGLFSTAPATVTVILGYLTGEWLKSQPRNSFT 233

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           +   V    + L+ G    +    P+NK L+T S+V VT+G   L+ +  Y +++   + 
Sbjct: 234 VINLVMFALSSLVVGYL--WGVWFPINKALWTSSFVLVTAGWGLLLLAFCYGVIEVKNWR 291

Query: 382 PW 383
            W
Sbjct: 292 RW 293


>gi|301625227|ref|XP_002941812.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Xenopus (Silurana) tropicalis]
          Length = 370

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 88/155 (56%), Gaps = 13/155 (8%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           SE +  +Q +    +++RL SLD FRG ++ +M+ V++ GG +    HAPWNG  +AD V
Sbjct: 190 SEDNCGEQSKVP--ESRRLYSLDTFRGFSLTIMVFVNYGGGGYWFFEHAPWNGLTVADLV 247

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELT 131
           MP+F+FI+G ++ALA    LKR   R   + K+ +RT  L   G+  L  G +  P    
Sbjct: 248 MPWFVFIIGTSVALAFNAMLKRGLSRCQLLYKLTWRTCILFAIGVFFLNYGPADGP---- 303

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             +  R  R+ GVLQR+  +Y +++L+      VQ
Sbjct: 304 --LSWRWARIPGVLQRLGFTYFVIALLHTCFHKVQ 336


>gi|352080530|ref|ZP_08951469.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
 gi|351683811|gb|EHA66887.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
          Length = 353

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 147/356 (41%), Gaps = 100/356 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +     HA W+GC   D V PFFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFPFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +   +R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
           LQRIAL +  V+L  I TK                     W   A +L+ Y  LL  G  
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +  W       DSA +G+                    Y+     G  H           
Sbjct: 157 LEPWTNLASRVDSAVFGRFV------------------YLIDPASGRGH----------- 187

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEGLLS++ S+  T++G+  G  +        R +
Sbjct: 188 ------------------------DPEGLLSTLPSLAGTLLGLRMGCWL--------RRE 215

Query: 324 QWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           Q+ T+   G A L+ G    ++  +P NK L+T S+V  T+G A L   A + L+D
Sbjct: 216 QFRTLLLAGIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWATLALLAFHVLID 269


>gi|343082900|ref|YP_004772195.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351434|gb|AEL23964.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 365

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 153/352 (43%), Gaps = 86/352 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD  RG  +A MILV++ G     +P + HA W+G  + DF+ PFF+F+VGV++A
Sbjct: 6   TRLISLDALRGFTIAAMILVNYPGSWSHVYPPLLHAEWHGMTMTDFIFPFFIFMVGVSVA 65

Query: 90  LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            A  +  D    +A   KK+I R +KL   GI L    +  PD      D   +R+ GVL
Sbjct: 66  FAYSKRLDEGVPKAGMYKKIIIRAIKLFVLGIFL----NLIPD-----FDFSHMRIAGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI++ +L  S + +   +V  K Q+               +   +LV Y   +  T +P
Sbjct: 117 QRISIVFLASSFLFL---NVGWKKQAY--------------LGGGILVFYWLCM--TLIP 157

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                     + D G V          L P  N   +IDR  L                 
Sbjct: 158 ----------TPDEGVVL---------LEPGRNLAAWIDRMFL----------------- 181

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                   P +    +W     +PEG  +++ +I++ I+G+  G +++        LK  
Sbjct: 182 --------PGKMWEGTW-----DPEGFFTTLPAIVTGILGLLAGRILLSNFNE--NLKSN 226

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
             M   L++  + L +    P+NK L+T S+  +T GA  L   A Y LVD 
Sbjct: 227 YLMSIGLVLVLIGLLWAQVFPINKHLWTSSFTLITGGAGFLGLGASYFLVDI 278


>gi|150004749|ref|YP_001299493.1| transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294775179|ref|ZP_06740705.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|149933173|gb|ABR39871.1| putative transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294450991|gb|EFG19465.1| putative membrane protein [Bacteroides vulgatus PC510]
          Length = 363

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 157/359 (43%), Gaps = 86/359 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++D   +  W
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGW 287


>gi|319641808|ref|ZP_07996487.1| transmembrane protein [Bacteroides sp. 3_1_40A]
 gi|345518546|ref|ZP_08797993.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|254835931|gb|EET16240.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|317386564|gb|EFV67464.1| transmembrane protein [Bacteroides sp. 3_1_40A]
          Length = 363

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 157/359 (43%), Gaps = 86/359 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGHVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++D   +  W
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGW 287


>gi|338212268|ref|YP_004656323.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306089|gb|AEI49191.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 365

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 152/355 (42%), Gaps = 90/355 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +T RL SLD  RG  +A M++V+  G +   +  + H  WNG +  D V P FLF+VGV+
Sbjct: 4   QTNRLVSLDALRGFTIAAMLMVNFPGSEEYVFFTLRHTKWNGLSFTDLVAPIFLFVVGVS 63

Query: 88  IALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I  A  KR  D     +  +K+I R+LK+   G+ L          L    D   IR  G
Sbjct: 64  IVFAYSKRKWDGRPTGELYRKIIIRSLKIFAVGMFLN---------LMPTFDFSDIRWTG 114

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
            L RIA  +L  +++ +   +   K Q+             W+ A  ++  +LAL   T 
Sbjct: 115 TLHRIAFVFLGCAVLYL---NTNWKQQA-------------WVGAVILVAYWLAL---TL 155

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +P          +   GKV          L P  N V + D + L               
Sbjct: 156 IP----------TPGIGKV---------MLEPGVNLVAWFDTQFL--------------- 181

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                     P +    +W     +PE +LS+  SI+S I G+  G ++  T     ++ 
Sbjct: 182 ----------PGKMWQGTW-----DPESILSTFPSIVSGITGMLAGQLLQSTFTPNEKVN 226

Query: 324 QWVTMGFALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             +T G    +F   L +   +  P+N+ L+T S+V VTSG A L+  A+Y +VD
Sbjct: 227 YLMTAG----VFSAALGYFWGLGFPVNENLWTSSFVLVTSGFACLLLGALYFMVD 277


>gi|340618131|ref|YP_004736584.1| hypothetical protein zobellia_2146 [Zobellia galactanivorans]
 gi|339732928|emb|CAZ96303.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 367

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 142/349 (40%), Gaps = 86/349 (24%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+FRGL + LMI+V+  G DW      + HA W+G    D V P FLF VG A 
Sbjct: 2   KRFKALDVFRGLTICLMIIVNTPG-DWDMTFSPLLHAKWHGFTPTDLVFPSFLFAVGNAF 60

Query: 89  ALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGV 144
           A    +  D+  +D  KK+  RTL +   G  +     FS         V     R+ GV
Sbjct: 61  AFVKTKWADKPLSDIFKKIAKRTLIIFLLGYTMYWIPFFSWTETGDLAAVPFSETRILGV 120

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL--YGT 202
           LQRIAL Y + +++  F  + Q    S                 A +L+ Y  LL  +G 
Sbjct: 121 LQRIALCYFIGAIMIYFLTNRQLIIAS-----------------AVILLGYWGLLSAFGD 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
           Y  +                               N V  IDR +LG +H+Y        
Sbjct: 164 YTLE------------------------------GNFVRTIDRMLLGDSHLYMGNG---- 189

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                  PF+PEGLLS++ SI + + G   G  II       +L
Sbjct: 190 ----------------------IPFDPEGLLSTLPSICNVLGGYLVGKYIIDKGIDYEKL 227

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
            + + +G  LL+       T   P+NK+L+T S+V +T G   +V S +
Sbjct: 228 AKMLLVGAGLLVVAYLWDLT--FPVNKKLWTSSFVVLTVGLDIVVLSVL 274


>gi|319952891|ref|YP_004164158.1| heparan-alpha-glucosaminide n-acetyltransferase [Cellulophaga
           algicola DSM 14237]
 gi|319421551|gb|ADV48660.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           algicola DSM 14237]
          Length = 363

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 100/349 (28%), Positives = 159/349 (45%), Gaps = 81/349 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           +R+ ++DIFRG+ + LMILV++ G  W  +     HA W+G    D V PFFLFIVG +I
Sbjct: 3   ERIVAVDIFRGMTIVLMILVNNPG-TWAAVYAPFLHADWHGYTPTDLVFPFFLFIVGTSI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A       AD  KK++ R+LKL+  G+LL       P    +  D   IR  GVLQRI
Sbjct: 62  VFAYSTKKPTADTYKKIVSRSLKLIGLGLLLGAFTLVFP----FVKDFSEIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            + + + S+                   +F  + W  L+   V ++    L   ++P   
Sbjct: 118 GVVFFITSI-------------------LFLNFNWKQLIGVTVFILIGYWLAMGFIP--- 155

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              +N  ++ + +              P N   Y+D  +LG +HM+              
Sbjct: 156 ---VNGIASTFDR-------------APNNLANYVDLNILG-SHMW-------------- 184

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
                   KD        ++PEGLLS++ +I S ++GV  G +++  + +  +    + +
Sbjct: 185 --------KD-------DYDPEGLLSTIPAIASCLLGVFTGKILLSKQQN--KTTIVLGL 227

Query: 329 GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           G  LLI G   HF    P+NK L++ S+V VT+G A L+ + IY   D 
Sbjct: 228 GLILLIVGHAWHFI--FPINKALWSSSFVLVTAGWANLILALIYYSTDV 274


>gi|294895713|ref|XP_002775269.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239881343|gb|EER07085.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 323

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 127/339 (37%), Gaps = 99/339 (29%)

Query: 41  FRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD 100
            RG+ +++M++VD  G   P I HAPWNG +LAD VMP F+FI                 
Sbjct: 1   MRGVVMSIMLIVDVCGKAVPSIGHAPWNGLHLADIVMPGFIFI----------------- 43

Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
                                      D LT G+D+   R  G+LQRIA+ Y    L+  
Sbjct: 44  ---------------------------DTLTVGLDLYTFRAPGILQRIAVCYAAAVLLAK 76

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
              D+   D   G               + VLVV L  +    V +W   ++        
Sbjct: 77  LVSDLSPNDTVKGALK----------NNSRVLVVGLLCI----VINWAIMLLGPQPKGCP 122

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
                    R  L P CN    IDR V G  HMY +P W                     
Sbjct: 123 ---------RGSLTPQCNVASNIDRMVFGPEHMY-NPLW--------------------- 151

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
                  +PEGLLS++ S+ +  +G+  G  I     H   L + V  G  L + G+ L 
Sbjct: 152 -------DPEGLLSTLPSLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLDLCGMGLG 203

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
               IP++K L+T SY  +T G        + + V  N+
Sbjct: 204 IV--IPVSKVLWTPSYCLLTGGICVAFLGIVSSRVGGNV 240


>gi|325286182|ref|YP_004261972.1| heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
 gi|324321636|gb|ADY29101.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
          Length = 361

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 155/355 (43%), Gaps = 91/355 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DIFRG+ + LMILV++ G  W  +     HA W+G    D V PFFLFIVG++I
Sbjct: 3   NRVVSVDIFRGITIVLMILVNNPG-TWSSVYAPFLHADWHGYTPTDLVFPFFLFIVGISI 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             A           +K++ R+LKL+  G+ L G F+ +     +  D   IR  GVLQRI
Sbjct: 62  VYAYHTKEVTGKTYRKIVIRSLKLIGLGLFL-GAFTLS---FPFFKDFNDIRFPGVLQRI 117

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            L +   ++                   +F    W  L+A C  ++ +  L+  +VP   
Sbjct: 118 GLVFFFTAI-------------------LFIKLNWKALVAVCAAILIMYWLWMGFVP--- 155

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
              IN  +  + +              P N   +ID KVLG +HM     W+        
Sbjct: 156 ---INGTAPTFDR-------------APNNWANFIDLKVLG-SHM-----WKTD------ 187

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-----IHTKGHLARLK 323
                             ++PEG+LS++ +I ++++GV  G ++       T+  L    
Sbjct: 188 ------------------YDPEGVLSTLPAIATSLLGVFVGLLLKSAYKKKTQILLLLGV 229

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
             +T G    +F          P+NK L++ S+V VT+G A ++ + IY   D  
Sbjct: 230 SLLTAGHIWDLF---------FPINKALWSSSFVLVTAGWATIILAVIYYFSDVK 275


>gi|323447301|gb|EGB03229.1| hypothetical protein AURANDRAFT_68196 [Aureococcus anophagefferens]
          Length = 399

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 79/137 (57%), Gaps = 12/137 (8%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
            + +S  +  R+ SLD+ RG AV LMI VD AG  +  + H+PW+G  +AD VMPFF+F+
Sbjct: 4   NEPESARRPPRVRSLDVVRGFAVLLMIFVDDAGSAYAVLDHSPWDGLTIADVVMPFFIFM 63

Query: 84  VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY--GVDVRMI 139
           VGV+ ALAL  KR       +  V+ R   L  W + +       PD  TY  G D+  +
Sbjct: 64  VGVSAALALGGKRT------LAPVLRRGATL--WVVGVAVQGGGLPDPTTYAWGYDLGTV 115

Query: 140 RLCGVLQRIALSYLLVS 156
           R CG+LQRIA  Y++ S
Sbjct: 116 RWCGILQRIAACYVVAS 132



 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 49/119 (41%), Gaps = 25/119 (21%)

Query: 282 WCHAP--FEPEGLLSSVSSILSTIIGVHFGHVIIHTKG-HLARLKQWVTMGFALLIFG-- 336
           WC AP   +PEG LS V ++ + +IG  F   +   +     R       G+A L+ G  
Sbjct: 167 WCAAPGVADPEGFLSGVLAVATAVIGALFARCLERCRAPRPGRDSDDRGGGYARLVDGDG 226

Query: 337 -----LTLHFTNAI---------------PLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
                L LH+  A                P+NKQL+T SY   T+       +A  AL+
Sbjct: 227 DRRGALMLHWALASLALAAAALVAVAAGSPVNKQLWTPSYCLATAALCGFALTAAVALL 285


>gi|422005552|ref|ZP_16352731.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417255773|gb|EKT85231.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 375

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 153/352 (43%), Gaps = 93/352 (26%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            +++S     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A         
Sbjct: 60  FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y +V SL  +F      K   V    I  ++ W        ++ 
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILT 156

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
            +AL   + V           S + GK                +   +IDR + G  H+ 
Sbjct: 157 QIALPGESVV-----------SLEQGK----------------DIGAWIDRTIFGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               WR SK                       ++PEG LS V+S+++T+ GV  G ++  
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL-- 220

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
                 R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 221 ----FLRERRNKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFL 268


>gi|372223192|ref|ZP_09501613.1| hypothetical protein MzeaS_12804 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 364

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 159/357 (44%), Gaps = 85/357 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ ++DIFRG+ ++LM+LV+  G  W  +     HA W+G    D V PFFLFIVG +
Sbjct: 4   NKRIVAVDIFRGMTISLMVLVNTPG-TWSSVYSPFLHAQWHGYTPTDLVFPFFLFIVGTS 62

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           I  A K         KK+  R LKL+  G+ L G F+ +     +  D   IR  GVLQR
Sbjct: 63  IVFAYKNKKPSLKTYKKIGVRALKLIILGLFL-GAFTLS---FPFFKDFENIRFPGVLQR 118

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I + + + S                   ++F    W  L     + V L LL+G +    
Sbjct: 119 IGVVFFITS-------------------ALFLNLNWKKL-----IFVVLVLLFGYWA--- 151

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
             T+I  D          +   RA    P N   ++D ++ G +H Y             
Sbjct: 152 LMTLIPVDGQP-------STLTRA----PNNLANWLDVQLFG-SHNY------------- 186

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   + D        ++PEG LS++ +I S ++G+  G +++  +    +      
Sbjct: 187 --------KPD--------YDPEGFLSTLPAIASALLGIFTGEILVGKQNK--KTPLLFG 228

Query: 328 MGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN--IFG 381
           +G  LLI G   H  + + P+NK L+T S+V VT+G A +V + I+ L D    IFG
Sbjct: 229 LGGCLLIIG---HLWDIVFPINKALWTSSFVLVTAGWANVVLAIIHYLSDDRKLIFG 282


>gi|405957484|gb|EKC23691.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crassostrea gigas]
          Length = 1901

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 160/375 (42%), Gaps = 81/375 (21%)

Query: 22   SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
            +D+Q+ S  K +RL SLD FRG                                    F+
Sbjct: 1531 ADEQD-SRPKKERLKSLDTFRG------------------------------------FI 1553

Query: 82   FIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
            +I+G A+A +    L+R+  +     K+  R+  L   G+L+  G    P  LT+     
Sbjct: 1554 WIMGTAMAYSFTGMLRRVTPKHKIFWKIFKRSCILFVLGLLVNTGGCD-PTRLTH----- 1607

Query: 138  MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYCWHWLMAACVLVVY 195
             +R+ GVLQR A +YL+V+ + +F     D       G       +   W++    + V+
Sbjct: 1608 -LRIPGVLQRFAGTYLVVASIHMFFAKTVDVSMYTYWGFIRDIVDFWLEWILHIVFVTVH 1666

Query: 196  LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
            + + +   VP        K     G +        A +   C   A GYIDR+V G +H+
Sbjct: 1667 IIITFTLDVPG-----CGKGYIGPGGLHEAVNSTEASVYQNCTGGAAGYIDRQVFGDDHI 1721

Query: 254  YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
            Y  P  +             P+ K        P++PEGLL +++S+    +G+  G +++
Sbjct: 1722 YQSPTCK-------------PIYKTT-----VPYDPEGLLGTLNSVFMCYLGLQAGKILM 1763

Query: 314  HTKGHLARLKQWVTMG-FALLIFGLTLHFTN---AIPLNKQLYTLSYVCVTSGAAALVFS 369
              K   AR+K+++  G F  LI G    F      IPLNK L++LS+V   +G A ++ +
Sbjct: 1764 TFKEPSARVKRFLIWGLFLCLIAGALCGFKKDGGTIPLNKNLWSLSFVLCMAGFAFVLLA 1823

Query: 370  AIYALVDCNIFGPWT 384
              Y  +D  ++  W+
Sbjct: 1824 FCYVTID--VYKVWS 1836


>gi|408382946|ref|ZP_11180487.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
 gi|407814484|gb|EKF85111.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
          Length = 382

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 154/366 (42%), Gaps = 85/366 (23%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEI-SHAPWNGCNL 72
           ++    +   + + +K +R+ SLD+FRGLAVA MI V+        P I  HA WNG   
Sbjct: 5   VNTNSTNSTVKTNAVKKRRVISLDVFRGLAVAAMIFVNAMAFSEFTPGIFEHATWNGLTF 64

Query: 73  ADFVMPFFLFIVGVAIA--LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           AD V P FLFIVGV++A   A +    + D     +FR   L   G+ L           
Sbjct: 65  ADLVFPSFLFIVGVSMAYSFAARSKNSKRDLWGHFLFRVGALFTIGVALN---------- 114

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
            +  D  M+R+ GVLQ IAL+ L  S +  F                     W  L+A  
Sbjct: 115 WFTSDFSMVRIPGVLQLIALASLFASPMARFKPR------------------WILLVAGV 156

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L+++  +L G   P                   +  G   K N   N   +ID +VL +
Sbjct: 157 LLLIHGFILLGVGAP------------------GIPAGTLEKGN---NIDDWIDIQVLTV 195

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NH             T D                A  +PEG+LS +++    ++G+  G 
Sbjct: 196 NH-------------TID----------------ANGDPEGILSIITATALVLLGLCVGR 226

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
            +   K +L  +   +  G   L+ GL L  +  +P+NKQL+T S++ V +G   L  + 
Sbjct: 227 TLQLRKHNLKTIGILLAGGAISLLLGLAL--SQILPINKQLWTSSFILVCAGIGTLFLTI 284

Query: 371 IYALVD 376
           ++  +D
Sbjct: 285 LFWYLD 290


>gi|423312333|ref|ZP_17290270.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688817|gb|EIY82101.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
          Length = 363

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 157/359 (43%), Gaps = 86/359 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +RL +LDI RG+ +A MILV++ G     +  + HA +NG    D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 90  LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           ++L++         ++K++ RT+ +   G+ L      A    T+ ++   +R  GV+QR
Sbjct: 67  ISLRKYNFTYSHAILRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
           +A+ Y + SLV I  K                    H    A +LV   VY  LL     
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                          G  FN++           N V   D   LG +HMYH         
Sbjct: 159 -------------AMGDGFNLSV---------TNIVARFDVWALGTSHMYH--------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                  +G +           F+PEGLLS++ ++   ++G + G ++   K +  ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
              +G  L   G  L +    P+NK++++ ++V  T G A+   + +  ++D   +  W
Sbjct: 231 LFLVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLALLIWIIDIKGYQGW 287


>gi|375149723|ref|YP_005012164.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063769|gb|AEW02761.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 368

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 155/367 (42%), Gaps = 86/367 (23%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  +R  SLD+FRGL + LMI+V+  G     +  + HA WNG  LAD V P FLF VG 
Sbjct: 2   IPNKRFISLDVFRGLIICLMIIVNTPGSHDTSFALLQHANWNGFTLADLVFPSFLFAVGN 61

Query: 87  AIALALK--RIPDRADAVKKVIFRT------LKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           A+  +++  +   +   + K+  RT        LL+W    +   ++    + +      
Sbjct: 62  ALFFSMQKWKTMTQGQVLAKIGKRTLLLFLLGYLLYWFPFFE---TNTQGHIVFK-SFAG 117

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
            R+ GVLQRIAL Y + SL+  + K                      +++A +LV Y  L
Sbjct: 118 TRIMGVLQRIALCYGIASLLIYYLKPKGAL-----------------IVSAIILVAYPGL 160

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           L+      W          D G           KLN   NAV   D  +LG +HM H   
Sbjct: 161 LF------WL--------GDPGN----------KLNMVGNAVTKFDLWLLGPDHMNHGEV 196

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
                                      PFEPEG+LS++ +I + + G   G   I T G 
Sbjct: 197 --------------------------VPFEPEGILSTLPAITNVVAGYLVGWY-IQTAGK 229

Query: 319 LAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
             R L + +  G  L   GL  ++   IP+NK L+T S+V  ++G   L+ +AI  + D 
Sbjct: 230 TKRMLLRLIATGAGLTFLGLCWNY--VIPINKNLWTSSFVVHSTGLDCLLLAAIIYIADF 287

Query: 378 NIFGPWT 384
                WT
Sbjct: 288 LNITRWT 294


>gi|421093382|ref|ZP_15554106.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
 gi|410363365|gb|EKP14394.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
          Length = 383

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 145/349 (41%), Gaps = 82/349 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  +     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPG-SWSYVYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   +R      +  R + L     +L G F +   E T+     
Sbjct: 60  FLFAVGASIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----S 110

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R+ GVLQRI   Y +V+ +                F IF            VLV  + 
Sbjct: 111 ELRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIP 146

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L    V  W  T I                    L    +   +IDR++ G  H+    
Sbjct: 147 IL---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRRIFGEKHL---- 188

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G
Sbjct: 189 -WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG 225

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
              + +     G   L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 226 R-GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFL 273


>gi|153808903|ref|ZP_01961571.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|423220258|ref|ZP_17206753.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
 gi|149128236|gb|EDM19455.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|392623335|gb|EIY17438.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
          Length = 371

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 155/362 (42%), Gaps = 87/362 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
             +RL +LD+ RG+ +A MILV++ G  W  +     HA WNG    D + PFF+FI+G+
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPG-SWGYVYFPLKHAQWNGLTPTDLIFPFFMFIMGI 63

Query: 87  AIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLC 142
           +  ++L++        A  K+I RT+ +   GI +       +  D L +      IR+ 
Sbjct: 64  STYISLRKYNFTFSTPAALKIIKRTIVIFLIGIAINWFALLCYYHDPLPFA----QIRVL 119

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GV+QR+AL Y   +L+ +  K                     +L+ A ++  ++ L+ G 
Sbjct: 120 GVMQRLALCYGASALIALLIKHKYIP----------------YLIVALLVGYFILLITGN 163

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
                            G  +N T           N +  +DR +LG  HMY        
Sbjct: 164 -----------------GFAYNET-----------NILSIVDRSILGDAHMY-------- 187

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
               QD+                  +PEGLLS++ SI   +IG   G +++  K    +L
Sbjct: 188 ----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIREKL 227

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
           ++   +G  L   G  L +    P NK++++ ++V VT G  +   + +  ++D   +  
Sbjct: 228 ERLFLIGTILTFAGFLLSY--GCPFNKKIWSPTFVLVTCGLGSSFLALLVWIIDIKGYKK 285

Query: 383 WT 384
           W+
Sbjct: 286 WS 287


>gi|408673387|ref|YP_006873135.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387855011|gb|AFK03108.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 423

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 108/401 (26%), Positives = 163/401 (40%), Gaps = 118/401 (29%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           QRL S+D+FRG+ + LM +V++ G DW  I     HA W+GC   D V PFFLFIVG++ 
Sbjct: 3   QRLTSIDVFRGMTIMLMTIVNNPG-DWSHIYAPLEHAEWHGCTPTDLVFPFFLFIVGIST 61

Query: 89  ALA----------LKRIPDRA------------------DAVKKVIFRTLKLLFWGI--- 117
            L+           +RI  RA                    ++ V    ++L+  GI   
Sbjct: 62  VLSSPVKRFDSNTFERIITRALRIFLLGLFLNFFSKIHLGTLEGVPLMLIRLVLTGIATV 121

Query: 118 LLQGGFSHAPD-ELTYGVDVRMIRLC-------------GVLQRIALSYLLVSLVEIFTK 163
           LL G F          G+ V MI LC             GVLQRIA+ YL+VS++   T 
Sbjct: 122 LLLGDFDKKKQFYAAVGLFVFMISLCFSGIEDFASVRIPGVLQRIAMVYLIVSVLYATTN 181

Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
                        +  L C         L+ Y AL+    VP       N  +A++ K  
Sbjct: 182 TTTQ--------IVVGLVC---------LLGYWALMALVPVP-------NVGTANFEK-- 215

Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
                         N   +ID  +L   H+     W  SK                    
Sbjct: 216 ------------GTNLAAWIDNLLLP-GHL-----WSVSKT------------------- 238

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
              ++PEG+LS++ +I + + GV  G ++ +      +    ++ G   ++ G    + +
Sbjct: 239 ---WDPEGILSTIPAIGTALAGVFTGKLLTNDFPKNKKAIYLLSAGVIGVMIGFL--WND 293

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
             P+NK L+T SYV   +G A LV   +Y ++D   F  WT
Sbjct: 294 YFPINKALWTSSYVLYVAGWALLVLGVLYFIIDVLGFEKWT 334


>gi|428210738|ref|YP_007083882.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
 gi|427999119|gb|AFY79962.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
          Length = 398

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 159/371 (42%), Gaps = 76/371 (20%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
           +   ++ L + RL SLD+FRG+A+A MILV++ G     +P + HAPW+G    D + P 
Sbjct: 9   NPSVQNLLNSMRLTSLDVFRGMAIASMILVNNPGSWQQVYPPLLHAPWHGFTPTDLIFPA 68

Query: 80  FLFIVGVAIALALKRIPDRADA--VKKVIFRTLK---LLFWGILLQGGFSHAPDELTYG- 133
           FLFI GVA+A +  +  +  ++     V F+ L+   +LF   L   G +     L  G 
Sbjct: 69  FLFISGVAMAFSFAKYTNSPNSPPAASVYFKILRRALILFGLGLFLNGSTLVLKTLLQGQ 128

Query: 134 -VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
            +D   +R+ GVLQRI+L+YL  +     T  +    + +G        C          
Sbjct: 129 PLDFGTLRIMGVLQRISLAYLFGA-----TAILNLSRRRLG------FLC---------- 167

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
              LA+L G +    Q  +      D       T             V Y+DR +L   H
Sbjct: 168 ---LAILLGYWFALTQIPVPGYGPGDLSAKGAGTL------------VAYLDRLILTPPH 212

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +              D  FE                PEGLLS++ S+++ ++G   G  +
Sbjct: 213 ILG------------DGSFE----------------PEGLLSTLPSVVTLLLGFFIGDWL 244

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
              K  +         G A++       +    P+NKQL+T SYV +++G + L+ +A Y
Sbjct: 245 --QKQPVTSRTSLQMAGVAVVTIVTGSLWGLVFPINKQLWTSSYVVLSAGWSLLLLAACY 302

Query: 373 ALVDCNIFGPW 383
            LV+   +  W
Sbjct: 303 ELVEVRQWRSW 313


>gi|393725858|ref|ZP_10345785.1| hypothetical protein SPAM2_19574 [Sphingomonas sp. PAMC 26605]
          Length = 400

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 158/348 (45%), Gaps = 73/348 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  RL +LD+ RGLAVA MILV   G DW     ++ HA W+G  LAD V P FLF VG+
Sbjct: 4   RLPRLEALDVLRGLAVAGMILVVSPG-DWSMAYAQLQHAAWHGATLADMVFPTFLFSVGM 62

Query: 87  AIALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGF-----SHAPDELTYGVD 135
           A+ L+  R+   AD  ++ +F      R++ L+  G++++  +     + AP     G+ 
Sbjct: 63  ALGLSFPRL--MADTAQRRLFWMRLIRRSITLVVLGLVVEATYVWTISAGAPYPGHGGLS 120

Query: 136 VRMIRLCGVLQRIALSYLL-VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ G+LQRI L YLL  +L+ + ++ + D   ++    +  L+C      A +L+ 
Sbjct: 121 --YVRIPGILQRIGLCYLLGGALIVVTSRTIADGRIAIAPQRV--LFC-----IAAILIG 171

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y ALL    VP +   ++  D                      +   ++DR +  + H+ 
Sbjct: 172 YWALLRFVPVPGFGVGLLTPDG---------------------SLPAFVDRTLFTVPHL- 209

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               W    A  Q     GP          A ++PEGLLS++ +  + + G         
Sbjct: 210 ----WPLGSATGQ-----GP----------ATYDPEGLLSTLPATANLLFGA-LAAWAWR 249

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
                A L   +  G  L+I GL L       +NK+L+T S+   +SG
Sbjct: 250 QNSDRATLHVAIA-GTMLIIAGLALD--PVFEINKRLWTSSFALFSSG 294


>gi|421090259|ref|ZP_15551055.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|421129053|ref|ZP_15589263.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
 gi|410000994|gb|EKO51618.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|410359757|gb|EKP06816.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
          Length = 369

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/335 (28%), Positives = 148/335 (44%), Gaps = 82/335 (24%)

Query: 38  LDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL- 92
           +D+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF VG++I L++ 
Sbjct: 1   MDLFRGMTVAGMILVNNPG-SWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVY 59

Query: 93  -KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
            K    ++     +  R++ L+  G+ L      +  EL         R+ GVLQRI   
Sbjct: 60  SKNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFV 110

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           Y +V+ + +                  R+    W+    +L+V+  +L     P      
Sbjct: 111 YWIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG----- 148

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
              +S  Y             L P  +   +IDR V G NH+     W+ SK        
Sbjct: 149 ---ESIVY-------------LEPGKDIGAWIDRNVFGENHL-----WKFSKT------- 180

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
                          ++PEGL S +SSI ++++GV  G  I+ +K +  + +     GF 
Sbjct: 181 ---------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKTNEIKKQILSIFGFG 224

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           +L+  + L +   +P+NK L+T SYV  T+G A L
Sbjct: 225 ILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFL 259


>gi|294627662|ref|ZP_06706244.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598014|gb|EFF42169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 388

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 141/335 (42%), Gaps = 75/335 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                     RKD        F+PEGLL ++S+ ++ + G   G  +      +A  +  
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRQGKTVASTRSL 249

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G  L++  L L +  A PL+K+L++ S+V  T
Sbjct: 250 LLAGAGLVV--LALLWAPAWPLSKKLWSGSFVACT 282


>gi|299140911|ref|ZP_07034049.1| membrane protein [Prevotella oris C735]
 gi|298577877|gb|EFI49745.1| membrane protein [Prevotella oris C735]
          Length = 370

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 164/360 (45%), Gaps = 77/360 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  QRL SLD+ RGLA+A MILV++ G  W  I     H+ WNG    D V PFF+F +G
Sbjct: 1   MTQQRLISLDMLRGLAMAGMILVNNPG-SWSHIYVPLEHSVWNGLTPTDLVFPFFVFAMG 59

Query: 86  VAIALALKRIPD-RADAVKKVIFRTLKLLFWG-ILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +A+  + K +   RA  ++KV+ R++ L   G +L   G      EL +      +R+ G
Sbjct: 60  MAMGFSTKNLTALRASYLRKVMKRSVLLFVIGLLLTLLGRWLNTGELCFS----QLRVMG 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQR++LSYL+V+L+    K V         F +  L   +W++          LL G  
Sbjct: 116 VLQRLSLSYLVVALIVRRVKGVPTMT-----FVVVALLSGYWVL----------LLLG-- 158

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
                          +G  F+             N V  +DR +LG +H+Y         
Sbjct: 159 ---------------HGFDFSAN-----------NIVAVVDRWLLGESHLY--------- 183

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
              +  P   P+           F+PEGLLS++  +   ++G     ++  ++    R+ 
Sbjct: 184 --IERLPDGTPI----------AFDPEGLLSTIPCVAQVLLGYICSRLLCTSQELPQRIL 231

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           +   +G  LL+ GL L +    PLNK++++ ++V VTSG A L ++ +    D      W
Sbjct: 232 RLAVIGALLLLSGLLLSYM--CPLNKKIWSSTFVMVTSGYALLAWTVMAWFADLKKQSRW 289


>gi|398343267|ref|ZP_10527970.1| hypothetical protein LinasL1_09415 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 399

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 148/350 (42%), Gaps = 88/350 (25%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFV 76
           V  +       ++RL S+D  RG  VA MILV++ G  W  I     HA W GC   D V
Sbjct: 21  VKQELLNDSFASKRLLSIDALRGFTVAGMILVNNPG-SWSAIYSPLRHAKWFGCTPTDLV 79

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
            PFFLF VGV+I  +            K++ R   L+F G+ L      + D L      
Sbjct: 80  FPFFLFSVGVSIPFS---TIGNGGTFFKILKRASILIFIGLFLHWFGEWSMDRL------ 130

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
              R+ GVLQRI L Y + ++                R + FR            +++ +
Sbjct: 131 ---RIPGVLQRIGLVYFISAIAY--------------RSASFR----------TRIMICI 163

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
           ++L+G +       I+ + +   G            L+P  +   ++DR V G NH+   
Sbjct: 164 SILFGYW-------ILLEFAPPPG-------AGSPSLSPGKDWGAWLDRIVFGENHL--- 206

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-- 314
             W+ SK                       ++PEGLL S+S++ +T +G+ FG V+    
Sbjct: 207 --WKSSKT----------------------WDPEGLLGSLSAVATTFLGIFFGEVLKKDS 242

Query: 315 -TKGHLARLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            TKG++ +    ++     L++ G   H     P+NK L+T S+V  T G
Sbjct: 243 DTKGNIQKTAFTFILAAIVLMLVGWIWH--QFFPMNKSLWTSSFVLWTGG 290


>gi|160885454|ref|ZP_02066457.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|423290374|ref|ZP_17269223.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
 gi|156109076|gb|EDO10821.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|392665761|gb|EIY59284.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 146/339 (43%), Gaps = 89/339 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           L++   +G  L   G    F+   P NK++++ ++V VT
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVT 263


>gi|410721825|ref|ZP_11361152.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
 gi|410598366|gb|EKQ52947.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
          Length = 372

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 155/356 (43%), Gaps = 91/356 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAG--GDWPEI-SHAPWNGCNLADFVMPFFLFIVGVAIA 89
            RL SLD+FRG  +A MI+V+  G   D P +  HA W G NLAD V PFFLFIVGV++ 
Sbjct: 6   DRLVSLDVFRGFTIAGMIMVNILGLYPDTPSLLQHASWIGLNLADLVFPFFLFIVGVSMN 65

Query: 90  LALKRIPDRADAVK--KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVLQ 146
            +      +    K  K +FR   L   G+ L  G         YGV D   IR+ G+LQ
Sbjct: 66  FSFASRSKQPSWKKWGKFLFRVAALYLIGVALVFGL------FFYGVPDFSTIRIPGILQ 119

Query: 147 RIALSYLLVS-LVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
            IALS L  + L  + T+                   W  L A+ +L V  A+L     P
Sbjct: 120 LIALSSLFAAPLARLRTR-------------------WIILAASVILFVQAAILLWVSAP 160

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRA-KLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                                 GV A  L    N  G+ID +V    H+           
Sbjct: 161 ----------------------GVPAGSLEMSNNIAGWIDSQVFTPAHL----------- 187

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHT--KGHLAR 321
                     L K+     H  F+PEG+++ ++     +IG+  G  + +H   KG    
Sbjct: 188 ----------LDKE-----HV-FDPEGMMAVINGTAMVLIGLACGRTLRLHRNWKG---- 227

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
             Q++ +G  L+   + L  +  +P+ KQL+T S++ V +G AA++ + +Y L+D 
Sbjct: 228 -VQYLIIG-GLIALTIGLIISPVMPIIKQLWTSSFILVNAGLAAIILALLYGLMDI 281


>gi|317478517|ref|ZP_07937676.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
 gi|316905331|gb|EFV27126.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 161/378 (42%), Gaps = 99/378 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    LL+ G+ L F+  +     P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLLLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDCNIFGPWT 384
             + +  ++D   +  W+
Sbjct: 293 FLALLIWIIDVKGYKKWS 310


>gi|336415339|ref|ZP_08595679.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940935|gb|EGN02797.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 146/339 (43%), Gaps = 89/339 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVIRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           L++   +G  L   G    F+   P NK++++ ++V VT
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVT 263


>gi|418737426|ref|ZP_13293823.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|410746620|gb|EKQ99526.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
          Length = 383

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 144/349 (41%), Gaps = 82/349 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  +     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPG-SWSYVYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   +R      +  R + L     +L G F +   E T+     
Sbjct: 60  FLFAVGTSIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----S 110

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R+ GVLQRI   Y +V+ +                F IF            VLV  + 
Sbjct: 111 ELRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIP 146

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L    V  W  T I                    L    +   +IDR + G  H+    
Sbjct: 147 IL---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL---- 188

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G
Sbjct: 189 -WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG 225

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
              + +     G   L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 226 R-GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFL 273


>gi|384417772|ref|YP_005627132.1| membrane protein [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460685|gb|AEQ94964.1| membrane protein, putative [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 388

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 146/352 (41%), Gaps = 77/352 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   ++  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          V                  +L+ Y ALLY     
Sbjct: 138 QRIGLCYLAAALLVRYLPPRSIAPACV-----------------ALLLGYWALLY----- 175

Query: 206 DWQFTIINKDSADYGKVFNV-TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                +  +  A+  K  N  TC               +D  + G  H+Y          
Sbjct: 176 -----VFGQPGAELSKTGNAGTC---------------LDLWLYGREHLY---------- 205

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                      RKD        F+PEGLL ++S+ ++ + G   G  +       A  + 
Sbjct: 206 -----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTTASTRS 248

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +  G  +++  L L +  A PL+K+L++ S+V  T G   L    +  L++
Sbjct: 249 LLLAGVGMVL--LALLWAPAWPLSKKLWSGSFVACTVGLDLLALGVLVYLLE 298


>gi|389799428|ref|ZP_10202419.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
 gi|388442725|gb|EIL98904.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
          Length = 353

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 147/356 (41%), Gaps = 100/356 (28%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +     HA W+GC   D V  FFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFSFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +   +R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
           LQRIAL +  V+L  I TK                     W   A +L+ Y  LL  G  
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +  W       DSA +G+                    Y+     G  H           
Sbjct: 157 LEPWTNLASRVDSAVFGRFV------------------YLIDPASGRGH----------- 187

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEGLLS++ S+  T++G+  G  +        R +
Sbjct: 188 ------------------------DPEGLLSTLPSLAGTLLGLRMGCWL--------RRE 215

Query: 324 QWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           Q+ T+   G A L+ G    ++  +P NK L+T S+V  T+G A L   A++ L+D
Sbjct: 216 QFRTLLLAGIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWATLALLALHVLID 269


>gi|418719584|ref|ZP_13278783.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
 gi|410743627|gb|EKQ92369.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
          Length = 383

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 144/349 (41%), Gaps = 82/349 (23%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  +     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPG-SWSYVYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   +R      +  R + L     +L G F +   E T+     
Sbjct: 60  FLFAVGASIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----S 110

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
            +R+ GVLQRI   Y +V+ +                F IF            VLV  + 
Sbjct: 111 ELRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIP 146

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +L    V  W  T I                    L    +   +IDR + G  H+    
Sbjct: 147 IL---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL---- 188

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W+ SK                       ++PEG LS ++SI +++ GV  G ++   +G
Sbjct: 189 -WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG 225

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
              + +     G   L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 226 R-GKNRVLSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFL 273


>gi|295690502|ref|YP_003594195.1| hypothetical protein Cseg_3137 [Caulobacter segnis ATCC 21756]
 gi|295432405|gb|ADG11577.1| Protein of unknown function DUF2261, transmembrane [Caulobacter
           segnis ATCC 21756]
          Length = 372

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 146/349 (41%), Gaps = 74/349 (21%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R  SLD+FRGL V LMI+V+ AG     + ++ HAPW G   AD V P FLF VG ++
Sbjct: 5   AARFLSLDVFRGLTVCLMIVVNTAGPGAKAYTQLVHAPWFGFTAADAVFPSFLFAVGCSM 64

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
           A A  R     + + KV+ R   +   G L+    F    D     +     R+ GVLQR
Sbjct: 65  AFAFSRPIPTNEFLAKVLRRAALIFLLGFLMYWFPFVKKIDGHWALIPFADTRVMGVLQR 124

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IAL Y+L +                        +   WL    ++ +   LL G     W
Sbjct: 125 IALCYMLAA------------------------FAVRWLSPRLIVALSAVLLLGY----W 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +   D A             A L+   NA  ++D  ++G NH+Y             
Sbjct: 157 AILMTLGDPA-------------APLSKLGNAGTHLDLFLIGQNHLY------------- 190

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   RKD        F+PEGLL ++ S ++ + G      +    G  + + +   
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSQSAMARMAI 236

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            G  L++ GL   ++   P+ K+L+T S+V +T G   ++ +A+  L++
Sbjct: 237 AGVVLILAGLA--WSPLFPIAKKLWTGSFVLLTVGIDLVLLAALTKLLE 283


>gi|294667090|ref|ZP_06732315.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292603100|gb|EFF46526.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 388

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 141/335 (42%), Gaps = 75/335 (22%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                     RKD        F+PEGLL ++S+ ++ + G   G  +      +A  +  
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTVASTRSL 249

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +  G  L++  L L +  A PL+K+L++ S+V  T
Sbjct: 250 LLAGAGLVV--LALLWAPAWPLSKKLWSGSFVACT 282


>gi|375110537|ref|ZP_09756759.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
 gi|374569481|gb|EHR40642.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
          Length = 394

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 171/369 (46%), Gaps = 69/369 (18%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +  D     +  ++
Sbjct: 67  LVMVGMAIPYSLAGRQLLPKAEQIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    L+ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D ++LG NH+Y  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+  +  G A +      H   A+P+NK L+T ++V ++SG  A++ +    L 
Sbjct: 248 AELAFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSGFTAVILALFLWLT 305

Query: 376 DCNIFGPWT 384
           +   +  WT
Sbjct: 306 EMKRYRLWT 314


>gi|345880604|ref|ZP_08832150.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
 gi|343922516|gb|EGV33216.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
          Length = 383

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 154/364 (42%), Gaps = 92/364 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           KT R+ ++DI RG+ +A MILV++ GG   +  + HA W G    D V PFF+FI+G+  
Sbjct: 5   KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWFGLTPTDLVFPFFMFIMGITT 64

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDVRM- 138
            L+L++          KK+I R + L   GI +       +G F+     L +   V   
Sbjct: 65  YLSLRKYDFEWSWPCAKKIIKRGMLLYVIGIAISWLMMFCRGLFNEDYAALPFFSHVFAA 124

Query: 139 ------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IRL GV  R+A  Y+  S+V +  K          RF         WL+AA V 
Sbjct: 125 ANVFDHIRLVGVFPRLAFCYVFASVVALSVKH---------RFI-------PWLIAA-VF 167

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + Y A+L           + N  + D   + NV                 ID  +LG  H
Sbjct: 168 IGYFAVL----------CLGNGFAHDASNICNV-----------------IDEAILGRQH 200

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y                     + D P       +PEGLLSS+ ++   +IG   G V+
Sbjct: 201 LY---------------------KWDIP-------DPEGLLSSLPALGHVLIGFCVGRVV 232

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +       ++++    G  L I G  L +    P++K+L+T ++  VT G A+   + + 
Sbjct: 233 MSATSLNDKIEKLFIYGAVLTILGFLLSY--GCPISKKLWTPTFALVTCGLASTTLALLS 290

Query: 373 ALVD 376
            ++D
Sbjct: 291 WVID 294


>gi|315500593|ref|YP_004089395.1| hypothetical protein Astex_3616 [Asticcacaulis excentricus CB 48]
 gi|315418605|gb|ADU15244.1| protein of unknown function DUF1624 [Asticcacaulis excentricus CB
           48]
          Length = 372

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 156/352 (44%), Gaps = 74/352 (21%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDH--AGGD-WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +   R  +LD+FRGL V +MI+V+   AG + + ++ HA W G  L D V P FLF +G 
Sbjct: 1   MSAARYTALDVFRGLTVCVMIVVNTSPAGAEPFAQLQHAQWFGFTLTDLVFPSFLFAIGN 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGV-DVRMIRLCGV 144
           ++  A ++     + + KV+ R+  +   G L+    F H   +  +   D+   R+ GV
Sbjct: 61  SMVFAFRKPLPHKEFLLKVLRRSALIFLLGYLMYWFPFVHQTTDGAWAFNDIGHTRIMGV 120

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL YL  SL          +  SV    I         ++A +L  Y  LLY  + 
Sbjct: 121 LQRIALCYLFASLAA--------RYLSVRGLVI---------LSALLLFGYWGLLY-AFT 162

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
           P          +AD      +T  + AK          ID+ VLG++HMY   A      
Sbjct: 163 P----------AAD---ALTMTGNLGAK----------IDQFVLGLDHMYRGGA------ 193

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                                 +EPEGLLS++ +I++ + G   G +I+ ++    R   
Sbjct: 194 --------------------KGYEPEGLLSTLPAIVNVLAGYLCGRLILDSEDR--RRTV 231

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +  G  L +    L ++   P +K+L+T S+  +T G   L+ S I A VD
Sbjct: 232 MILSGAGLALVAAALVWSFGFPFSKRLWTSSFAVLTIGLDCLILSGIIAYVD 283


>gi|436833933|ref|YP_007319149.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065346|emb|CCG98556.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 361

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 140/343 (40%), Gaps = 89/343 (25%)

Query: 49  MILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-----IPDRA 99
           MILV++AG DW      + HAPWNG    D + PFFLFIVGV+I  AL +     + D  
Sbjct: 1   MILVNNAG-DWAHSYAPLKHAPWNGWTPTDLIFPFFLFIVGVSITFALSKRQTSLLEDEK 59

Query: 100 DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE 159
               K+I R + L   G  L          L    D   +R+ GVLQRI + Y + +LV 
Sbjct: 60  TQRLKIIRRGVTLFALGFFLN---------LFPRFDFANVRIMGVLQRIGIVYTVCALVF 110

Query: 160 IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY 219
           + T   Q  +                 +   +L+ Y  L+    VP   +          
Sbjct: 111 LRTSPRQQVN-----------------LILLILIGYFLLMTMVPVPGIGY---------- 143

Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
                      A L P  N   +IDR +L   H Y     R SK                
Sbjct: 144 -----------ANLEPETNLAAWIDRTILTPAHCY-----RSSKV--------------- 172

Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQ---WVTMGFALLIF 335
                  ++PEGLLS+V +I + ++G+  G  +  T+ G   R  Q   ++ +   L+ F
Sbjct: 173 -------WDPEGLLSTVPAIATGLLGLLAGRWLRSTRYGTTVRESQKALFLFLAGLLMAF 225

Query: 336 GLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
             TL F    P+NK L+T SYV +  G A    +  Y L+D  
Sbjct: 226 VGTL-FDTVFPINKALWTSSYVLLAGGLAMCGLAIFYYLIDVR 267


>gi|445498183|ref|ZP_21465038.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
 gi|444788178|gb|ELX09726.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
          Length = 370

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 146/351 (41%), Gaps = 81/351 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QR  ++D+ RGL VALMI+V+  G     +    HA W+G  L D V P F+F+VG A++
Sbjct: 6   QRSQAIDVLRGLTVALMIMVNMPGTPATTYAPFLHAEWHGLTLTDLVFPTFMFVVGTALS 65

Query: 90  LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGG--FSHAPDELTYGVDVRMIRLCGVL 145
             L++     +A  +KK+  RT  +   G L+     FS     LT  + +   R+ GVL
Sbjct: 66  FTLEKYEGMGEAAVLKKIFTRTALIFLCGFLMYWYPFFSTDGGSLTV-LPLSGTRIFGVL 124

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL Y   SL+  + +   +K   V               A   L+ Y  ++YG    
Sbjct: 125 QRIALGYCAGSLILHYWR---EKGALV--------------FAVLALLGYWTVMYG---- 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                       DY                  NA   +D  VLG  HMYH          
Sbjct: 164 ----------FGDY--------------TLAGNAQRKLDLLVLGEAHMYHG--------- 190

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                 EG             F+PEG+LS++ SI++ + G   G ++         + + 
Sbjct: 191 ------EG-----------IAFDPEGILSTLPSIVNVLAGYFAGRLVRRLGASYETVAKL 233

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
              G  L +  L L +++  PLNK+L+T SY  +T        S +  ++D
Sbjct: 234 AMSGAVLTV--LALCWSSVFPLNKKLWTSSYTLITIAIDLFTLSLLLYVID 282


>gi|270295536|ref|ZP_06201737.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274783|gb|EFA20644.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 160/378 (42%), Gaps = 99/378 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                                 +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+  +     P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDCNIFGPWT 384
             + +  ++D   +  W+
Sbjct: 293 FLALLIWIIDVKGYKKWS 310


>gi|398348299|ref|ZP_10533002.1| hypothetical protein Lbro5_13954 [Leptospira broomii str. 5399]
          Length = 399

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 144/338 (42%), Gaps = 88/338 (26%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG  VA MILV++ G      WP + HA W GC   D V PFFLF VGV+ 
Sbjct: 33  KRLLSIDALRGFTVAGMILVNNPGSWSAIYWP-LKHAKWFGCTPTDLVFPFFLFSVGVS- 90

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
                 IP  +       F+ LK     ++L G F H   E +    +  +R+ GVLQRI
Sbjct: 91  ------IPFSSIGNGGTFFKILKRAS-ILILIGLFLHWFGEWS----IDQLRIPGVLQRI 139

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
            L Y + ++                    +R   +H  +  C+ +++   +   +VP   
Sbjct: 140 GLVYFISAIA-------------------YRSSNFHARILICLSILFGYWILLEFVPP-- 178

Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
                 DS                L+P  +   ++DR V G NH+     W+ SK     
Sbjct: 179 ---PGSDSVS--------------LSPGKDWGAWLDRIVFGENHL-----WKSSKT---- 212

Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH---TKGHLARLK-Q 324
                             ++PEGLLSS+S++ +T +G  FG V+     TK ++ +    
Sbjct: 213 ------------------WDPEGLLSSLSAVATTFLGFFFGEVLKKDSDTKKNIQKTAFN 254

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           ++     +++ G   H     P+NK L+T S+V  T G
Sbjct: 255 FILAAIVIMVAGWIWH--QFFPMNKSLWTSSFVLWTGG 290


>gi|160887858|ref|ZP_02068861.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
 gi|156862688|gb|EDO56119.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
          Length = 394

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 162/378 (42%), Gaps = 99/378 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +DR +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDRAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ +I   ++G   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+  +     P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDCNIFGPWT 384
             + +  ++D   +  W+
Sbjct: 293 FLALLIWIIDVKGYKKWS 310


>gi|427781073|gb|JAA55988.1| Putative heparan-alpha-glucosaminide n-acetyltransferase
           [Rhipicephalus pulchellus]
          Length = 337

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 133/297 (44%), Gaps = 35/297 (11%)

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +GV++A+ ++ +  ++    ++  + +K      L+  G     + L+  VD+  +R+ G
Sbjct: 1   MGVSLAMTIRSLLRKSVTRGRIFLQIVK----RSLILFGLGIMTNTLSGDVDLNTLRIPG 56

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQR+A SYL+ + V +      +             Y   WL+A  +L ++LAL +   
Sbjct: 57  VLQRLAFSYLVAATVHLLFAKPHEGQLVWAPVRDVLAYWPEWLLAIPMLALHLALTFFLP 116

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP+     +          F    G          A G+IDR++ G +H+Y  P  R   
Sbjct: 117 VPNCPQGYLGPGGLHLNSSFENCTG---------GAAGFIDRRIFGNSHIYQTPDMRH-- 165

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
               D+              H P++PEG L  ++SI    +G+  G +++      AR+ 
Sbjct: 166 --VYDT--------------HLPYDPEGTLGCLTSIFLVFLGLQAGKILLTFPEWKARVI 209

Query: 324 QWVTMGFAL-LIFGLTLHFTNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +W   G    +I G+  +F+     IP+NK L+++S++  T+  A  +   +Y L+D
Sbjct: 210 RWCIWGLLCGIIAGVLCNFSKEEGWIPINKNLWSVSFILSTASTAFFLLVVLYYLID 266


>gi|404404862|ref|ZP_10996446.1| transmembrane protein [Alistipes sp. JC136]
          Length = 382

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 156/360 (43%), Gaps = 58/360 (16%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +K++RL SLD+ RG+ +A MILV++    G  +  + HA W+G    D + PFF+FI+GV
Sbjct: 1   MKSERLLSLDVMRGMTIAAMILVNNPAVWGKAYAPLQHAFWHGMTPTDLIYPFFVFIMGV 60

Query: 87  AIALALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           +   +L +  + A  +A  +++ R+  +   G+LL         E++Y            
Sbjct: 61  SAFFSLSKRYEGAGREAFSRILRRSAVIFGVGLLL--------QEISY------------ 100

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
                  Y   + +   T       ++V  F  FR+      +A   L    ALL     
Sbjct: 101 -----FGYGTANFLSGQTSADATWFETVFPFRTFRIMGVLQGLALVYLFGSAALL----C 151

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
             ++  I+         +  +  G    L+   N +  +DR VLG +H+Y          
Sbjct: 152 LRFRHLIVAAGGLLILYLVLLQTGNGYSLSAD-NIIAVVDRAVLGESHLY---------- 200

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
                      R+  P      FEPEGLLS++  I   ++G   G +++  +    R  +
Sbjct: 201 -----------REWLPDGSRLAFEPEGLLSTLPRIAQFLLGCAAGRILLANEDAPMRFGR 249

Query: 325 WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
               G AL   GL L + +  PLNK++++ S+   TSG A+L+   +  ++D +    WT
Sbjct: 250 LFAFGTALFFTGLLLQYGD--PLNKKIWSSSFALATSGFASLLLGLLCWVIDLHKQVRWT 307


>gi|383110831|ref|ZP_09931649.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
 gi|313694406|gb|EFS31241.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
          Length = 371

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 145/339 (42%), Gaps = 89/339 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV++ G     +  + HA WNG    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
             ++LK+        A  K+I RT+ +   GI L       ++H P      +    IR+
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y   +L+ +  K                    H  +   ++V    LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
            ++      +I  +   Y +                N +  +DR +LG  HMY       
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                QD+                  +PEGLLS++ SI   +IG   G +++  K    +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           L++   +G  L   G    F+   P NK++++ ++V  T
Sbjct: 227 LERLFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLAT 263


>gi|409199197|ref|ZP_11227860.1| hypothetical protein MsalJ2_19286 [Marinilabilia salmonicolor JCM
           21150]
          Length = 369

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 159/363 (43%), Gaps = 91/363 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           ++QR  +LD+ RG+ +ALMI V+  G  W  I     HA W+GC   D V PFFLF+ GV
Sbjct: 3   QSQRYLALDVLRGMTIALMITVNTPG-SWQYIYAPLRHASWHGCTPTDLVFPFFLFVAGV 61

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLC 142
           ++  +  +      ++++K++  RTL +   G+ L     +SH         D   +R+ 
Sbjct: 62  SMFFSFGKYGGALNSESLKRLGRRTLLIFVIGLFLNSFPQWSH---------DFSTLRIM 112

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL+Y + SL+ +               S  R Y     +   +L++Y  +L   
Sbjct: 113 GVLQRIALAYGIGSLIVL---------------SAPRKYI--PFIGGGILLIYWGIL--- 152

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
               W                    G     +   NAV   D+ +LG  H+Y        
Sbjct: 153 ---AW-------------------FGGAEPYSLEGNAVIPFDKAILGEQHLY-------- 182

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                             +    PF+PEGLLS+V +I++ ++G   G +I +T+    R 
Sbjct: 183 ------------------TGFGIPFDPEGLLSTVPAIVTVLLGYLTGVIIKNTE----RA 220

Query: 323 KQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           K    +    LI  +   F   I P+NK L+T SYV  T+G AAL  + +  ++D   + 
Sbjct: 221 KVPGRLALYGLITTVIGRFWGVIFPINKPLWTSSYVLYTAGLAALFLALLVFIIDIKGYK 280

Query: 382 PWT 384
            WT
Sbjct: 281 KWT 283


>gi|333029673|ref|ZP_08457734.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332740270|gb|EGJ70752.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 387

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 158/374 (42%), Gaps = 95/374 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           + QRL +LDI RG+ +A MILV++ G  W  I     HA WNG    D V PFF+FI+G+
Sbjct: 5   ENQRLLALDILRGITIAGMILVNNPG-SWGSIYAPLGHAEWNGLTPTDLVFPFFMFIMGI 63

Query: 87  AIALALK--RIPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAP------DELT 131
           +   +L+  +     +A  K++ RT       L + ++ + L+   S A       + L+
Sbjct: 64  STYFSLRKYKFEFSKEAALKILKRTIIIFAIGLGIAWFSLFLRTWNSLASADISIFERLS 123

Query: 132 YGVDV-RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
             + V   +R+ GV+ R+AL+Y   +++ +  K        VG                 
Sbjct: 124 QSIFVFENLRILGVMPRLALTYCATAIIALTIKHKYIPTLIVG----------------- 166

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L+VY             F +   +  +Y +                N +  +D+ +LG 
Sbjct: 167 ILIVY------------TFILFLGNGFEYNE---------------TNILSIVDKAILGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           NHMY                                 +PEGL+S++ +I   ++G   G 
Sbjct: 200 NHMYKDNG----------------------------IDPEGLVSTIPAIAHVLLGFFVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +    K   ++++    MG  L   GL L +    P+NK++++ ++V  T G A+ + + 
Sbjct: 232 IFTEKKDIHSKVEFLFIMGSILTFVGLLLSY--GCPINKKIWSPTFVLTTCGLASTLLAL 289

Query: 371 IYALVDCNIFGPWT 384
           +  ++D   +  W+
Sbjct: 290 LIWIIDIKGYKRWS 303


>gi|418676277|ref|ZP_13237561.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418684894|ref|ZP_13246077.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418743212|ref|ZP_13299580.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|400323423|gb|EJO71273.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|410740642|gb|EKQ85357.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410749503|gb|EKR06488.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 369

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 147/335 (43%), Gaps = 82/335 (24%)

Query: 38  LDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL- 92
           +D+FRG+ VA MILV++ G  W  I     HA WNGC   D V PFFLF VG++I L++ 
Sbjct: 1   MDLFRGMTVAGMILVNNPG-SWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVY 59

Query: 93  -KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
            K    ++     +  R++ L+  G+ L      +  EL         R+ GVLQRI   
Sbjct: 60  SKNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFV 110

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           Y +V+ + +                  R+    W+    +L+V+  +L     P      
Sbjct: 111 YWIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG----- 148

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
              +S  Y             L P  +   +IDR V G N +     W+ SK        
Sbjct: 149 ---ESIVY-------------LEPGKDIGAWIDRNVFGENRL-----WKFSKT------- 180

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
                          ++PEGL S +SSI ++++GV  G  I+ +K +  + +     GF 
Sbjct: 181 ---------------WDPEGLFSGISSIATSLLGVFCGS-ILSSKTNEIKKQILSIFGFG 224

Query: 332 LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           +L+  + L +   +P+NK L+T SYV  T+G A L
Sbjct: 225 ILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFL 259


>gi|431798742|ref|YP_007225646.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
 gi|430789507|gb|AGA79636.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
          Length = 363

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 76/134 (56%), Gaps = 16/134 (11%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGV 86
           K +RL SLD+ RG+ +A MILV+  G  W    P + HA WNG    DF+ PFFLFIVGV
Sbjct: 4   KNKRLISLDVLRGMTIAAMILVNFPG-SWEHVFPPLHHAQWNGITPTDFIFPFFLFIVGV 62

Query: 87  AIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           +I +A   K   D+    KK+ FR  K+   G+LL       P+      D   IR+ GV
Sbjct: 63  SIVMAYAGKMEMDKTIVYKKLFFRGAKIFALGVLL----GMIPE-----FDFSAIRVAGV 113

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL ++  +L+
Sbjct: 114 LQRIALVFVACTLM 127



 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 2/92 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           ++PEG+ S++ +I + I+G+  G ++      + +    + +G  L ++GL   +    P
Sbjct: 188 WDPEGVFSTLPAIATGILGMLAGQLLKSQLKEVEKANNLMVIGLVLTLWGL--FWAWFFP 245

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           +NK L+T S+V VT G A     A Y  +D  
Sbjct: 246 INKNLWTSSFVLVTGGTAFSFLGAFYYWIDVK 277


>gi|329851960|ref|ZP_08266641.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
 gi|328839809|gb|EGF89382.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
          Length = 369

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 98/366 (26%), Positives = 151/366 (41%), Gaps = 84/366 (22%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +  QR  SLD+FRGL VA MI+V+ +G     + ++SHA W G  LAD V P FLF VG 
Sbjct: 1   MAGQRFTSLDVFRGLTVAFMIVVNTSGPGAAPFAQLSHATWFGLTLADLVFPAFLFAVGN 60

Query: 87  AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVL 145
           A++    +       + KV+ R   L   G L+    F HA  +      V   R+ GVL
Sbjct: 61  AMSFGDPKSGPTGRYLGKVVKRAAILFLLGYLMYWFPFVHATADGWALNPVEHTRIPGVL 120

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL +L  ++   +     D  + +G  ++  L  W  LM               + P
Sbjct: 121 QRIALCFLAAAIAVRWL----DVPKLIGLSAVLLLGYWGALM--------------VFGP 162

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
             +                       +L P  N    IDR V GINHMY      + K  
Sbjct: 163 PGE-----------------------QLTPLGNIGALIDRAVFGINHMYA-----KGKG- 193

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                ++PEGL S++ +I++ + G   G   I ++  L  +   
Sbjct: 194 ---------------------YDPEGLFSTLPAIVNVLAGYLAGRY-IRSQPDLRTVVIR 231

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD--------- 376
           + +  A LI    L ++   PL+K+L+T S+  +  G    + + + A V+         
Sbjct: 232 LAVA-AGLIVAAALAWSLTFPLSKRLWTSSFALINIGIDLGLLAGLIAYVELARQKFGVP 290

Query: 377 -CNIFG 381
            C +FG
Sbjct: 291 FCEVFG 296


>gi|374374997|ref|ZP_09632655.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
 gi|373231837|gb|EHP51632.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
          Length = 395

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 145/376 (38%), Gaps = 108/376 (28%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  SLD+FRG  V LMILV++ G  W  I     HAPW+G    D V PFFLF VG A++
Sbjct: 4   RYRSLDVFRGATVCLMILVNNPG-SWAHIYAPLDHAPWHGLTPTDLVFPFFLFAVGNAMS 62

Query: 90  LALKRIPDR--ADAVKKVIFRTLKLLFWGILL--------QGGFSHAPDELTYGVDVRMI 139
             + R+ +   A+  KK+  RTL +   GI L         G    A   +T       I
Sbjct: 63  FVIPRLQEAGPAEFWKKITKRTLIIFGIGIFLNWSPFVRWNGDTLQAVTWVTDPAKNIGI 122

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRIA  Y   S++  + K                         A  L + L L 
Sbjct: 123 RIFGVLQRIAFCYFFASIIVYYLKP----------------------KTAYFLSLVLLLA 160

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYH 255
           Y      W   I+  + AD                 P +  G+    ID+ +L I HMY 
Sbjct: 161 Y------WGLCILG-NPAD-----------------PYSLKGWFGTNIDKAILHIPHMYK 196

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                           EG            PF+PEG  SS+ +I+  + G   G  I ++
Sbjct: 197 G---------------EG-----------VPFDPEGFASSLGAIVQIVFGYFVGMYIKNS 230

Query: 316 KGHLAR---------------LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
              + +               L      G  LL+ G    +    P+NK+++T SY   T
Sbjct: 231 SAQIPKDLTDKQDPRNPMFKMLTVLFVAGVGLLVTGFC--WDMVFPINKKIWTSSYTVYT 288

Query: 361 SGAAALVFSAIYALVD 376
           +G A +    +   ++
Sbjct: 289 TGLAIITLCVMIFFIE 304


>gi|418752318|ref|ZP_13308585.1| putative membrane protein [Leptospira santarosai str. MOR084]
 gi|409967313|gb|EKO35143.1| putative membrane protein [Leptospira santarosai str. MOR084]
          Length = 363

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 95/339 (28%), Positives = 146/339 (43%), Gaps = 93/339 (27%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PFFLF VG +I ++L
Sbjct: 2   SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60

Query: 93  --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             K   +R+D    +  R+  L+  G+ L   G +S A            +R+ GVLQRI
Sbjct: 61  YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109

Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
              Y +V SL  +F+       + +  FS+  L    W++    L            P  
Sbjct: 110 GFVYWVVASLCLVFS------GKKILVFSVPILLIHTWILTQIAL------------PG- 150

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
           +  +  +   D G                     +IDR + G  H+     WR SK    
Sbjct: 151 ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL-----WRFSKT--- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG LS V+S+++T+ GV  G ++        R ++   
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFIL------FLRERRNKI 217

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 218 LGLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLSFL 256


>gi|397171248|ref|ZP_10494657.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
 gi|396087147|gb|EJI84748.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
          Length = 394

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 171/369 (46%), Gaps = 69/369 (18%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILTQQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +  D     +  ++
Sbjct: 67  LVMVGMAIPYSLAGRQLLPKAELIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    ++ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLVIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D ++LG NH+Y  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ ++
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSQ 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+  +  G A +      H   A+P+NK L+T ++V ++SG  AL+ +    L 
Sbjct: 248 AELAFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSGFTALMLALFLWLT 305

Query: 376 DCNIFGPWT 384
           +   +  WT
Sbjct: 306 EMKRYRLWT 314


>gi|195565141|ref|XP_002106164.1| GD16714 [Drosophila simulans]
 gi|194203536|gb|EDX17112.1| GD16714 [Drosophila simulans]
          Length = 318

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 74/127 (58%), Gaps = 13/127 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 241

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           ++K    R   +A    +++ R++KL   G+ L             G ++  +R+ GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MSGPNLEQLRVMGVLQ 292

Query: 147 RIALSYL 153
           R  ++YL
Sbjct: 293 RFGVAYL 299


>gi|16124796|ref|NP_419360.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|221233512|ref|YP_002515948.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
 gi|13421730|gb|AAK22528.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|220962684|gb|ACL94040.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
          Length = 372

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 144/357 (40%), Gaps = 74/357 (20%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R  SLD+FRGL V LMI+V+ AG     + ++ HAPW G   AD V P FLF VG ++
Sbjct: 5   AARFLSLDVFRGLTVFLMIVVNTAGPGAKAYSQLVHAPWFGFTAADAVFPSFLFAVGCSM 64

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
           A A  +     D   KV+ R   +   G L+    F    D     +     R+ GVLQR
Sbjct: 65  AFAFSKPIPLNDFTVKVLRRAALIFLLGFLMYWFPFVRKVDGDWALIPFSDTRVMGVLQR 124

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IAL YLL +                        +   WL    ++ +   LL G     W
Sbjct: 125 IALCYLLAA------------------------FAVRWLSPRLIVALCAVLLLGY----W 156

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
              +   D A             A L+   NA   +D  ++G NH+Y             
Sbjct: 157 AILMAFGDPA-------------APLSKLGNAGTRLDLLLIGQNHLY------------- 190

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                   RKD        F+PEGLL ++ S ++ + G      +    G    + +   
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSSQAMGRMAI 236

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            G  L++ GL   ++   P+ K+L+T S+V +T G   ++ + +  L++     P T
Sbjct: 237 AGLVLILAGLV--WSPLFPIAKKLWTSSFVLLTVGIDLILLAGLAKLLEGKASNPGT 291


>gi|322436289|ref|YP_004218501.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
 gi|321164016|gb|ADW69721.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
          Length = 389

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 156/372 (41%), Gaps = 92/372 (24%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNL 72
           ++E  + D Q  S     RL S+D+ RGL +  MILV+ AG +   +  + HA WNG   
Sbjct: 1   MTEQALGDIQRPS-----RLLSIDLLRGLTIGFMILVNDAGSERDAYAPLQHAWWNGFTP 55

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            D V P FLF+VG+   L+L    DR   V ++       LFW +L +    +    L  
Sbjct: 56  TDLVFPTFLFLVGITTVLSLGSRMDR--NVPRMT------LFWSVLRRAVLIYVVGILAS 107

Query: 133 G---VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                 +  +R  GVL RIAL YL+V  + + +K  +DK                 ++AA
Sbjct: 108 TFPFTHLAGMRFVGVLPRIALCYLIVGSLLLISKSWKDKVV---------------ILAA 152

Query: 190 CVLVVYLALLY-----GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           C L+ Y ALL      G  VP     ++++D                      N   ++D
Sbjct: 153 C-LIGYWALLRFVPVPGYGVPTHDVPLLDRDG---------------------NLAAWLD 190

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +    H+Y           T+D                    PEGLLS++ ++ + ++
Sbjct: 191 RWMFAPQHLYER---------TRD--------------------PEGLLSTIPAVGTALL 221

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G+     +     H  R K     G A +   L L +   +P+NK+++T SYV    G +
Sbjct: 222 GLL--TGLFLRSQHALRTKIMGIAGAATVSILLGLLWNITLPINKKMWTSSYVLFAGGLS 279

Query: 365 ALVFSAIYALVD 376
            L+ +A   L+D
Sbjct: 280 MLLLAACMTLID 291


>gi|217974365|ref|YP_002359116.1| hypothetical protein Sbal223_3208 [Shewanella baltica OS223]
 gi|217499500|gb|ACK47693.1| conserved hypothetical protein [Shewanella baltica OS223]
          Length = 384

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 155/381 (40%), Gaps = 104/381 (27%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWN 68
           ++  E   +   RL SLD  RG  +            L+IL   AG  W   ++ H+ W+
Sbjct: 6   TNAIEPVKVNKPRLVSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWH 65

Query: 69  GCNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF 123
           G    D + P F+F+ GVA+ L+ KR     I +R    +  I R   LL  GIL   G+
Sbjct: 66  GFRFYDLIFPLFIFLSGVALGLSPKRLDKLPISERLPVYRHGIKRLFLLLLLGILYNHGW 125

Query: 124 -SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
            + AP       D   IR   VL RIA ++   +L+                        
Sbjct: 126 GTGAP------ADPEKIRYASVLGRIAFAWFFAALL-----------------------V 156

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNA 239
           WH  +   ++V  L +L G                 YG +        G    L+P  + 
Sbjct: 157 WHTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFPGGQAGVLSPTESI 198

Query: 240 VGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
             Y+D  +L G+++    P                              +PEGLLS++ +
Sbjct: 199 NAYVDSILLPGVSYQGRTP------------------------------DPEGLLSTIPA 228

Query: 299 ILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
           I++ + GV  GH I+  H KG  A++      G  LL FG  L     IP+NK+L+T S+
Sbjct: 229 IVNALTGVFVGHFIVKSHPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSF 286

Query: 357 VCVTSGAAALVFSAIYALVDC 377
           V VTSG + ++ +  YALVD 
Sbjct: 287 VLVTSGWSMILLAVFYALVDV 307


>gi|90022681|ref|YP_528508.1| hypothetical protein Sde_3039 [Saccharophagus degradans 2-40]
 gi|89952281|gb|ABD82296.1| conserved hypothetical protein [Saccharophagus degradans 2-40]
          Length = 363

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 143/342 (41%), Gaps = 91/342 (26%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVG 85
           + TQR  +LD+ RG  +A+MILV+   GDW      + HA W+G  + DFV PFFLFI+G
Sbjct: 1   MATQRYLALDVMRGATLAMMILVN-TPGDWGFVYAPLLHADWHGVTITDFVFPFFLFIIG 59

Query: 86  VAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLC 142
            A+    +     A A+  KK+I RT  L   G+LL    F+ A  EL         R+ 
Sbjct: 60  SALFFTSRSSGQLAPAIKAKKIIKRTALLFTIGLLLHAFPFTTALSEL---------RIL 110

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQRIAL+Y + + + ++    Q                   LMAA           G 
Sbjct: 111 GVLQRIALAYGIAAFI-VWLPTTQR------------------LMAAL----------GI 141

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            V  W   I+   S                     N V +ID  +LG  H++        
Sbjct: 142 LVAYWLVFILTDSSYHLAD----------------NIVRHIDITILGAEHLWQGKG---- 181

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG---HL 319
                                   F+PEGLLS++ + ++ + G     +++       + 
Sbjct: 182 ----------------------LAFDPEGLLSTLPAAVNILAGFEATRLLVSQPAGEPNN 219

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           A  +Q+    +A+    + L +   +P+NK L+T S+V +TS
Sbjct: 220 ATSRQFKLALYAMCSITIALIWHRWMPINKSLWTSSFVLLTS 261


>gi|393763917|ref|ZP_10352530.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
 gi|392605231|gb|EIW88129.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
          Length = 394

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 168/369 (45%), Gaps = 69/369 (18%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
           QQ  +     R+ +LD  RGLA+  MILV++ G     +P + HA W+G    D + P F
Sbjct: 7   QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66

Query: 81  LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           L +VG+AI  +LA +++  +A+ +++   R LKL   G+ L   + +  D     +  ++
Sbjct: 67  LVMVGMAIPYSLAGRQMLPKAELLRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126

Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +R  GVLQRI + Y    L+ +++        + GR          WL   C+L   L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRVL--------WLSGLCLLYFLL 171

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                 +VP        +D  +YG  F    G+    N   N   ++D  VLG NH++  
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHHVLGPNHVFFR 212

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
            A          +PF               F+PEG+LS++ +I S + GV     ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247

Query: 317 GHLA-RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             LA +L+     G A +      H   A+P+NK L+T ++V ++SG  AL+      L 
Sbjct: 248 AELAFKLRVLFLAGLAAIWVAELAH--PALPINKMLWTPTFVLLSSGFTALILGLFLWLT 305

Query: 376 DCNIFGPWT 384
           +   +  WT
Sbjct: 306 EMKRYRLWT 314


>gi|359728547|ref|ZP_09267243.1| hypothetical protein Lwei2_17159 [Leptospira weilii str.
           2006001855]
          Length = 383

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 148/348 (42%), Gaps = 84/348 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   DR+     +  R++ L+  G+LL      +  EL       
Sbjct: 60  FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112

Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRI   Y +V SL  IF        + V  F I  L    W++         
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                   P  +  +  +   D G                     +IDR ++G  H+   
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
             W+ SK                       ++PEGLLS V+SI +++ GV  G ++   +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   R +   T G   L   + L +  ++P+NK L+T SY   T+G A
Sbjct: 225 GG-GRSRVLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLA 271


>gi|404486905|ref|ZP_11022093.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335959|gb|EJZ62425.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
          Length = 373

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 148/372 (39%), Gaps = 95/372 (25%)

Query: 31  KTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
           K  RL SLD  RG           L V L  L   A GD     + H PW+G    D + 
Sbjct: 5   KNTRLLSLDTLRGFDMLFIMGFAPLVVTLNALHPTAVGDVIAGHMRHVPWDGFTQHDMIF 64

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFS-HAPDELTY 132
           P FLFI G++   +L +      + K +  R  +    L+  G L  G    + PD    
Sbjct: 65  PLFLFIAGISFPFSLAKQRGSGSSDKHIYLRVFRRGVTLVLLGFLYNGFLQLNFPD---- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 +RL  VL RI L+++  + + +  K      +SV    I  +   +WL+ A   
Sbjct: 121 ------VRLASVLGRIGLAWMFGAFIYMSLK------KSVQYGLIVFILVGYWLLLA--- 165

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
                     +VP         D+A             + L+   N VGYIDR  L    
Sbjct: 166 ----------FVPA-------PDAAG-----------ASPLSIEGNLVGYIDRHCLPGKL 197

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y +                              F+PEGLLS++ +I++ ++G++ G ++
Sbjct: 198 IYGN------------------------------FDPEGLLSTLPAIVTALLGIYAGEIV 227

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             T+      K  +  G  +++  + L +    P+NK L++ S+ C   G + L+F+  Y
Sbjct: 228 RSTRLGSGERKSLLLSGIGVVLVAIGLVWNTVFPINKMLWSSSFTCFVGGLSFLLFALFY 287

Query: 373 ALVDCNIFGPWT 384
            +VD   +  WT
Sbjct: 288 YIVDVKGWKSWT 299


>gi|410448043|ref|ZP_11302130.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|410018124|gb|EKO80169.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|456875246|gb|EMF90470.1| putative membrane protein [Leptospira santarosai str. ST188]
          Length = 363

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 97/339 (28%), Positives = 147/339 (43%), Gaps = 93/339 (27%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PFFLF VG +I ++L
Sbjct: 2   SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60

Query: 93  --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             K   +R+D    +  R+  L+  G+ L   G +S A            +R+ GVLQRI
Sbjct: 61  YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109

Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
              Y +V SL  +F      K   V    I  ++ W        ++  +AL   + V   
Sbjct: 110 GFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILTQIALPGESVV--- 154

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                   S + GK                +   +IDR + G  H+     WR SK    
Sbjct: 155 --------SLEQGK----------------DIGAWIDRTIFGEKHL-----WRFSKT--- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG LS V+S+++T+ GV  G ++        R ++   
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFIL------FLRERRNKI 217

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
           +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 218 LGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLSFL 256


>gi|423304873|ref|ZP_17282872.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|423310012|ref|ZP_17287996.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
 gi|392682836|gb|EIY76175.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|392683302|gb|EIY76639.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
          Length = 394

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 162/378 (42%), Gaps = 99/378 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS        APD+L++G  +  
Sbjct: 66  TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   S++ +  K                     +L+A  +
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIP----------------YLIAGLL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ L+ G                  G  +N T           N +  +D  +L   
Sbjct: 169 TGYFILLMCGN-----------------GFAYNET-----------NILSVVDHAILTPA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY            +D+  +                PEGLLS++ +I   ++G   G +
Sbjct: 201 HMY------------KDNGID----------------PEGLLSTIPAIAHVLLGFCVGRL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAAL 366
           ++       R     +    L + G+ L F+  +     P+NK++++ +YV VT G A+ 
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 367 VFSAIYALVDCNIFGPWT 384
             + +  ++D   +  W+
Sbjct: 293 FLALLIWIIDVKGYKKWS 310


>gi|392390355|ref|YP_006426958.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
 gi|390521433|gb|AFL97164.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
          Length = 390

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 149/367 (40%), Gaps = 96/367 (26%)

Query: 34  RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  SLD+FRG  VALMILV++ G     +  ++HA W GC   D V PFFLF VG A+A 
Sbjct: 4   RYYSLDVFRGATVALMILVNNPGSWSAMFKPLTHAEWAGCTPTDLVFPFFLFAVGNAMAF 63

Query: 91  ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQR 147
            + R+      V  +KV+ RT  +   G+LL    F    D +        +R+ GVLQR
Sbjct: 64  VIPRMQKAGSQVFWRKVLKRTFLIFIIGLLLNWFPFVQWKDGILTFKHWENVRILGVLQR 123

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IA +Y   +++  + K+ +                   +++  +L+VY  L       D 
Sbjct: 124 IAFAYFFAAIIAYYFKEKKVL-----------------IISFLLLIVYWLLALLLGGAD- 165

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYHHPAWRRSK 263
                                       P +  G+    +D  +LG +HMYH        
Sbjct: 166 ----------------------------PYSMQGFWGTRVDLAILGESHMYHG------- 190

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                   EG            PF+PEG + ++SS    ++G   G +I+  +G +  L 
Sbjct: 191 --------EG-----------VPFDPEGFVGAISSTAQVLLGYLAGKIIM-AQGEVNWLF 230

Query: 324 QWVTMGFALLIFGLTLHFTNA-------------IPLNKQLYTLSYVCVTSGAAALVFSA 370
                   L    L++ F +A              P+ K++++ +YV  T+G A +  S 
Sbjct: 231 VRAPKTSELHYKVLSMLFVSAGILLVVAYVWQLDFPIIKKIWSSTYVLYTTGLAIITISI 290

Query: 371 IYALVDC 377
           +   ++ 
Sbjct: 291 MIWFIEV 297


>gi|311274235|ref|XP_003134250.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Sus scrofa]
          Length = 297

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 82/258 (31%), Positives = 119/258 (46%), Gaps = 46/258 (17%)

Query: 140 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
           R+ GVLQR+ ++Y +V+++E+ F K V +   S    S F L      W  WL    +  
Sbjct: 6   RIPGVLQRLGVTYFVVAVLELLFAKPVPESCAS--ERSCFSLLDVTSSWPQWLFVLVLEG 63

Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           V+LAL +   VP      +      D GK  N T G          A GYIDR +LG +H
Sbjct: 64  VWLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 113

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           +Y HP    S A    +                 ++PEG+L +++SIL   +GV  G ++
Sbjct: 114 LYQHP----SPAVLYHT--------------KVAYDPEGILGTINSILMAYLGVQAGKIL 155

Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAAL 366
           ++    TKG L R   W      L+   LT    N   IP+NK L++ SYV   S +A L
Sbjct: 156 LYYKDRTKGILIRFAVWGCF-LGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSAFL 214

Query: 367 VFSAIYALVDCNIFGPWT 384
           +   +Y +VD    G WT
Sbjct: 215 ILLVLYPIVDVK--GLWT 230


>gi|388258355|ref|ZP_10135531.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
 gi|387937867|gb|EIK44422.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
          Length = 362

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 144/337 (42%), Gaps = 85/337 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           QR  +LD+ RGL +ALMILV+  G     +  + HA W+G    DFV PFF+FIVG ++ 
Sbjct: 4   QRFQALDVMRGLTLALMILVNTPGSWSFVYGPLLHADWHGATATDFVFPFFMFIVGSSMY 63

Query: 90  LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
            A++ +   A A   +K++ R + L   G+LL    S  P    +  ++   R+ GVLQR
Sbjct: 64  FAMRGLRQLAPAAQAQKILRRVVLLFVIGVLL----SAYP----FTNNIENWRVMGVLQR 115

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           IA++Y   + + ++           GR           +M+A +L+ Y  LL        
Sbjct: 116 IAIAYGFAAFIILYFG-------FTGRV----------VMSAILLLGYWGLL-------- 150

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                N  +  Y              +   N V   D  VLG NH++             
Sbjct: 151 -----NIAADPY--------------SLEHNLVRQFDLAVLGANHLWQGKG--------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              F+PEG+LS+V SI++ IIG     V++ ++     L Q   
Sbjct: 183 -----------------LAFDPEGILSTVPSIVNVIIGFEATRVLLASEDKAKALSQLFV 225

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
               L+   L  +     P+NK L+T S+V +T G A
Sbjct: 226 AALLLIGLALVWNLL--FPINKSLWTSSFVVLTCGVA 260


>gi|114048505|ref|YP_739055.1| hypothetical protein Shewmr7_3014 [Shewanella sp. MR-7]
 gi|113889947|gb|ABI43998.1| conserved hypothetical protein [Shewanella sp. MR-7]
          Length = 395

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 102/393 (25%), Positives = 156/393 (39%), Gaps = 98/393 (24%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
            TT    +  +  +  +   K      RL SLD  RG  +           AL+IL   A
Sbjct: 2   STTAPESVANTGVNAQNAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWA 61

Query: 56  GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFR 108
           G  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+      +R    +  I R
Sbjct: 62  GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKR 121

Query: 109 TLKLLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
              LL  GIL   G+ + AP      VD   +R   VL RIA ++   +L+         
Sbjct: 122 LFLLLLLGILYNHGWGTGAP------VDPEKVRYASVLGRIAFAWFFAALL--------- 166

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                          WH  +   VLV    L+    V  W                    
Sbjct: 167 --------------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPG 197

Query: 228 GVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           G   +L+P  +   Y+D  +L G+++    P                             
Sbjct: 198 GQAGELSPTESINAYVDSLLLPGVSYQGRTP----------------------------- 228

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
            +PEG+LS++ ++++ + GV  GH I+  H KG  A++      G   L  G  L     
Sbjct: 229 -DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLD--GV 285

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           IP+NK+L+T S+V VTSG + L+ +  YA+VD 
Sbjct: 286 IPVNKELWTSSFVLVTSGWSMLLLALFYAIVDV 318


>gi|417780880|ref|ZP_12428636.1| PF07786 family protein [Leptospira weilii str. 2006001853]
 gi|410778851|gb|EKR63473.1| PF07786 family protein [Leptospira weilii str. 2006001853]
          Length = 383

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 100/348 (28%), Positives = 148/348 (42%), Gaps = 84/348 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           FLF VG +I ++L  K   DR+     +  R++ L+  G+LL      +  EL       
Sbjct: 60  FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112

Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             R+ GVLQRI   Y +V SL  IF        + V  F I  L    W++         
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                   P  +  +  +   D G                     +IDR ++G  H+   
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
             W+ SK                       ++PEGLLS V+SI +++ GV  G ++   +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   R +   T G   L   + L +  ++P+NK L+T SY   T+G +
Sbjct: 225 GG-GRSRVLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLS 271


>gi|225010297|ref|ZP_03700769.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
 gi|225005776|gb|EEG43726.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
          Length = 363

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 97/353 (27%), Positives = 152/353 (43%), Gaps = 81/353 (22%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ S+DIFRG  + LMILV+  G  W  +     HA W+G  L D V PFF+FIVGV+
Sbjct: 4   NKRVPSVDIFRGATLLLMILVNTPG-TWSAVYAPFLHASWHGYTLTDLVFPFFIFIVGVS 62

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           I+L+ K          K+  R+LKL+  G+ L G F+ +     +   V  IR  GVLQR
Sbjct: 63  ISLSYKDKKLNGPVFFKLTKRSLKLIGLGLFL-GAFTIS---FPFIKAVENIRFPGVLQR 118

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I L +   S++ ++      K  +     I  L  W W+                     
Sbjct: 119 IGLVFFFASIIYLW----GSKRSTALIIGIILLAYWLWM--------------------- 153

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
            F  ++  +  Y +  N             N   +ID KVLG +HM     W+       
Sbjct: 154 GFLPLDGVAPTYERAAN-------------NWANFIDFKVLG-SHM-----WKPD----- 189

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                              ++PEG+LS++ +I + ++G   G V+          K  + 
Sbjct: 190 -------------------YDPEGILSTLPAIATALLGTLAGDVLRSNTYQ----KVSLL 226

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
           +   L++ GL      + P+NK L++ S+V VT+G A LV   ++ L +   F
Sbjct: 227 LISGLVLLGLGHLLDLSFPINKALWSSSFVMVTAGWANLVLGILFYLREVKKF 279


>gi|29349027|ref|NP_812530.1| hypothetical protein BT_3619 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298386734|ref|ZP_06996289.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383124379|ref|ZP_09945043.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|29340934|gb|AAO78724.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251839125|gb|EES67209.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|298260408|gb|EFI03277.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 372

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 155/363 (42%), Gaps = 89/363 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV+  G     +  + HA W G    D V PFF+FI+G++
Sbjct: 6   SNKRLLALDVMRGITIAGMILVNTPGSWQHAYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
             ++L++        A  K++ RT+ +   GI    L    F H P        +  IR+
Sbjct: 66  TYISLRKYNFTFSVPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 119

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y + ++V +  K                 + +   + A +L+ Y A+L  
Sbjct: 120 LGVMQRLALGYGVTAIVALLMK-----------------HKYIPYLIAVLLISYFAIL-- 160

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                         +   G V++ T           N +  +DR VLG  H+Y       
Sbjct: 161 --------------ALGNGYVYDET-----------NILSIVDRAVLGQAHIYGGQI--- 192

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                     +PEGLLS++S+I   +IG   G +++  K    +
Sbjct: 193 -------------------------LDPEGLLSTISAIAHVLIGFCAGKLLMEVKDIHEK 227

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           L++   +G  L   G  L + +  P+ K++++ S+V VT G  +   + +  ++D   + 
Sbjct: 228 LERLFLIGTILTFAGFLLSYGS--PICKKVWSPSFVLVTCGLGSSFLALLVWIIDIKGYK 285

Query: 382 PWT 384
            W+
Sbjct: 286 NWS 288


>gi|374311063|ref|YP_005057493.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358753073|gb|AEU36463.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 435

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 156/368 (42%), Gaps = 83/368 (22%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----EISHAPWNGCNLADFVMP 78
            Q E++  K  R+ S+D+ RG+ +ALMILV+  G DW     ++ HA WNG  L D V P
Sbjct: 38  SQTERTVSKPGRVLSVDVLRGITIALMILVNDPG-DWDHIFGQLDHAAWNGWTLTDMVFP 96

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT-YGVDV- 136
            FLF++G +I  +L+    R +               G L    F+ A   L  Y V   
Sbjct: 97  AFLFLMGASIIFSLQARIARGNCK-------------GTLAGHIFARAGKILALYWVLAF 143

Query: 137 --RM---IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
             RM   IR  GVL RIAL YLL SLV + T+ V+     V                A +
Sbjct: 144 FPRMHWTIRWFGVLPRIALCYLLASLVLLATRRVRVLIAIV----------------AFL 187

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           LV Y  LL    VP      +     D              ++   N   +IDR V   +
Sbjct: 188 LVGYWVLLRWVPVPG-----LGTPMRDI-----------PFMDQNANLASWIDRGVSSWS 231

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
             + H                G L +          +PEGLLS++ ++ +T++G   G  
Sbjct: 232 LRWLH---------------TGTLYRKTR-------DPEGLLSTLPAVATTLLGALAGMW 269

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLH---FTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           +I+ +  +  +++ + +G A +          ++   P+NK L+T S+V + +G  AL  
Sbjct: 270 MINGQRVVNGMRR-MRIGLAAMGVAGVAAGVLWSRWFPINKNLWTSSFVLLMAGWTALAL 328

Query: 369 SAIYALVD 376
           +    L+D
Sbjct: 329 AGCSWLID 336


>gi|355694569|gb|AER99714.1| heparan-alpha-glucosaminide N-acetyltransferase [Mustela putorius
           furo]
          Length = 296

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 118/257 (45%), Gaps = 42/257 (16%)

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R+ GVLQR+ ++Y +V+++E IF K V +   S       R  ++ W  WL    +  +
Sbjct: 5   VRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESI 64

Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +LAL +   VP      +      D GK  N T G          A GYIDR +LG +H+
Sbjct: 65  WLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHI 114

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y HP    S A    +                 ++PEG+L S++SI+   +GV  G +++
Sbjct: 115 YQHP----SSAVLYHT--------------QVAYDPEGILGSINSIVMAFLGVQAGKILL 156

Query: 314 H----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALV 367
           +    TK  L R   W      L+   LT    N   IP+NK L+++SYV   S  A  +
Sbjct: 157 YYKDQTKDILIRFTAWSCF-LGLISVALTKFSENEGFIPINKNLWSVSYVTTLSSFAFFI 215

Query: 368 FSAIYALVDCNIFGPWT 384
              +Y +VD    G WT
Sbjct: 216 LLILYPIVDVK--GLWT 230


>gi|359438686|ref|ZP_09228688.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|359445329|ref|ZP_09235071.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
 gi|358026628|dbj|GAA64937.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|358040838|dbj|GAA71320.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
          Length = 359

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/354 (24%), Positives = 149/354 (42%), Gaps = 85/354 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LD  RGL +ALMILV+  G  W  +     HA W+GC   D + PFF+FI+G A+ 
Sbjct: 3   RYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            + K+    A A +      L+L+  G ++      A +   +  ++  +R+ GVLQRI 
Sbjct: 62  FSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRIG 115

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           ++Y+L S+  +F          + R  +  L       +  +LV Y  LL          
Sbjct: 116 IAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL---------- 148

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                          ++ G         N V  +D  VLG +H++               
Sbjct: 149 ---------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG----------- 182

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                            F+PEGL+S++ +++S + G     ++  T    A +K+   + 
Sbjct: 183 ---------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR---LL 224

Query: 330 FALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
              ++  +    T+ + P+NK L+T S+V  TSG A +V +    L  C+I  P
Sbjct: 225 VIGVVAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLAFFVWL--CDIVKP 276


>gi|392555555|ref|ZP_10302692.1| hypothetical protein PundN2_08983 [Pseudoalteromonas undina NCIMB
           2128]
          Length = 359

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/354 (24%), Positives = 149/354 (42%), Gaps = 85/354 (24%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LD  RGL +ALMILV+  G  W  +     HA W+GC   D + PFF+FI+G A+ 
Sbjct: 3   RYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
            + K+    A A +      L+L+  G ++      A +   +  ++  +R+ GVLQRI 
Sbjct: 62  FSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRIG 115

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           ++Y+L S+  +F          + R  +  L       +  +LV Y  LL          
Sbjct: 116 IAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL---------- 148

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
                          ++ G         N V  +D  VLG +H++               
Sbjct: 149 ---------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG----------- 182

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                            F+PEGL+S++ +++S + G     ++  T    A +K+   + 
Sbjct: 183 ---------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR---LL 224

Query: 330 FALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
              ++  +    T+ + P+NK L+T S+V  TSG A +V +    L  C+I  P
Sbjct: 225 VIGVVAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLAFFVWL--CDIVKP 276


>gi|418023168|ref|ZP_12662153.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
 gi|353537051|gb|EHC06608.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
          Length = 384

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 151/364 (41%), Gaps = 94/364 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G N  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFNFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +                       L  WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH ++  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALTGVFVGHFLVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G  LL FG  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDC 377
           LVD 
Sbjct: 304 LVDV 307


>gi|113971267|ref|YP_735060.1| hypothetical protein Shewmr4_2932 [Shewanella sp. MR-4]
 gi|113885951|gb|ABI40003.1| conserved hypothetical protein [Shewanella sp. MR-4]
          Length = 395

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 155/377 (41%), Gaps = 92/377 (24%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAP 66
           +  D   K      RL SLD  RG  +           AL+IL   AG  W   ++ H+ 
Sbjct: 15  NAQDAAAKKSQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSE 74

Query: 67  WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH 125
           WNG    D + P F+F+ GVA+ L+ KR+       +  ++R  +K LF  +LL   ++H
Sbjct: 75  WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNH 134

Query: 126 APDELTYGV--DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
                  GV  D   +R   VL RIA ++   +L+   T                R    
Sbjct: 135 GWGT---GVPADPEKVRYASVLGRIAFAWFFAALLVWHTS--------------LRTQV- 176

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
             L+A  +LV Y A+      P  Q                   GV   L+P  +   Y+
Sbjct: 177 --LVALGILVAYGAMQLWLPFPGGQ------------------AGV---LSPTESINAYV 213

Query: 244 DRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           D  +L G+++    P                              +PEG+LS++ ++++ 
Sbjct: 214 DSLLLPGVSYQGRTP------------------------------DPEGVLSTLPAVVNA 243

Query: 303 IIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           + GV  GH I+  H KG  A++      G   L  G  L     IP+NK+L+T S+V VT
Sbjct: 244 LAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLD--GVIPVNKELWTSSFVLVT 301

Query: 361 SGAAALVFSAIYALVDC 377
           SG + L+ +  YALVD 
Sbjct: 302 SGWSMLLLALFYALVDV 318


>gi|149176468|ref|ZP_01855081.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
 gi|148844581|gb|EDL58931.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
          Length = 518

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/426 (22%), Positives = 173/426 (40%), Gaps = 79/426 (18%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALM------------- 49
           E+ AE        ++E DVS +++K     QRL SLD +RG  +  M             
Sbjct: 43  EVSAEVEPAKA--VTEKDVSLKEKKKPETNQRLVSLDAYRGFVMLAMASGGLAIASVVRN 100

Query: 50  ---ILVDHAGGDWP------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
              +L  + G  W             ++SH  W G    D + P F+F+VGV++  ++++
Sbjct: 101 SPEVLDQYNGTQWESSWKTLWQTLSYQLSHVEWTGAGFWDLIQPSFMFMVGVSMPFSVRK 160

Query: 95  IPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
              + D+  K+    IFR + L+  G+ L   FS         V         VL +I L
Sbjct: 161 RRQKGDSTFKIWMHAIFRAILLVALGVFLSSQFSPERGFTYEDVPQTNFTFANVLCQIGL 220

Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
            YL+V     F  +     Q +G  +I   Y W +          LA +  TY+ +    
Sbjct: 221 GYLVV----FFYVNRSFATQMIGVVTILGGY-WFFFYQYMPPEDELAAV-KTYLKE---- 270

Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
           + +KD A++ +      G+ +  N   NA   +DR++L +   Y +P         +D P
Sbjct: 271 VQHKDEAEWSQF----SGIGSAWNKHTNAAAAVDRQLLNMFPRYDNP---------KDDP 317

Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
            +G        W +        L+ + SI + + G+  G ++I  +    ++K  +  G 
Sbjct: 318 DQGD-----TFWVNK--GGYQTLNFIPSIATMLFGLMAGQLLISNRLEKMKVKWLLQAG- 369

Query: 331 ALLIFGLTLHFTNAI-------------PLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            L+ FG+++    +I             P+ K++++  +   ++G A    +  Y ++D 
Sbjct: 370 -LICFGVSMLLDTSIWPVNINNWEWHLVPIVKRIWSPGWAIFSAGWAFWFLAVFYWIIDV 428

Query: 378 NIFGPW 383
             +  W
Sbjct: 429 KGYKKW 434


>gi|452822118|gb|EME29140.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 2
           [Galdieria sulphuraria]
          Length = 351

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/330 (23%), Positives = 143/330 (43%), Gaps = 83/330 (25%)

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
           H  W   ++AD + PFFLF+VG +I  A +++P       ++  A++ V  RT+KL   G
Sbjct: 14  HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 73

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           +LL    S        G     +R  G+LQRIA+ Y  V+ + +F       +  V +++
Sbjct: 74  VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 119

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     + + + +++ +LLYG  VP+                    C +  +L   
Sbjct: 120 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 149

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           C+A  Y+D  +LG  H+Y                             H  ++PEG+LS++
Sbjct: 150 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 181

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL--HFTNAIPLNKQLYTL 354
            + ++T  G+    +    +    R+     +G + +   + L   F +++P++K L+T 
Sbjct: 182 MATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPDSVPISKPLWTA 241

Query: 355 SYVCVTSGAA--ALVFSAIYALVDCNIFGP 382
           S++ +T G +   L F  ++A V   +  P
Sbjct: 242 SFLFLTVGCSFWCLSFCGLWAKVTPRLVQP 271


>gi|452822119|gb|EME29141.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 1
           [Galdieria sulphuraria]
          Length = 356

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/330 (23%), Positives = 143/330 (43%), Gaps = 83/330 (25%)

Query: 64  HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
           H  W   ++AD + PFFLF+VG +I  A +++P       ++  A++ V  RT+KL   G
Sbjct: 19  HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 78

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           +LL    S        G     +R  G+LQRIA+ Y  V+ + +F       +  V +++
Sbjct: 79  VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 124

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     + + + +++ +LLYG  VP+                    C +  +L   
Sbjct: 125 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 154

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
           C+A  Y+D  +LG  H+Y                             H  ++PEG+LS++
Sbjct: 155 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 186

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL--HFTNAIPLNKQLYTL 354
            + ++T  G+    +    +    R+     +G + +   + L   F +++P++K L+T 
Sbjct: 187 MATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPDSVPISKPLWTA 246

Query: 355 SYVCVTSGAA--ALVFSAIYALVDCNIFGP 382
           S++ +T G +   L F  ++A V   +  P
Sbjct: 247 SFLFLTVGCSFWCLSFCGLWAKVTPRLVQP 276


>gi|375253854|ref|YP_005013021.1| hypothetical protein BFO_0041 [Tannerella forsythia ATCC 43037]
 gi|363406758|gb|AEW20444.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 390

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 158/372 (42%), Gaps = 93/372 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            ++RL +LDI RG+ +A MI+V++ G     +  + HA W+G    D V PFF+FI+G++
Sbjct: 8   SSRRLLALDILRGITIAGMIMVNNPGSWSFVYAPLGHAAWHGLTPTDLVFPFFMFIMGIS 67

Query: 88  IALALKR--IPDRADAVKKVIFRT--------------LKLLFWGILLQGGFSHAPDELT 131
             ++LK+        A++K+I RT              L    W  L  GG S       
Sbjct: 68  TYISLKKYDFTFSYSAMRKIIRRTAVIFAIGLGLAWLGLTCRTWHGLADGGLSFGARLWQ 127

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
              +   +R+ GV+QR+ALSY   +L+ +  +  +                        +
Sbjct: 128 SVSNFGHLRILGVMQRLALSYGATALIALAIRHHR------------------------I 163

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
             + +ALL G       +T++    A  G  +N T           N +  +DR VLG+N
Sbjct: 164 PYLIVALLGG-------YTVLLL--AGNGLAYNET-----------NILSIVDRAVLGVN 203

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           H Y                      KD         EPEGLLS++ +I   +IG   G  
Sbjct: 204 HTY----------------------KD------MGIEPEGLLSTLPAIAHVLIGFCCGRA 235

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++       ++ +   +G A+   G  L +    P+NK++++ ++V +T G A+ + + +
Sbjct: 236 MLGVTEVRDKMLRLFLIGTAMAFAGWLLSY--GCPINKKIWSPTFVLITCGMASGLLALL 293

Query: 372 YALVDCNIFGPW 383
             ++D      W
Sbjct: 294 IWIIDVKRHTKW 305


>gi|320106288|ref|YP_004181878.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
 gi|319924809|gb|ADV81884.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
          Length = 394

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 145/338 (42%), Gaps = 70/338 (20%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLF 82
           E     TQR+ S+D+ RGL VA MILV+  G     +  + HAPWNG    D V P FLF
Sbjct: 3   ETKAAPTQRILSVDVLRGLTVAFMILVNDPGDGHVAYAPLDHAPWNGWTPTDMVFPTFLF 62

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +VG +I  ++     R D+  ++  + ++   + + +       P +  YG   RM RL 
Sbjct: 63  LVGCSIVFSITSRLKRGDSKSRIALQVIRRTIYLLAINYAIRLIP-QFHYG---RM-RLF 117

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV--LVVYLALLY 200
           GVL RIA+ YL+ +L+ ++ +  +                  W+  A V  LV Y AL+ 
Sbjct: 118 GVLPRIAICYLIAALLFLWLQRAR------------------WIAVAVVTLLVGYWALMR 159

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
              +P     + +    D    FN             N   YIDR   G N         
Sbjct: 160 FVPIPGAGMPVHDFPLMDQ---FN-------------NLPSYIDR---GFNDF------- 193

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                TQ     G L +          +PEG+LS++ ++ + ++GV  G ++   + +  
Sbjct: 194 -----TQRFLHTGSLYEKTR-------DPEGILSTLPAVGTALLGVLNGKLL---RSNTP 238

Query: 321 RLK-QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
            +K   + +G  +L   L   +    P NK L+T SYV
Sbjct: 239 PVKVSAILLGGGVLSIALGYLWNPWFPFNKNLWTSSYV 276


>gi|313203961|ref|YP_004042618.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
 gi|312443277|gb|ADQ79633.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
          Length = 382

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 146/356 (41%), Gaps = 85/356 (23%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            R  +LDIFRG+ V  MI+V+ +G      WP + HA WNG    D V P FLF VG A+
Sbjct: 13  SRFTALDIFRGMTVCFMIIVNTSGNGATTYWP-LMHADWNGFTPTDLVFPSFLFAVGNAL 71

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLCGV 144
             A+KR     ++D + K+  RT  +   G L+     F    +       +   R+ GV
Sbjct: 72  GFAMKRWDTMKQSDVLLKIFKRTALIFLIGYLMYWFPFFRLNAESHLILSPISQTRIMGV 131

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
           LQRIAL Y + +L+  +    +                  W+    +L  ++ LL     
Sbjct: 132 LQRIALCYGITALLVYYLGTKRTI----------------WVGVVSLLAYWVLLL----- 170

Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                         +G+         A+ +   NAV  +D  +LG +H+Y    +     
Sbjct: 171 -------------AFGE-------AGAEFSKTGNAVLRLDIWLLGTHHLYGGEGF----- 205

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LA 320
                                PF+PEG+LS++ ++ + I G   G  +   KG     LA
Sbjct: 206 ---------------------PFDPEGVLSTLPALFNVIAGFAVGRYLQQQKGKSYESLA 244

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           +L         + +  L   + + +P+NK+L+T SY  +T G   L+ S I    D
Sbjct: 245 KLLL-----VGIGLLVLAYCWNSWMPINKKLWTSSYAVLTVGLDCLLLSVIIYFTD 295


>gi|383643230|ref|ZP_09955636.1| hypothetical protein SeloA3_10334 [Sphingomonas elodea ATCC 31461]
          Length = 382

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 157/359 (43%), Gaps = 90/359 (25%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  +LD+FRG  + LMILV+ +G     +P++ HA W G  LAD V P FLF +G A++ 
Sbjct: 17  RFLALDVFRGATIFLMILVNTSGPGAEPYPQLVHAKWIGFTLADLVFPTFLFAMGNAMSF 76

Query: 91  ALKRI----PDRADAVKK--VIFRTLKLLFW-GILLQG--GFSHAPDELTYGVDVRMIRL 141
           A ++     P  A   ++  +IF    L++W   + QG  G++  P  LT        R+
Sbjct: 77  AFRKPVATGPFLARLFRRGAIIFVLGYLMYWFPFVEQGPDGWALKPFALT--------RV 128

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GVLQR+AL Y+L  L+                          WL    +L+  +A+L G
Sbjct: 129 PGVLQRLALCYVLAGLMI------------------------RWLKPRQLLLAGIAMLLG 164

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                W   ++   +   G  F+    +  +          ID  +LG  H+Y       
Sbjct: 165 Y----WTILLVFSPA---GMAFDKYANIGTQ----------IDLWLLGPGHLY------- 200

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                         +KD      A F+PEGLL ++ + ++ I G   G  I+        
Sbjct: 201 --------------KKD------AGFDPEGLLGTLPATVNVIAGYLAGLAIVQGGDLRRT 240

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIF 380
           + +   +G AL++ GL   ++   P+ K+L+T SYV +T G   ++ + +  L++   F
Sbjct: 241 VGRMALVGAALVLAGLA--WSPWFPIAKKLWTGSYVLLTVGIDLVLLAGVIGLIEIAGF 297


>gi|312131163|ref|YP_003998503.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
 gi|311907709|gb|ADQ18150.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
          Length = 413

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 160/398 (40%), Gaps = 127/398 (31%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           L  QR+ SLD+FRGL + LMI V++ G DW  +     HA W+G    D V PFF+F +G
Sbjct: 3   LAKQRIVSLDVFRGLTMILMITVNNPG-DWSNVYAPLLHAEWHGWTPTDLVFPFFVFAMG 61

Query: 86  VAIALALK-----------RIPDR---------------------ADAVKKVIFRTLKLL 113
           +A+  ++K           +I  R                     A  +  ++FR +   
Sbjct: 62  MALPFSMKPGSGLSKDDFLKILARSARLIALGLFLNFFSKIEFGNAQGITLLLFRLMITG 121

Query: 114 FWGILLQGGFSHAPDELTYGVDVRM--------------IRLCGVLQRIALSYLLVSLVE 159
           F G LL G F       T    + +              +R+ GVLQR+   Y   +++ 
Sbjct: 122 FVGFLLMGNFPTKIKLYTALALLGLMLALAYSGLPHFAQVRIPGVLQRLGTVYFFAAILY 181

Query: 160 I-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
           + F+  VQ                  W +   VLV+Y  LL    VP    T   K    
Sbjct: 182 LAFSLRVQ------------------WGIGLSVLVIYWLLLAYIPVPGSGVTGFEKGE-- 221

Query: 219 YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKD 278
                              N   +ID  VLG +H+     W  SK               
Sbjct: 222 -------------------NLPAWIDSIVLG-DHV-----WSSSK--------------- 241

Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
                  P++PEG+LS++ +I+S ++G   G  +   K      K+ +  G  LLI GL 
Sbjct: 242 -------PWDPEGVLSTLPAIISCLLGAWAGVFLREDK------KKLLLTGVILLICGLA 288

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             ++   P+NK L+T S+V +T+G  +++ S +  +VD
Sbjct: 289 --WSTFFPINKALWTSSFVLLTAGLGSIIVSLLGFVVD 324


>gi|421097001|ref|ZP_15557700.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
 gi|410800246|gb|EKS02307.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
          Length = 383

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 153/357 (42%), Gaps = 102/357 (28%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
            ++KS    +R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PF
Sbjct: 1   MEKKSTQNKERILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59

Query: 80  FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
           FLF VGV+I ++L  K   +R+     +  R++ L+  G+LL   G +S A         
Sbjct: 60  FLFAVGVSIPISLYSKNGINRSKVWIGICIRSISLILLGLLLNFFGEWSFAE-------- 111

Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
              +R+ GVLQRI   Y  V SL  IF      K   +    +  ++ W           
Sbjct: 112 ---LRVPGVLQRIGFVYWTVASLYLIF----PGKKVLIFLILVLLVHTW----------- 153

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
               +    +P  + T+  +   D G                     +IDR ++G  H+ 
Sbjct: 154 ----ILTHIIPPGESTVSLEQGKDIG--------------------AWIDRMIIGEKHL- 188

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
               W+ SK                       ++PEGLLS ++SI +++ GV  G ++  
Sbjct: 189 ----WKFSKT----------------------WDPEGLLSGIASIATSLFGVLCGFILF- 221

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFT-------NAIPLNKQLYTLSYVCVTSGAA 364
                  L++ V     L IFGL   FT        ++P+NK L+T SY   T+G +
Sbjct: 222 -------LREGVGKNRVLGIFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLS 271


>gi|393782159|ref|ZP_10370348.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
 gi|392674193|gb|EIY67642.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
          Length = 387

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 156/372 (41%), Gaps = 93/372 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL +LDI RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+G++ 
Sbjct: 6   SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAQWNGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDVRM- 138
            ++L++        A  K++ RT       L L ++ +  +   S + +E+++   +   
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGLAWFSMFCRTWNSLSAEEISFFSRLGQS 125

Query: 139 ------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IR+ GV+QR+AL Y   ++V +  K                       + A +L
Sbjct: 126 IWTFDHIRILGVMQRLALCYGATAIVALTMKHKHIP-----------------YLIATLL 168

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + Y  LL            +  +  +Y                  N +  +DR VLG  H
Sbjct: 169 IGYFILL------------VTGNGFEYNS---------------TNILSVVDRAVLGEAH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                      KD         +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MY----------------------KD------NGIDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K    +L +   +G  L   G  L +    P+NK++++ ++  VT G  +   + + 
Sbjct: 234 MEVKDINEKLGRLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLGSSFLALLI 291

Query: 373 ALVDCNIFGPWT 384
            ++D   +  W+
Sbjct: 292 WIIDVKGYKSWS 303


>gi|152999681|ref|YP_001365362.1| hypothetical protein Shew185_1147 [Shewanella baltica OS185]
 gi|151364299|gb|ABS07299.1| conserved hypothetical protein [Shewanella baltica OS185]
          Length = 384

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 150/364 (41%), Gaps = 94/364 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G    D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +                       L  WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH I+  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALTGVFVGHFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G  LL FG  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDC 377
           LVD 
Sbjct: 304 LVDV 307


>gi|291516094|emb|CBK65304.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 331

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 139/331 (41%), Gaps = 84/331 (25%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLF 114
           ++ H  WNG    D + P FLFI GV+   +L +   RA  +       KVI R + L+ 
Sbjct: 8   QMGHVSWNGLTQHDTIFPLFLFIAGVSFPFSLSK--QRASGISERRILFKVIRRGMTLIV 65

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
            G++  G F           D   +R+  VL RI L+++  SL+ ++ K V+ +      
Sbjct: 66  LGMIYNGLFRF---------DFASLRVASVLGRIGLAWMFASLLYMYCK-VRTRA----- 110

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      + AA VL+ Y  L+Y    PD        D  D              L+
Sbjct: 111 -----------VFAAVVLIGYSLLMYLVVAPD------APDGTD-------------PLS 140

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
              N  G++DR+ L        P    + AC                     F+PEGLLS
Sbjct: 141 VAGNIAGWVDRQWL--------PG---TFAC-------------------GSFDPEGLLS 170

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-IPLNKQLYT 353
           ++ +I+S + G+  G  ++  +  L+  +  + M  A +   +     N  IP+NK+L++
Sbjct: 171 TLPAIVSALFGMFTGEFLLRKRSSLSGEQIALCMALAAVAITVIGIIWNCWIPINKKLWS 230

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            S+ CV +G +  +F+  Y L+D   +  WT
Sbjct: 231 SSFTCVVTGYSLGLFALFYYLIDVRGWKRWT 261


>gi|329851309|ref|ZP_08266066.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
 gi|328840155|gb|EGF89727.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
          Length = 384

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 149/355 (41%), Gaps = 75/355 (21%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
           +  R  +LDI RGL++  M+L +   G W E    + HA W G    D V P FLF +GV
Sbjct: 7   QGNRWLALDILRGLSIIFMLL-NLNPGSWSEQYGWVLHAKWEGATFIDMVAPVFLFCIGV 65

Query: 87  AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           AI L+L+R  +  ++     K ++ R   L+  G+ L               D   +R+ 
Sbjct: 66  AIPLSLRRRIEAGESNGQLAKHILNRAGILVLLGLFLNA---------YPAFDWAHMRIP 116

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYG 201
           GVLQRI + Y  V+L  +FT   +           FRL     W+    VL+ + ALL  
Sbjct: 117 GVLQRIGVCYGAVALFVLFTARREGG---------FRLNAKAGWIAWTFVLLSWTALLMF 167

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             VP             +G           + +P  +   Y+DR VL  +HM+  P W  
Sbjct: 168 VPVP------------GFGA---------PRFDPVGSWPAYVDRLVLTTDHMF--PWW-- 202

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                   P +G             F+P+GLLS+     + + G   GH      G  A 
Sbjct: 203 --------PVDG----------KVVFDPDGLLSTWPVCANVLFGALVGHA--RLTGITAP 242

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           + + +  G  L+   + LH T  IP+ K ++T ++   T G + +   A+  LV+
Sbjct: 243 ILKMLVAGGLLMAAAVGLHTT--IPIIKHIWTATFALFTIGFSLVSLGALTLLVE 295


>gi|388456506|ref|ZP_10138801.1| hypothetical protein FdumT_08017 [Fluoribacter dumoffii Tex-KL]
          Length = 372

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/138 (37%), Positives = 82/138 (59%), Gaps = 13/138 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +T+R+ SLD+FRGL +ALM+LV+  G   ++  + H  WNGC+LAD V P FLFIVG+  
Sbjct: 7   ETKRILSLDVFRGLTMALMVLVNSLGSRENYKILMHVEWNGCSLADLVFPAFLFIVGITT 66

Query: 89  ALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            ++L+R      +A   + ++ RTL L    +      +  P +    +D+  IR+ G+L
Sbjct: 67  VISLQRHLNDESKAQLYRSILTRTLLL----MFFGLFLNIFPKQ----IDLSTIRIYGIL 118

Query: 146 QRIALSYLLVSLVEIFTK 163
           QRIA  YL+ S++ + T 
Sbjct: 119 QRIAWCYLICSILYLHTS 136



 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 54/99 (54%), Gaps = 4/99 (4%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNAI 345
           F+PEG LS++ +I +T+ G+  G  ++     L +    +T+G   L + G   H+    
Sbjct: 193 FDPEGFLSTIPAIATTLSGLLVGQYLLTPGSKLKKSFTLITIGILCLFVAGFWNHY---F 249

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           P+NK L+T ++V  +SG + +VF   + ++D   +  W+
Sbjct: 250 PINKNLWTSTFVLWSSGFSLIVFGLCFFVIDVLGYTKWS 288


>gi|117921549|ref|YP_870741.1| hypothetical protein Shewana3_3111 [Shewanella sp. ANA-3]
 gi|117613881|gb|ABK49335.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 395

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 155/393 (39%), Gaps = 98/393 (24%)

Query: 7   ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
            TT    +  +  +  +   K      RL SLD  RG  +           AL++L   A
Sbjct: 2   STTAPESITNTGVNAQEAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLMLTGWA 61

Query: 56  GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKL 112
           G  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+       +  ++R  +K 
Sbjct: 62  GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMQERMPVYRHGIKR 121

Query: 113 LFWGILLQGGFSH-----APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           LF  +LL   ++H     AP       D   +R   VL RIA ++   +L+         
Sbjct: 122 LFLLLLLGILYNHGWGTGAP------ADPEKVRYASVLGRIAFAWFFAALL--------- 166

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
                          WH  +   VLV    L+    V  W                    
Sbjct: 167 --------------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPG 197

Query: 228 GVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
           G    L+P  +   Y+D  +L G+++    P                             
Sbjct: 198 GQAGVLSPTESINAYVDSLLLPGVSYQGRTP----------------------------- 228

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNA 344
            +PEG+LS++ ++++ + GV  GH I+  H KG  A++      G   L  G  L     
Sbjct: 229 -DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLG--GV 285

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           IP+NK+L+T S+V VTSG + L+ +  YALVD 
Sbjct: 286 IPVNKELWTSSFVLVTSGWSMLLLALFYALVDV 318


>gi|393786264|ref|ZP_10374400.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
 gi|392659893|gb|EIY53510.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
          Length = 387

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/372 (22%), Positives = 155/372 (41%), Gaps = 93/372 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           ++RL +LDI RG+ +A MI+V++ G     +  + HA WNG    D V PFF+FI+G++ 
Sbjct: 6   SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAKWNGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTY----GVD 135
            ++L++        A  K++ RT       L + ++ +  +   S + +E+++    G  
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSGEEISFLSRLGQS 125

Query: 136 VRM---IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
           V     IR+ GV+QR+AL Y   +++ +  K                             
Sbjct: 126 VWTFDHIRILGVMQRLALCYGATAIIALTMKH---------------------------- 157

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             Y+  L  T +  +   +I  +  +Y                  N +  +DR VLG  H
Sbjct: 158 -KYIPYLIVTLLAGYFILLITGNGFEYND---------------TNILSVVDRAVLGEAH 201

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                                 +PEGLLS++ +I   +IG   G ++
Sbjct: 202 MYKDNG----------------------------IDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K    +L++   +G  L   G  L +    P+NK++++ ++  VT G  +   + + 
Sbjct: 234 MEVKDINEKLERLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLGSSFLALLI 291

Query: 373 ALVDCNIFGPWT 384
            ++D   +  W+
Sbjct: 292 WIIDVKGYKSWS 303


>gi|336316712|ref|ZP_08571601.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
 gi|335878877|gb|EGM76787.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
          Length = 363

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 55/138 (39%), Positives = 79/138 (57%), Gaps = 13/138 (9%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           + + R  +LD  RGLA+ALMILV+  G  W  +     HAPW+G   AD V P FLF+VG
Sbjct: 1   MSSPRFYALDALRGLAIALMILVNTPG-SWQHVYTPLLHAPWDGFTFADIVFPTFLFVVG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
            A+  +LK       ++ +V  R LKL+  G+LL        + + + VD+  +RL GVL
Sbjct: 60  AAMFYSLKTAVLSRQSLWRVSSRALKLIGIGVLL--------NYVPFTVDLAELRLPGVL 111

Query: 146 QRIALSYLLVSLVEIFTK 163
           QRI L+Y L +L+ +  K
Sbjct: 112 QRIGLAYWLAALLILTVK 129


>gi|373948546|ref|ZP_09608507.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
 gi|386325609|ref|YP_006021726.1| hypothetical protein [Shewanella baltica BA175]
 gi|333819754|gb|AEG12420.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica BA175]
 gi|373885146|gb|EHQ14038.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
          Length = 384

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 150/364 (41%), Gaps = 94/364 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G    D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +L+                        WH  +   ++V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G    L+P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAIQLWLPFPGGQAGVLSPTESINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +PEGLLS++ +I++ + GV  GH I+  
Sbjct: 216 TP------------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H KG  A++      G   L+ G  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPKGEWAKVGVLAAAGGIFLVLGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYA 303

Query: 374 LVDC 377
           LVD 
Sbjct: 304 LVDV 307


>gi|440747989|ref|ZP_20927244.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483731|gb|ELP39771.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 382

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 77/134 (57%), Gaps = 9/134 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALMI+V+  G DW  +     HA W+G  + D V P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMIVVNTPG-DWSNVFSPLLHADWHGFTITDLVFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRAD-AVKKVIFRTLKLLF---WGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           + ++K++   +  A  K +F+   L+F   WG+     F      +   +D   +RL GV
Sbjct: 72  SFSMKKMEKMSQGAFLKKVFKRAALIFLIGWGLNAFPFFETNETGVVSMIDWSAVRLLGV 131

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL YL+ SLV
Sbjct: 132 LQRIALCYLIASLV 145



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 2/99 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS+  S+++ I G   G  I     +   +K  +  G  L+  G ++ +  A 
Sbjct: 206 PFDPEGVLSTWPSVVNVIAGFLAGKFIQQIGNNKRTVKALLLAG--LIAVGFSVIWELAF 263

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           P+NK+++T SYV +T G   +V   +  +++   F  WT
Sbjct: 264 PINKKIWTSSYVLLTVGLDLIVLGFLVLIIEVQKFNKWT 302


>gi|423223641|ref|ZP_17210110.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638266|gb|EIY32113.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 395

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 93/386 (24%), Positives = 161/386 (41%), Gaps = 102/386 (26%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
           + Q K++   +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V P
Sbjct: 2   NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57

Query: 79  FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
           FF+FI+G++  ++LK+   +    A  K++ RT+ +   G+ + G FS        + + 
Sbjct: 58  FFMFIMGISTYISLKKYNFKFSHAAALKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116

Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
           L++G  +         IR+ GV+QR+AL Y   +++ +  K                   
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
               + A +LV Y  LL                           CG     N   N +  
Sbjct: 163 ---YLIATLLVGYFILL--------------------------MCGNGFAYNET-NILSI 192

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           +DR +L   HMY                      KD         +PEGLLS++ SI   
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYV 357
           ++G   G +++ +    +R     +    L + G  L FT  +     P+NK++++ ++V
Sbjct: 225 LLGFCVGRMMLDSNKAESREALLNSHLIKLFLVGAILTFTGFLLSYGCPINKKIWSPTFV 284

Query: 358 CVTSGAAALVFSAIYALVDCNIFGPW 383
             T G A+   + +  ++D   +  W
Sbjct: 285 LTTCGLASSFLALLIWIIDVKGYKKW 310


>gi|325954677|ref|YP_004238337.1| hypothetical protein [Weeksella virosa DSM 16922]
 gi|323437295|gb|ADX67759.1| hypothetical protein Weevi_1050 [Weeksella virosa DSM 16922]
          Length = 402

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 151/363 (41%), Gaps = 101/363 (27%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +KT R  SLD+FRG  +ALMILV++ G     +  + HA W+GC   D V PFFLF VG 
Sbjct: 1   MKTTRYYSLDVFRGATIALMILVNNPGSWSYMFSPLQHASWHGCTPTDLVFPFFLFAVGN 60

Query: 87  AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDELTYGV----DVRM 138
           A++  +  +  +A  V  KK+I RT+ +   G+ +          +EL +      +   
Sbjct: 61  AMSFGMSHLKLQASNVFWKKIIKRTILIFAIGLFINWWPFLKWENNELVFRAWRESEENG 120

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           +R+ GVLQRIA++    S +  +      +D+ V + SI             +L+ Y AL
Sbjct: 121 VRIMGVLQRIAIANFFASTLAYYY-----RDRVVLKISI------------LILLFYWAL 163

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV----LGINHMY 254
            +                           GV      P +  G+I  K+    +G+ HMY
Sbjct: 164 TF------------------------FLGGVD-----PYSLEGFIGTKIDVHLIGLAHMY 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                            EG            PF+PEGL S++ +I   ++G   G V I 
Sbjct: 195 KG---------------EG-----------VPFDPEGLYSTIPAISQILLGYLVG-VYIQ 227

Query: 315 TKGHLARLKQWVTMG-----------FALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTS 361
            +G +    + +              F L  F L + +   +  P NK++++ SYV  T+
Sbjct: 228 KQGDIRWFSRSLPASNLPIYRMLSGLFVLGAFALIMGYIWQLDFPYNKKIWSSSYVIHTT 287

Query: 362 GAA 364
             A
Sbjct: 288 ALA 290


>gi|380693406|ref|ZP_09858265.1| hypothetical protein BfaeM_05407 [Bacteroides faecis MAJ27]
          Length = 371

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/363 (23%), Positives = 153/363 (42%), Gaps = 89/363 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RL +LD+ RG+ +A MILV+  G     +  + HA W G    D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNTPGSWQHTYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
             ++L++        A  K++ RT+ +   GI    L    F H P        +  IR+
Sbjct: 65  TYISLRKYDFTFSIPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 118

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
            GV+QR+AL Y + +L  +  K                 + +   +   +L+ Y  +L  
Sbjct: 119 LGVMQRLALGYGITALAALLIK-----------------HKYIPYLITVLLIGYFMIL-- 159

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
                         +   G V++ T           N +  +DR VLG  H+Y       
Sbjct: 160 --------------AVGNGYVYDET-----------NVLSIVDRAVLGQAHIYG------ 188

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                                  A  +PEGLLS++S++   +IG   G +++  K    +
Sbjct: 189 ----------------------GAILDPEGLLSTISAVAHVMIGFCAGKLLMEVKDIHEK 226

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           L++   +G  L   G  L + +  P+ K++++ S+V +T G  +   + +  ++D   + 
Sbjct: 227 LERLFLIGTILTFAGFLLSYGS--PICKKIWSPSFVLITCGMGSSFLALLVWIIDIKGYK 284

Query: 382 PWT 384
            W+
Sbjct: 285 GWS 287


>gi|325103749|ref|YP_004273403.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324972597|gb|ADY51581.1| hypothetical protein Pedsa_1010 [Pedobacter saltans DSM 12145]
          Length = 374

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 76/131 (58%), Gaps = 6/131 (4%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R  SLD+ RG  VA MI+V+  G     +  + HAPW+G  + D V P FLF+VG A++ 
Sbjct: 11  RYLSLDVLRGATVAFMIIVNTPGSWSYVYAPLKHAPWHGFTVTDLVFPTFLFVVGNAMSF 70

Query: 91  ALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQR 147
            + ++ ++ ++  +KKV  RTLK+   G+ L    F    D++    D   IR+ GVLQR
Sbjct: 71  GMGKLKEQGNSAFLKKVFSRTLKIFLIGLFLNMFPFVKWVDDVLVMKDFTEIRIWGVLQR 130

Query: 148 IALSYLLVSLV 158
           IA+ Y + SL+
Sbjct: 131 IAVCYCIASLL 141



 Score = 41.6 bits (96), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS++ ++++ I+G +F  + I  KG+  +   +  +G  +++      +    
Sbjct: 198 PFDPEGVLSTIPAVVNVILG-YFAGLFIQKKGN-NKSTAFNLIGTGVILLLAASAWNLVF 255

Query: 346 PLNKQLYTLSYVCVTSG 362
           P+NK ++T SYV  T G
Sbjct: 256 PINKPIWTSSYVLYTVG 272


>gi|16552925|dbj|BAB71412.1| unnamed protein product [Homo sapiens]
          Length = 367

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 118/257 (45%), Gaps = 42/257 (16%)

Query: 139 IRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R+ GVLQR+ ++Y +V+++E+ F K V +   S       R     W  WL+   +  +
Sbjct: 75  VRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGL 134

Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +L L +   VP      +      D+GK  N T G          A GYIDR +LG +H+
Sbjct: 135 WLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHL 184

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y HP    S A    +                 ++PEG+L +++SI+   +GV  G +++
Sbjct: 185 YQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILL 226

Query: 314 H----TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALV 367
           +    TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +
Sbjct: 227 YYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFI 285

Query: 368 FSAIYALVDCNIFGPWT 384
              +Y +VD    G WT
Sbjct: 286 LLVLYPVVDVK--GLWT 300


>gi|336312505|ref|ZP_08567454.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
 gi|335864011|gb|EGM69129.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
          Length = 384

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/379 (25%), Positives = 150/379 (39%), Gaps = 102/379 (26%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNG 69
           ++     +   RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G
Sbjct: 7   NETATIKVTKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHG 66

Query: 70  CNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF- 123
               D + P F+F+ GVA+ L+ KR+      +R    +  I R L LL  GIL   G+ 
Sbjct: 67  FRFYDLIFPLFIFLSGVALGLSPKRLDKLPLSERLPVYRHGIKRLLLLLLLGILYNHGWG 126

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           + AP       D   +R   VL RIA ++   +                       L  W
Sbjct: 127 TGAP------ADPEKVRYASVLGRIAFAWFFAA-----------------------LLVW 157

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAV 240
           H  +   ++V  L +L G                 YG +      + G    L+P  +  
Sbjct: 158 HTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFSGGQAGVLSPTESIN 199

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
            YID  +L                                S+     +PEGLLS++ +++
Sbjct: 200 AYIDSILL-----------------------------PGVSYQGRTLDPEGLLSTIPAVV 230

Query: 301 STIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           + + GV  GH I+  H +G  A++      G   L  G  L     IP+NK+L+T S+V 
Sbjct: 231 NALAGVFVGHFIVKSHPQGEWAKVGLLAAAGGVCLALGWLLD--GVIPVNKELWTSSFVL 288

Query: 359 VTSGAAALVFSAIYALVDC 377
           VTSG + ++ +  YALVD 
Sbjct: 289 VTSGWSMVLLAVFYALVDV 307


>gi|386392672|ref|ZP_10077453.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
 gi|385733550|gb|EIG53748.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
          Length = 370

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/361 (26%), Positives = 149/361 (41%), Gaps = 87/361 (24%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +  RLAS+D  RGLAVA MIL ++ G     + E+ HA W+G   ADF+ P FLF+VGV 
Sbjct: 5   RKTRLASVDGLRGLAVAGMILANNPGERGHVYRELQHAVWDGWTAADFIFPLFLFLVGVC 64

Query: 88  IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           +ALA+ R   R        ++V+ R + L   G+L         +     V    +R+ G
Sbjct: 65  VALAVDRDTVRTGEAHRFWRRVLTRAIILFLLGLL---------ENAYLRVSFENLRIPG 115

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA+ YL  + + +       +  + G  S+  +           L+ Y  LL G  
Sbjct: 116 VLQRIAVVYLATAWLHV-------RCGNRGIVSVILV----------TLLGYWLLLAGVP 158

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +++D                      N  G+ID+ +LG NH++ +       
Sbjct: 159 VPGLGHPSLSRD---------------------VNWEGWIDQLLLG-NHIWKY------- 189

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                  ++PEG+LS+  +I   ++GV  G  +      +    
Sbjct: 190 --------------------ETTWDPEGVLSTFPAIALGLVGVLCGRWLRLGGLGVG--- 226

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
               +   L +  L L +    P+NK L T S+V    GA  ++ +  Y L+D      W
Sbjct: 227 --RGLAVGLAMLLLGLLWNAWFPVNKSLCTSSFVLFVGGAGVMMLAGCYWLLDMRGNAAW 284

Query: 384 T 384
            
Sbjct: 285 A 285


>gi|429738942|ref|ZP_19272716.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
 gi|429158431|gb|EKY00988.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
          Length = 400

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/386 (24%), Positives = 163/386 (42%), Gaps = 108/386 (27%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           K+ R+ ++DI RG+ +A MILV++ G +W  I     HA WNG    D V PFF+F++G+
Sbjct: 7   KSSRILAIDILRGITIAGMILVNNPG-NWGRIFAPFEHAEWNGMTPTDLVFPFFMFVMGM 65

Query: 87  AIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------AP-DELTYGVDV- 136
            I +A+++     +   V K+  R + +   G+ + G F+       +P +E ++G  + 
Sbjct: 66  CIYIAMRKFDFTCNKSTVYKITKRMVLIYLVGLGI-GWFAKFCFRWASPLEEASFGEQLW 124

Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IRL GVL R+A+ Y + +L+ +    V+ K+                 +   
Sbjct: 125 YMVWPFDSIRLTGVLARLAICYGITALLAV---TVKHKNLP--------------YIIVT 167

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +LV Y  +L                 A  G  ++ T           N +   DR VL  
Sbjct: 168 LLVGYFIIL----------------MAGNGFAYDET-----------NILSIADRAVLTD 200

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMYH                                +PEGLLS++ SI  T++G   G 
Sbjct: 201 VHMYHDNG----------------------------IDPEGLLSTLPSIAHTLLGFMVGS 232

Query: 311 VIIHTKGH------------LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ++  T               L+++     +G  L+  G  L +    PLNK++++ +YV 
Sbjct: 233 LLFKTTDEHSEHTDVRTGIILSKVVPLFVVGTILIFSGFLLSY--GCPLNKKVWSPTYVL 290

Query: 359 VTSGAAALVFSAIYALVDCNIFGPWT 384
           VT G A+ + + +  L+D   +  W+
Sbjct: 291 VTCGLASTLLALLIWLIDVKGYRRWS 316


>gi|329963071|ref|ZP_08300851.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
 gi|328529112|gb|EGF56042.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
          Length = 381

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 145/366 (39%), Gaps = 96/366 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             QRL +LD+ RGL +A MILV+        +  + HA WNG    D + PFFLF++GV+
Sbjct: 3   NNQRLVALDVMRGLTIAGMILVNTPETWSYVYAPLQHARWNGLTPTDVIFPFFLFMMGVS 62

Query: 88  IALALKRIPDRADA--VKKVIFRTLKLLFWGILL---------------QGGFSHAPDEL 130
           + ++LK+      +  + K+I R+L L   G  +               Q GF   P + 
Sbjct: 63  MYISLKKCSFHLSSHLLMKIIRRSLILFLIGTAIYALATFLGTLRDACRQPGFEGNPWKE 122

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
            +   +   R+ GVLQR+ + Y                    G  SI  L C H      
Sbjct: 123 AFA-SLPGTRIPGVLQRLGVCY--------------------GIGSIIVLTCRH------ 155

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
                       Y+P     I+    A Y   F +       ++ P N +  +DR + G 
Sbjct: 156 -----------RYIPHLAGGIL----AGY---FLILLFGNGFVHSPENILSVVDRTLFGD 197

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           N +              D   +                PEG LS++ SI   +IG   G 
Sbjct: 198 NMI-------------NDGGID----------------PEGALSTLPSIAQVLIGFCIGK 228

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           + I T     +L +    G  +LI G    F+   PLNK+++T S+V VT G A L+   
Sbjct: 229 ICIETPDMREKLNKIFLYGSLMLIVGWL--FSYGCPLNKRVWTPSFVLVTCGFACLLLGI 286

Query: 371 IYALVD 376
           +   +D
Sbjct: 287 LIWYID 292


>gi|255093765|ref|ZP_05323243.1| hypothetical protein CdifC_14056 [Clostridium difficile CIP 107932]
          Length = 505

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
            +S+    S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF 
Sbjct: 125 KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 184

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            PFF+  +GV I ++    LK        +  +  R++ L+ +G  L   +   P     
Sbjct: 185 FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 237

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             D+  +R+ GVLQR+ L Y + SLV +  K + 
Sbjct: 238 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 269



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 323 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 380

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 381 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 421


>gi|255315516|ref|ZP_05357099.1| hypothetical protein CdifQCD-7_14229, partial [Clostridium
           difficile QCD-76w55]
          Length = 381

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
            +S+    S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF 
Sbjct: 1   KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 60

Query: 77  MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            PFF+  +GV I ++    LK        +  +  R++ L+ +G  L   +   P     
Sbjct: 61  FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 113

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
             D+  +R+ GVLQR+ L Y + SLV +  K + 
Sbjct: 114 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 145



 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 199 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 256

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 257 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 297


>gi|359686994|ref|ZP_09256995.1| hypothetical protein LlicsVM_01380 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418756670|ref|ZP_13312858.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|384116341|gb|EIE02598.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
          Length = 391

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 146/358 (40%), Gaps = 92/358 (25%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ S+D+ RGL VA MILV++ G      WP + HA W+GC   D V PFFLF VG +I 
Sbjct: 25  RILSIDLLRGLTVAGMILVNNPGTWSNMYWP-LKHAKWDGCTPTDLVFPFFLFAVGASIP 83

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            +   + +      K++ R++ L+F G+ L   G +S +            +R  GVLQR
Sbjct: 84  FS---VSNGIQEFPKILKRSVILIFLGLFLNFFGEWSFSN-----------LRFPGVLQR 129

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
           I  +Y   ++        ++K+          L   +W +   +         G   P  
Sbjct: 130 IGFAYFFSAIA------YREKNLKFRIILFLTLLISYWYLQEFIPPP------GAAEPS- 176

Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                 K+  D+G                     ++DR+V G  H+     W+  K    
Sbjct: 177 -----MKEGKDWG--------------------AWLDREVFGQAHL-----WKFGKV--- 203

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHTKGHLARLK--- 323
                              ++PEGLL+S +SI S   G+  G  + +H +   + L    
Sbjct: 204 -------------------WDPEGLLTSFTSIASVFCGIFAGEFLKVHLEKKESPLSISG 244

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           + V   FA+L+ G         P+NK L+T +Y   T+G A L  S    L   N FG
Sbjct: 245 KIVLGAFAVLVVGGVWGIY--YPINKSLWTGTYSLWTAGWALLAVSLFLILEKYNRFG 300


>gi|389809458|ref|ZP_10205319.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
 gi|388441722|gb|EIL97972.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
          Length = 353

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RLASLD  RG  VA M+LV+  G DW  +    +HA WNGC   D V PFFLF+VGV++
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPG-DWSHVYWPLAHAAWNGCTPTDLVFPFFLFVVGVSV 60

Query: 89  ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ALA L R+   A      +  ++R L++L  G+ +            + +    +R  GV
Sbjct: 61  ALAILPRLEQGASPSALTRAAMWRALRILALGVAIN-------LLAAWWLPQAHLRFPGV 113

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIAL +  V+L  ++TK
Sbjct: 114 LQRIALCFAGVALFAVYTK 132



 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 7/96 (7%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ S+ ST++G+  G         L R K    +        L   ++  +P 
Sbjct: 188 DPEGLLSTLPSLASTLLGLRAGC-------WLRREKLRTLLLAGAACLLLGALWSWWLPF 240

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           NK L+T S+V  TSG   L   A + L+D   +  W
Sbjct: 241 NKNLWTPSFVLWTSGWTTLALLAFHMLIDRRGWPAW 276


>gi|389793498|ref|ZP_10196662.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
 gi|388434056|gb|EIL91012.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
          Length = 354

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/361 (26%), Positives = 147/361 (40%), Gaps = 94/361 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++RL SLD  RG  VA M+LV+  G DW  I     HAPW+GC   D V PFFLF+VGV+
Sbjct: 2   SKRLPSLDALRGCTVAAMLLVNDPG-DWGHIYAPLEHAPWHGCTPTDLVFPFFLFVVGVS 60

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
            ALAL    ++  A    VK  ++R L++L  G+ +            + +    +R  G
Sbjct: 61  SALALLPRLEQGVAPGALVKAALWRALRILALGVAIN-------LLAAWLLPHAHLRFPG 113

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY-GT 202
           VLQRI + +  V+L  + T+                     W+    +L+ Y  LL  G 
Sbjct: 114 VLQRIGICFAAVALFAVHTRPRTQ-----------------WIAIGGILLGYWGLLLAGG 156

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            V  W   +   DS  +G                     Y+     G  H          
Sbjct: 157 SVAPWVNIVSRTDSVVFGHFV------------------YLIDPATGRGH---------- 188

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                                    +PEGLL+++ S+ +T++G+  G  +   +     +
Sbjct: 189 -------------------------DPEGLLATLPSLATTLLGLRVGCWLRRREIRRLLI 223

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
                    +L+  L   ++  +PLNK L+T S+V  T+G A L   A + L+D + +  
Sbjct: 224 AG-------VLLLLLGALWSLWLPLNKNLWTSSFVLWTAGWATLALLAFHWLIDRHGWPA 276

Query: 383 W 383
           W
Sbjct: 277 W 277


>gi|374310943|ref|YP_005057373.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358752953|gb|AEU36343.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 385

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 79/143 (55%), Gaps = 18/143 (12%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIV 84
            L ++R+ S+D+ RG  +A MILV+ A G+WP     + HA WNGC   D V P FLF+ 
Sbjct: 12  ELTSKRIPSVDVLRGFTLAAMILVN-AAGEWPHAYWPLKHAQWNGCTPTDLVFPTFLFLT 70

Query: 85  GVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           G ++  + +    R    +++   TLK    L F G+LL        + L Y   +  +R
Sbjct: 71  GTSLVFSFRSRLARGVGKRELFLHTLKRSVILFFIGVLL--------NALPY-FHIGTLR 121

Query: 141 LCGVLQRIALSYLLVSLVEIFTK 163
           + GVLQRIAL YL VS++ ++ +
Sbjct: 122 IYGVLQRIALCYLCVSVLYLWNR 144


>gi|160874301|ref|YP_001553617.1| hypothetical protein Sbal195_1181 [Shewanella baltica OS195]
 gi|378707545|ref|YP_005272439.1| hypothetical protein [Shewanella baltica OS678]
 gi|160859823|gb|ABX48357.1| conserved hypothetical protein [Shewanella baltica OS195]
 gi|315266534|gb|ADT93387.1| hypothetical protein Sbal678_1209 [Shewanella baltica OS678]
          Length = 384

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 151/369 (40%), Gaps = 104/369 (28%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
           +F+ GVA+ L+ KR+      +R    +  I R   LL  GIL   G+ + AP       
Sbjct: 78  IFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   IR   VL RIA ++   +                       L  WH  +   ++V 
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQIIVA 168

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
            L +L G                 YG +        G  + L+P  +   Y+D  +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPGGQASVLSPTESINAYVDSILLPGV 210

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           ++    P                              +PEGLLS++ +I++ + GV  GH
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAIVNALAGVFVGH 240

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
            I+  H KG  A++      G   L FG  L     IP+NK+L+T S+V VTSG + ++ 
Sbjct: 241 FIVKSHPKGEWAKVGLLAAAGCVCLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILL 298

Query: 369 SAIYALVDC 377
           +  YALVD 
Sbjct: 299 AVFYALVDV 307


>gi|322436067|ref|YP_004218279.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
 gi|321163794|gb|ADW69499.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
          Length = 391

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 149/355 (41%), Gaps = 83/355 (23%)

Query: 34  RLASLDIFRGLAVALMILVDH---AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           R+ S+D+ RGL +ALMILV+    AG  +P++ HA WNG   AD V P FLF+ G ++  
Sbjct: 16  RVLSIDVLRGLTIALMILVNDPGDAGCVYPQLQHAEWNGYTAADLVFPNFLFLGGASLVF 75

Query: 91  ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
           +L+    R  DR +  + +  R + L+   ++L          +     +R IR+ GVL 
Sbjct: 76  SLQGRIERGADRWELARGLGRRGVNLIALKLVL---------AMLPSFRLRRIRIFGVLF 126

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           R A+  +   L+ + T ++      VG                       A+L G Y   
Sbjct: 127 RTAVCSVAGGLILLGTLEIPMLVGIVG-----------------------AMLTGYY--- 160

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                       +G++ N        L+P  N    +DRKV    H+ H           
Sbjct: 161 ------GALRISFGRM-NAPL-----LDPENNLAAALDRKV---AHLLH----------- 194

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH---TKGHLARLK 323
                 G L   A    +   +PEGLLSSV ++ +T++G     V+ H   T+G     K
Sbjct: 195 ------GELHTGA--LYNVTHDPEGLLSSVPAVGTTLLGAVAALVMRHPRLTQGQ----K 242

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
             V     +   G+   +  + P+NK L+T SYV V  G + L    IY  +D  
Sbjct: 243 VGVLAASGVGALGVGSVWGRSFPVNKNLWTSSYVLVAGGWSLLALGGIYWCLDVR 297


>gi|319900285|ref|YP_004160013.1| hypothetical protein Bache_0400 [Bacteroides helcogenes P 36-108]
 gi|319415316|gb|ADV42427.1| putative transmembrane protein [Bacteroides helcogenes P 36-108]
          Length = 396

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/381 (24%), Positives = 161/381 (42%), Gaps = 103/381 (27%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           ++R+ +LDI RG+ +A MI+V++ G +W  I     HA W G    D V PFF+FI+G++
Sbjct: 7   SKRILALDILRGITIAGMIMVNNPG-NWGHIYAPLEHAEWIGLTPTDLVFPFFMFIMGIS 65

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
             ++LK+        A  K++ RT  +   G+ + G F+        AP EL++G ++  
Sbjct: 66  TYISLKKYDFEFSRSAALKILKRTAIIFLIGLAI-GWFARLCYYWAAAPGELSFGENLWA 124

Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                 RM R+ GV+QR+AL Y   S++ +  K                     +L+A  
Sbjct: 125 SVWTFDRM-RILGVMQRLALCYGATSIIALTMKHRHIP----------------YLIAGL 167

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           ++  ++ L+ G                  G  +N T           N +  +DR VL  
Sbjct: 168 LISYFILLMCGN-----------------GFAYNET-----------NILSVVDRAVLTP 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMY                                 +PEGLLS++ SI   ++G   G 
Sbjct: 200 AHMYKDNG----------------------------IDPEGLLSTIPSIAHVLLGFCVGR 231

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGA 363
           +++         R+    +    LL+ G  L F   +     P++K++++ ++V VT G 
Sbjct: 232 MMLGDDRNAKADRMAVLDSHLIKLLLTGAILTFAGFLLSYGCPISKKIWSPTFVLVTCGM 291

Query: 364 AALVFSAIYALVDCNIFGPWT 384
           A+   + +  +VD   +  W+
Sbjct: 292 ASSFLALLIWIVDVKGYRKWS 312


>gi|116327439|ref|YP_797159.1| hypothetical protein LBL_0655 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116120183|gb|ABJ78226.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
          Length = 369

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/342 (28%), Positives = 141/342 (41%), Gaps = 96/342 (28%)

Query: 38  LDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL- 92
           +D+FRG+ V  MILV++ G  W  +     HA WNGC   D V PFFLF VG +I ++L 
Sbjct: 1   MDLFRGMTVVGMILVNNPG-SWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLY 59

Query: 93  -KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
            K   +R      +  R + L     +L G F +   E T+      +R+ GVLQRI   
Sbjct: 60  SKNGINRIRIWIGICIRGISL-----ILLGLFLNFFGEWTF----SELRIPGVLQRIGFV 110

Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
           Y +V+                   ++F ++         VLV  + +L    V  W  T 
Sbjct: 111 YWVVA-------------------TLFLVFP-----GKKVLVFLIPIL---LVHTWILTH 143

Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
           I                    L    +   +IDR + G  H+     W+ SK        
Sbjct: 144 IAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT------- 180

Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
                          ++PEG LS ++SI +++ GV  G ++   +G   R K  V     
Sbjct: 181 ---------------WDPEGFLSGIASIATSLFGVICGFILFRREG---RGKNRV----- 217

Query: 332 LLIFGLTLHFT-------NAIPLNKQLYTLSYVCVTSGAAAL 366
           L IFGL   FT        ++P+NK L+T SY   T+G A L
Sbjct: 218 LSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGFAFL 259


>gi|395213375|ref|ZP_10400182.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
 gi|394456744|gb|EJF11001.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
          Length = 391

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 9/149 (6%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNL 72
           + P ++D         +R  SLD+ RGL +ALM++V++ G  W  I     HA W+G  +
Sbjct: 6   TAPPLTDAGLLRPQTYERYLSLDVLRGLTIALMVVVNNPG-SWGSIYAPFKHAAWHGFTV 64

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPD 128
            D V P FLF+VG A++ ++++   + D+V  +KV+ RT  +   G+ L         P+
Sbjct: 65  TDLVFPSFLFVVGNAMSFSMRKFETQPDSVFLRKVLKRTALIFLIGLFLNLFPFVMRNPE 124

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
                 D   +R+ GVLQRIAL Y + SL
Sbjct: 125 GAIVMKDFTAVRIMGVLQRIALCYFIASL 153


>gi|427384458|ref|ZP_18880963.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727719|gb|EKU90578.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
          Length = 395

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 158/384 (41%), Gaps = 100/384 (26%)

Query: 27  KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
            S +KT +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+
Sbjct: 2   SSQMKTNKRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFPFFM 60

Query: 82  FIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTY 132
           FI+G++  ++LK+        A  K++ RT+ +   G+ + G FS        P E + +
Sbjct: 61  FIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSKFCYYWTNPSEGIGF 119

Query: 133 GVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
           G  +         IR+ GV+QR+AL Y   +++ +  K                      
Sbjct: 120 GAQLWESVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP----------------- 162

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            + A +L+ Y  LL                           CG     N   N +  +DR
Sbjct: 163 YLIATLLIGYFILL--------------------------ICGNGFAYN-ETNVLSIVDR 195

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            +L   HMY                      KD         +PEGLLS++ SI   ++G
Sbjct: 196 AILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHVLLG 227

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVT 360
              G +++      +R +   +    L + G  L F   +     P+NK++++ ++V  T
Sbjct: 228 FCVGRMMLDGNKAQSREELLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFVLTT 287

Query: 361 SGAAALVFSAIYALVDCNIFGPWT 384
            G A+   + +  ++D   +  W+
Sbjct: 288 CGLASSFLALLIWIIDVKGYKKWS 311


>gi|189464405|ref|ZP_03013190.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
 gi|189438195|gb|EDV07180.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
          Length = 395

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 163/386 (42%), Gaps = 102/386 (26%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
           + Q K++   +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V P
Sbjct: 2   NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57

Query: 79  FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
           FF+FI+G++  ++LK+        A  K++ RT+ +   G+ + G FS        + + 
Sbjct: 58  FFMFIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116

Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
           L++G  +         IR+ GV+QR+AL Y   +++ +  K                   
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
             +L+A  ++  ++ L+ G                  G V+N T           N +  
Sbjct: 163 --YLIATLLVGYFILLMCGN-----------------GFVYNET-----------NILSI 192

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           +DR +L   HMY                      KD         +PEGLLS++ SI   
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224

Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYV 357
           ++G   G +++ +     R     +    L + G  L F   +     P+NK++++ ++V
Sbjct: 225 LLGFCVGRMMLDSNKTENRETLLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFV 284

Query: 358 CVTSGAAALVFSAIYALVDCNIFGPW 383
             T G A+   + +  ++D   +  W
Sbjct: 285 LTTCGLASSFLALLIWIIDVKGYKKW 310


>gi|47213040|emb|CAF93449.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 297

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 77/255 (30%), Positives = 121/255 (47%), Gaps = 50/255 (19%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKD--------VQDKDQSVGRFSIFRLYCWHWLMAAC 190
           +R+ GVLQR+AL+YL+V+ +++            +QD   S G   +      +W    C
Sbjct: 6   LRIPGVLQRLALAYLVVACLDLLVARRFSCVFCVLQDAWWSQGIDILL-----YWPAWVC 60

Query: 191 VLVVYLALLYGTY---VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           VL++    L+ T+   VPD     +      D G   N T G          A G+IDR 
Sbjct: 61  VLLLESVWLFITFLLPVPDCPTGYLGPGGIGDMGLYPNCTGG----------AAGFIDRW 110

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +LG  H+Y +P+ +   A                   H P++PEG+L S++SIL   +G+
Sbjct: 111 LLGEKHIYQNPSSQGIYAT------------------HLPYDPEGILGSINSILIAFLGL 152

Query: 307 HFGHVIIHTKG-HLARLKQWVTMGFALLIFG--LTLHFTNA--IPLNKQLYTLSYVCVTS 361
             G +I+H +  H   + +++  GF L I    LT   TN   IP+NK L++LSYV   +
Sbjct: 153 QAGKIILHHRDLHQGVISRFLIWGFLLGIISAVLTNCSTNQGLIPINKNLWSLSYVTTLA 212

Query: 362 GAAALVFSAIYALVD 376
             A ++ + IY  VD
Sbjct: 213 CFAYVLLALIYYTVD 227


>gi|404256028|ref|ZP_10959996.1| hypothetical protein SPAM266_22806 [Sphingomonas sp. PAMC 26621]
          Length = 392

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 151/355 (42%), Gaps = 81/355 (22%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           +LD+ RGLAVA MILV   G DW +    + HA WNG  LAD V P FLF VG+A+ L+ 
Sbjct: 2   ALDVLRGLAVAGMILVVSPG-DWGQAYVQLQHANWNGATLADMVFPTFLFSVGIALGLSF 60

Query: 93  KRIPDRADAVKKVIFRTLKLLFWGILLQ----GGFSHAPDELTY------------GVDV 136
            R  + A           + LFW  LL+            E TY            G  +
Sbjct: 61  PRRLETAGD---------RRLFWTRLLRRTALLILLGLLVEATYVWTIAAGAPYPGGPGL 111

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             IR+ G+LQRI L Y L  ++ + T   +D D   G   I  L      +  C++V+ +
Sbjct: 112 AHIRIPGILQRIGLCYGLAGILLLATNR-RDPD---GMIRINPLA-----IVGCIVVILI 162

Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
                     W   I       +  V     GV   L P  N  G++DR +    H++  
Sbjct: 163 GY--------WLLLI-------FVPVPGFGAGV---LTPAGNLPGFVDRTLFTEPHLWPL 204

Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
                  + T   P              A ++PEGLLS++ +  + + G+     +   +
Sbjct: 205 ------GSATAARP--------------ATYDPEGLLSTLPATANVLFGILSAWAL---R 241

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
            +  R   ++ +  A L+F   L     + +NK+++T S+  ++ G +AL  +A+
Sbjct: 242 RYPDRALGYIAV-VAALLFSAGLALDPLLVINKRIWTSSFAVLSGGVSALALTAL 295


>gi|424666001|ref|ZP_18103037.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
 gi|404574254|gb|EKA79005.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
          Length = 387

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 86/373 (23%), Positives = 159/373 (42%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F H  + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSRLGE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+AA +
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLIAALL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +  ++ L+ G                  G  +N T           N +  +DR VLG  
Sbjct: 169 IGYFIILITGN-----------------GFEYNST-----------NILAVVDRAVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEG+LS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 VWIIDVKGYTRWS 303


>gi|381169858|ref|ZP_09879020.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689628|emb|CCG35507.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 388

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 118/287 (41%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNAPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+                          +L    +  V LALL G    
Sbjct: 138 QRIGLCYLAAALLV------------------------RYLPPRGIAPVCLALLLGY--- 170

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
            W F  +                  A+L+   NA   +D  + G +H+Y           
Sbjct: 171 -WAFLYVFGQPG-------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|386312853|ref|YP_006009018.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
 gi|319425478|gb|ADV53552.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
          Length = 384

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 151/369 (40%), Gaps = 104/369 (28%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
           +F+ GVA+ L+ KR+      DR    +  I R   LL  GIL   G+ + AP       
Sbjct: 78  IFLSGVALGLSPKRLDKLPMKDRLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   +R   VL RIA ++   +L+                        WH  +   V+V 
Sbjct: 132 DPEKVRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQVIVA 168

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
            L +L G                 YG +        G    L+P  +   Y+D  +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPSGQAGVLSPTQSINAYVDSILLPGV 210

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
           ++    P                              +PEGLLS++ ++++ + GV  G+
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAVVNALAGVFVGY 240

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
            I+  H +G   ++    T G A L  G  L     IP+NK+L+T S+V VTSG + ++ 
Sbjct: 241 FIVKSHPQGEWVKVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILL 298

Query: 369 SAIYALVDC 377
           +  YALVD 
Sbjct: 299 ALFYALVDV 307


>gi|239907232|ref|YP_002953973.1| hypothetical protein DMR_25960 [Desulfovibrio magneticus RS-1]
 gi|239797098|dbj|BAH76087.1| hypothetical membrane protein [Desulfovibrio magneticus RS-1]
          Length = 371

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 147/361 (40%), Gaps = 92/361 (25%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL+S+D  RGLA+A MI+V++ G     +P++ HA W+G  LAD V P FLF+VGV +AL
Sbjct: 8   RLSSVDTLRGLAIAAMIVVNNPGDRRFVYPQLLHAQWHGLTLADVVFPLFLFLVGVCVAL 67

Query: 91  ALKRIPD-------RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           A+   PD       RA   +K++ R   L   G+        + DEL         RL G
Sbjct: 68  AID--PDKPRDAEARARLWRKILPRAAVLFALGLGENAYLRLSFDEL---------RLPG 116

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA+ YL  + ++     +  +  +V              + A  L+ Y  LL    
Sbjct: 117 VLQRIAVVYLAAAWLQ---SRLSSRALAV--------------VGAVTLLGYWLLLAAVP 159

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           VP      +  +                      N  G++D+ VLG +    H +W    
Sbjct: 160 VPGHGHPSLAMEP---------------------NLQGWLDQLVLGRHIWKFHTSW---- 194

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                                   +PEG+LS+  +I   +IGV  G  +        R  
Sbjct: 195 ------------------------DPEGILSTFPAIALGLIGVLAGRWLRRGGDRPGRAG 230

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
                   LL+  + L +    PLNK L T S+V +T G    + +  +A++D      W
Sbjct: 231 L-----LGLLLIAIGLAWDAVFPLNKSLCTSSFVLLTGGLGLAMLAVAHAVLDGRPPAAW 285

Query: 384 T 384
            
Sbjct: 286 A 286


>gi|408369302|ref|ZP_11167083.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
 gi|407745048|gb|EKF56614.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
          Length = 345

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/327 (27%), Positives = 135/327 (41%), Gaps = 83/327 (25%)

Query: 49  MILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA--V 102
           MI+V+  G  W  +    SHA W+G  L D V P FLF+VG A++ ++++    + A  +
Sbjct: 1   MIIVNTPG-SWGSVYRPLSHASWHGFTLTDLVFPTFLFVVGNAMSFSMRKFEQTSQAAFL 59

Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
           KKVI RT  +   G LL         +L    D R+    GVLQRIAL Y   SLV    
Sbjct: 60  KKVIKRTFVIFAIGFLLSWFPFFRDGQLKPLEDARIF---GVLQRIALCYFFASLVI--- 113

Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
                                H+      LV  +  L G       + +I     DY   
Sbjct: 114 ---------------------HYFKIKGALVFSMVALLG-------YHLIMYTMGDY--- 142

Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
                          NA   +D  +LG NH+Y                 EG         
Sbjct: 143 -----------TLEGNAALKLDLWLLGPNHLYQG---------------EG--------- 167

Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
              PF+PEGLLS++ + ++ I G +F  + +   G   +    + +G A L+F L L + 
Sbjct: 168 --IPFDPEGLLSTLPATVNVIFG-YFAGLFLQQSGKNFKTIALLMIGGATLVF-LALGWD 223

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFS 369
              P+NK+++T SYV +T G   +V +
Sbjct: 224 LLFPINKKIWTSSYVLLTVGIDVMVLA 250


>gi|338211253|ref|YP_004655306.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305072|gb|AEI48174.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 374

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 92/377 (24%), Positives = 152/377 (40%), Gaps = 102/377 (27%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAV-------ALMILVDHAGGDW-------PEISHAPWNG 69
            Q  +  +  RL SLD  RG  +        +  L+  A G W        ++SH  WNG
Sbjct: 2   SQPSTDTRPHRLLSLDALRGFDMFWITGGEEIFHLLAKATG-WTGAIIMAEQLSHPDWNG 60

Query: 70  CNLADFVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
               D + P FLF+ GV    ++ + L+R  DR   ++KVI R L L+  GI+   G   
Sbjct: 61  FRAYDLIFPLFLFLSGVSAPYSLGVRLERGDDRGKMLRKVIQRGLTLVLLGIIYNNGLQI 120

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD-VQDKDQSVGRFSIFRLYCWH 184
            P E         +R   VL RI L+ +   ++ ++T   VQ              Y W 
Sbjct: 121 KPLE--------DMRFPSVLGRIGLAGMFAQIIYLYTSTRVQ--------------YIW- 157

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
                     +++LL G     W F ++              CG    +   CN V Y+D
Sbjct: 158 ----------FVSLLLGY----WAFVMLVPVPG---------CGA-GLMTMECNPVSYLD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R ++                        G L KD     H   +PEGL+S++ +I + ++
Sbjct: 194 RLII-----------------------PGHLHKD----IH---DPEGLVSTIPAIATGLL 223

Query: 305 GVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           G+  G+++   +   +R ++ + +   G   LI G    +    P+NK L+T S+V    
Sbjct: 224 GIFAGNLLRADERSTSRTQKVLVLFVAGILFLIIGKLWDYV--FPINKNLWTSSFVMTVG 281

Query: 362 GAAALVFSAIYALVDCN 378
           G + ++ S  Y ++D  
Sbjct: 282 GWSLILLSLFYWIIDVQ 298


>gi|325922207|ref|ZP_08183994.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
 gi|325547326|gb|EGD18393.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
          Length = 390

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 155/382 (40%), Gaps = 94/382 (24%)

Query: 6   AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEI 62
           +E T     I + P ++ ++E       R  SLD+FRGL + LMILV+ AG     + ++
Sbjct: 2   SEQTPAAAAITASPTLTPKRE-------RFLSLDVFRGLTIFLMILVNTAGPGAQAYAQL 54

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALAL------KRIPDRADAVKKVIFRTLKLLFWG 116
           +HA W G  LAD V P FLF VG A++ AL      +    R      +IF    L++W 
Sbjct: 55  THAAWFGFTLADLVFPSFLFAVGSAMSFALAADTPHRPFLGRVGKRAALIFLCGVLMYWF 114

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
                 F   P        +  +RL GVLQRI L YLL +L+  +               
Sbjct: 115 PF----FHLQPGGGWAFTAIDQLRLTGVLQRIGLCYLLAALLVRYLPP------------ 158

Query: 177 IFRLYCWHWLMAACV--LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                    +  ACV  L+ Y A+LY          +  +  A+  K  N   G R    
Sbjct: 159 -------RGIAPACVALLLGYWAVLY----------VFGQPGAELSKTGN--AGTR---- 195

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
                   +D  + G  H+Y                     RKD        F+PEGLL 
Sbjct: 196 --------LDLWLYGRAHLY---------------------RKD------NGFDPEGLLG 220

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           ++ + ++ + G   G   +  +G  A   +   +   + +  L L +  A PL+K+L++ 
Sbjct: 221 TLPATVNVLAGYLTGR-FLQRRGKTAAATR-TLLLAGVGLVLLALLWNPAWPLSKKLWSG 278

Query: 355 SYVCVTSGAAALVFSAIYALVD 376
           S+V  T G   L    +  L++
Sbjct: 279 SFVACTVGLDLLALGVLVYLLE 300


>gi|313147781|ref|ZP_07809974.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423280992|ref|ZP_17259903.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
 gi|313136548|gb|EFR53908.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404583442|gb|EKA88121.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
          Length = 387

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 86/373 (23%), Positives = 159/373 (42%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F H  + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSRLGE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+AA +
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLIAALL 168

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           +  ++ L+ G                  G  +N T           N +  +DR VLG  
Sbjct: 169 IGYFIILITGN-----------------GFEYNST-----------NILAVVDRAVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEG+LS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 VWIIDVKGYTRWS 303


>gi|404450663|ref|ZP_11015643.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
 gi|403763718|gb|EJZ24662.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
          Length = 381

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 50/134 (37%), Positives = 77/134 (57%), Gaps = 10/134 (7%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALMI+V+  G  W  +     HAPW+G  + D V P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMIVVNTPG-SWSHMYGPFMHAPWHGFTITDLVFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLF--WGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           + ++K++        ++KV+ R+  +    WG+     F    + L   ++   +RL GV
Sbjct: 72  SFSMKKLEKMGQGLFLRKVLKRSFLIFIIGWGLNAFPFFDQTENGLAM-INWGEVRLLGV 130

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL YL+ SLV
Sbjct: 131 LQRIALCYLIASLV 144



 Score = 41.6 bits (96), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 2/72 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS+  ++++ I G   G  I     +   +K  +  G  L++  + L +    
Sbjct: 205 PFDPEGLLSTFPAVVNVIAGYFAGKFIQQMGNNTKTVKYLLVAG--LILIAVCLAWDPFF 262

Query: 346 PLNKQLYTLSYV 357
           P+NK+L+T SYV
Sbjct: 263 PINKKLWTSSYV 274


>gi|329964617|ref|ZP_08301671.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
 gi|328525017|gb|EGF52069.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
          Length = 396

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 163/391 (41%), Gaps = 113/391 (28%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
            S   ++R+ +LDI RG+ +A MI+V++ G  W  I    +HA W G    D V PFF+F
Sbjct: 2   NSQKTSKRILALDILRGITIAGMIMVNNPG-SWAHIYAPLAHAQWIGLTPTDLVFPFFMF 60

Query: 83  IVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYG 133
           I+G++  ++LK+        A  K++ RT+ +   G+ + G FS        APD L +G
Sbjct: 61  IMGISTYISLKKYNFEFSHAAALKILKRTVIIFLIGMAI-GWFSRFCYYWSSAPDNLGFG 119

Query: 134 VDV--------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            ++        RM R+ GV+QR+AL Y   S++ +  K                      
Sbjct: 120 ENLWASVWTFDRM-RILGVMQRLALCYGATSIIALTMKHKNIP----------------- 161

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            + A +L+ Y  LL G                  G  +N T           N +  +DR
Sbjct: 162 YLIATLLIGYFILLLG----------------GNGFAYNET-----------NILSIVDR 194

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VL   HMY                      KD         +PEG+LS++ +I   ++G
Sbjct: 195 AVLTPAHMY----------------------KD------NGIDPEGILSTIPAIAHVLLG 226

Query: 306 VHFGHVII----HTKG--------HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
              G +++      KG        HL +L      G  L   G  L +    P++K++++
Sbjct: 227 FCVGRMMLGDSQSAKGDRTNVLDSHLIKL---FLAGTILTFAGFLLSY--GCPISKKIWS 281

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            ++V VT G A+   + +  ++D   +  W+
Sbjct: 282 PTFVLVTCGLASSFLALLIWIIDVKGYKRWS 312


>gi|390989491|ref|ZP_10259788.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
 gi|372555760|emb|CCF66763.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
          Length = 388

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +                         +   +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPQRGIAP-----------------VCLALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|384362003|ref|YP_006199855.1| hypothetical protein CDBI1_13575 [Clostridium difficile BI1]
          Length = 485

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 113 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 172

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 173 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 223

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 224 ILGVLQRMGLVYFVTSLVYLLLKKLN 249



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 303 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 360

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 361 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 401


>gi|255651295|ref|ZP_05398197.1| hypothetical protein CdifQCD_14003 [Clostridium difficile
           QCD-37x79]
          Length = 461

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 89  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 148

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 149 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 199

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 200 ILGVLQRMGLVYFVTSLVYLLLKKLN 225



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 279 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 336

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 337 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 377


>gi|255518179|ref|ZP_05385855.1| hypothetical protein CdifQCD-_13768 [Clostridium difficile
           QCD-97b34]
          Length = 469

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 97  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 157 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 207

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 287 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 344

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 345 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 385


>gi|254976379|ref|ZP_05272851.1| hypothetical protein CdifQC_13741 [Clostridium difficile QCD-66c26]
          Length = 459

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 87  SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 146

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 147 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 197

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 198 ILGVLQRMGLVYFVTSLVYLLLKKLN 223



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 277 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 334

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 335 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 375


>gi|262381451|ref|ZP_06074589.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296628|gb|EEY84558.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 372

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 146/385 (37%), Gaps = 108/385 (28%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
            E+   + QRL SLD  RG           L VAL  L  +      AG    ++ H  W
Sbjct: 1   MERQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           NG    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  
Sbjct: 57  NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S   D L         R   VL RI L ++  +L  +F +        +           
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL--LFVRFGWKARAGI----------- 154

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
                A +LV Y   +    VPD                     G         N VGYI
Sbjct: 155 ----TALILVGYWLAMAFVPVPD--------------------AGGAGPFTLEGNLVGYI 190

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
           DR  L                               P   H   F+PEGL S+V +I + 
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219

Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           ++G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VCV
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPINKKLWTSSFVCV 277

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
               +A +F+  + ++D   +  WT
Sbjct: 278 VGAYSAWMFALFFYIIDVLGWRKWT 302


>gi|255656770|ref|ZP_05402179.1| hypothetical protein CdifQCD-2_14006 [Clostridium difficile
           QCD-23m63]
          Length = 481

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 109 SKLMNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLN 245



 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 299 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 356

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 357 PFNKRLWSSSFVLLMAGSYGVLLSIFYFICDIKNKSKIFTP 397


>gi|265765098|ref|ZP_06093373.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263254482|gb|EEZ25916.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 387

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 85/374 (22%), Positives = 153/374 (40%), Gaps = 97/374 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +RL +LD+ RG+ +A MI+V++ G  W  I     HA W G    D V PFF+FI+G++
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPG-SWSYIYAPLGHAAWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
             ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+       
Sbjct: 65  TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123

Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IR+ GV+QR+AL Y   +++ +  K                           
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
               Y+  L    +  +   +IN +  +Y                  N +  +DR VLG 
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMY                      KD         +PEGLLS++ SI   +IG   G 
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + 
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289

Query: 371 IYALVDCNIFGPWT 384
           +  ++D   +  W+
Sbjct: 290 LVWIIDVRGYTRWS 303


>gi|410028220|ref|ZP_11278056.1| hypothetical protein MaAK2_03415 [Marinilabilia sp. AK2]
          Length = 382

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 78/134 (58%), Gaps = 9/134 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALM++V+  G  W  +     HA W+G  + D + P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGV 144
           + ++KR+     +  +KKV  RTL +   G LL    F +   E  Y  ++   +RL GV
Sbjct: 72  SFSMKRMESMGQSLFLKKVFKRTLLIFLIGWLLNAFPFFNYNAETGYSMINWSEVRLLGV 131

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL Y+L +L+
Sbjct: 132 LQRIALCYMLAALI 145



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS+  S+++ I G   G  I  +  +   +K+ V  G  LL+    L +    
Sbjct: 206 PFDPEGVLSTFPSVVNVIAGFLVGKFIQESGNNTGTVKKLVIWGIILLV--ACLVWDMVF 263

Query: 346 PLNKQLYTLSYVCVT 360
           P+NK+++T  YV +T
Sbjct: 264 PINKKIWTSPYVLLT 278


>gi|282877735|ref|ZP_06286550.1| putative membrane protein [Prevotella buccalis ATCC 35310]
 gi|281300307|gb|EFA92661.1| putative membrane protein [Prevotella buccalis ATCC 35310]
          Length = 403

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 159/385 (41%), Gaps = 106/385 (27%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           +++R+ ++DI RG+ +A MILV++ G  W  I     HA WNG    D V PFF+F++G+
Sbjct: 8   QSKRILAIDILRGITIAGMILVNNPG-SWAHIFAPLEHAEWNGMTPTDLVFPFFMFVMGM 66

Query: 87  AIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV- 136
            I +++++     +   V K+I RTL L   GI + G FS          ++ T G  + 
Sbjct: 67  CIFISMQKYQFACNRQTVYKIIRRTLLLYLVGIFV-GWFSRFCYRWAFPLEDATLGQQIW 125

Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IRL GVL R+A+ Y + +L+ I  +          R+               
Sbjct: 126 HTVWSFDTIRLSGVLARLAICYGITALLAITVRH---------RY--------------- 161

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           +L + + LL G       +TI+            + CG         N +  +DR VL  
Sbjct: 162 LLSIVITLLIG-------YTIL------------LFCG-NGFAYDETNILSIVDRAVLTD 201

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMYH                                +PEGLLS+  +I  T+IG   G 
Sbjct: 202 AHMYHDNG----------------------------IDPEGLLSTFPAIAHTLIGFLIGK 233

Query: 311 VIIHTKGHLARLKQ-------WVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVC 358
           +    +  +    Q        +     L I G  L F+  +     PLNK++++ ++V 
Sbjct: 234 LAFSKQKTVTATTQNDPKTGLILHNIVPLFIAGTILTFSGLLLAYGCPLNKKIWSPTFVL 293

Query: 359 VTSGAAALVFSAIYALVDCNIFGPW 383
            + G A+ + + +  L+D   +  W
Sbjct: 294 TSCGLASTLLALLIWLIDVKGYKRW 318


>gi|406662851|ref|ZP_11070935.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
 gi|405553158|gb|EKB48438.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
          Length = 382

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/134 (36%), Positives = 77/134 (57%), Gaps = 9/134 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +R  +LD+ RGL +ALM++V+  G  W  +     HA W+G  + D + P FLF+VG A+
Sbjct: 13  ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71

Query: 89  ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLCGV 144
           + ++K++      V  KKV  RTL +   G LL      ++ P+     ++   +RL GV
Sbjct: 72  SFSMKKLESMGQQVFLKKVFKRTLLIFLIGWLLNAFPFVNYNPESGYSMINWSEVRLLGV 131

Query: 145 LQRIALSYLLVSLV 158
           LQRIAL Y+L +L+
Sbjct: 132 LQRIALCYMLAALI 145



 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEG+LS+  S+++ I G   G  I  +  +   +K+ V  G  LL+    L +  A 
Sbjct: 206 PFDPEGILSTFPSVVNVIAGFLVGKFIQDSGNNTGTVKKMVIWGIILLV--ACLIWDMAF 263

Query: 346 PLNKQLYTLSYVCVT 360
           P+NK+++T  YV +T
Sbjct: 264 PINKKIWTSPYVLLT 278


>gi|255013328|ref|ZP_05285454.1| hypothetical protein B2_05430 [Bacteroides sp. 2_1_7]
 gi|410103821|ref|ZP_11298742.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
 gi|409236550|gb|EKN29357.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
          Length = 372

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/386 (24%), Positives = 149/386 (38%), Gaps = 110/386 (28%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
            EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  W
Sbjct: 1   MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           NG    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  
Sbjct: 57  NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S   D L         R   VL RI L ++  +L                   +F  + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148

Query: 184 HWLMAACVLVV---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
                  VL++   +LA+ +   VPD                     G         N V
Sbjct: 149 KVRAGITVLILVGYWLAMAF-VPVPD--------------------VGGAGPFTLEGNLV 187

Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSI 299
           GYIDR  L                               P   H   F+PEGL S+V +I
Sbjct: 188 GYIDRLFL-------------------------------PGRLHETVFDPEGLFSTVPAI 216

Query: 300 LSTIIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
            + ++G+  G  I +  +G   R K+   +G   ++  + L ++   P+NK+L+T S+VC
Sbjct: 217 ATAMLGMFTGEWIKLRKEGLTDRKKELCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVC 276

Query: 359 VTSGAAALVFSAIYALVDCNIFGPWT 384
           V    +  +F+  + ++D   +  WT
Sbjct: 277 VVGAYSVWMFALFFYIIDVLGWRKWT 302


>gi|298376668|ref|ZP_06986623.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|298266546|gb|EFI08204.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 372

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 146/385 (37%), Gaps = 108/385 (28%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
            EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  W
Sbjct: 1   MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           NG    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  
Sbjct: 57  NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S   D L         R   VL RI L ++  +L  +F +        +           
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL--LFVRFGWKARAGI----------- 154

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
                A +LV Y  ++    VPD                     G         N VGYI
Sbjct: 155 ----TALILVGYWLVMAFVPVPD--------------------AGGAGPFTLEGNLVGYI 190

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
           DR  L                               P   H   F+PEGL S+V +I + 
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219

Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           ++G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VCV
Sbjct: 220 MLGMFTGEWIKLGKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPVNKKLWTSSFVCV 277

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
               +  +F+  + ++D   +  WT
Sbjct: 278 VGAYSVWMFALFFYIIDVLGWRKWT 302


>gi|329956032|ref|ZP_08296803.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
 gi|328524791|gb|EGF51845.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
          Length = 396

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/378 (23%), Positives = 153/378 (40%), Gaps = 98/378 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV--- 136
             ++LK+   + + AV   I +   L+F   +  G FS       +P E +++G  +   
Sbjct: 67  TYISLKKYNFEFSRAVGMKILKRTILIFLIGMAIGWFSKFCYYWTSPTEGISFGAQLWES 126

Query: 137 ----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
                 IR+ GV+QR+AL Y   +++ +  K                       + A +L
Sbjct: 127 VWTFDRIRILGVMQRLALCYGATAIIALTVKHRNIP-----------------YLIATLL 169

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
             Y  LL                           CG     N   N +  IDR +L   H
Sbjct: 170 TGYFILL--------------------------VCGNGFAYN-DTNILSVIDRTILTPAH 202

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
           MY                                 +PEGLLS++ SI   ++G   G ++
Sbjct: 203 MYKDNG----------------------------IDPEGLLSTIPSIAHVLLGFCVGRMM 234

Query: 313 IHT-KGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAAL 366
           +   K +  R     +    L + G  L F+  +     P+NK++++ ++V  T G A+ 
Sbjct: 235 LENGKANEDRESMLNSHLIKLFLAGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLASS 294

Query: 367 VFSAIYALVDCNIFGPWT 384
             + +  ++D   +  W+
Sbjct: 295 FLALLIWIIDVKGYKKWS 312


>gi|60679957|ref|YP_210101.1| hypothetical protein BF0369 [Bacteroides fragilis NCTC 9343]
 gi|336407897|ref|ZP_08588393.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|423248371|ref|ZP_17229387.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|423253319|ref|ZP_17234250.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|423269643|ref|ZP_17248615.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|423272798|ref|ZP_17251745.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
 gi|60491391|emb|CAH06139.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|335944976|gb|EGN06793.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|392657219|gb|EIY50856.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|392659584|gb|EIY53202.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|392700489|gb|EIY93651.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|392708362|gb|EIZ01469.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
          Length = 387

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/373 (22%), Positives = 153/373 (41%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +DR VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 VWIIDVRGYTRWS 303


>gi|24375008|ref|NP_719051.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
 gi|24349746|gb|AAN56495.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
          Length = 395

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 148/372 (39%), Gaps = 110/372 (29%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +           AL+I    AG  W   ++ H+ W+G  L D + P F
Sbjct: 29  RLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLF 88

Query: 81  LFIVGVAIALALKRI---------PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           +F+ GVA+ L+ KR+         P     VK++    L  + +     G  + AP    
Sbjct: 89  IFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYN---HGWGTGAP---- 141

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
             VD   IR   VL RIA ++   +L+                        WH  +   V
Sbjct: 142 --VDPDKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQV 176

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL 248
           LV  + +L G                 YG +        G    L+P  +   Y+D  +L
Sbjct: 177 LVA-VGILVG-----------------YGAMQLWLPFPGGQAGVLSPTVSINAYVDSLLL 218

Query: 249 -GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G+++    P                              +PEG+LS++ ++++ + GV 
Sbjct: 219 PGVSYQGRMP------------------------------DPEGVLSTLPAVVNALAGVF 248

Query: 308 FGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
            GH I+  H KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG + 
Sbjct: 249 VGHFIVKSHPKGEWAKVGLLGAAGGVCLALGWLLD--AVIPVNKELWTSSFVLVTSGWSM 306

Query: 366 LVFSAIYALVDC 377
           L+ +  YALVD 
Sbjct: 307 LLLALFYALVDV 318


>gi|323343595|ref|ZP_08083822.1| transmembrane protein [Prevotella oralis ATCC 33269]
 gi|323095414|gb|EFZ37988.1| transmembrane protein [Prevotella oralis ATCC 33269]
          Length = 384

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 147/361 (40%), Gaps = 98/361 (27%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
           D + QQ+K      R+ ++DI RG+ +A MILV++ G D  +  + HA W G    D V 
Sbjct: 2   DTATQQKK------RILAVDILRGMTIAGMILVNNPGTDTVYAPLEHAEWIGLTPTDLVF 55

Query: 78  PFFLFIVGVAIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-----PDEL 130
           PFF+FI+G+   L+LK+   +   +  +K+  R L L   G+ +   F        PD  
Sbjct: 56  PFFMFIMGITTYLSLKKFEFKWSVECGRKIAKRALLLWLIGLAISWLFMFCRGLLDPDMS 115

Query: 131 TYGVDVRM---------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           +     R+         +RL GVL R+ + Y L ++V +  K                  
Sbjct: 116 SMPFGSRLWASVNTFDQLRLLGVLPRLGICYGLAAVVALSVKHKYIP------------- 162

Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
              WL+A   +  Y+ L                           TC   A  +   N + 
Sbjct: 163 ---WLIAIIFIGYYILL--------------------------ETCNGYA--HDASNILA 191

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
            +D  VLG  H+Y                     R ++P       +PEGLLS+  ++  
Sbjct: 192 IVDDAVLGHGHVY---------------------RWESP-------DPEGLLSTFPALAH 223

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
            +IG   G  ++  +    ++++   +G  L   G  L +  A P++K+L+T ++  VT 
Sbjct: 224 VLIGFCVGRTVMEMQNLNDKIERLFLIGALLTFAGFLLSY--ACPISKKLWTPTFAMVTC 281

Query: 362 G 362
           G
Sbjct: 282 G 282


>gi|225875032|ref|YP_002756491.1| hypothetical protein ACP_3497 [Acidobacterium capsulatum ATCC
           51196]
 gi|225792728|gb|ACO32818.1| putative membrane protein [Acidobacterium capsulatum ATCC 51196]
          Length = 378

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 158/369 (42%), Gaps = 92/369 (24%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMP 78
            Q + + +  +R+ S+D+ RG+ +A MILV++ G +    W  + HA WNG    D V P
Sbjct: 2   KQHQDTVVNAKRMVSIDLLRGITIAFMILVNNNGDEAHAFW-ALKHAQWNGFTPTDLVFP 60

Query: 79  FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            F+F+VG+++  +    L+R   R       + R++ L   G+++  GF +      +G 
Sbjct: 61  TFIFVVGISLVFSTEARLRRGQSRLLIAAHALRRSVILFLLGLVVN-GFPY----FHFGT 115

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
               +R+ GVLQRIA+ YL  SL+ + ++ V  +                 L+    LV 
Sbjct: 116 ----LRIYGVLQRIAICYLFGSLLYLLSRRVWLQA----------------LLFTTALVG 155

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y AL+    VP   + +  +D                 L+P  N V ++DR +L      
Sbjct: 156 YWALMRWVPVPG--YGLPGRDIPF--------------LDPNANLVAWLDRLLL------ 193

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
             P   R  A T+D                    PEGLLS++ ++ + ++G       + 
Sbjct: 194 --PG--RLYAGTRD--------------------PEGLLSTIPAMGTLLLG-------MM 222

Query: 315 TKGHL-----ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           T G L      R K  + +  A +       +    P+NK+++T SYV    G + L F+
Sbjct: 223 TAGWLRSAAAPRRKLMLLLAAAGIALTAGALWGLEFPINKRVWTSSYVLYAGGWSLLAFA 282

Query: 370 AIYALVDCN 378
             + + +  
Sbjct: 283 LCFWMTEVR 291


>gi|423259248|ref|ZP_17240171.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|423263781|ref|ZP_17242784.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
 gi|387776828|gb|EIK38928.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|392706047|gb|EIY99170.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
          Length = 387

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/374 (22%), Positives = 153/374 (40%), Gaps = 97/374 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +RL +LD+ RG+ +A MI+V++ G  W  +     HA W G    D V PFF+FI+G++
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPG-SWSYVYVPLGHAAWIGLTPTDLVFPFFMFIMGIS 64

Query: 88  IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
             ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+       
Sbjct: 65  TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123

Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
                   IR+ GV+QR+AL Y   +++ +  K                           
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
               Y+  L    +  +   +IN +  +Y                  N +  +DR VLG 
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            HMY                      KD         +PEGLLS++ SI   +IG   G 
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + 
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289

Query: 371 IYALVDCNIFGPWT 384
           +  ++D   +  W+
Sbjct: 290 LVWIIDVRGYTRWS 303


>gi|374373358|ref|ZP_09631018.1| membrane protein [Niabella soli DSM 19437]
 gi|373234331|gb|EHP54124.1| membrane protein [Niabella soli DSM 19437]
          Length = 375

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 93/359 (25%), Positives = 141/359 (39%), Gaps = 93/359 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           K  R  +LDIFRG+ +  MI+V+  G + +PE+ HA WNG  L D V P FLF VG AIA
Sbjct: 8   KPGRFLALDIFRGMTICFMIIVNTGGPNPFPELRHAQWNGFTLTDLVFPSFLFAVGNAIA 67

Query: 90  LALKRIPDRA--DAVKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYGVDVR 137
            +  +   ++  + + K+I RT  L   G L+          Q      P   T      
Sbjct: 68  FSKSKWDQQSNKEVLTKIIKRTCLLFLIGYLMYWLPFVKIDAQNNIRPFPIGET------ 121

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             R+ GVLQRIAL Y + +L+  F                          A  +++  + 
Sbjct: 122 --RIFGVLQRIALCYGIGALIIRFAS------------------------ARTIIITSIC 155

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           LL G +        I     DY     V      KL          D  +   +H+Y   
Sbjct: 156 LLLGYW-------FIMMAFGDY----TVNGAAETKL----------DILLFTRDHLY--- 191

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                         + P R   P    + F      + V+ ++  + G+H    I     
Sbjct: 192 -------------IKDPARAFDPEGFLSTFP-----AIVNVLIGYLAGIH----IRKNPK 229

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
               L +    GF L+  G   +   A P+NK+L+T S+VC+T+G   L+ + I   +D
Sbjct: 230 SYEMLARLAVAGFLLVALGYLWNL--AFPVNKKLWTSSFVCLTTGLDCLIIATILYFID 286


>gi|418515336|ref|ZP_13081517.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|418520970|ref|ZP_13087016.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702946|gb|EKQ61443.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410708055|gb|EKQ66504.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 388

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|427400072|ref|ZP_18891310.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
 gi|425720812|gb|EKU83727.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
          Length = 380

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 104/356 (29%), Positives = 149/356 (41%), Gaps = 93/356 (26%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           RL SLD FRG  +A M+LV++ G DW     +++HA W+G    D + PFFLFI GVA+A
Sbjct: 7   RLTSLDAFRGFTIAAMVLVNNPG-DWGHLHAQLAHAAWHGWTFTDTIFPFFLFIGGVAMA 65

Query: 90  LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
           L+L R+      +   + K+  R   +   G LL          L    D   +R+ GVL
Sbjct: 66  LSLGRLAAAGAHKPQLLLKLAKRAALIFLIGFLL---------NLIPRFDFDSVRIPGVL 116

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL  LL                                  A  LVVYL     T+  
Sbjct: 117 QRIALCTLL----------------------------------AAPLVVYL-----TWRG 137

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRA-KLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
                 +         +     G+ A  L P  +A  +IDR ++   H++       ++A
Sbjct: 138 QALAVFLLLALYSVLMLLVPVPGIGAGNLEPGRDAGAWIDRALMD-GHLW-------AQA 189

Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
            T D   EG L    P+ C                 S + GV  G +++     L R++Q
Sbjct: 190 KTWDP--EG-LVSTLPAVC-----------------SLLFGVLAGRLLLSA---LPRVEQ 226

Query: 325 --WVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
             W+ + G A L  G TL     +P+NK L+T S+  + SG A L F A Y L+D 
Sbjct: 227 VVWLMLSGLACLALGSTLD-AVLMPINKSLWTPSFCLLMSGWALLAFGASYWLLDA 281


>gi|53711719|ref|YP_097711.1| hypothetical protein BF0428 [Bacteroides fragilis YCH46]
 gi|52214584|dbj|BAD47177.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 387

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/373 (21%), Positives = 153/373 (41%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +DR VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  +T G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIITCGLASSFLALL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 VWIIDVRGYTRWS 303


>gi|371776142|ref|ZP_09482464.1| hypothetical protein AnHS1_01923 [Anaerophaga sp. HS1]
          Length = 369

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/157 (35%), Positives = 85/157 (54%), Gaps = 17/157 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
           KT+R  +LD+ RG+ +ALMI V++ G  W  I     H+ W+GC   D V PFFLF+VGV
Sbjct: 3   KTERYLALDVLRGMTIALMITVNNPG-SWKYIYAPLRHSSWHGCTPTDLVFPFFLFVVGV 61

Query: 87  AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           ++  +  +  +    ++ K++  RTL +   G+ L    +  P    +  D   +R+ GV
Sbjct: 62  SMFFSFSKYGNTLNKESFKRLGRRTLLIFAIGLFL----NSFPQ---WDRDYSTLRIMGV 114

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
           LQRIAL+Y   SL+ +    V  K   +  FSI  LY
Sbjct: 115 LQRIALAYGFGSLIVL---SVPRKYIPLLGFSILLLY 148



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 3/99 (3%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS++ ++++ ++G +   V I T   +    Q    G  + I G         
Sbjct: 188 PFDPEGLLSTIPAVVTVLLG-YLTGVFIKTTEKVKIPGQLALYGLIVAIAGRLWGLV--F 244

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           P+NK L+T SYV  T+G AA+ F+ +  ++D   +  WT
Sbjct: 245 PINKPLWTGSYVLYTAGLAAMAFALLIFIIDIKGYKKWT 283


>gi|192359631|ref|YP_001981658.1| hypothetical protein CJA_1162 [Cellvibrio japonicus Ueda107]
 gi|190685796|gb|ACE83474.1| putative membrane protein [Cellvibrio japonicus Ueda107]
          Length = 399

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 78/142 (54%), Gaps = 15/142 (10%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLF 82
           + ++  QR  +LD+ RGL +ALMILV+   G W  +     HA W+G    DFV PFFLF
Sbjct: 31  EVYMVKQRFLALDVMRGLTLALMILVN-TPGSWSHVYGPLLHADWHGVTPTDFVFPFFLF 89

Query: 83  IVGVAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           IVG A+  +++ +     +  ++KV  R L L   GILL            +  DV   R
Sbjct: 90  IVGSAMYFSVRGLAQLSLSQQLRKVGRRVLLLFVMGILLAA--------YPFTADVHDWR 141

Query: 141 LCGVLQRIALSYLLVSLVEIFT 162
           + GVLQRIAL+Y + +L+ ++ 
Sbjct: 142 IMGVLQRIALAYGVAALIVLYA 163


>gi|340786861|ref|YP_004752326.1| protein involved in N-acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
 gi|340552128|gb|AEK61503.1| protein involved in N-Acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
          Length = 373

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 151/355 (42%), Gaps = 96/355 (27%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
           T+RLAS+D  RG  VA M+LV+  G DW  +     H+ W+GC   D V PFFLF+VGV+
Sbjct: 22  TRRLASVDALRGCTVAAMLLVNDPG-DWSHVYAPLEHSAWHGCTPTDLVFPFFLFVVGVS 80

Query: 88  IALALK-RIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM-IRLC 142
            AL ++ R+   A+     +  + R L+++  G+L+        + L + +   + +RL 
Sbjct: 81  TALGIEPRLAQGANPSTLARAALIRALRIVALGLLI--------NLLAWFIMPGVHLRLP 132

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YG 201
           GVLQRI L +   +L  I+T+                     W +   +L+ Y  LL  G
Sbjct: 133 GVLQRIGLCFAATALCSIYTRPRT-----------------QWGLIVAILLGYWGLLTLG 175

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
             +  W       DSA +G                   V  ID     ++   H P    
Sbjct: 176 GSLEPWLNLASRSDSALFGHF-----------------VYQID----AVSGRGHDP---- 210

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
                     EG                  LL ++ S+ ++++G+  G  +        R
Sbjct: 211 ----------EG------------------LLGTLPSLATSLLGLCAGRWLREN-----R 237

Query: 322 LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           LK  +    A L+ G TL ++ A+PLNK L+T S+V   +G A L     + LVD
Sbjct: 238 LKPLLFAAVAALVLG-TL-WSLALPLNKNLWTSSFVLWCAGWATLALLLFHWLVD 290


>gi|255307823|ref|ZP_05351994.1| hypothetical protein CdifA_14636 [Clostridium difficile ATCC 43255]
          Length = 483

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 111 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 170

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 171 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 221

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 222 ILGVLQRMGLVYFVTSLVYLLLKKLN 247



 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 301 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 358

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 359 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 399


>gi|255101955|ref|ZP_05330932.1| hypothetical protein CdifQCD-6_14161 [Clostridium difficile
           QCD-63q42]
          Length = 469

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 97  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 157 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 207

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233



 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 287 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 344

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 345 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 385


>gi|423081105|ref|ZP_17069717.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
 gi|423085023|ref|ZP_17073481.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357550878|gb|EHJ32683.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357551414|gb|EHJ33204.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
          Length = 427

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 55  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 114

Query: 85  GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I +++        +   +I     R++ L+ +G  L   +   P       D+  +R
Sbjct: 115 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 165

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SLV +  K + 
Sbjct: 166 ILGVLQRMGLVYFVTSLVYLLLKKLN 191



 Score = 45.8 bits (107), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 245 FEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FNQYF 302

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 303 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 343


>gi|301309930|ref|ZP_07215869.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340409|ref|ZP_17318148.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831504|gb|EFK62135.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227844|gb|EKN20740.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
          Length = 372

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 146/385 (37%), Gaps = 108/385 (28%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
            EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  W
Sbjct: 1   MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           NG    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  
Sbjct: 57  NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S   D L         R   VL RI L ++  +L                   +F  + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
                  VL++    L   +VP         D+   G                 N VGYI
Sbjct: 149 KVRAGITVLILVGYWLAMAFVP-------VPDAGGAGP-----------FTLEGNLVGYI 190

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
           DR  L                               P   H   F+PEGL S+V +I + 
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219

Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           ++G+  G  I   K  L   K+    V  G  LLI GL   ++   P+NK+L+T S+VCV
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLL--WSLVFPINKKLWTSSFVCV 277

Query: 360 TSGAAALVFSAIYALVDCNIFGPWT 384
               +  +F+  + ++D   +  WT
Sbjct: 278 VGAYSVWMFALFFYIIDVLGWRKWT 302


>gi|410099160|ref|ZP_11294132.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219182|gb|EKN12145.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 371

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 99/386 (25%), Positives = 158/386 (40%), Gaps = 110/386 (28%)

Query: 23  DQQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWN- 68
           ++Q++S    QRL SLD  RG           L VAL  LV     +    ++SHA W  
Sbjct: 2   EKQKQS----QRLLSLDALRGFDMFFIMGGGSLFVALATLVPTPFFESIAAQMSHAKWGA 57

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++  +R    A   KK+I R + L+  G +  G   
Sbjct: 58  GFTFEDIIFPLFLFIAGISFPFSLEKQRERGMSEAAIYKKIIRRGITLVVLGFVYNG--- 114

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
                    ++    R   VL RI L ++  +L+ + T+ +                   
Sbjct: 115 ------LLNLNFETQRYASVLARIGLGWMFGALIFVNTRTITRV---------------- 152

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           W++AA +L+ Y  LL+    PD         +  + +  N+ C              YID
Sbjct: 153 WIVAA-ILIGYWLLLF-IPAPD------GNGAELFTREGNLAC--------------YID 190

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTI 303
           R +L                               P   H   ++PEG+LS++ +I + +
Sbjct: 191 RLLL-------------------------------PGRLHGGNYDPEGILSTLPAIGTAL 219

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLT--LHFTNAIPLNKQLYTLSYVC 358
           +G+  G  +   +  L   K+ V M   G  LL+ GL   L F    P+NK L+T S+VC
Sbjct: 220 LGMFTGEFVKLRREGLTETKKVVYMLAVGGCLLVIGLLWGLFF----PINKYLWTSSFVC 275

Query: 359 VTSGAAALVFSAIYALVDCNIFGPWT 384
              G +A++F+  Y +VD      WT
Sbjct: 276 TVGGISAILFAVFYYIVDVKECRGWT 301


>gi|289667572|ref|ZP_06488647.1| hypothetical protein XcampmN_03447, partial [Xanthomonas campestris
           pv. musacearum NCPPB 4381]
          Length = 298

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R + ++  G+L+     F   PD       V  +RL GVL
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 142 QRIGLCYLAAALL 154


>gi|294949094|ref|XP_002786049.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
 gi|239900157|gb|EER17845.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
          Length = 277

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 75/306 (24%), Positives = 109/306 (35%), Gaps = 99/306 (32%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
           R+ ++D+ RG +   + +VD  G   P I HAPWNG +LAD VMP F+FI          
Sbjct: 31  RIVAVDVMRGRSS--VQIVDVCGKTVPSIGHAPWNGLHLADIVMPGFIFI---------- 78

Query: 94  RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
                                             D LT G+D+   R  G+LQRIA+ Y 
Sbjct: 79  ----------------------------------DTLTLGLDLYTFRAPGILQRIAVCYA 104

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
              L+     D+   D   G            L+  C+++            +W   ++ 
Sbjct: 105 AAVLLRKLVSDLSPNDTVKGALKNNSRVLLMGLL--CIII------------NWAIMLLG 150

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                           R  L P CN    IDR V G  HMY                   
Sbjct: 151 PQPEGCS---------RGSLTPQCNVASNIDRMVFGPEHMY------------------- 182

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
                +P W     +PEGLLS++ ++ +  +G+  G  I     H   L + V  G  L 
Sbjct: 183 -----SPLW-----DPEGLLSTLPTLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLA 231

Query: 334 IFGLTL 339
           + G+ L
Sbjct: 232 LSGMAL 237


>gi|296452402|ref|ZP_06894103.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
 gi|296258732|gb|EFH05626.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
          Length = 481

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 78/150 (52%), Gaps = 16/150 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S     R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 109 SKFVNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQ 170
           + GVLQR+ L Y + SLV +  K +  +  
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLNVRSS 249



 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 299 FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 356

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 357 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 397


>gi|218131911|ref|ZP_03460715.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
 gi|217986214|gb|EEC52553.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
          Length = 396

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 161/379 (42%), Gaps = 100/379 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS       +P E +++G  +  
Sbjct: 67  TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+A  +
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIP----------------YLIATLL 169

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ LL G                  G  +N T           N +  +DR +L   
Sbjct: 170 TGYFILLLCGN-----------------GFAYNDT-----------NILSIVDRTILTPA 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ +I   ++G   G +
Sbjct: 202 HMY----------------------KD------NGIDPEGLLSTIPAIAHVLLGFCVGRM 233

Query: 312 IIH-TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAA 365
           ++   K +  R     +    LL+ G  L F+  +     P+NK++++ ++V  T G A+
Sbjct: 234 MLEGGKANEDRESMLNSHLIKLLLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLAS 293

Query: 366 LVFSAIYALVDCNIFGPWT 384
              + +  ++D   +  W+
Sbjct: 294 SFLALLIWIIDVKGYKKWS 312


>gi|242062184|ref|XP_002452381.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
 gi|241932212|gb|EES05357.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
          Length = 96

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           L G+LQRIA++YLL ++ EI+ K   D D   G   + R Y +   +   + + Y  LLY
Sbjct: 1   LMGILQRIAIAYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLY 57

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
           G YVPDW++ I    S +  K F+V CGVR      CNAVG
Sbjct: 58  GIYVPDWEYKISGPGSTE--KSFSVKCGVRGDTGLACNAVG 96


>gi|196233857|ref|ZP_03132695.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196222051|gb|EDY16583.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 437

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 71/387 (18%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGL-------AVALMILVDH-------AGGDWPE 61
           I  P+V    E       RL SLD  RG        A A++  +D        AG  W +
Sbjct: 27  IPSPNVPVSPETGQ-PAGRLVSLDALRGFDMFWIVGAGAVIQSLDKMCRTPFTAGLAW-Q 84

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
             H  W G +  D + P FLFI+G++I  +L +      +A  + +V  R++ L   G+L
Sbjct: 85  FKHVHWKGLHCYDVIFPLFLFIIGISIVFSLDKALATGGKAQVLTRVARRSVLLFALGVL 144

Query: 119 LQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
             GGF    P+          ++L GVL RIAL YL  +L+  F +  +           
Sbjct: 145 YYGGFMKPWPN----------VQLGGVLPRIALCYLAAALIYTFIRSTRGLLA------- 187

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
                     AA +L+ Y ALL     PD Q            K        R   + P 
Sbjct: 188 ---------AAAALLIGYWALLAFVPFPDLQLR----------KPVVEEIAERIGSDSPA 228

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
                +  +V G+   Y      R+ A   D  +  P RK         F  EGLLS++ 
Sbjct: 229 AIAAAVPERVHGLYEEY------RNLANYVDFLYM-PGRK-----AQFYFINEGLLSTIP 276

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQW-VTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
           SI  ++ G   G ++ + K    R   W V  G A ++ G    +   +PL K+++T S+
Sbjct: 277 SIALSLFGAVAGLLLKNQKVLPRRKIAWLVGAGVAFIVLGRV--WAIDLPLIKRIWTSSF 334

Query: 357 VCVTSGAAALVFSAIYALVDCNIFGPW 383
           + V +G +AL+ +  Y +VD   +  W
Sbjct: 335 ILVATGWSALMLALFYYIVDVKQWRKW 361


>gi|423089801|ref|ZP_17078150.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
 gi|357557565|gb|EHJ39099.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
          Length = 391

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
           S L   R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +
Sbjct: 19  SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 78

Query: 85  GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           GV I ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R
Sbjct: 79  GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 129

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
           + GVLQR+ L Y + SL  +  K + 
Sbjct: 130 ILGVLQRMGLVYFVTSLAYLLLKKLN 155



 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 209 FEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FNQYF 266

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 267 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 307


>gi|289663929|ref|ZP_06485510.1| hypothetical protein XcampvN_12875 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 392

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R + ++  G+L+     F   PD       V  +RL GVL
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 142 QRIGLCYLAAALL 154


>gi|380025576|ref|XP_003696546.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 298

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 122/259 (47%), Gaps = 58/259 (22%)

Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
           +R  GVLQ + +SY + +++E IF K         GRF++FR  L  W  WL+ A ++  
Sbjct: 13  LRFPGVLQLLGVSYFVCAILETIFMK----PHSQFGRFAMFRDILESWPQWLIMAGIVTT 68

Query: 195 YLALLY---------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
           +  + +         G + P  ++          GK  N T G          A GYIDR
Sbjct: 69  HTLITFLLPISNCPKGYFGPGGEYHF-------RGKYMNCTAG----------AAGYIDR 111

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            + G NH Y+H         T++  +   LR D          PEGL++++S+I    +G
Sbjct: 112 LIFG-NHTYNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLG 151

Query: 306 VHFGHVIIHTKGHLARLKQWVTMG-FALLIFGLTLHFTNA---IPLNKQLYTLSYVCVTS 361
           VH G +++      +R+ +W     F  ++ G+  +F      IP++K++ TLSYV + S
Sbjct: 152 VHAGKILLLYYQCNSRVIRWFLWTIFTGIVAGILCNFETQGGIIPISKRMMTLSYVLICS 211

Query: 362 GAAALVFSAIYALVDCNIF 380
             A L+++ +Y L+D   F
Sbjct: 212 SFAFLLYALLYVLIDYKQF 230


>gi|150007980|ref|YP_001302723.1| hypothetical protein BDI_1342 [Parabacteroides distasonis ATCC
           8503]
 gi|256840846|ref|ZP_05546354.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|423331513|ref|ZP_17309297.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936404|gb|ABR43101.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
 gi|256738118|gb|EEU51444.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409230083|gb|EKN22951.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
          Length = 372

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 94/383 (24%), Positives = 146/383 (38%), Gaps = 104/383 (27%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
            EK   + QRL SLD  RG           L VAL  L  +      AG    ++ H  W
Sbjct: 1   MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           NG    D + P FLFI G++   +L++   +        KK++ R + L+F G++  G  
Sbjct: 57  NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
           S   D L         R   VL RI L ++  +L                   +F  + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
                  VL++    L   +VP         D+   G                 N VGYI
Sbjct: 149 KVRAGITVLILVGYWLAMAFVP-------VPDAGGAG-----------PFTLEGNLVGYI 190

Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
           DR  L                               P   H   F+PEGL S+V +I + 
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219

Query: 303 IIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           ++G+  G  I +  +G   R K    +G   ++  + L ++   P+NK+L+T S+VCV  
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRNKVLCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVCVVG 279

Query: 362 GAAALVFSAIYALVDCNIFGPWT 384
             +  +F+  + ++D   +  WT
Sbjct: 280 AYSVWMFALFFYIIDVLGWRKWT 302


>gi|126173329|ref|YP_001049478.1| hypothetical protein Sbal_1087 [Shewanella baltica OS155]
 gi|386340088|ref|YP_006036454.1| hypothetical protein [Shewanella baltica OS117]
 gi|125996534|gb|ABN60609.1| conserved hypothetical protein [Shewanella baltica OS155]
 gi|334862489|gb|AEH12960.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS117]
          Length = 387

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 101/373 (27%), Positives = 151/373 (40%), Gaps = 109/373 (29%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAG------GDWPEISHAPWNGCNLADFV 76
           RL SLD  RG  +            L+IL   AG      GD  ++ H+ W+G +  D +
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWAGWQWGD-EQMHHSQWHGFHFYDLI 76

Query: 77  MPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDEL 130
            P F+F+ GVA+ L+ KR+      +R    +  I R   LL  GIL   G+ + AP   
Sbjct: 77  FPLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP--- 133

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
               D   IR   VL RIA ++   +                       L  WH  +   
Sbjct: 134 ---ADPEKIRYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQ 167

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKV 247
           ++V  L +L G                 YG +        G    L+P  +   Y+D  +
Sbjct: 168 IIVA-LGILLG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSIL 209

Query: 248 L-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           L G+++    P                              +PEGLLS++ +I++ + GV
Sbjct: 210 LPGVSYQGRTP------------------------------DPEGLLSTIPAIVNALAGV 239

Query: 307 HFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
             GH I+  H KG  A++      G   L FG  L     IP+NK+L+T S+V VTSG +
Sbjct: 240 FVGHFIVKSHPKGEWAKVGLLAAAGCVCLTFGWLLDLV--IPVNKELWTSSFVLVTSGWS 297

Query: 365 ALVFSAIYALVDC 377
            ++ +  YALVD 
Sbjct: 298 MILLALFYALVDV 310


>gi|296877751|ref|ZP_06901777.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
 gi|296431202|gb|EFH17023.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
          Length = 370

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +GV I
Sbjct: 2   NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61

Query: 89  ALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R+ GV
Sbjct: 62  PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVRILGV 112

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQ 170
           LQR+ L Y + SLV +  K +  +  
Sbjct: 113 LQRMGLVYFVTSLVYLLLKKLNVRSS 138



 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 188 FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 245

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 246 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 286


>gi|288929890|ref|ZP_06423732.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
 gi|288328709|gb|EFC67298.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
          Length = 399

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 96/392 (24%), Positives = 163/392 (41%), Gaps = 109/392 (27%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
            EK+   T R+ S+DI RGL +A MI V++ G     +  + HA WNG    D V PFF+
Sbjct: 1   MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
            ++G+ I +A+++     +RA   K  I + + L++   L  G F+      ++P E   
Sbjct: 60  CVMGMCIYIAMRKFDFACNRATVYK--IVKRMVLIYLVGLAIGWFAKFCYRWNSPQEGAD 117

Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
               L Y V     IRL GVL R+A+ Y + +L+ I    V+ K                
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAICYGITALLAI---TVRHKHLP------------- 161

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           +++   +L  ++ L+ G                  G  ++ T           N +   D
Sbjct: 162 YIIVGLLLTYFVILMAGN-----------------GFAYDET-----------NILSIAD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VL   HMYH                                +PEGLLS++ SI  T++
Sbjct: 194 RAVLTDAHMYHDNG----------------------------IDPEGLLSTLPSIAHTLL 225

Query: 305 GVHFGHVII------------HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           G   G ++              T   L ++     +G +LL  G  L +    P+NK+++
Sbjct: 226 GFIIGSLLFRKADVGEQQLDARTNITLTKVVPLFVVGTSLLFAGYLLSY--GCPINKKVW 283

Query: 353 TLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           + ++V VT G A+++ + +  ++D      W+
Sbjct: 284 SPTFVLVTCGLASMLLALLTWIIDVKGKKSWS 315


>gi|423282312|ref|ZP_17261197.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
 gi|404581880|gb|EKA86575.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
          Length = 387

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 85/376 (22%), Positives = 155/376 (41%), Gaps = 101/376 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ-----------------GGFSHAPDE 129
            ++L++        A  K++ RT+ +   G+ +                    FS   + 
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSSEDISFFSRLYES 125

Query: 130 L-TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
           + T+G     IR+ GV+QR+AL Y   +++ +  K                       + 
Sbjct: 126 IWTFG----HIRILGVMQRLALCYGATAIIALIMKHKYIP-----------------YLI 164

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
           A +L+ Y  +L            IN +  +Y                  N +  +DR VL
Sbjct: 165 AILLIGYFIIL------------INGNGFEYNS---------------SNILSIVDRTVL 197

Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
           G  HMY                      KD         +PEGLLS++ SI   +IG   
Sbjct: 198 GEAHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCV 229

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G +++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   
Sbjct: 230 GKLLMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFL 287

Query: 369 SAIYALVDCNIFGPWT 384
           + +  ++D   +  W+
Sbjct: 288 ALLVWIIDVRGYTRWS 303


>gi|146292182|ref|YP_001182606.1| hypothetical protein Sputcn32_1079 [Shewanella putrefaciens CN-32]
 gi|145563872|gb|ABP74807.1| conserved hypothetical protein [Shewanella putrefaciens CN-32]
          Length = 384

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 94/364 (25%)

Query: 34  RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +            L+IL   AG  W   ++ H+ W+G +  D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           +F+ GVA+ L+ KR+     + +  ++R  +K LF  +LL   ++H         D   I
Sbjct: 78  IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R   VL RIA ++   +                       L  WH  +   V+V  L +L
Sbjct: 137 RYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQVIVA-LGIL 172

Query: 200 YGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYH 255
            G                 YG +        G     +P  +   Y+D  +L G+++   
Sbjct: 173 LG-----------------YGAMQLWLPFPSGQAGVFSPTQSINAYVDSILLPGVSYQGR 215

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
            P                              +P+GLLS++ ++++ + GV  G+ I+  
Sbjct: 216 TP------------------------------DPQGLLSTIPAVVNALAGVFVGYFIVKS 245

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
           H +G   ++    T G A L  G  L     IP+NK+L+T S+V VTSG + ++ +  YA
Sbjct: 246 HPQGEWVKVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILLALFYA 303

Query: 374 LVDC 377
           LVD 
Sbjct: 304 LVDV 307


>gi|424670170|ref|ZP_18107195.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401070628|gb|EJP79142.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 355

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILMAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|375356811|ref|YP_005109583.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|383116724|ref|ZP_09937472.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|251947990|gb|EES88272.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|301161492|emb|CBW21032.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 387

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 82/373 (21%), Positives = 152/373 (40%), Gaps = 95/373 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +RL +LD+ RG+ +A MI+V++ G     +  + HA W G    D V PFF+FI+G++ 
Sbjct: 6   NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
            ++L++        A  K++ RT+ +   G+ +     F    + L+ G D+        
Sbjct: 66  YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124

Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                            
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              Y+  L    +  +   +IN +  +Y                  N +  +D  VLG  
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDHTVLGEA 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ SI   +IG   G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           ++  K    ++++   +G  L   G  L +    P++K++++ ++  VT G A+   + +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLALL 290

Query: 372 YALVDCNIFGPWT 384
             ++D   +  W+
Sbjct: 291 VWIIDVRGYTRWS 303


>gi|326801867|ref|YP_004319686.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326552631|gb|ADZ81016.1| hypothetical protein Sph21_4499 [Sphingobacterium sp. 21]
          Length = 376

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 136/335 (40%), Gaps = 79/335 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LD+FRG+ +  MI+V+  G      WP ++HA W+G    D V P FLF VG A++
Sbjct: 7   RFTALDVFRGMTICFMIIVNSPGSGATPYWP-LNHATWHGFTPTDLVFPSFLFAVGNALS 65

Query: 90  LALKRIPD-RADAVKKVIFRTLKLLF-WGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
            + ++     +  V   IF+   L+F  G L+     F            +   R+ GVL
Sbjct: 66  FSERKFQYLSSKQVLLTIFKRAALIFLLGFLMYWFPFFKITEQHEIISFPLHETRVFGVL 125

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRIAL YL  +L   + +               R Y   WL  A +L+ ++ LL      
Sbjct: 126 QRIALCYLFTALAVYYVR---------------RKYL-VWLAIALLLIYWVILL------ 163

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                 G  A  +   NA+  +D  +LG +H+YH          
Sbjct: 164 --------------------IFGTDAPYSLEGNAIFKLDLWLLGESHLYHSHG------- 196

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
                                F+PEGLLS++ +I + I G   G  +    G +  L + 
Sbjct: 197 -------------------IIFDPEGLLSTIPAITNAIAGYLVGKYLQEKGGTVQSLGKL 237

Query: 326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           + +G   ++  L   +    P+NK++++ S+  +T
Sbjct: 238 LIIGAIGVLIALV--WNQVFPINKKIWSSSFALLT 270


>gi|317474486|ref|ZP_07933760.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
 gi|316909167|gb|EFV30847.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
          Length = 396

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 160/379 (42%), Gaps = 100/379 (26%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
            +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+FI+G++
Sbjct: 8   NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66

Query: 88  IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
             ++LK+        A  K++ RT+ +   G+ + G FS       +P E +++G  +  
Sbjct: 67  TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125

Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
                  IR+ GV+QR+AL Y   +++ +  K                     +L+A  +
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIP----------------YLIATLL 169

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
              ++ LL G                  G  +N T           N +  +DR +L   
Sbjct: 170 TGYFILLLCGN-----------------GFAYNDT-----------NILSIVDRTILTPA 201

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           HMY                      KD         +PEGLLS++ +I   ++G   G +
Sbjct: 202 HMY----------------------KD------NGIDPEGLLSTIPAIAHVLLGFCVGRM 233

Query: 312 IIH-TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVCVTSGAAA 365
           ++   K +  R     +    L + G  L F+  +     P+NK++++ ++V  T G A+
Sbjct: 234 MLEGGKANEDRESMLNSHLIKLFLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLAS 293

Query: 366 LVFSAIYALVDCNIFGPWT 384
              + +  ++D   +  W+
Sbjct: 294 SFLALLIWIIDVKGYKKWS 312


>gi|319785830|ref|YP_004145305.1| transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
 gi|317464342|gb|ADV26074.1| putative transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
          Length = 357

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 78/141 (55%), Gaps = 16/141 (11%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
           + +RLAS+D  RGL VA M+LV++ G DW  +     HA W+GC  AD V PFFL IVGV
Sbjct: 5   RFRRLASVDALRGLTVAAMLLVNNPG-DWGHVYAPLLHADWHGCTPADLVFPFFLAIVGV 63

Query: 87  AIALA-LKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +IAL  + RI    DRA  ++ V  R L++L  G+LL        D+  Y       R  
Sbjct: 64  SIALGVVPRIEAGADRAGLMRTVAVRPLRILAVGLLLHLLAWWWLDQPHY-------RPW 116

Query: 143 GVLQRIALSYLLVSLVEIFTK 163
           GVLQRI L +L      ++ +
Sbjct: 117 GVLQRIGLCFLGAGAAALYLR 137


>gi|456734835|gb|EMF59605.1| N-acetylglucosamine transporter NagX [Stenotrophomonas maltophilia
           EPM1]
          Length = 355

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|399088486|ref|ZP_10753550.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
 gi|398030770|gb|EJL24173.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
          Length = 398

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 104/352 (29%), Positives = 149/352 (42%), Gaps = 66/352 (18%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLF 82
           + +     R+ +LD+ RGLAVA MILV   G     +  + HA W G  LAD V P FLF
Sbjct: 2   DSAKAGGGRIVALDVLRGLAVAGMILVTSPGAWAHAYAPLKHAAWQGWTLADLVFPTFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
            VGVAI L++ R+     A   +  +  +     ILL    +  P       D+  +R+ 
Sbjct: 62  CVGVAIGLSVPRLRIGEGASAALWIKVARRTALLILLGLVLNALPR-----FDLAHLRIP 116

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL-YCWHWLMAACVLVVYLALLYG 201
           GVLQRI L Y L S + I     +   Q        RL      L    +LV Y ALL  
Sbjct: 117 GVLQRIGLCYALASAICILPARAEADGQ-------LRLNVGGVVLAVVGLLVGYWALLTF 169

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
           T VP   F +   DS      F                   IDR V  I H++    W  
Sbjct: 170 TPVPG--FGVDRWDSQGALPAF-------------------IDRAVFTIPHLW---PWGT 205

Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT----KG 317
           ++                       ++PEGLLS+  + ++ ++G      +  T    +G
Sbjct: 206 TEGVGVT------------------YDPEGLLSTFPATVNVLLGAVAAAFLARTGDGRQG 247

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA--AALV 367
               L   + +G AL++ GL L     +P+NK+L+T S+   +SGA  AALV
Sbjct: 248 RGRVLAALLALGAALIVAGLALD--PIVPVNKRLWTPSFALFSSGASLAALV 297


>gi|190575857|ref|YP_001973702.1| transmembrane protein [Stenotrophomonas maltophilia K279a]
 gi|190013779|emb|CAQ47415.1| putative transmembrane protein [Stenotrophomonas maltophilia K279a]
          Length = 355

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|84625357|ref|YP_452729.1| hypothetical protein XOO_3700 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188575197|ref|YP_001912126.1| hypothetical protein PXO_04319 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|84369297|dbj|BAE70455.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188519649|gb|ACD57594.1| membrane protein, putative [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 388

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 71/133 (53%), Gaps = 5/133 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +   G+L+     F   PD       V  +RL GVL
Sbjct: 78  MSFALATNMPHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVL 137

Query: 146 QRIALSYLLVSLV 158
           QRI L YL  +L+
Sbjct: 138 QRIGLCYLAAALL 150


>gi|126700401|ref|YP_001089298.1| membrane protein [Clostridium difficile 630]
 gi|115251838|emb|CAJ69673.1| putative membrane protein [Clostridium difficile 630]
          Length = 370

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 76/142 (53%), Gaps = 16/142 (11%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
             R+ S+DI RGL++ALMI+ ++ G     +P++ HA W+G  LADF  PFF+  +GV I
Sbjct: 2   NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61

Query: 89  ALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            ++    LK        +  +  R++ L+ +G  L   +   P       D+  +R+ GV
Sbjct: 62  PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 112

Query: 145 LQRIALSYLLVSLVEIFTKDVQ 166
           LQR+ L Y + SLV +  K + 
Sbjct: 113 LQRMGLVYFVTSLVYLLLKKLN 134



 Score = 45.1 bits (105), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 188 FEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FNQYF 245

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 246 PFNKRLWSSSFVLLMAGSYGILLSIFYFICDIKNKSKIFTP 286


>gi|21241481|ref|NP_641063.1| hypothetical protein XAC0710 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21106824|gb|AAM35599.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 388

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/287 (27%), Positives = 116/287 (40%), Gaps = 73/287 (25%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           K +R  SLD+FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
           ++ AL         + +V  R   +L  G+L+     F   PD       V  +RL  VL
Sbjct: 78  MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTCVL 137

Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
           QRI L YL  +L+  +          +                  +L+ Y ALLY    P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180

Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
                                    A+L+   NA   +D  + G +H+Y           
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205

Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                     RKD        F+PEGLL ++S+ ++ + G   G  +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|91794054|ref|YP_563705.1| hypothetical protein Sden_2703 [Shewanella denitrificans OS217]
 gi|91716056|gb|ABE55982.1| conserved hypothetical protein [Shewanella denitrificans OS217]
          Length = 400

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 150/376 (39%), Gaps = 86/376 (22%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCN 71
           Q  + L   RL SLD  RG           L  AL  L   AG +    ++ H+ W+G  
Sbjct: 25  QTSTSLNKPRLKSLDALRGFDMFWIIGGEGLFAALFTLTGWAGWNIASRQMQHSQWHGFT 84

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDEL 130
           L D + P F+F+ GVA+ L+ KR+  +A AV   +++   K L   I L   ++H     
Sbjct: 85  LYDLIFPLFIFLSGVALGLSPKRLDQQAFAVALPLYQHACKRLILLIALGILYNHGWGT- 143

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
               D+  IR   VL RI  ++   +++   T+      Q +   SI  LY         
Sbjct: 144 GIPADLDKIRYSSVLARIGFAWFFAAMLVWHTR---LSIQVIVSVSIIGLYT-------- 192

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
                LA LY                        V  G   +     +   Y+D  +   
Sbjct: 193 -----LAQLY----------------------LPVPGGQAGQFTLDASINTYVDGLL--- 222

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
                     R     QD P                 +PEG+LS+V ++++ ++GV  G 
Sbjct: 223 ----------RPGIAYQDRP----------------LDPEGILSTVPAVINAMVGVFAGQ 256

Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
            II  H++G  A++   +  G  LL+    L     IP+NK L+T S+V VTSG + L  
Sbjct: 257 FIIRAHSRGDWAKVGVLIACGVLLLVLAWLLE--PMIPVNKDLWTTSFVLVTSGWSLLFL 314

Query: 369 SAIYALVDCNIFGPWT 384
           S  Y ++D   +  WT
Sbjct: 315 SLFYVIIDVLKWQKWT 330


>gi|410663435|ref|YP_006915806.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409025792|gb|AFU98076.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 356

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 140/338 (41%), Gaps = 89/338 (26%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
           QR  +LD  RGL +ALMI+V+  G  W  +     HA W G    D V PFFLFIVG ++
Sbjct: 3   QRYIALDALRGLTLALMIVVNTPG-SWAHVYGPLLHADWMGWTFTDLVFPFFLFIVGASL 61

Query: 89  ALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
             + K +    RAD ++K+I R+L L+              +   + V +  +RL GVLQ
Sbjct: 62  YFSQKGMASLTRADQLRKIIRRSLLLIV--------LGVLLEYYPFIVSLHELRLPGVLQ 113

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
           RI L++ + +L+ +F              + F L C        +L  Y  LL+ ++ P 
Sbjct: 114 RIGLAFGVAALLVVFVP------------ARFNLVC-----IGAILASYQVLLWLSHEP- 155

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                    S ++                  N V  +D  V G +H++            
Sbjct: 156 --------YSLEH------------------NLVRQVDLWVFGESHLWAGKG-------- 181

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
                               F+PEGLLS+  S+ + + G      +    G   R  Q+ 
Sbjct: 182 ------------------LAFDPEGLLSTWPSVATVLAGFETARWL--RSGRQLRYLQFG 221

Query: 327 TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
             G   ++  +T  +  A+P+NK L+T  +V +T+G A
Sbjct: 222 LWGAGGVVLLMT--YALALPINKSLWTPGFVLLTAGLA 257


>gi|224025514|ref|ZP_03643880.1| hypothetical protein BACCOPRO_02254, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018750|gb|EEF76748.1| hypothetical protein BACCOPRO_02254 [Bacteroides coprophilus DSM
           18228]
          Length = 298

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 80/136 (58%), Gaps = 11/136 (8%)

Query: 33  QRLASLDIFRGLAVALMILVD-HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
           +RL S+DIFRG+ +  MILV+  AGG +  + H P  G  +AD V P F+FI+G ++ L+
Sbjct: 17  KRLLSIDIFRGITIFFMILVNTQAGGSFDFLIHIPGYGWRIADLVYPSFIFIMGASMYLS 76

Query: 92  LKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
           +++  +    D  K +  RT+ +   GI+    F+  P    +  ++  +R+ GVLQRIA
Sbjct: 77  MRKYVEAPPTDLYKHIFRRTVLIFLMGII----FNWIP----FDQNLLDVRILGVLQRIA 128

Query: 150 LSYLLVSLVEIFTKDV 165
           + YL+ SL+ I  + +
Sbjct: 129 IVYLICSLLVIKVRSI 144



 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 56/95 (58%), Gaps = 6/95 (6%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL-ARLKQWVTMGFALLIFGLTLHFTNAIP 346
           +PEGLLSS+ S+++ IIG     ++  T+  L  RL +  T+  +++     LH+T  +P
Sbjct: 188 DPEGLLSSIPSVVNAIIGYVSARIL--TENELKERLIKMSTIAISMIALAYILHWT-ILP 244

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
           + K  ++ S+  +TSG + L +  ++ +  C+++G
Sbjct: 245 IYKTYWSSSFGLLTSGISLLAWIVVHLI--CDVWG 277


>gi|194367192|ref|YP_002029802.1| hypothetical protein Smal_3420 [Stenotrophomonas maltophilia
           R551-3]
 gi|194349996|gb|ACF53119.1| conserved hypothetical protein [Stenotrophomonas maltophilia
           R551-3]
          Length = 355

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++  R  D   R    + V+ R L++L  G LL          + + +D    R+ GV
Sbjct: 66  AFSVAPRALDVALRPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|116748970|ref|YP_845657.1| hypothetical protein Sfum_1534 [Syntrophobacter fumaroxidans MPOB]
 gi|116698034|gb|ABK17222.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
          Length = 374

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 143/355 (40%), Gaps = 76/355 (21%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
            S     RLASLD FRG  +A MILV+  G     + ++ HA WNG   AD + P FLF+
Sbjct: 2   NSPTTNTRLASLDAFRGAVIAGMILVNSPGRWVYTYSQLKHAQWNGWTFADTIFPAFLFV 61

Query: 84  VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           VGV++  +  R  +  +   +++ +  +      LL    +   D   +G +   +R+ G
Sbjct: 62  VGVSMVFSFSRRRECEEPAWRLVLQVFRRTSLIFLLGLLLNVMLD--FHGSN---LRIPG 116

Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
           VLQRIA  Y + SL+ + T               FR                     G  
Sbjct: 117 VLQRIAACYFVASLIVLGTG--------------FR---------------------GQA 141

Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
           +       +     ++  V  +  GV   L P  N   Y+D  +L   HM+ H       
Sbjct: 142 IWALGLLALYWLLMEFYPVPGIGAGV---LEPGRNFASYVDSLLLD-GHMWSH------- 190

Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
                               +  ++PEG++S++ ++ ST+ GV  GH +  T    A+ K
Sbjct: 191 --------------------YRTWDPEGIISTIPAVSSTLFGVLTGHFLRST--FSAKAK 228

Query: 324 QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
               +G    +  L    +  +P+NK ++T SY    +G +    +  Y L+D  
Sbjct: 229 TAGMLGAGAALLALGRFCSIWLPINKNIWTSSYSIFMTGLSLAGLAVFYWLIDVK 283


>gi|392548092|ref|ZP_10295229.1| hypothetical protein PrubA2_17028 [Pseudoalteromonas rubra ATCC
           29570]
          Length = 373

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 144/369 (39%), Gaps = 100/369 (27%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPWNGCNLADFVM 77
             +RLASLD  RG+ +           AL +L    G    E    H+ W+G    D + 
Sbjct: 4   NKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKIFEAQTLHSAWHGFTFYDLIF 63

Query: 78  PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GVA+ L  KRI      +R     K I R   L  +G+L   G+         
Sbjct: 64  PLFIFLSGVAMGLRPKRIDHLPMAERKPIYIKAIKRLGLLCLFGVLYNHGWGTGIPA--- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
             D   IR   VL RIA+++   +++             VG                 +L
Sbjct: 121 --DFGEIRYASVLGRIAIAWFFCAMLVWHCSLKTTALTGVG-----------------IL 161

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
           + Y  LL   ++P                   V  G   +L P  +   ++D+ +L GI 
Sbjct: 162 LAYWLLL--CFIP-------------------VPGGSAGELTPAGSWNAWVDQALLPGIT 200

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           +                               + P +PEG+LSS  +I++ I GV  G +
Sbjct: 201 YQ------------------------------NRPVDPEGILSSFPAIVNAIAGVFAGQL 230

Query: 312 IIHTKGHLARLKQWVTMG--FALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAALV 367
           I  +     +L QW   G  FA  I  L L +   +  P+NK+L+T S+V VT G +A+ 
Sbjct: 231 IAQSD----KLGQWQVAGRLFAAGIVSLALGWLWDLQFPVNKELWTSSFVLVTVGWSAIF 286

Query: 368 FSAIYALVD 376
            +  + LVD
Sbjct: 287 LAVFFTLVD 295


>gi|167764222|ref|ZP_02436349.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
 gi|167698338|gb|EDS14917.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
          Length = 396

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 92/386 (23%), Positives = 158/386 (40%), Gaps = 103/386 (26%)

Query: 27  KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
            S +KT +R+ +LDI RG+ +A MI+V++ G  W  I     HA WNG    D V PFF+
Sbjct: 2   SSTVKTNKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFM 60

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LT 131
           FI+G++  ++LK+      RA  + K++ RT+ +   G+ + G FS       +P E + 
Sbjct: 61  FIMGISTYISLKKYNFEFSRAAGM-KILKRTILIFLIGMGI-GWFSRFCYYWTSPTEGIG 118

Query: 132 YGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
           +G  +         IR+ GV+QR+AL Y   +++ +  K                     
Sbjct: 119 FGAQLWEAAWTFDRIRILGVMQRLALCYGATAIIALTMKHRNIP---------------- 162

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
             + A +L  Y  LL                           CG     N   N +  +D
Sbjct: 163 -YLIATLLTGYFILL--------------------------VCGNGFAYN-DTNILSVVD 194

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +L   HMY                                 +PEGLLS++ +I   ++
Sbjct: 195 RAILTPAHMYKDNG----------------------------IDPEGLLSTIPAIAHVLL 226

Query: 305 GVHFGHVIIH-TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTLSYVC 358
           G   G +++   K   +R     +    L + G  L F   +     P+NK++++ ++V 
Sbjct: 227 GFCVGRMMLEGGKADESRESMLNSHLIKLFLAGTILTFAGFLLSYGCPINKKIWSPTFVL 286

Query: 359 VTSGAAALVFSAIYALVDCNIFGPWT 384
            T G A+   + +  ++D   +  W+
Sbjct: 287 ATCGLASSFLALLIWIIDVKGYKKWS 312


>gi|260910302|ref|ZP_05916976.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635554|gb|EEX53570.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 399

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/390 (24%), Positives = 161/390 (41%), Gaps = 105/390 (26%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
            EK+   T R+ S+DI RGL +A MI V++ G     +  + HA WNG    D V PFF+
Sbjct: 1   MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59

Query: 82  FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
            ++G+ I +A+ +     +RA   K  I + + L++   L  G F+      + P E   
Sbjct: 60  CVMGMCIYIAMSKFNFACNRATVYK--ILKRMVLIYLVGLAIGWFAKFCYRWNNPQEGAD 117

Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
               L Y V     IRL GVL R+A+ Y + +L+ I    V+ K                
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAVCYGITALLAI---TVRHKHLP------------- 161

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           +++   +L  ++ L+ G                  G  ++ T           N +  +D
Sbjct: 162 YIVGGLLLAYFVILMAGN-----------------GFAYDET-----------NILSIVD 193

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R VL   HMYH                                +PEGLLS++ SI  T++
Sbjct: 194 RAVLTDAHMYHDNG----------------------------IDPEGLLSTLPSIAHTLL 225

Query: 305 GVHFGHVIIH-----TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-----PLNKQLYTL 354
           G   G ++        +   AR    +T    L + G +L F   +     P+NK++++ 
Sbjct: 226 GFIIGGMLFRKADAGVQQLDARANITLTKVVPLFVVGTSLLFAGYLLSYGCPINKKVWSP 285

Query: 355 SYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           ++V VT G A+++ +    ++D      W+
Sbjct: 286 TFVLVTCGLASMLLALFTWIIDVKGKKQWS 315


>gi|223936398|ref|ZP_03628310.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894916|gb|EEF61365.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 427

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 165/389 (42%), Gaps = 100/389 (25%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
           QRL S+D  RG           L  AL  L  +   D+   ++ H  W G +  D + P 
Sbjct: 24  QRLMSVDALRGFDMFWIIGADSLVYALHRLSQNRVTDFLGLQLDHCDWAGFHFYDLIFPL 83

Query: 80  FLFIVGVAIALALKRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVD 135
           F+FI+GV++  +L +      RA+AVK+V  R+  L    ++  GG   A PD       
Sbjct: 84  FVFIMGVSVVFSLTKAIQQLGRAEAVKRVFRRSALLFVVALIYSGGVRSAWPD------- 136

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IRL GVL RIAL Y +  L+  F K          R  +         +AA +L+ Y
Sbjct: 137 ---IRLLGVLNRIALCYFVGGLIFCFFKP---------RAMV--------AIAAALLIGY 176

Query: 196 LALLYGTYVP----------------DWQFTIINKDS--ADYGKVF-NVTCGVRAKLNPP 236
            +++  T+VP                D     I +D+  +D  K+F N T  V AK +  
Sbjct: 177 WSIM--TFVPIRDIRMAHYKEKHELVDNDVDKIMQDTGVSDPAKIFYNTTNWVTAKYDMG 234

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
            N   ++D K LG                          RK    W     +PEGLLS++
Sbjct: 235 YNVANHLDFKYLG-------------------------GRKYDTYW-----DPEGLLSTI 264

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLK--QWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
            ++ +T +      +++ +  +  R K    +++G A +I G    ++   P+ K+++T 
Sbjct: 265 PAV-ATCLLGILAGLLLRSTNYCDRWKVIYLLSLGAAGVILGFL--WSIQFPVVKKIWTS 321

Query: 355 SYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           S+V V  G +A++    Y +VD   +  W
Sbjct: 322 SFVLVAGGFSAILLGIFYQVVDVWKYQKW 350


>gi|392308231|ref|ZP_10270765.1| hypothetical protein PcitN1_06167 [Pseudoalteromonas citrea NCIMB
           1889]
          Length = 375

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 95/370 (25%), Positives = 150/370 (40%), Gaps = 104/370 (28%)

Query: 33  QRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADFVM 77
           +RLASLD  RG+ +           AL +L    G  W        H+ W+G    D + 
Sbjct: 8   KRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTVHSAWHGFTFYDLIF 65

Query: 78  PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GVA+ L+ KRI      +R     K + R   L   G+L   G+         
Sbjct: 66  PLFIFLSGVAMGLSPKRIDHLPFSERRGYYNKALKRLFLLSALGVLYNHGWGTGIP---- 121

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
            V +  IR   VL RIA+++    L+   T       +++G             +   VL
Sbjct: 122 -VALGEIRYASVLGRIAIAWFFCMLLVWHT-----SLRTLG------------FIGGGVL 163

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
           + Y  LL   ++P                   V  G    L+   +   ++D  +L GI+
Sbjct: 164 IGYWLLL--CFIP-------------------VPGGTAGDLSAAGSWNAWVDAMLLPGIS 202

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
           +              Q+ P                 +PEG+LSS+ +I++ I GV  G +
Sbjct: 203 Y--------------QNRP----------------VDPEGVLSSLPAIVNAIAGVFAGRL 232

Query: 312 IIHTKGHLARLKQWVTMG----FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           I   +     + QW T+     F  +  GL   +    P+NK+L+T S+V VT G +A++
Sbjct: 233 IAQAQ----HVGQWHTVARLFVFGFVALGLGWLWDIHFPVNKELWTSSFVLVTVGWSAIL 288

Query: 368 FSAIYALVDC 377
            +A YALVD 
Sbjct: 289 LAAFYALVDV 298


>gi|311031971|ref|ZP_07710061.1| hypothetical protein Bm3-1_15792 [Bacillus sp. m3-13]
          Length = 370

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 137/345 (39%), Gaps = 90/345 (26%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFL 81
            +  +H+  +R  S+D+ RG+ V + + V     G ++  + HA W G  + D V P FL
Sbjct: 1   METNNHITKKRYRSIDVTRGIVVLVSVFVSALPGGAEYDFLRHAYWYGLTITDLVFPAFL 60

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
            + G+ +A+  ++     D ++    RT  L+ +G+L            ++  D+  +R 
Sbjct: 61  TVYGIGLAIVYRKGVRWKDLLR----RTFLLVLYGLLFN-------LIASWSFDLSTLRF 109

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV--VYLALL 199
            GVLQ  A++ L V ++    K                   W  ++A  +++   YL++L
Sbjct: 110 TGVLQLFAITGLGVVVLSYLAKG------------------WKSMLALGMVIATAYLSIL 151

Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTC--GVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
                                 + +V C  GV  +    CN  G +D  V G  HMY   
Sbjct: 152 V---------------------ISSVGCEGGVPQR---DCNPSGVVDVLVFGEKHMYAQG 187

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
                                        F+PEG+LS  S++ +   G   G V+   K 
Sbjct: 188 --------------------------EKGFDPEGILSIFSALSNVAFGFAVGLVLNGRKQ 221

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
            L R+      G ++ +  L   F N I LNK+L+T S+  + SG
Sbjct: 222 ILQRV-----FGISIGLISLAFIFNNFIELNKRLWTPSFAILASG 261


>gi|445497063|ref|ZP_21463918.1| putative transmembrane protein [Janthinobacterium sp. HH01]
 gi|444787058|gb|ELX08606.1| putative transmembrane protein [Janthinobacterium sp. HH01]
          Length = 353

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 75/144 (52%), Gaps = 16/144 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+D  RGL VA M+LV+ AG DW    P + HA W+GC   DF+ P F+ IVG++I
Sbjct: 1   MRINSIDAVRGLTVAAMLLVNDAG-DWSHVYPWLEHAEWHGCTPPDFIFPIFMLIVGISI 59

Query: 89  ALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
            LAL    D   A     + V+ R ++++  G+ L     H    L   ++ R  RL GV
Sbjct: 60  NLALSPRLDAGAATAPLARSVLLRAVRIVLLGLAL-----HVVAMLL--LNGRGFRLFGV 112

Query: 145 LQRIALSYLLVSLVEIFTKDVQDK 168
           LQR  + +    L+ I  +  + +
Sbjct: 113 LQRTGICFAAAGLLAIHVRGARAQ 136



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEG+L ++ S+ + I+G+  G  +        + ++    G A ++ G    ++ A+P 
Sbjct: 188 DPEGILGTLPSLATVILGLRAGDWL-----RSGQTRKLALAGVAAMLAGGI--WSLALPF 240

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
           NKQL+T S+V  T G   L  +  + L+D   + P
Sbjct: 241 NKQLWTSSFVLWTGGFGMLAIALAHQLIDVRAWPP 275


>gi|440804580|gb|ELR25457.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acanthamoeba
           castellanii str. Neff]
          Length = 446

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 150/375 (40%), Gaps = 97/375 (25%)

Query: 22  SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-------------DWPEISHAPWN 68
           +D    S  K  RL SLD+FRG+ +  MILVD+ G                P     P N
Sbjct: 47  TDASCPSAPKKPRLQSLDVFRGVTMLGMILVDNQGNFDHVVRPLDESIVRHPAPPRPPTN 106

Query: 69  G---CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQGG 122
                + AD    +  F V +A+     +IPDR   +K   +V+ R   L   G+LL   
Sbjct: 107 ARSWVDPADHCAQWDGFAVALAMNGFWDKIPDRRGKIKAWARVLQRIGTLFVVGLLLNAF 166

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
            S+  D+  +       R+ G   RIAL Y  V+++ + T  +  +              
Sbjct: 167 GSNPWDKWPH----WHFRIMGC--RIALCYGTVTVLFLATSTIVQR-------------- 206

Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
              ++  C   +Y+ L+YG  VP                     CG R  L P CNA G+
Sbjct: 207 ---VVMLCFTAIYVGLMYGLDVP--------------------KCG-RGNLTPGCNAGGF 242

Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           IDR + G                                W   P +PEGLLS++++ L+ 
Sbjct: 243 IDRSIFG-------------------------------DWMIRPNDPEGLLSTLTATLTC 271

Query: 303 IIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
            +G+ FG ++  +    L  + +WV +   L+   L L     +P+NK+++++ +  +  
Sbjct: 272 YLGLEFGRILHKYRANQLELVCRWVMLALGLIGLALFLWLW--MPINKKMWSVPFALMMG 329

Query: 362 GAAALVFSAIYALVD 376
           G   LV    Y LVD
Sbjct: 330 GIGGLVIFICYYLVD 344


>gi|357624248|gb|EHJ75102.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 340

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 116/283 (40%), Gaps = 37/283 (13%)

Query: 98  RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
           R +A+ +V  R+L L   GI L           +   +   +R  GVLQR+A  YL+V  
Sbjct: 19  RVNALGQVARRSLLLSLIGICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGS 68

Query: 158 VEI-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
           +E  F +  Q+       F         WL    ++ + L +      P           
Sbjct: 69  LECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAPG-----CPVGY 123

Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
           +  G +     G  +  N      GYIDR +LG NH+Y H              F+   R
Sbjct: 124 SGPGGLHRTATGDFSLQNCTGGIAGYIDRLILGPNHLYQH------------GTFKSIYR 171

Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIF 335
              P       +PEG+L  +S +L    G H   +++      AR+ +WV       ++ 
Sbjct: 172 TQLPH------DPEGILGILSGVLVVQAGAHAARIMLVYNHARARIMRWVFWSVMFGVVG 225

Query: 336 GLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           GL   F++   IP+NK L+++SY  VTS  A  + + +Y +VD
Sbjct: 226 GLLCKFSDGGYIPVNKNLWSVSYCLVTSSMAFFIQAILYFVVD 268


>gi|410463501|ref|ZP_11317013.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
 gi|409983383|gb|EKO39760.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
          Length = 371

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 17/136 (12%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           T RL S+D  RGLA+A MI+V++ G     +P++ HA W+G  LAD V P FLF+VGV +
Sbjct: 6   TSRLLSVDALRGLAIAAMIVVNNPGDRRFIYPQLLHAHWHGLTLADVVFPLFLFLVGVCV 65

Query: 89  ALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           ALA+     +    RA   +K++ R   L   G+        + DEL         R+ G
Sbjct: 66  ALAIDLDKARDAKGRARLWRKILPRAAVLFALGLGETAYLRLSFDEL---------RIPG 116

Query: 144 VLQRIALSYLLVSLVE 159
           VLQRIA+ YL  + ++
Sbjct: 117 VLQRIAVVYLAAAWLQ 132


>gi|343087500|ref|YP_004776795.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342356034|gb|AEL28564.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 386

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 146/364 (40%), Gaps = 98/364 (26%)

Query: 33  QRLASLDIFRGL-------AVALMILVDHAGG----DW--PEISHAPWNGCNLADFVMPF 79
           +RL S+D  RG        A A ++L+    G    DW   +  H  WNG +  DF+ P 
Sbjct: 25  KRLLSIDALRGFDMLLIAGAGAFLVLLKGKTGIPAIDWIAGQFYHPAWNGFSFYDFIFPL 84

Query: 80  FLFIVGVAIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLFI GV++  +L +  +    +    KK   R L L+  GIL    + ++P  +    +
Sbjct: 85  FLFIAGVSLTFSLNKGRNLGMSKPTLYKKTFSRMLVLILLGIL----YKNSPVPI---FE 137

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
              IR   VL RI ++  + +LV +      D  + +G             +A  +LV+Y
Sbjct: 138 PSQIRYGSVLGRIGIATFVTTLVYL----NFDFYKRLG-------------IAMAILVLY 180

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
            A L+   VP             YG            L+   N VG+ DR  +       
Sbjct: 181 YAALFLIPVP------------GYGA---------GDLSIEGNLVGWFDRTFM------- 212

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                            G L+++        ++  GLL+ + ++  TI G   G ++   
Sbjct: 213 ----------------PGILKQEI-------YDELGLLTQIPALCLTIFGTLAGEILTKA 249

Query: 316 KGHLARLKQWVTMGFALLIFGLT--LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                ++KQ    G   L  GL   LHF    P+NK L++ S++ +TSG A L     Y 
Sbjct: 250 WLDTKKIKQLAIAGVISLTLGLIWDLHF----PINKHLWSSSFILLTSGMAFLTLLLFYV 305

Query: 374 LVDC 377
           ++D 
Sbjct: 306 VIDV 309


>gi|339021122|ref|ZP_08645235.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
 gi|338751776|dbj|GAA08539.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
          Length = 377

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 91/332 (27%), Positives = 136/332 (40%), Gaps = 75/332 (22%)

Query: 42  RGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKR 94
           RG  +  M++V++ G     W  + HA WNGC  AD V PFFLF++G  I  A    L+ 
Sbjct: 2   RGATIVFMVIVNNPGDWNRVWSPLDHAAWNGCTPADLVFPFFLFLMGCVIPFAFDRRLRE 61

Query: 95  IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
              R+  V  + +R L L+   +LL    F H          V  +R  GVL RIAL Y+
Sbjct: 62  GAQRSQLVSHIAWRGLALVGLKLLLSLYPFFH----------VTHLRFFGVLTRIALCYV 111

Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
               + + ++        +G                 +L+ Y A+LY   VP   +    
Sbjct: 112 AAVSLYLCSRKTGFLVSVIG----------------LILLAYWAILYALPVPGLGWP--- 152

Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
                 GK F       A L+   N   ++DR+       + H      K          
Sbjct: 153 ------GKDF-------AFLDLNRNMAAWLDRQFSAWCQTWLHTGILYEKT--------- 190

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ-WVTMGFAL 332
                        ++PEGLLS++ +I +T+ GV  G V        AR    ++  G A 
Sbjct: 191 -------------WDPEGLLSTLPAIATTLSGVLAGQVFRRKDISPARRPMLFMAAGGAS 237

Query: 333 LIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           ++ GL   +    PLNK L+T S+  V+SGAA
Sbjct: 238 ILVGLL--WGQVFPLNKSLWTSSFTLVSSGAA 267


>gi|218260820|ref|ZP_03475939.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|423344001|ref|ZP_17321714.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218224343|gb|EEC96993.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|409213863|gb|EKN06876.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 371

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 97/379 (25%), Positives = 153/379 (40%), Gaps = 108/379 (28%)

Query: 31  KTQRLASLDIFRG-----------LAVALMILVDH----AGGDWPEISHAPWNGCNLADF 75
           +++RL SLD  RG           L VAL  L  +      GD  ++ H  W+G    D 
Sbjct: 6   QSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNPFFQVIGD--QMHHVKWDGLTHHDT 63

Query: 76  VMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           + P FLFI G++   +L++  ++    AD  +K+I R L L+  G +  G  +       
Sbjct: 64  IFPLFLFIAGISFPFSLEKQREQGKTDADIYRKIIRRGLTLVVLGFVYNGLLNF------ 117

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
              D    R   VL RI L+++  +L+ + T+ +                   W+  A +
Sbjct: 118 ---DFEHQRYASVLGRIGLAWMFGALIFVNTRTITRV----------------WITVA-I 157

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
           LV Y  LL     PD           +   VF +            + VGY+DR +L G 
Sbjct: 158 LVGYWLLLAFVPAPD----------GNGAGVFTM----------EGSLVGYVDRLLLPGR 197

Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
            H+  H                               +PEG+LS+V ++ + ++G+  G 
Sbjct: 198 LHLTVH-------------------------------DPEGILSTVPAVATALLGMFTGE 226

Query: 311 VIIHTKGHLARLKQ---WVTMGFALLIFGL--TLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
            I   +  L   K+    V  G  LL  GL  +L F    P+NK L+T S+VCV    + 
Sbjct: 227 FIKMQREGLTDKKKVGGLVIAGAVLLAVGLLWSLFF----PINKNLWTSSFVCVVGAYSV 282

Query: 366 LVFSAIYALVDCNIFGPWT 384
            +F+  + +VD   F  WT
Sbjct: 283 WMFALFFYVVDVLEFRKWT 301


>gi|224537871|ref|ZP_03678410.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|423227284|ref|ZP_17213748.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|224520557|gb|EEF89662.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|392624424|gb|EIY18516.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 373

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 12/127 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
             RL +LD+ RG+ +A MILV++ G  W  +     HA W+G    D V PFF+FI+GVA
Sbjct: 5   NNRLLALDVIRGITIAGMILVNNPG-SWQSVYAPLQHARWHGLTPTDLVYPFFMFIMGVA 63

Query: 88  IALALKRIPDRADAVK-KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVL 145
           I  +L++       V  K+I RT+ L   GI L         +L YG      +R+ GV+
Sbjct: 64  IHFSLRKFDKLNTTVSLKIIRRTVALFAVGIALD-----CFSKLCYGTFSWEHLRILGVM 118

Query: 146 QRIALSY 152
           QR+AL+Y
Sbjct: 119 QRLALAY 125



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 24/128 (18%)

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N +  ID K+LG  H+    A   S A                      FEPEGLLS++ 
Sbjct: 167 NLIAVIDVKLLGAGHLIKETAEGGSFA----------------------FEPEGLLSTIP 204

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
                + G   G +I     +  R++Q    G  LL  G  L + +  P+NK+L+T SY 
Sbjct: 205 CWAHVLFGTFVGSLITGIAENKERIRQIALFGTVLLFAGFLLQYLD--PINKKLWTASYT 262

Query: 358 CVTSGAAA 365
            +TSG A+
Sbjct: 263 LITSGTAS 270


>gi|295136516|ref|YP_003587192.1| hypothetical protein ZPR_4697 [Zunongwangia profunda SM-A87]
 gi|294984531|gb|ADF54996.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
          Length = 371

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 12/144 (8%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +   R  SLDI RG+ VALMILV++ G  W  I     HA W+G  L D V P FLF+VG
Sbjct: 1   MARSRYLSLDILRGMTVALMILVNNPG-SWATIYAPFKHAAWHGFTLTDLVFPTFLFVVG 59

Query: 86  VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAP-----DELTYGVDVRMIR 140
            A++ + K++   + +  + + +T K      L+  G S+ P     D      ++  IR
Sbjct: 60  NAMSFSFKKM--NSWSTPEFLTKTFKRAAIIFLIGLGLSYYPFVRRTDGEFILKNILDIR 117

Query: 141 LCGVLQRIALSYLLVSLVEIFTKD 164
           + GVLQRIA+ YLL ++   F K 
Sbjct: 118 IMGVLQRIAVCYLLAAIAIRFLKK 141



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 54/99 (54%), Gaps = 2/99 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           PF+PEGLLS + ++++ I G   G  I  +      + Q V  G A++   L L +   +
Sbjct: 195 PFDPEGLLSCLPAVVNVIFGYLAGRFIQQSVNKKKLVIQLVIAGLAMI--SLALVWDVIL 252

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           P+NK ++T SYV +++G   ++  A+  +++   F  W+
Sbjct: 253 PVNKPIWTSSYVILSTGWDFIILGALIGILEIAEFKSWS 291


>gi|404404699|ref|ZP_10996283.1| hypothetical protein AJC13_04673 [Alistipes sp. JC136]
          Length = 376

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 152/372 (40%), Gaps = 97/372 (26%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPF 79
           QRL SLD  RG           L VAL  L   +  DW   ++ HA W+G    D + P 
Sbjct: 12  QRLLSLDALRGFDMFFIMGFAGLVVALCKLRPGSFADWMSAQMGHAAWDGFFHHDTIFPL 71

Query: 80  FLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           FLFI G++   +L +   R   V+      KVI R L L+  G++  G F+         
Sbjct: 72  FLFIAGISFPFSLAK--QREKGVRERSIYTKVIRRGLTLVALGLVYNGLFN--------- 120

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   +RL  VL RI L+++  +++ I             RF I         +AA +LV
Sbjct: 121 LDFATLRLPSVLGRIGLAWMFAAMLFI-------------RFGIRTRIA----LAAVILV 163

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y  LL        QF            V          L    N VGYIDR ++   H+
Sbjct: 164 GYGLLL--------QF------------VAAPDAAGAGPLTEAGNIVGYIDRTIMP-AHL 202

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
           Y +                              F+PEGLLS++ +I++ ++G+  G  + 
Sbjct: 203 YGNRG----------------------------FDPEGLLSTLPAIVTAMLGMFTGEFVR 234

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIY 372
            ++  ++  ++ + M     +  +     + + P+NK+L+T ++V      +  +F+  Y
Sbjct: 235 RSEEQISGSRKALRMAAGAALLLVLALCLDPLQPVNKKLWTPAFVFAAGAYSLGMFALFY 294

Query: 373 ALVDCNIFGPWT 384
            ++D   +  W+
Sbjct: 295 YIIDVRQWRRWS 306


>gi|390946357|ref|YP_006410117.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
 gi|390422926|gb|AFL77432.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
          Length = 369

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 150/373 (40%), Gaps = 97/373 (26%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
            +RL SLD  RG+ +  ++    LV      WP         ++ HA WNG  + D + P
Sbjct: 4   NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 63

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWGILLQGGFSHAPDELTYGV 134
            FLFI GVA   +L +   R    K++   IFR  L L   G++  G F          +
Sbjct: 64  LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 114

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+  VL RI L+++  +L+ ++   V+ +                  +A  +L+ 
Sbjct: 115 NFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA----------------VAGIILIG 157

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  LL     PD          AD              L+      G+IDR+ L  + +Y
Sbjct: 158 YSLLLGLVVAPD------APVGAD-------------PLSVEGCLAGWIDRQYLPGHILY 198

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEG+LS++ +++S + G+  G  ++ 
Sbjct: 199 ------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLLD 228

Query: 315 TKGHLARLKQWVTMGFALLIFGLT---LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
            +  L+    W     A+    +T   L +   +P+NK L++ S+ CV SG +  + +  
Sbjct: 229 GRRGLS--GSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWSSSFTCVVSGYSLGMTALF 286

Query: 372 YALVDCNIFGPWT 384
           Y L+D   +  WT
Sbjct: 287 YYLIDVCGYKRWT 299


>gi|334366956|ref|ZP_08515871.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313156833|gb|EFR56273.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 370

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 150/373 (40%), Gaps = 97/373 (26%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
            +RL SLD  RG+ +  ++    LV      WP         ++ HA WNG  + D + P
Sbjct: 5   NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWGILLQGGFSHAPDELTYGV 134
            FLFI GVA   +L +   R    K++   IFR  L L   G++  G F          +
Sbjct: 65  LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 115

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +   +R+  VL RI L+++  +L+ ++   V+ +                  +A  +L+ 
Sbjct: 116 NFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA----------------VAGIILIG 158

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y  LL     PD          AD              L+      G+IDR+ L  + +Y
Sbjct: 159 YSLLLGLVVAPD------APVGAD-------------PLSVEGCLAGWIDRQYLPGHILY 199

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                           F+PEG+LS++ +++S + G+  G  ++ 
Sbjct: 200 ------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLLD 229

Query: 315 TKGHLARLKQWVTMGFALLIFGLT---LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
            +  L+    W     A+    +T   L +   +P+NK L++ S+ CV SG +  + +  
Sbjct: 230 GRRGLS--GSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWSSSFTCVVSGYSLGMTALF 287

Query: 372 YALVDCNIFGPWT 384
           Y L+D   +  WT
Sbjct: 288 YYLIDVCGYKRWT 300


>gi|343083133|ref|YP_004772428.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351667|gb|AEL24197.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 381

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 66/126 (52%), Gaps = 6/126 (4%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            R  SLD+ RGL +ALM++V+  G     +  ++HA W+G  L D V P FLF+VG A++
Sbjct: 12  TRYQSLDVLRGLTLALMVIVNTPGDGSTSFGPLTHADWHGLTLTDLVFPSFLFVVGNAMS 71

Query: 90  LALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQ 146
            +L +   +       KV  RT  +   G+LL    F    D      D   IR+ GVLQ
Sbjct: 72  FSLGKFKLKGGKAYFSKVFKRTALIFIIGLLLTAFPFFRVNDSGVVPYDFTSIRILGVLQ 131

Query: 147 RIALSY 152
           RIAL Y
Sbjct: 132 RIALCY 137



 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 52/97 (53%), Gaps = 2/97 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS+  ++++ ++G   G + I  +G       W+ M FA+++  +   +    P
Sbjct: 206 FDPEGLLSTFPAMVNVLLGYWVG-LQIQKRGGDIETVLWLAM-FAVILLVVGYLWDYGFP 263

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           +NK+++T S+  VT G + L  + +  +++      W
Sbjct: 264 INKKIWTSSFTLVTVGYSTLTLALLMFILEVRSIKGW 300


>gi|284041428|ref|YP_003391358.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820721|gb|ADB42559.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 381

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 87/331 (26%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
           + SH  WNG    D + P F+F+ GV+   +    L +  D+A   +K+I R L L+  G
Sbjct: 60  QFSHPAWNGFRAYDLIFPLFMFMAGVSTPFSVGSRLDQGTDKAKIARKIISRGLILVVLG 119

Query: 117 ILLQGG-FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           I+   G F+   +++         R   VL RI L+ +   L+ ++ +            
Sbjct: 120 IIYNNGLFNRVFEDM---------RFPSVLGRIGLAGMFAQLIYLYFRPRAQ-------- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                Y W       +L+ Y AL+    VP                     CG    L  
Sbjct: 163 -----YIWF----VGLLLGYWALMMLVPVPG--------------------CGA-GVLTM 192

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
            CN   +IDR ++   H+Y                             H   +PEGL S+
Sbjct: 193 ECNLASFIDRMLVP-GHLYKT--------------------------IH---DPEGLFST 222

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF--ALLIFGLTLHFTNAIPLNKQLYT 353
           + +I +T++G+ F    + T G     K  + +G   A ++ G    F    P+NK L+T
Sbjct: 223 LPAIDNTLLGI-FAGTFLRTHGRTGNQKTALLLGAGAAFVLLGWLWDFV--FPINKNLWT 279

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            S+V VT G + L+ +  Y ++D      WT
Sbjct: 280 SSFVLVTGGLSLLLLAVFYWVIDVKGIKRWT 310


>gi|428299602|ref|YP_007137908.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
 gi|428236146|gb|AFZ01936.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
          Length = 104

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 3/81 (3%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            + RL SLD+FRG+A+A MILV++ G     +P + HA W+GC   D V PFFLFIVG+A
Sbjct: 10  NSNRLVSLDVFRGIAIASMILVNNPGSWDSIYPPLEHAEWHGCTPTDLVFPFFLFIVGMA 69

Query: 88  IALALKRIPDRADAVKKVIFR 108
           +  +  +         +V +R
Sbjct: 70  MPFSFAKYTKENRPTARVYWR 90


>gi|291514403|emb|CBK63613.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 376

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 89/378 (23%), Positives = 149/378 (39%), Gaps = 93/378 (24%)

Query: 25  QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCN 71
           Q      +QRL SLD  RG           L  AL  L      DW   ++ HA WNG  
Sbjct: 4   QPTQPAASQRLLSLDALRGFDMLFIMGFAGLVTALCKLCPGEFSDWMTAQMGHADWNGFF 63

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAP 127
             D + P FLFI G++   +L +  ++  + +    KVI R L L+  G +  G F    
Sbjct: 64  HHDTIFPLFLFIAGISFPFSLAKQREKGMSERSIYLKVIRRGLTLVALGFVYSGLFK--- 120

Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
                 +D   +RL  VL RI L+++  +L+ +   +   + ++V              +
Sbjct: 121 ------LDFATLRLPSVLGRIGLAWMFAALLFV---NFNVRTRAV--------------I 157

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           AA +L+ Y  LL     PD                     G    L    N VGY+DR V
Sbjct: 158 AAAILLGYGLLLQFVAAPD--------------------AGGAGPLTLEGNIVGYVDRIV 197

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                M  H    R                         F+PEGLLS++ +I++ ++G+ 
Sbjct: 198 -----MPSHLLGGRG------------------------FDPEGLLSTLPAIVTAMLGMF 228

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAAL 366
            G  +  ++   +  ++ + M    +         + + P+NK+L+T ++V      +  
Sbjct: 229 TGEFVRRSEERTSGSRKTLWMAAGAVALLALALCLDPLQPINKKLWTPAFVFAAGAYSLG 288

Query: 367 VFSAIYALVDCNIFGPWT 384
           +F+  Y ++D   +  WT
Sbjct: 289 MFALFYYIIDVCQWRRWT 306


>gi|116331948|ref|YP_801666.1| hypothetical protein LBJ_2457 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116125637|gb|ABJ76908.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
          Length = 363

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 132/329 (40%), Gaps = 82/329 (24%)

Query: 44  LAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPD 97
           + V  MILV++ G  W  +     HA WNGC   D V PFFLF VG +I ++L  K   +
Sbjct: 1   MTVVGMILVNNPG-SWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYSKNGIN 59

Query: 98  RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
           R      +  R + L     +L G F +   E T+      +R+ GVLQRI   Y +V+ 
Sbjct: 60  RIRIWIGICIRGISL-----ILLGLFLNFFGEWTF----SELRIPGVLQRIGFVYWVVA- 109

Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA 217
                             ++F ++         VLV  + +L    V  W  T I     
Sbjct: 110 ------------------TLFLVFP-----GKKVLVFLIPIL---LVHTWILTHIAPPGE 143

Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
                          L    +   +IDR + G  H+     W+ SK              
Sbjct: 144 S-----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT------------- 174

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
                    ++PEG LS ++SI +++ GV  G ++   +G   + +     G   L   +
Sbjct: 175 ---------WDPEGFLSGIASIATSLFGVICGFILFRREGR-GKNRVLSIFGLGFLFTFV 224

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            L +  ++P+NK L+T SY   T+G A L
Sbjct: 225 GLLWDRSLPMNKSLWTGSYAVYTTGFAFL 253


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 71/131 (54%), Gaps = 5/131 (3%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMILV+   AG D + ++ H PW G   AD V P FLF VG A++
Sbjct: 737 ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 796

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
            AL R       ++++  R+  +   G L+       H  D     + +   R+ GVLQR
Sbjct: 797 FALDRGQPLGAFLRRIGKRSALIFLLGFLMYWFPFVHHGADGSWSFIAIDQTRVPGVLQR 856

Query: 148 IALSYLLVSLV 158
           IAL Y L +L+
Sbjct: 857 IALCYALGALL 867


>gi|373953356|ref|ZP_09613316.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373889956|gb|EHQ25853.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 404

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 76/332 (22%), Positives = 134/332 (40%), Gaps = 87/332 (26%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFW 115
           ++ H+PWNG    D + P F+FI G+++  +  R   ++ +  K I+     RT+ L+  
Sbjct: 79  QLHHSPWNGFTFYDLIFPLFIFIAGISMPFSYNRQVAQSPSSNKQIYVRLIKRTVLLILL 138

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           G ++ G    A  + T        R   VL RIAL+    +++ + +             
Sbjct: 139 GTVVNGALHFAGYQQT--------RFASVLGRIALACFFAAVIYLNSS------------ 178

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                  W  +  A +L+ Y  L+    VP             +G       GV   L P
Sbjct: 179 -----LRWQIIWFAVILLGYWLLMALVPVPG------------HG------AGV---LTP 212

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
             N   +ID+  L                        G L +         ++PEGLLS+
Sbjct: 213 GANLSAWIDQHFL-----------------------PGKLHRKV-------YDPEGLLST 242

Query: 296 VSSILSTIIGVHFGHVIIHTKGH-LARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQL 351
           + +I + ++G+  GH +    G  L+ LK+   M   G +L++  L  +   A P+NK +
Sbjct: 243 IPAIATAMMGIFTGHFLQWEPGERLSPLKKIGIMAAAGISLILIALIWNM--AFPINKNM 300

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           +T S+     G + L+F+  Y ++D   +  W
Sbjct: 301 WTSSFTLYAGGWSLLLFTLFYGIIDVAGYKKW 332


>gi|408821750|ref|ZP_11206640.1| hypothetical protein PgenN_01470 [Pseudomonas geniculata N1]
          Length = 355

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 75/139 (53%), Gaps = 16/139 (11%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
           A ++      +  R    + V+ R L++L          +     + + +D    R+ GV
Sbjct: 66  AFSVAPRALDVSARPALARGVLERALRILL-------AGALLHLLIWWALDTHHFRIWGV 118

Query: 145 LQRIALSYLLVSLVEIFTK 163
           LQRIA+   LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137


>gi|344208862|ref|YP_004794003.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
 gi|343780224|gb|AEM52777.1| Protein of unknown function DUF2261, transmembrane
           [Stenotrophomonas maltophilia JV3]
          Length = 360

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 78/147 (53%), Gaps = 20/147 (13%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
           K  +  +RLAS+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF
Sbjct: 6   KGSMPPRRLASIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLF 64

Query: 83  IVGVAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +VGV++A ++   P   DA  +      V+ R L++L          +     + + +  
Sbjct: 65  LVGVSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHT 115

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTK 163
              R+ GVLQRIA+   LV ++ ++ +
Sbjct: 116 HHFRIWGVLQRIAVCAALVGVLAVYAR 142



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 51/89 (57%), Gaps = 7/89 (7%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+  G ++    G  A L     +G A  + GL L     +PL
Sbjct: 198 DPEGLLSTLGALASTVLGLIAGGLL--RNGRSAALAG---LGVATAVLGLLL--ATVLPL 250

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NKQL+T SYV  T G AAL     + L+D
Sbjct: 251 NKQLWTPSYVLWTGGLAALALWLGHVLID 279


>gi|334364999|ref|ZP_08513969.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313158791|gb|EFR58176.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 383

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 147/371 (39%), Gaps = 94/371 (25%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
           +++RL SLD  RG  +  ++    LV    G WP          +SH  W+G    D + 
Sbjct: 20  QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 79

Query: 78  PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P FLFI GV+   ++A +R    ++     K++ R L L+  G++  G F          
Sbjct: 80  PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 130

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   +R+  VL RI L++   S+  +   +   K ++               +A  VL 
Sbjct: 131 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 173

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y AL                       V          L    N  GYIDR+ L     
Sbjct: 174 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 208

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                              G L        +  F+PEGLLS+V ++++ ++G+  G  + 
Sbjct: 209 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 243

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
            +     R   W+    A L+    L F+  +P+NK+L++ ++VCV    +  +F+  Y 
Sbjct: 244 RSDIRGGRKTLWMAAAAAALLA-AGLAFSGVLPVNKKLWSSTFVCVVGAYSLGMFALFYY 302

Query: 374 LVDCNIFGPWT 384
           L+D   +  WT
Sbjct: 303 LIDVRGWRRWT 313


>gi|338212226|ref|YP_004656281.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306047|gb|AEI49149.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 369

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 151/370 (40%), Gaps = 98/370 (26%)

Query: 33  QRLASLDIFRGLAVALMI-----LVDHAGGDW---------PEISHAPWNGCNLADFVMP 78
            RLASLD  RG  + LMI      +   GG            +  H  W+G    DF+ P
Sbjct: 8   SRLASLDALRGFDM-LMISGGGAFLSLMGGKTDSALLNAVAAQFHHPDWDGFTFYDFIFP 66

Query: 79  FFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
            FLF+ GV+++++LK      IP +   ++KV  R L L F G+L +    +AP ++   
Sbjct: 67  LFLFMAGVSLSISLKNGIAKGIP-QYKLMEKVFKRMLILFFLGLLDK----NAPIDI--- 118

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   IR   VL RI ++  LV+++ +   +   K Q +  F+I  LY      AA +L+
Sbjct: 119 LDPAHIRYGTVLGRIGIATFLVAILYL---NTGWKTQLIVAFTILGLYY-----AALMLI 170

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                                   DYG            L+   N VG+IDR  +     
Sbjct: 171 ---------------------SVGDYGG---------GNLSFEGNLVGWIDRAFM----- 195

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
              P   R K  T D                       + + +S+   TI G   G +++
Sbjct: 196 ---PG--RLKQTTYDEL--------------------AMTTQLSATCLTIFGSLAGKILL 230

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  +L +   MG   +  GL   +    P+NK L++ S++ +T+G A+L+ +  Y 
Sbjct: 231 DKTTANIKLIRLAGMGVIGIAAGLA--WATVFPINKHLWSSSFILLTAGMASLLVALFYG 288

Query: 374 LVDCNIFGPW 383
           ++D   F  W
Sbjct: 289 IMDVLKFTKW 298


>gi|395761203|ref|ZP_10441872.1| hypothetical protein JPAM2_05565 [Janthinobacterium lividum PAMC
           25724]
          Length = 373

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 73/137 (53%), Gaps = 9/137 (6%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           + +QR  +LD+ RGL VALMI+V+  G DW  +     HA W+G  L D V P FLF+VG
Sbjct: 3   IASQRYLALDVLRGLTVALMIVVNTPG-DWGSVYAPFLHAEWHGFTLTDLVFPSFLFVVG 61

Query: 86  VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRL 141
            A+A  L +  + A    + K+  R+  +   G LL    F    D   +    +   R+
Sbjct: 62  NALAFVLGKYENLAHGAVLAKLCKRSALIFLLGFLLYWFPFFKIDDAGQFAWSSLSQTRI 121

Query: 142 CGVLQRIALSYLLVSLV 158
            GVLQRIA+ YL  +L+
Sbjct: 122 PGVLQRIAVCYLAAALI 138


>gi|357628855|gb|EHJ78009.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 275

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 34/172 (19%)

Query: 2   SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
           +E+ +ET     ++ +E  V     +S     RL SLDIFRG+A+ALM           +
Sbjct: 59  NELGSETR----VLTTEASVPRSPTRS-----RLRSLDIFRGIAIALM--------QANK 101

Query: 62  ISHAPWNGCNLADFVMPFFLFIVGVAIALALK-----RIPDRADAVKKVIFRTLKLLFWG 116
            SHA WNG  +AD V P+F F +G A+ L+L       +P R +A+ +V  R+L L   G
Sbjct: 102 FSHAVWNGLTVADLVFPWFAFTMGEAMVLSLNARLRTSLP-RVNALGQVARRSLLLSLIG 160

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQD 167
           I L           +   +   +R  GVLQR+A  YL+V  +E  F +  Q+
Sbjct: 161 ICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGSLECAFMRTSQN 202


>gi|374384982|ref|ZP_09642493.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
 gi|373227040|gb|EHP49361.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/371 (23%), Positives = 150/371 (40%), Gaps = 84/371 (22%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNG 69
           D  + +H   +RL SLD  RG  +  ++        L   A  +W        +H  W G
Sbjct: 2   DTMKSTHSAAKRLESLDALRGFDLFFLVALGPLMNSLARAADAEWFNNWMGIFNHVSWEG 61

Query: 70  CNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSH 125
            +  D +MP FLF+ G+++  AL R   +PD+   ++++  R L L  +G++ QG     
Sbjct: 62  FSPWDLIMPLFLFMSGISMPFALARYKSMPDKRPLLRRLGKRILLLWIFGMICQGNLLGL 121

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            PD        ++      LQ IA  YL+ +L+ +FT+    + Q +             
Sbjct: 122 NPD--------KIYLYSNTLQAIAAGYLITALLFLFTR---RRTQLI------------- 157

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
             A  +L++Y A +        QF  ++     YG              P  N   +ID+
Sbjct: 158 -TAVLLLLIYWAAM--------QFIQVD----GYGG---------GNYTPQGNLAEWIDK 195

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VLG           R  A   D       +     W H  +    +LSS++  ++ + G
Sbjct: 196 VVLGR---------FRDTAQLVDG------KVVVAEWYHYTW----ILSSLNFGVTVLTG 236

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
           +  G++         +LK +  +G  ++  G   +F   +P+ K ++T S V V+SG   
Sbjct: 237 LFAGYIAKDKIEEKRKLKLYFGIGATMVTIGWLWNFQ--MPVIKTIWTSSMVLVSSGYCF 294

Query: 366 LVFSAIYALVD 376
           L+    Y  +D
Sbjct: 295 LLMGLFYYWID 305


>gi|390946391|ref|YP_006410151.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
 gi|390422960|gb|AFL77466.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
          Length = 366

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 146/371 (39%), Gaps = 94/371 (25%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
           +++RL SLD  RG  +  ++    LV    G WP          +SH  W+G    D + 
Sbjct: 3   QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 62

Query: 78  PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           P FLFI GV+   ++A +R    ++     K++ R L L+  G++  G F          
Sbjct: 63  PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 113

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           +D   +R+  VL RI L++   S+  +   +   K ++               +A  VL 
Sbjct: 114 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 156

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
            Y AL                       V          L    N  GYIDR+ L     
Sbjct: 157 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 191

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                              G L        +  F+PEGLLS+V ++++ ++G+  G  + 
Sbjct: 192 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 226

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
                  R   W+    A L+    L F+  +P+NK+L++ ++VCV    +  +F+  Y 
Sbjct: 227 RGDIRGGRKTLWMAAAAAALLA-AGLAFSGVLPVNKKLWSSTFVCVVGAYSLGMFALFYY 285

Query: 374 LVDCNIFGPWT 384
           L+D   +  WT
Sbjct: 286 LIDVRGWRRWT 296


>gi|297567057|ref|YP_003686029.1| hypothetical protein [Meiothermus silvanus DSM 9946]
 gi|296851506|gb|ADH64521.1| Protein of unknown function DUF2261, transmembrane [Meiothermus
           silvanus DSM 9946]
          Length = 377

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/153 (35%), Positives = 80/153 (52%), Gaps = 27/153 (17%)

Query: 18  EPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPE---ISHAPWN-G 69
            P   DQQ ++   ++    RL SLD+FRGL + LM+LV++   D      ++HAPW  G
Sbjct: 7   NPPTQDQQTETPFPSRKTAMRLGSLDVFRGLTILLMLLVNNVALDANTPYLLTHAPWKGG 66

Query: 70  CNLADFVMPFFLFIVGVAIALAL-----KRIPD-RADAVKKVIFRTLKLLFWGILLQGGF 123
             LAD V P+FL  VGVAI  A      K +P  R D   K+I R++ L   G+L+    
Sbjct: 67  VYLADLVFPWFLLAVGVAIPFAAASFRKKNLPSWRYDL--KIIQRSIVLFGLGLLIVSSI 124

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
           +  P    + +D        VLQ IA++YL+ +
Sbjct: 125 ARRP---VFALD--------VLQLIAMAYLVAA 146


>gi|320333679|ref|YP_004170390.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
 gi|319754968|gb|ADV66725.1| hypothetical protein Deima_1072 [Deinococcus maricopensis DSM
           21211]
          Length = 376

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 105/215 (48%), Gaps = 40/215 (18%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPW-NGCNLADFVMPFFLFIVGV 86
           +  RLA+LD +RGL V LM+LV++   DW    E+ HAPW  G  LAD V P+FLF  G 
Sbjct: 24  RGARLAALDAWRGLTVLLMLLVNNVALDWRTPKELMHAPWGGGATLADLVFPWFLFCAGT 83

Query: 87  AIALAL---KRIPDRADA-VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A+  +L   +R   R  A V+K++ RT+ L   G++L    +H    LT+G+        
Sbjct: 84  ALPFSLASARRAGVRGWALVRKLLTRTVLLYLVGVVLVSAVAH---RLTFGL-------- 132

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQ IAL+ LL                 VG   +         +AA +LV Y A+L  T
Sbjct: 133 GVLQLIALASLL---------GAAGAQLRVGARMV---------LAAALLVGYAAVLLLT 174

Query: 203 YVPDWQFTII--NKDSADY-GKVFNVTCGVRAKLN 234
            VP     ++   +++  Y  + F    GVR  L+
Sbjct: 175 PVPGVGAGVLEETRNAVQYLNQTFLAPLGVRGLLS 209


>gi|198277541|ref|ZP_03210072.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
 gi|198270039|gb|EDY94309.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
          Length = 338

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 72/119 (60%), Gaps = 7/119 (5%)

Query: 49  MILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP--DRADAVKK 104
           MILV++AGG   +  + H+ WNG    D V PFFLF+VG++  ++L++      ++ ++K
Sbjct: 1   MILVNNAGGPVSYAPLRHSVWNGLTPCDLVFPFFLFMVGISTYISLRKFNFGPTSEVIRK 60

Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
           ++ RT  ++  G+ +   F +A +   + +D   +R+ GVLQRI L Y +VSL+ I+  
Sbjct: 61  IVRRTFLIILIGLAID-WFGYACNGNFFPIDT--LRIPGVLQRIGLCYGIVSLMVIYIN 116



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 60/108 (55%), Gaps = 3/108 (2%)

Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
           +A  +  +P +PEGL  ++S++  T+IG   G +++       R+ + +T    +LI G 
Sbjct: 159 EAHLYKKSPIDPEGLAGTLSAVAHTLIGFMCGRLLLEKISVNKRIVKLITAAVMMLIIGY 218

Query: 338 TLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC-NIFGPWT 384
            L     +P+NK++++ ++V VT G  +L+ + +  ++D  NI   WT
Sbjct: 219 VLSI--WMPINKRVWSTTFVLVTCGWGSLLLALLMYVIDVKNINKGWT 264


>gi|150007979|ref|YP_001302722.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|423331514|ref|ZP_17309298.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936403|gb|ABR43100.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|409230084|gb|EKN22952.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
          Length = 378

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 152/365 (41%), Gaps = 87/365 (23%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACV 191
               R+      LQ IA+ YL+ SL+ ++ +  VQ     +G            + AA +
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFLYVRIRVQ-----IG------------IAAALL 158

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           L+ + A+ +         T+ N     Y               P  N   +IDR VLG  
Sbjct: 159 LIFWGAMEF--------ITVGNYGGGSY--------------TPDSNLAEWIDRTVLG-- 194

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
                    R +        +G   ++        +    +LSS++  ++ + G+  G+ 
Sbjct: 195 ---------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY- 236

Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
           I+  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    
Sbjct: 237 ILKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLF 296

Query: 372 YALVD 376
           Y  +D
Sbjct: 297 YYWID 301


>gi|154492358|ref|ZP_02031984.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|423722056|ref|ZP_17696232.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
 gi|154087583|gb|EDN86628.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|409242758|gb|EKN35518.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
          Length = 370

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 88/380 (23%), Positives = 151/380 (39%), Gaps = 103/380 (27%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGC 70
           +   +++RL SLD  RG           L VAL  L  ++     GG   ++ H  W+G 
Sbjct: 2   EKQKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHA 126
              D + P FLFI G++   +L++  ++    +D  K+++ R + L+  G +  G     
Sbjct: 59  THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                   D   +R   VL RI L ++  +L+                F  FR     W+
Sbjct: 118 --------DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI 153

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
            A  +LV Y   +    VP           A+ G                 N VGY+DR 
Sbjct: 154 -AGTILVGYWVWIAFIPVP----------GAEAGP-----------FTLEGNWVGYVDRL 191

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGLLSSVSSILSTIIG 305
           +L                               P   H  F +PEGLLS++  I++ ++G
Sbjct: 192 LL-------------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLG 220

Query: 306 VHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           +  G  I +  +G   + K    +   +L+  + + ++   P+NK+L++ S+VCV    +
Sbjct: 221 MFTGEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVCVVGAYS 280

Query: 365 ALVFSAIYALVDCNIFGPWT 384
             +F+  Y +VD   +  WT
Sbjct: 281 VWMFALFYYIVDVLGWRKWT 300


>gi|333382729|ref|ZP_08474395.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828330|gb|EGK01039.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 389

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 79/151 (52%), Gaps = 19/151 (12%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           + RL SLD+ RG+ +A MI+V+++G     +  + H  W+G    D V PFF+FI+G++ 
Sbjct: 6   SGRLLSLDVLRGITIAGMIMVNNSGSGEYTYAPLKHVAWDGLTPTDLVFPFFMFIMGIST 65

Query: 89  ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQG-GFSH------APDELTYG------ 133
            ++L++     +   + K++ RT+ +   G+ L   G S        PD L +       
Sbjct: 66  YISLRKFNFEFNTPTLLKILKRTIVIFLIGLGLSWLGLSFRTYHMLEPDNLGFWERFFRA 125

Query: 134 -VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
             D   +R  GV+QR+AL+Y   S++ I  K
Sbjct: 126 ITDFGHLRTLGVMQRLALTYGAASIIAITVK 156



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 28/123 (22%)

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           V  +D+ +LGINHMY            +DS                  +PEGLLS++ +I
Sbjct: 189 VSIVDQAILGINHMY------------KDSGLA--------------IDPEGLLSTIPAI 222

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
              +IG   G +I++TK +  R+ Q   +G  L   G  L +    P+NK+++T ++V  
Sbjct: 223 AHVLIGFCCGALIMNTKDNDKRISQLFIVGTILTFAGFLLSY--GCPINKKIWTPTFVLA 280

Query: 360 TSG 362
           T G
Sbjct: 281 TCG 283


>gi|423345097|ref|ZP_17322786.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
 gi|409222883|gb|EKN15820.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
          Length = 370

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 88/380 (23%), Positives = 151/380 (39%), Gaps = 103/380 (27%)

Query: 27  KSHLKTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGC 70
           +   +++RL SLD  RG           L VAL  L  ++     GG   ++ H  W+G 
Sbjct: 2   EKQKQSRRLLSLDALRGVDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHA 126
              D + P FLFI G++   +L++  ++    +D  K+++ R + L+  G +  G     
Sbjct: 59  THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF- 117

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
                   D   +R   VL RI L ++  +L+                F  FR     W+
Sbjct: 118 --------DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI 153

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
            A  +LV Y   +    VP           A+ G                 N VGY+DR 
Sbjct: 154 -AGVILVGYWVWIAFIPVP----------GAEAGP-----------FTLEGNWVGYVDRL 191

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGLLSSVSSILSTIIG 305
           +L                               P   H  F +PEGLLS++  I++ ++G
Sbjct: 192 LL-------------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLG 220

Query: 306 VHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           +  G  I +  +G   + K    +   +L+  + + ++   P+NK+L++ S+VCV    +
Sbjct: 221 MFTGEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVCVVGAYS 280

Query: 365 ALVFSAIYALVDCNIFGPWT 384
             +F+  Y +VD   +  WT
Sbjct: 281 VWMFALFYYIVDVLGWRKWT 300


>gi|442611023|ref|ZP_21025729.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
 gi|441746951|emb|CCQ11791.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
          Length = 373

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 96/365 (26%)

Query: 33  QRLASLDIFRGLAV-----------ALMILVDHAGGDWPE--ISHAPWNGCNLADFVMPF 79
           +RLASLD  RG  +           AL +L    G    E    H+ W+G    D + P 
Sbjct: 6   KRLASLDALRGFDMMWILGGQGIFAALFVLTGWTGWRTFEAHTVHSDWHGFTFYDLIFPL 65

Query: 80  FLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
           F+F+ GVA+ L+ KRI      +R    +K + R + L   GIL   G+           
Sbjct: 66  FIFLSGVAMGLSPKRIDHLPMSERTPIYRKALKRFVLLCLLGILYNHGWGTGIPA----- 120

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   IR   VL RIA ++L+ +L+             V  FS+ ++      +   +L+ 
Sbjct: 121 DFSEIRYSSVLGRIAFAWLICALL-------------VWHFSLKQVAY----IGLSILIT 163

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GINHM 253
           Y   L   Y+P     +   D+ D              +    NA  +ID+  L G+ + 
Sbjct: 164 YWIWL--CYIP-----VPGGDAGDL------------SIGGSWNA--WIDQNFLPGVRY- 201

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
                        QD P                 +PEG+LSS+ ++++ I G+  G +I 
Sbjct: 202 -------------QDRP----------------VDPEGILSSLPAVVNAIAGLFAGQLIK 232

Query: 314 HT--KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
               KG    +    + G  +L   L   +    P+NK+L+T S+  VT G +A++ +  
Sbjct: 233 RAPEKGEWKCVALLFSGG--VLFIALGWLWDLVFPVNKELWTSSFTLVTIGWSAILLAVF 290

Query: 372 YALVD 376
           Y LVD
Sbjct: 291 YVLVD 295


>gi|254524630|ref|ZP_05136685.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
 gi|219722221|gb|EED40746.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
          Length = 355

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 75/141 (53%), Gaps = 20/141 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
           +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VGV++
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65

Query: 89  ALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           A ++   P   DA  +      V+ R L++L          +     + + +     R+ 
Sbjct: 66  AFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHFRIW 116

Query: 143 GVLQRIALSYLLVSLVEIFTK 163
           GVLQRIA+   LV ++ ++ +
Sbjct: 117 GVLQRIAVCAALVGVLAVYAR 137



 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 49/89 (55%), Gaps = 7/89 (7%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+      +   G  A L     +G A  + GL L     +PL
Sbjct: 193 DPEGLLSTLGALASTVLGLLA--GGLLRNGRSAALGG---LGVATAVLGLLL--ATVLPL 245

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           NKQL+T SYV  T G AAL     + L+D
Sbjct: 246 NKQLWTPSYVLWTGGLAALALWLGHVLID 274


>gi|315498708|ref|YP_004087512.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
 gi|315416720|gb|ADU13361.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
          Length = 378

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 73/152 (48%), Gaps = 33/152 (21%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW--------PEISHAPW 67
           +SEP  +         TQRL SLD+ RGL V  MILV+   G +        P + HA W
Sbjct: 1   MSEPKTA---------TQRLPSLDVLRGLTVIGMILVNATAGMYYGLQAKVFPLLLHAHW 51

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQ-- 120
            G  +AD V P FL +VG++I +AL R       D A A +K+  R L+L   G LL   
Sbjct: 52  EGLKIADVVFPAFLTMVGLSIPMALNRAKMTTGLDVAQA-RKIGGRVLRLFLIGWLLSNL 110

Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
           G  +H         D    R  GVLQRI L Y
Sbjct: 111 GWLAH--------FDGEPWRFWGVLQRIGLVY 134


>gi|255037955|ref|YP_003088576.1| hypothetical protein Dfer_4208 [Dyadobacter fermentans DSM 18053]
 gi|254950711|gb|ACT95411.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 371

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 85/378 (22%), Positives = 145/378 (38%), Gaps = 98/378 (25%)

Query: 27  KSHLKTQRLASLDIFRGLAVALM--------ILVDHAGGDWP-----EISHAPWNGCNLA 73
           K    +QRL SLD  RG  +  +        +L    G  W      + +H  WNG    
Sbjct: 2   KDSAPSQRLLSLDTLRGFDMFWISGGEEIFHVLAKVTGWSWAIVLAHQFTHPDWNGFRAY 61

Query: 74  DFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
           D + P FLF+ GV+   +L     K +P  +  V+KVI R + L+F GI+   G      
Sbjct: 62  DLIFPTFLFMAGVSTPFSLGSRLEKGVPP-SQLVRKVIQRGIILVFLGIIYNNG------ 114

Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
              +  +   +R   VL RI L+ +   ++ ++                   Y   W+  
Sbjct: 115 --IFETEWSQMRYPSVLARIGLAGMFAQIIYLYFG-----------------YRARWIWF 155

Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
             +L+ Y   +    VP                     CG    L   CN   Y D  ++
Sbjct: 156 GGLLIGYYLFMMFYPVPG--------------------CGA-GLLTIDCNPASYFDSLII 194

Query: 249 -GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
            G  H+  H                               +PEGL+S++ +I + ++G+ 
Sbjct: 195 PGRLHLTIH-------------------------------DPEGLVSTIPAIATGLMGIF 223

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLT-LHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            G ++  +   L++  + + + FA ++  L  L +    P+NK L+T S+V    G + L
Sbjct: 224 AGELLRTSHEVLSQKSKVIYLVFAGVVSLLVCLVWDYFFPINKNLWTSSFVLCAGGFSTL 283

Query: 367 VFSAIYALVDCNIFGPWT 384
           + +  Y +VD   +  WT
Sbjct: 284 LLALFYWIVDVLNYRKWT 301


>gi|423345098|ref|ZP_17322787.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
 gi|409222884|gb|EKN15821.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
          Length = 376

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 151/365 (41%), Gaps = 92/365 (25%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWP-------EISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H+ G   D P         SH  W G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHSLGRAIDAPWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+ 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHVRLSVQ-----IGT-------------AVALLLA 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSP--FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                 R  A  ++    F G  R    +W         +LSS++  ++ + GV  GH++
Sbjct: 195 -----FRDAAVVENGQIVFAGSYRY---TW---------ILSSLNFGVTVLTGVFAGHIL 237

Query: 313 IHTKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
              K  + R  +W   +G   ++  L   +   +P+ K+++T S V V+SG   L+    
Sbjct: 238 ---KSGMDRKHKWQWLLGIGAIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLF 294

Query: 372 YALVD 376
           Y  +D
Sbjct: 295 YYWID 299


>gi|329851798|ref|ZP_08266479.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
 gi|328839647|gb|EGF89220.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
          Length = 398

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 91/367 (24%), Positives = 153/367 (41%), Gaps = 86/367 (23%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R  +LDI RGL +  M+L ++AG DW  I     HA W+G  L D V P F+  VG+++ 
Sbjct: 27  RFEALDILRGLFIIGMLLANNAG-DWSHIYTPLDHAEWHGFTLTDMVFPGFMTCVGLSMT 85

Query: 90  LALKR----IPDRADAVKKVIFRTLKL--------LFWGILLQGGFSHAPDELTYGVDVR 137
           L+L R    +  +A     ++  +L+         LF  +L Q  F H            
Sbjct: 86  LSLGRRQKTLNSQAGGKAALLVHSLRRAAILVGIGLFLNLLPQFDFEHW----------- 134

Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
             RL GVLQRI + Y + S + +       +   +       L+   +L+A  +L+ Y+ 
Sbjct: 135 --RLPGVLQRIGICYAIASGLVVLHSHQNQQGGLILHSRALALWGVGFLVAYTLLLKYVP 192

Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           +  G     W                       A  + P     ++D +VLG+NH+    
Sbjct: 193 VPDGAGANQWD----------------------AIHSWPA----WVDMQVLGVNHV---- 222

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            W  +K                       ++PEGLLSSV +  + + G+  G + I+T+ 
Sbjct: 223 -WSGAKT----------------------YDPEGLLSSVPATSNILFGILMG-LYINTRT 258

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
                     +G  L++  L L   + +P+ K+L+T S+V ++ G A  V + +  ++D 
Sbjct: 259 PRNAWGGVAIIGVLLML--LALVLDSYVPIIKKLWTPSFVLLSCGFAFTVLAVLMVVMDR 316

Query: 378 NIFGPWT 384
             F  W 
Sbjct: 317 LGFKRWA 323


>gi|256840847|ref|ZP_05546355.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298376669|ref|ZP_06986624.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|256738119|gb|EEU51445.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298266547|gb|EFI08205.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 378

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 155/364 (42%), Gaps = 85/364 (23%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLL 158

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +++    +GT     +F  +     +YG              P  N   +IDR VLG   
Sbjct: 159 LIF----WGTM----EFITV----GNYGG---------GSYTPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVD 376
             +D
Sbjct: 298 YWID 301


>gi|260790699|ref|XP_002590379.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
 gi|229275571|gb|EEN46390.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
          Length = 347

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 128/330 (38%), Gaps = 98/330 (29%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
           S P+ S+Q        +RL SLD FRG                                 
Sbjct: 40  STPE-SEQGLTEKKARERLRSLDTFRG--------------------------------- 65

Query: 77  MPFFLFIVGVAIALALKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
              F+FI+G ++AL+ + +  R      V +VI R+ KL   G  L  G  H  +     
Sbjct: 66  ---FVFIMGTSMALSFRGMRKRTSTRRVVFRVITRSAKLFLVGFFLNAG--HGRN----- 115

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH--------- 184
            D+  +R+ GVLQR++++YL+   +E F    +       R +   L   H         
Sbjct: 116 -DLGTVRVPGVLQRLSIAYLVSGFIECFVGKERKSSDERSRLTNPTLQKIHNALRDIVDN 174

Query: 185 WLMAACVLVVYLALLYGTYV------PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
           W      L++ +  L  T++      P            D  +  N T G          
Sbjct: 175 WAAWLLHLLILVIHLIITFLLPVPGCPTGYLGPGGPLLGDGVEYLNCTGG---------- 224

Query: 239 AVGYIDRKVLGINHMYHHPAWR---RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
           A GYIDR +LG +HMY  P  R   ++K                       F+PEG+L S
Sbjct: 225 AAGYIDRLILG-SHMYQTPTVRVFYKTKVA---------------------FDPEGILGS 262

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
           +++I +  +G+  G ++++ K H +R+ +W
Sbjct: 263 LTTIFNCFLGLQAGKILVYYKEHSSRIIRW 292


>gi|408673239|ref|YP_006872987.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387854863|gb|AFK02960.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 370

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/381 (22%), Positives = 142/381 (37%), Gaps = 105/381 (27%)

Query: 27  KSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLA 73
            S   TQRL SLD  RG  +           AL     H    W   ++SH  WNG    
Sbjct: 2   SSSSSTQRLYSLDALRGFDMFWIMGGEDFFHALSEATHHPAAIWIATQLSHVAWNGFRFY 61

Query: 74  DFVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D + P FLFI GV    ++   +++  D+   ++++I R L L+  G++   G       
Sbjct: 62  DLIFPLFLFISGVSTPYSVGREIEKGIDKQAILRRIIKRGLILVLLGVIYNNGLQIK--- 118

Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                ++  IR   VL RI L+Y                                  M A
Sbjct: 119 -----ELSQIRFPSVLGRIGLAY----------------------------------MFA 139

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKV--FNVTCG-VRAKLNPPCNAVGYIDRK 246
           C++ VY +         WQ+   +     Y  +  FN   G V   L    N   ++DR 
Sbjct: 140 CIIYVYAS-------QRWQYVWFSGLLIGYYLIMKFNAAPGFVAGDLTIQGNFASWVDRM 192

Query: 247 VL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
           ++ G  H+  H                               +PEGL S++ +I + ++G
Sbjct: 193 LVPGRLHLKIH-------------------------------DPEGLFSTIPAIGTGLLG 221

Query: 306 VHFGHVI--IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGA 363
           +  G+ +  + TK    +      +G   LI G +  +    P+NK L+T S++    G 
Sbjct: 222 IFAGNYLKNVTTKTPTQKAATLAILGLVCLIIGGS--WGIVFPINKNLWTSSFMLYAGGC 279

Query: 364 AALVFSAIYALVDCNIFGPWT 384
           + L+ S  Y ++D   +  W 
Sbjct: 280 SLLLLSLFYFIIDVLQYQRWA 300


>gi|332666399|ref|YP_004449187.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332335213|gb|AEE52314.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 369

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/371 (23%), Positives = 145/371 (39%), Gaps = 98/371 (26%)

Query: 33  QRLASLDIFRGL---------AVALMILVDHAGGDWP----EISHAPWNGCNLADFVMPF 79
           QRL SLD  RG          AV   +        W     ++SH  W+G  L D + P 
Sbjct: 8   QRLYSLDALRGFDMFWIMGAEAVVHSLATATGSSVWEAAAHQLSHPDWHGFRLYDLIFPL 67

Query: 80  FLFIVGVAIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           FLF+ GVA   ++ R  +    +   + +VI R L L+  GI+   G    P        
Sbjct: 68  FLFLAGVATPYSVGRDLENGKPKQQLLLRVIRRGLVLVLLGIIYNNGLVLKP-------- 119

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
           +  IR   VL RI L+Y+  +++ ++TK +                          ++ +
Sbjct: 120 LAEIRFPSVLGRIGLAYMFANIIYLYTKQLGQ------------------------IIWF 155

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMY 254
            ALL G ++                  FN   G     L+   N   Y+DR ++   H+Y
Sbjct: 156 CALLIGYWLL---------------LRFNAAPGFPMGDLSMEGNFASYLDRLIIP-GHLY 199

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                                        H P   EGL S + +I + ++G++ G+++ H
Sbjct: 200 LD--------------------------IHDP---EGLTSCIPAIGTALLGIYAGNLLKH 230

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAI-PLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             G +   K+ + +    ++  +     N + P+NK L+T S+V  T G + L+ SA Y 
Sbjct: 231 --GTMTPPKKALILAIMGVVALVLAQLWNLVFPINKNLWTSSFVLQTGGCSLLLLSAFYY 288

Query: 374 LVDCNIFGPWT 384
           ++D   +  W 
Sbjct: 289 VIDVLGYRRWA 299


>gi|399028715|ref|ZP_10729871.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
 gi|398073551|gb|EJL64721.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
          Length = 382

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 81/378 (21%), Positives = 142/378 (37%), Gaps = 104/378 (27%)

Query: 34  RLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLADFVMPFF 80
           RL SLD  RG  +  ++  +H           P       ++ H  W G    D + P F
Sbjct: 9   RLVSLDALRGFVMFWIMSGEHIIHALAKAAPIPVFVWMSSQLHHTEWEGITFYDMIFPIF 68

Query: 81  LFIVGVAIALALKR----------IPDRADAVKKVIFRTLK----LLFWGILLQGGFSHA 126
           LF+ GV++  + ++          +   A   KK+    LK    L+F G ++ G     
Sbjct: 69  LFVAGVSMPYSFEKKMSIAGVNTPMELPAKEKKKIYLSMLKRTCILIFLGFIVNGLLRFD 128

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
             + T        R   VL RI L++    ++ +   +   K Q +            W 
Sbjct: 129 GYDQT--------RFASVLGRIGLAWFFAGIIYL---NFNLKKQII------------WF 165

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
           +   +LV Y   +    VPD+   ++ K+ +                       GYIDR 
Sbjct: 166 IG--ILVGYYLAMKLIPVPDFGAGVLTKEGS---------------------LEGYIDRM 202

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
            L        P    SK                       ++PEGL S++ ++ + ++G+
Sbjct: 203 FL--------PGRLHSKV----------------------YDPEGLFSTIPAVATALLGM 232

Query: 307 HFGHVIIHTKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G  +     H +  K+ + M   A+++ G+ L +    P+NK L++ S+VC   G + 
Sbjct: 233 FLGTFLKIKANHFSTNKKILIMVASAIVLIGIGLIWNYDFPINKHLWSSSFVCFVGGCSI 292

Query: 366 LVFSAIYALVDCNIFGPW 383
           L F+  Y ++D   F  W
Sbjct: 293 LFFTFFYLIIDVLGFHKW 310


>gi|301309931|ref|ZP_07215870.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340410|ref|ZP_17318149.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831505|gb|EFK62136.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227845|gb|EKN20741.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
          Length = 378

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 153/364 (42%), Gaps = 85/364 (23%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R L L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLL 158

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +++   +        +F  +     +YG              P  N   +IDR VLG   
Sbjct: 159 LIFWGAM--------EFITV----GNYGG---------GSYTPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVD 376
             +D
Sbjct: 298 YWID 301


>gi|255013329|ref|ZP_05285455.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|410103820|ref|ZP_11298741.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
 gi|409236549|gb|EKN29356.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
          Length = 378

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 94/364 (25%), Positives = 155/364 (42%), Gaps = 85/364 (23%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R + L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G             +AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG-------------IAASLL 158

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           +++    +GT     +F  +     +YG              P  N   +IDR VL    
Sbjct: 159 LIF----WGTM----EFITV----GNYGG---------GSYTPDSNLAEWIDRTVL---- 193

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                  RR +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 194 -------RRFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVD 376
             +D
Sbjct: 298 YWID 301


>gi|58583544|ref|YP_202560.1| hypothetical protein XOO3921 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58428138|gb|AAW77175.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 362

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 65/124 (52%), Gaps = 5/124 (4%)

Query: 40  IFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP 96
           +FRGL + LMILV+ AG     + +++HA W G  LAD V P FLF VG A++ AL    
Sbjct: 1   MFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALATNM 60

Query: 97  DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLL 154
                + +V  R   +   G+L+     F   PD       V  +RL GVLQRI L YL 
Sbjct: 61  PHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVLQRIGLCYLA 120

Query: 155 VSLV 158
            +L+
Sbjct: 121 AALL 124


>gi|291514624|emb|CBK63834.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 352

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 84/331 (25%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWG 116
           ++ HA WNG  + D + P FLFI GVA   +L +   R    K++   IFR  L L   G
Sbjct: 29  QMQHAAWNGLTIQDTIFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLG 88

Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
           ++  G F          ++   +R+  VL RI L+++  +L+ ++   V+ +        
Sbjct: 89  MVYNGLFE---------LNFSSLRIASVLGRIGLAWMFAALLCVYC-SVRTRIA------ 132

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                     +A  +L+ Y  LL     PD          AD              L+  
Sbjct: 133 ----------VAGIILIGYSLLLGLVVAPD------APVGAD-------------PLSVE 163

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
               G+IDR+ L  + +Y                                F+PEG+LS++
Sbjct: 164 GCLAGWIDRQYLPGHILY------------------------------GAFDPEGILSTL 193

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT---LHFTNAIPLNKQLYT 353
            +++S + G+  G  ++  +  L+    W     A+    +T   L +   +P+NK L++
Sbjct: 194 PAVVSALFGMFTGEFLLDGRRGLS--GSWKAFYMAVAALAITTAGLCWNLIMPVNKNLWS 251

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            S+ CV SG +  + +  Y L+D   +  WT
Sbjct: 252 SSFTCVVSGYSLGMTALFYYLIDVCGYKRWT 282


>gi|254446502|ref|ZP_05059978.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260810|gb|EDY85118.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 87/356 (24%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
           +K +RL +LD  RG  +  MI+V+  G  W  +     HA W+G    D V PFFLF VG
Sbjct: 1   MKRERLLALDALRGFTIIGMIIVNSPG-SWSHVYSPLLHASWHGVTPTDLVFPFFLFFVG 59

Query: 86  VAIALAL------KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           V+IALA       KR  +R    +K+ +R  K+   G+ L        +E+         
Sbjct: 60  VSIALAYSGKRGTKR--ERVGKYRKIFWRVAKIFALGLFLNLWPYFYFEEM--------- 108

Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
           R+ GVLQRIAL + + +++ + T+  Q                W   + A +L+ Y ALL
Sbjct: 109 RVAGVLQRIALVFGVCAILFLNTRWKQQ--------------LW---VGASILLGYWALL 151

Query: 200 YGTYVPDWQFTIINKDSAD-------YGKVFNVTCGVRAK------LNPPCNAVGYIDRK 246
               VP      +N  + +       YG    V+   R +        P  N   ++DR 
Sbjct: 152 VWVPVP---LDEVNAGALETGIVERSYGTEVAVSVEARGETSIAGNFEPGVNIAAWVDRV 208

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
           +L          W R+                        ++PEGLLS+V ++ + I G+
Sbjct: 209 LL------PGGMWERT------------------------WDPEGLLSTVPAVATGIFGM 238

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
             G +I+       R+     +G   L+ G    +    P NK L++ S+V    G
Sbjct: 239 LVGALILGVGDPYRRVSWVFFVGVVALLIGSAWSWV--FPYNKNLWSSSFVLYAGG 292


>gi|403236334|ref|ZP_10914920.1| hypothetical protein B1040_11244 [Bacillus sp. 10403023]
          Length = 373

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/361 (22%), Positives = 145/361 (40%), Gaps = 79/361 (21%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAG-GDWPEISHAPWNGCNLADFVMPFFLF 82
            +  ++    R+ SLD+ RG+ V   + +     G +   +HA W G  L DF++P F+ 
Sbjct: 1   MKNTNNSARSRIHSLDMARGIIVVFSVFLSSLPYGSYDFATHASWYGLTLVDFILPCFIT 60

Query: 83  IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           + GV +A+A +    +    K+ I RT+KL+ +G+L           + +  D+  +R  
Sbjct: 61  VFGVGMAIAYQ----KGVKWKRFISRTIKLILFGLLFN-------IIVAWSFDLSTLRFT 109

Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
           GVLQ  AL  +   L+  F K  +    S+    IF +Y          +++Y+      
Sbjct: 110 GVLQMYALLGIGTVLITRFIK--RPITVSLVGLLIFSIY--------GAILLYMGQTCEG 159

Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
            +P                             P CN    +D  V G +H+Y        
Sbjct: 160 SLPQ----------------------------PGCNPSWLVDPVVFGESHIY-------- 183

Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
                             S     F+PEG+ +  S++ + ++G   G +II  K   A  
Sbjct: 184 ------------------SLGERGFDPEGIPAIFSALGNVLLGFAAGRIIILYKEKGA-- 223

Query: 323 KQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
             W  +   +L+  L       +P  K+L+T S+  VT+GA +L+ + ++ + D     P
Sbjct: 224 -GWRLLLHGVLLLALAFLVEAFLPFGKRLWTPSFGLVTAGATSLLLALLHIIFDRKYTSP 282

Query: 383 W 383
           +
Sbjct: 283 F 283


>gi|262381452|ref|ZP_06074590.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296629|gb|EEY84559.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 378

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 152/364 (41%), Gaps = 85/364 (23%)

Query: 31  KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
           K  RL SLD+ RG     L V  M+L  + HA  D P         SH  W G +  D V
Sbjct: 5   KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63

Query: 77  MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
           MP FLF+ GV++  +L R   +PD+    +++  R + L  +G++ QG   +  PD    
Sbjct: 64  MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD---- 119

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
               R+      LQ IA+ YL+ SL+ +    V+ + Q +G            + AA +L
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFL---HVRIRVQ-IG------------IAAALLL 159

Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           + + A+ +         T+ N     Y               P  N   +IDR VLG   
Sbjct: 160 IFWGAMEF--------ITVGNYGGGSY--------------TPDSNLAEWIDRTVLG--- 194

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
                   R +        +G   ++        +    +LSS++  ++ + G+  G+ I
Sbjct: 195 --------RFR--------DGATVENGEVIFATWYRYTWILSSLNFGVTVLTGLFAGY-I 237

Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIY 372
           +  K +  RLK  +  G  L +      +   +P+ K+L+T S V V+SG   L+    Y
Sbjct: 238 LKNKLYSERLKLRMLFGIGLGMVIAGWLWGIELPVIKKLWTSSMVLVSSGYCFLLMGLFY 297

Query: 373 ALVD 376
             +D
Sbjct: 298 YWID 301


>gi|119774084|ref|YP_926824.1| hypothetical protein Sama_0947 [Shewanella amazonensis SB2B]
 gi|119766584|gb|ABL99154.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 378

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 92/377 (24%), Positives = 148/377 (39%), Gaps = 100/377 (26%)

Query: 24  QQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW----PEISHAPWN 68
           Q  ++     RL SLD  RG           L +AL  L   +   W     E+ H+ W+
Sbjct: 2   QATQTKAAKPRLMSLDALRGFDMFWILGGEKLFIALFALTGWS--FWQLADAEMHHSEWH 59

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAP 127
           G    D + P F+F+ GVA+ L+ KR+   A A +  I+R  +K LF  + L   ++H  
Sbjct: 60  GFTFYDLIFPLFIFLSGVALGLSPKRLDKLAPAERNPIYRHAVKRLFLLLALGVLYNHG- 118

Query: 128 DELTYGVDVRM----IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
               +G  +      +R   VL RIA ++   +L+                        W
Sbjct: 119 ----WGTGIPAHSDEVRYASVLGRIAFAWFFAALL-----------------------VW 151

Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
           H  +   +    LA+L+G               A       V  G    L P  +   ++
Sbjct: 152 HTSLRTQIATA-LAILFGY--------------AAIQLWLPVPGGQAGVLTPSGSINAWV 196

Query: 244 DRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
           D   L GI +              Q  P++                PEG+LS++ +I++ 
Sbjct: 197 DTHFLPGITY--------------QHRPYD----------------PEGILSTLPAIVNA 226

Query: 303 IIGVHFGHVIIH--TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           ++GV  G  I+    +G  A+       G   L+ G +L   + +P+NK L+T S+V VT
Sbjct: 227 LMGVFVGRFIVKPDARGDWAKAGILTGAGGLSLVLGWSLD--SVLPVNKDLWTSSFVLVT 284

Query: 361 SGAAALVFSAIYALVDC 377
           +G   L  +  Y LVD 
Sbjct: 285 TGWNLLFLALFYVLVDV 301


>gi|430745463|ref|YP_007204592.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
 gi|430017183|gb|AGA28897.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
          Length = 391

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/387 (24%), Positives = 156/387 (40%), Gaps = 92/387 (23%)

Query: 14  LIISEPDVSDQQEKSHLK-TQRLASLDIFRG-----------LAVALMILVDHAGGD--W 59
           L+ +E  + D    +  K ++RL S+D  RG           LA AL    D + G    
Sbjct: 7   LLTAETPLMDSDSIAAPKPSERLLSIDALRGFDMLWIIGGERLAKALARWSDSSAGKVVQ 66

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI--PDRADAVKKVIFRTLKLLFWGI 117
            ++ HA W+G  L D + P FLF+VG  +  +L ++    R    +++  RTL L   G+
Sbjct: 67  EQLEHAEWHGFRLNDLIFPLFLFLVGTVLPFSLGKLQGQGRGAEYRRIARRTLLLFALGL 126

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
           L  G             D   +R+ GVLQRIAL Y + +L+ ++        + V     
Sbjct: 127 LCNG---------VLKFDWANLRVAGVLQRIALCYGIAALISLWF-----SRRGVA---- 168

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
                   ++   +LV Y AL+     P          + DY              +   
Sbjct: 169 --------ILLVLILVGYWALMANVGAP-------GHTAGDY--------------SISG 199

Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
           N  G+IDR+ L                        G + K      +   + EGLL+++ 
Sbjct: 200 NLAGWIDRQFL-----------------------PGKIMKS----YYGYGDNEGLLTTIP 232

Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
           ++ + ++GV  GH +   +G   ++   V  G   LI G+   +    P+NK L+T  +V
Sbjct: 233 AVGTALLGVLAGHWLRSQRGPWQKVAGLVAAGVLSLIVGVA--WGERFPINKILWTSPFV 290

Query: 358 CVTSGAAALVFSAIYALVDCNIFGPWT 384
            V  G + L+ +  YA++D   F  W 
Sbjct: 291 LVAGGLSLLLLALFYAVIDVLRFRRWA 317


>gi|224027055|ref|ZP_03645421.1| hypothetical protein BACCOPRO_03816, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224020291|gb|EEF78289.1| hypothetical protein BACCOPRO_03816 [Bacteroides coprophilus DSM
           18228]
          Length = 373

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 87/384 (22%), Positives = 147/384 (38%), Gaps = 109/384 (28%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----------------EISHAPWN 68
           + K  +K+QRL SLD+ RG     M  +   G  +                 ++ H PW+
Sbjct: 5   RMKKEVKSQRLQSLDVLRGFD---MFFIMGGGALFAGLATCCPIPFFQAIARQMEHVPWH 61

Query: 69  GCNLADFVMPFFLFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFS 124
           G    D + P FLFI G++   +L++        A   +KVI R L L+  G +  G   
Sbjct: 62  GVAFEDMIFPLFLFIAGISFPYSLEKQKACGMSSAAIYRKVIRRGLVLVLLGCIYNG--- 118

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
                     D   +R   VL RI LS++  +L+ +   +V+ + + +G  ++  L+  +
Sbjct: 119 ------LLDFDFAHLRYASVLGRIGLSWMFAALLFL---NVRTRVR-MGVVAL--LFIGY 166

Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           W + AC                                F    G     +   + VGY+D
Sbjct: 167 WALLAC--------------------------------FPAPDGNGDPFSMEGSLVGYVD 194

Query: 245 RKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
           R  L G  ++  H                               +PEGLL  + ++ +  
Sbjct: 195 RMFLPGQLYLGIH-------------------------------DPEGLLGIIPAVGTAS 223

Query: 304 IGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           +G+  G  I   +  L  L++ V +   G  L++ G         P+NK L+T S+ C+ 
Sbjct: 224 LGMLTGEWI--KREGLPELRKVVLLACAGVMLVVVGWIWDLV--FPINKNLWTSSFACLV 279

Query: 361 SGAAALVFSAIYALVDCNIFGPWT 384
            G + L+F+  Y L+D      WT
Sbjct: 280 GGISMLLFALFYYLIDVRHCHRWT 303


>gi|345322030|ref|XP_003430524.1| PREDICTED: hypothetical protein LOC100681967 [Ornithorhynchus
           anatinus]
          Length = 530

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 51/80 (63%), Gaps = 4/80 (5%)

Query: 44  LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
           L++ LM+ V++ GG +    HAPWNG  +AD VMP+F+FI+G ++ALA     +R  +R 
Sbjct: 130 LSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFILGTSVALAFYAMRRRGVNRV 189

Query: 100 DAVKKVIFRTLKLLFWGILL 119
             ++K+ +RT  L+  G+  
Sbjct: 190 QLLRKLTWRTAVLMIIGLFF 209


>gi|224064476|ref|XP_002301495.1| predicted protein [Populus trichocarpa]
 gi|222843221|gb|EEE80768.1| predicted protein [Populus trichocarpa]
          Length = 136

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 30/65 (46%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 199 LYGTYVPDWQFTIINKDSADY-GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
           LYG Y PDW+F + +     Y      V CGV+  L PPCNA G IDR  LG + +Y HP
Sbjct: 68  LYGLYDPDWEFEVPSTHLFGYKSGTKTVNCGVKGSLEPPCNAAGLIDRFFLGEHPLYQHP 127

Query: 258 AWRRS 262
            +RR+
Sbjct: 128 VYRRT 132



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 37/45 (82%)

Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
          MILVD AGG +P I+H+PW G  L+DFVMPFFLF+VG++I+L  K
Sbjct: 1  MILVDDAGGAFPCINHSPWFGVTLSDFVMPFFLFVVGLSISLVFK 45


>gi|386719962|ref|YP_006186288.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
 gi|384079524|emb|CCH14124.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
          Length = 352

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 75/144 (52%), Gaps = 20/144 (13%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
           +  +RL S+D  RG+ VA M+LV++ G DW  +     H+ W+GC   D V PFFLF+VG
Sbjct: 1   MPPRRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVG 59

Query: 86  VAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
           V++A ++   P   DA  +      V+ R L++L          +     + + +     
Sbjct: 60  VSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHF 110

Query: 140 RLCGVLQRIALSYLLVSLVEIFTK 163
           R+ GVLQRIA+    V ++ ++ +
Sbjct: 111 RIWGVLQRIAVCAASVGVLAVYAR 134



 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPL 347
           +PEGLLS++ ++ ST++G+  G ++ + +          T    LL+          +PL
Sbjct: 190 DPEGLLSTLGALASTVLGLIAGGLLRNGRAAALAGLGVATAVLGLLL-------ATVLPL 242

Query: 348 NKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGP 382
           NKQL+T SYV  T G AAL     + L+D   + P
Sbjct: 243 NKQLWTPSYVLWTGGLAALALWLGHVLIDQKRWPP 277


>gi|127512051|ref|YP_001093248.1| hypothetical protein Shew_1118 [Shewanella loihica PV-4]
 gi|126637346|gb|ABO22989.1| conserved hypothetical protein [Shewanella loihica PV-4]
          Length = 387

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 160/392 (40%), Gaps = 115/392 (29%)

Query: 18  EPDVSDQQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW----PEI 62
           EP  +D   K   K  RL SLD  RG           L  AL++     G  W     ++
Sbjct: 2   EPK-TDTHPKPAAKP-RLMSLDALRGFDMFWILGGEALFAALLVWTGWQG--WRIADAQM 57

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGI 117
            H+ W+G    D + P F+F+ GVA+ L+ KR+     P+R    +  I R + LL +G+
Sbjct: 58  HHSQWHGFTFYDLIFPLFIFLSGVALGLSPKRLDSLPWPERLPLYRHAIKRLMLLLLFGV 117

Query: 118 LLQGGFS----HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
           L   G+      A DE         +R   VL RIA ++   +L+ ++  D++ +     
Sbjct: 118 LYNHGWGTGMPMAADE---------VRYASVLGRIAFAWFFAALL-VWHTDLKTQI---- 163

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+A  +L+ Y A+     VP                      GV    
Sbjct: 164 ------------LVAIGILLGYAAMQLWLPVPG---------------------GVAGDF 190

Query: 234 NPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           +   +   Y+D K L GI++        + +A                       +PEG+
Sbjct: 191 SISGSINAYVDGKFLPGISY--------QGRAT----------------------DPEGI 220

Query: 293 LSSVSSILSTIIGVHFGHVIIHT-------KGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           LS++ +I++ + GV  G  I+ +       KG  A++   +  G   L  G  ++    I
Sbjct: 221 LSTLPAIVNALAGVFTGRFIVKSHPKWGGVKGEWAKVGILLLAGLLSLGLGWLMN--PYI 278

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           P+NK L+T S+V VT G + ++ +  YALVD 
Sbjct: 279 PVNKDLWTSSFVMVTLGWSLILLAIFYALVDV 310


>gi|423722057|ref|ZP_17696233.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
 gi|409242759|gb|EKN35519.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
          Length = 376

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 85/363 (23%), Positives = 147/363 (40%), Gaps = 88/363 (24%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H  G   D P         SH  W G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+ 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHVRLSVQ-----IGT-------------AVALLLA 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + GV  GH++  
Sbjct: 195 -----FRDAAVVENGQIV----------FAESYRYTWILSSLNFGVTVLTGVFAGHIL-- 237

Query: 315 TKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
            K  + R  +W   +G  +++  L   +   +P+ K+++T S V V+SG   L+    Y 
Sbjct: 238 -KSGMDRKHKWQWLLGIGVIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLFYY 296

Query: 374 LVD 376
            +D
Sbjct: 297 WID 299


>gi|288800484|ref|ZP_06405942.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
 gi|288332697|gb|EFC71177.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
          Length = 409

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 158/382 (41%), Gaps = 96/382 (25%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE--------------- 61
           S  ++ ++    H  + RL SLD+ RG  +AL++L       W E               
Sbjct: 25  SSTEIFNKATAPH--SGRLLSLDLLRGADLALLVLFQPIIYQWVEASEPTPGSFGEMVFG 82

Query: 62  -ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD-RADAVKKVIFRTLK--LLFW-- 115
            I+H PW G    D +MP F+F+ G+ I  ++ +    +  A K  ++R LK  ++ W  
Sbjct: 83  QITHVPWEGFCFWDIIMPLFMFMSGITIPFSMGKYQQGKVKADKGFLWRLLKRFVVLWVL 142

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
           G++ QG      + L +  D R+I L    LQ IA+ Y++V+L+ ++T     + Q    
Sbjct: 143 GMIAQG------NLLLF--DPRLIHLYSNTLQSIAVGYVMVALLFVYTS---WRTQLA-- 189

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                      ++AAC LV Y+A+                  A +G++            
Sbjct: 190 -----------VVAAC-LVGYVAVF-----------------AIWGQM---------DFT 211

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
              N    IDR+VLG         WR           +G L K    +    +    +LS
Sbjct: 212 IDANICEEIDRQVLG--------HWR-----------DGVLWKGDNWYWDDSYHYTWILS 252

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
           S++ +++  +G   G ++      L +++  V  G AL+     LH    +P+ K +++ 
Sbjct: 253 SLNFVVTVYLGCVAGMILRMKLQDLKKVRILVFSGVALIALSFLLH--PVVPIIKHIWST 310

Query: 355 SYVCVTSGAAALVFSAIYALVD 376
           S   ++ G   L+ +  Y  VD
Sbjct: 311 SMTFLSGGICLLLIALAYYWVD 332


>gi|404485011|ref|ZP_11020215.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
 gi|404340016|gb|EJZ66447.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
          Length = 440

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 25/145 (17%)

Query: 33  QRLASLDIFRGLAVALMIL-----------VDHA--GGDWPEISHAPWNGCNLADFVMPF 79
           QRLASLDI RG  + L++            VD +       +  H  W G    D VMP 
Sbjct: 75  QRLASLDILRGFDLFLLVFLQPVLVSLGACVDSSVMNAVLYQFDHEVWEGFRFWDLVMPL 134

Query: 80  FLFIVGVAIALAL---KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           FLF+ GV++  +    +R+  R    KKV+ R + L   G+++QG           G+D+
Sbjct: 135 FLFMTGVSMPFSFSKYERVESRRFIYKKVLRRFVILFLLGMVVQGNL--------LGLDL 186

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
           + IRL    LQ IA  YL+ +L+++
Sbjct: 187 KYIRLYSNTLQAIAAGYLIAALIQL 211


>gi|146300862|ref|YP_001195453.1| hypothetical protein Fjoh_3117 [Flavobacterium johnsoniae UW101]
 gi|146155280|gb|ABQ06134.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 380

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 88/382 (23%), Positives = 142/382 (37%), Gaps = 111/382 (29%)

Query: 34  RLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLADFVMPFF 80
           RL SLD+ RG  +  ++  +H           P       ++ HA WNG    D + P F
Sbjct: 8   RLISLDVLRGFVMFWIMSGEHIIHALAKAAPIPIFIWMSSQLHHAEWNGITFYDMIFPVF 67

Query: 81  LFIVGVAIALA---------LKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHA 126
           LF+ GV++  +         +K   D   A K+ I+     RT  LL  G ++ G     
Sbjct: 68  LFVAGVSMPFSFEKKMKLAGVKEPKDLPKAEKRKIYLSMLRRTCILLVLGFVVNGLLRFD 127

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
             + T        R   VL RI L++    ++ +   +   K Q +  F I         
Sbjct: 128 GFDQT--------RFASVLGRIGLAWFFAGIIYL---NFDFKKQLICFFGI--------- 167

Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
                L+ Y A +    VP++   ++ K+ +                       GYIDR 
Sbjct: 168 -----LIGYYAAMKWIPVPNFGAGVLTKEGS---------------------LEGYIDRL 201

Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIG 305
            L                               P   H+  ++PEG+ S++ +I + ++G
Sbjct: 202 FL-------------------------------PGRLHSTVYDPEGIFSTIPAIATALLG 230

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFA---LLIFGLTLHFTNAIPLNKQLYTLSYVCVTSG 362
           V  G   +  K   +   + + M  A   L+I GL        P+NK L+T S+VC   G
Sbjct: 231 VFIG-TFLKAKCPFSINIKLLLMALAAVVLIIAGLIWDIN--FPINKHLWTSSFVCFVGG 287

Query: 363 AAALVFSAIYALVDCNIFGPWT 384
            + L F   YA++D   F  W 
Sbjct: 288 FSILFFVFFYAIIDLLGFQKWA 309


>gi|322785719|gb|EFZ12357.1| hypothetical protein SINV_16151 [Solenopsis invicta]
          Length = 111

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 42/73 (57%)

Query: 20  DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           +V D+      K  R+ ++D FRG++   MI V+   G +  + HA WNG  L D V P 
Sbjct: 33  NVKDESSNKEPKKNRVKAIDTFRGISTLFMIFVNDGSGSYTVLEHATWNGLLLGDLVFPC 92

Query: 80  FLFIVGVAIALAL 92
           F++I+GV + +AL
Sbjct: 93  FIWIMGVCVPIAL 105


>gi|189463416|ref|ZP_03012201.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
          17136]
 gi|189429845|gb|EDU98829.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
          17136]
          Length = 82

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
          K QRL SLD+ RG+ +A MI+V++ G     +  ++HA WNG    D V PFF+FI+G++
Sbjct: 3  KAQRLISLDVLRGITIAGMIIVNNPGSWKHVYTPLTHAVWNGLTPTDLVFPFFMFIMGIS 62

Query: 88 IALALKR 94
            ++LK+
Sbjct: 63 TYISLKK 69


>gi|53715734|ref|YP_101726.1| hypothetical protein BF4455 [Bacteroides fragilis YCH46]
 gi|423271955|ref|ZP_17250924.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|423276040|ref|ZP_17254983.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
 gi|52218599|dbj|BAD51192.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|392696310|gb|EIY89506.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|392699545|gb|EIY92721.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
          Length = 375

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|265767324|ref|ZP_06094990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263252629|gb|EEZ24141.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 375

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|154492357|ref|ZP_02031983.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
 gi|154087582|gb|EDN86627.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
          Length = 376

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 145/362 (40%), Gaps = 86/362 (23%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
           QRL SLD+ RG  +  ++ ++   H  G   D P         SH  W+G +  D VMP 
Sbjct: 6   QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWDGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    ++++ R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
            R+      LQ IA+ YL+ +++ +             R S+             V+ + 
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAMLFLHV-----------RLSV---------QIGTVVALL 157

Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
           LA     Y    QF  ++     YG              P  N   ++DR VLG      
Sbjct: 158 LA-----YWGAMQFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR----- 194

Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
                R  A  ++                  +    +LSS++  ++ + GV  GH++   
Sbjct: 195 ----FRDAAVVENGQIV----------FAESYRYTWILSSLNFGVTVLTGVFAGHIL--- 237

Query: 316 KGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           K  + R  +W   +G   ++  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 238 KSGMDRKHKWQWLLGIGAIMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGLFYYW 297

Query: 375 VD 376
           +D
Sbjct: 298 ID 299


>gi|383753678|ref|YP_005432581.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
 gi|381365730|dbj|BAL82558.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
          Length = 384

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 12/136 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD---HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
             +RLA++DIFRGLA+A+M+LV+   +    WP + HAPW G  +AD   P F+FI+GV+
Sbjct: 7   NKRRLAAIDIFRGLAIAIMLLVNALPNFEQAWPLLVHAPWAGLTIADLAFPGFVFIMGVS 66

Query: 88  IALALKRIPDRADAVKKVIFRTLKLLF---------WGILLQGGFSHAPDELTYGVDVRM 138
            +L   +        K  I     LL          + ++LQ  F   P        V  
Sbjct: 67  ASLWFPKHEQDGSGEKFCIILKRSLLLILLGFFLCQFPLVLQHVFQPEPGGSLIKDIVEH 126

Query: 139 IRLCGVLQRIALSYLL 154
            R+ GVLQR+ L Y  
Sbjct: 127 GRIPGVLQRLGLVYFF 142


>gi|313145390|ref|ZP_07807583.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423279948|ref|ZP_17258861.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
 gi|424661980|ref|ZP_18099017.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|313134157|gb|EFR51517.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404578291|gb|EKA83026.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|404584284|gb|EKA88949.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
          Length = 377

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 144/362 (39%), Gaps = 86/362 (23%)

Query: 33  QRLASLDIFRGLAVALMILVD-------------HAGGDWPEISHAPWNGCNLADFVMPF 79
           QRL SLD  RGL +  ++ +              H  G    + H  W G +  D +MP 
Sbjct: 7   QRLESLDALRGLDLFFLVALGPLLRTLVRAIDSPHLDGVNWCLRHVDWIGFSPWDLIMPL 66

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGG-FSHAPDELTYGV 134
           FLF+ G++I  AL R    AD   K+I+R  K  LL W  G++ QG   S  PD L    
Sbjct: 67  FLFMSGISIPFALSRFKGEADK-SKLIYRLCKRVLLLWIFGMMCQGNLLSFDPDHLYLYT 125

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           +         LQ IA  Y+  +L+ ++T     K Q V              +   +L++
Sbjct: 126 N--------TLQSIATGYIAAALLFLYTG---RKTQIV--------------LCVALLLI 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y A +        +F  I+     YG              P  N   +IDR VLG     
Sbjct: 161 YWAAM--------KFISID----GYGG---------GNYTPEGNLAEWIDRTVLG----- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R +        +G   ++        +    +LSS++ I++ + G+  G++   
Sbjct: 195 ------RFR--------DGASVENGTIIFAEGYYYTWILSSLNFIVTVMTGLFAGYIAKD 240

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
                 +L+ +  +G  +++ G T       P+ K ++T S V V+SG   L+    Y  
Sbjct: 241 ATEGKHKLRLYFGIGAGMVLAGWTWGLV--FPVIKTIWTSSMVLVSSGYCFLLMGLFYYW 298

Query: 375 VD 376
           +D
Sbjct: 299 ID 300


>gi|423282787|ref|ZP_17261672.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
 gi|404581658|gb|EKA86354.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
          Length = 375

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSPPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +LV+Y   +  T++ D                            P  N    +DR V
Sbjct: 157 TALLLVIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|336411649|ref|ZP_08592112.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
 gi|335941083|gb|EGN02943.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
          Length = 375

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/371 (21%), Positives = 145/371 (39%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y       ++P                             P  N    +DR V
Sbjct: 157 TALLLLIY-------WIP---------------------MTFLGDFTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGLSLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|383119755|ref|ZP_09940493.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|423252290|ref|ZP_17233284.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|423252861|ref|ZP_17233792.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
 gi|251944624|gb|EES85099.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|392647563|gb|EIY41262.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|392659230|gb|EIY52856.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
          Length = 375

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/371 (21%), Positives = 145/371 (39%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y       ++P                             P  N    +DR V
Sbjct: 157 TALLLLIY-------WIP---------------------MTFLGDFTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|392402534|ref|YP_006439146.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
 gi|390610488|gb|AFM11640.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
          Length = 396

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 140/349 (40%), Gaps = 93/349 (26%)

Query: 37  SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
           SLD+ RGL +ALMI+V++ G DW      + HA W+G   AD V P FLF+ G A AL +
Sbjct: 5   SLDLLRGLTIALMIIVNNPG-DWKAMFAVLRHAEWHGFLGADIVFPLFLFVAGYAAALKI 63

Query: 93  KRI--------PDRADAVK-----------KVIFRTLKLLFWGILLQG-GFSHAPD-ELT 131
            R+        P  A A+             ++ R   L   G+ L        PD E +
Sbjct: 64  DRLYGPTTAGGPHCASALTLEERELPAYYLPLMRRAAILFLIGLFLNAWPLGLLPDTEFS 123

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
           +G     +R+ GVLQRIA+  L+  ++    + V+                    + + +
Sbjct: 124 FG----HLRVLGVLQRIAICVLVGGVLLRRVETVRG-------------------LVSVL 160

Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
           ++++L   +   VP  +F         +G+ F +            N    +D  VL   
Sbjct: 161 ILIFLLYEFCMRVPLVKFA-----DVFFGRSFELQ----------NNFARLVDISVLPEA 205

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG---VHF 308
            +Y                    +RK A       F+PEGLL+S+++I++ + G      
Sbjct: 206 MLYK-------------------VRKIA-------FDPEGLLTSLTAIMTFLFGGLAFRL 239

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYV 357
               I T  H  R       G A +   L     N  P+NK L+T+ +V
Sbjct: 240 ALPCIATTQHKFRNFLQSRWGLASICAVLGFALANFEPINKNLWTIPFV 288


>gi|380512476|ref|ZP_09855883.1| hypothetical protein XsacN4_14717 [Xanthomonas sacchari NCPPB 4393]
          Length = 384

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/126 (38%), Positives = 66/126 (52%), Gaps = 15/126 (11%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMILV+   AG D + ++ H PW G   AD V P FLF VG A++
Sbjct: 16  ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDVRMIRLC 142
            AL R       +++V  R+  +   G L+       QG   H    LT    +   R+ 
Sbjct: 76  FALDRGQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGHW--SLT---AIDQTRVP 130

Query: 143 GVLQRI 148
           GVLQRI
Sbjct: 131 GVLQRI 136



 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLL ++ + ++ I G +   + +   G  AR  +W+ +    L+  L L +    P
Sbjct: 207 FDPEGLLGTLPATVNVIAG-YLTGLYVRRVGKQARTVRWLWLAGGALVL-LALAWQPWFP 264

Query: 347 LNKQLYTLSYVCVT 360
           L K+L+T S+V +T
Sbjct: 265 LAKKLWTGSFVLLT 278


>gi|223936396|ref|ZP_03628308.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894914|gb|EEF61363.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 383

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 91/396 (22%), Positives = 153/396 (38%), Gaps = 110/396 (27%)

Query: 13  PLIISE-PDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPEI----- 62
           P   SE P +S+Q   +    Q    R+ S+D  RG  +  ++  D     + +I     
Sbjct: 4   PTSTSEAPALSNQAGSTATLNQKANTRIISIDALRGFDMFWIMGGDQLVRSFQKIDDSAP 63

Query: 63  --------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD---RADAVKKVIFRTLK 111
                    H  W G +  D + P F+F+ GV+I  ++ R+ +   R  AVK++ FR++ 
Sbjct: 64  THALANQMEHCEWAGFHFYDLIFPLFVFLAGVSIVFSITRLIEHSGRVAAVKRIAFRSVI 123

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           L  +GI   GG S+           + I L GVL RIA++Y   +L+  F +        
Sbjct: 124 LFLFGIFYMGGVSNG---------FKNIYLAGVLHRIAVAYFFAALLFCFFRPKA----- 169

Query: 172 VGRFSIFRLYCWHWLMAACV--LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV 229
                         L+A C+  LV Y ALL    VP      +   S D GK        
Sbjct: 170 --------------LIAICIGLLVGYWALLTFVPVPG-----VGAASYDQGK-------- 202

Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
                   N   Y+D+  L                      FEG                
Sbjct: 203 --------NLAYYLDQHYL------------------PGQKFEGT--------------- 221

Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLN 348
             LLS++ ++ + ++G+  G ++ +      +   W+   G   L+ GL   ++   P+ 
Sbjct: 222 --LLSTMPAVANCLLGIFAGLLLTNKTVDDQKKVYWLLGSGITSLVIGLI--WSIQFPII 277

Query: 349 KQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           K L+T +YV +  G +A++    Y +++   F  W 
Sbjct: 278 KLLWTSTYVLLACGYSAILLGLFYQIIEIWKFQKWA 313


>gi|375360501|ref|YP_005113273.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|301165182|emb|CBW24752.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 373

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 83/369 (22%), Positives = 147/369 (39%), Gaps = 91/369 (24%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S   + RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
             D VMP FLF+ G ++  +  K  PD+    +K+I R + L  +G+++QG         
Sbjct: 62  FWDLVMPLFLFMTGASMPFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL------- 114

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D + + L    LQ IA  YL+ +++++                 FR   W  ++ A
Sbjct: 115 -LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIVTA 156

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++Y   +  T++ D                            P  N    +DR VLG
Sbjct: 157 LLLLIYWIPM--TFLGD--------------------------FTPEGNFAEKVDRLVLG 188

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGVHF 308
                H                +G    +  SW   A +    + SS++   + ++G   
Sbjct: 189 -----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGAFA 229

Query: 309 GHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
           G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   L+
Sbjct: 230 GKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCFLL 287

Query: 368 FSAIYALVD 376
             A Y  +D
Sbjct: 288 MGAFYYRID 296


>gi|254784997|ref|YP_003072425.1| hypothetical protein TERTU_0813 [Teredinibacter turnerae T7901]
 gi|237684955|gb|ACR12219.1| putative membrane protein [Teredinibacter turnerae T7901]
          Length = 354

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 9/124 (7%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            +R  +LD  RG+ +A+MILV+  G     +P + HA W+G    DFV PFFLFIVG A+
Sbjct: 2   NERSLALDALRGITLAMMILVNTPGSWSHVYPPLLHANWHGVTPTDFVFPFFLFIVGCAL 61

Query: 89  ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
             +     +R +    +    LK +F   +L            Y       RL GVLQRI
Sbjct: 62  FFS-----NRKNHQLDIYTHALK-IFRRTVLLLLAGLGLHAYLYSGTFAEFRLPGVLQRI 115

Query: 149 ALSY 152
           AL+Y
Sbjct: 116 ALAY 119



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 49/88 (55%), Gaps = 5/88 (5%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F+PEGLLS++ +I++ + G     +I+         +Q V +  A L  G+ L     +P
Sbjct: 183 FDPEGLLSTLPAIVTVLSGYEATRIIVERTT-----QQKVLVIIAALAIGMALLLHPWVP 237

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +NK L+T SYV +TSG A LV  A+  L
Sbjct: 238 INKYLWTSSYVLLTSGVAVLVLVALMQL 265


>gi|423344000|ref|ZP_17321713.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409213862|gb|EKN06875.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 376

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 86/362 (23%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++ ++   H  G      W        SH  W G +  D VMP 
Sbjct: 6   KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
           F+F+ GV++  AL R   +PD+    +++I R   L  +G++ QG      PD       
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNLLGLDPD------- 118

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
            R+      LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+V
Sbjct: 119 -RIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQ-----IGT-------------AVALLLV 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + G+  G  I+ 
Sbjct: 195 -----FRDAAVVENGQVV----------FAESYRYTWILSSLNFGVTVLTGLFAGQ-ILK 238

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +     R  QW+ +G  + +  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 239 SAMDQKRKWQWL-LGIGVAMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGVFYYW 297

Query: 375 VD 376
           +D
Sbjct: 298 ID 299


>gi|60683670|ref|YP_213814.1| hypothetical protein BF4252 [Bacteroides fragilis NCTC 9343]
 gi|423259842|ref|ZP_17240765.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|423267497|ref|ZP_17246478.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
 gi|60495104|emb|CAH09923.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|387775880|gb|EIK37984.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|392696971|gb|EIY90158.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
          Length = 375

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 93/371 (25%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCN 71
           ++ S     RLASLDI RG  + L++        L       W      +  H  W G  
Sbjct: 2   KKPSSTPAPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFR 61

Query: 72  LADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
             D VMP FLF+ G ++  +  +    PD+    +K+I R + L  +G+++QG       
Sbjct: 62  FWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL----- 116

Query: 129 ELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
               G+D + + L    LQ IA  YL+ +++++                 FR   W  ++
Sbjct: 117 ---LGLDPKHLYLYSNTLQAIATGYLIAAIIQLHCN--------------FR---WQLIV 156

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
            A +L++Y       ++P                             P  N    +DR V
Sbjct: 157 TALLLLIY-------WIP---------------------MTFLGDFTPEGNFAEKVDRLV 188

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTIIGV 306
           LG     H                +G    +  SW   A +    + SS++   + ++G 
Sbjct: 189 LG-----HFR--------------DGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGA 229

Query: 307 HFGHVIIHTKGHLARLKQ-WVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAA 365
             G ++   K +  ++ Q  + +G +L+ F L   ++  +P+ K+L+T S    + G   
Sbjct: 230 FAGKIMKAGKDNRRKVVQTLLIIGISLIAFSLI--WSLQMPIIKRLWTSSMTLFSGGLCF 287

Query: 366 LVFSAIYALVD 376
           L+  A Y  +D
Sbjct: 288 LLMGAFYYRID 298


>gi|433678126|ref|ZP_20510025.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
 gi|430816762|emb|CCP40477.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
          Length = 384

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMIL +   AG D + ++ HAPW G   AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
            AL R       +++V  R+  +   G L+    F H   + ++  + +   R+ GVLQR
Sbjct: 76  FALDRSQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135

Query: 148 I 148
           I
Sbjct: 136 I 136


>gi|406980095|gb|EKE01754.1| hypothetical protein ACD_21C00059G0003 [uncultured bacterium]
          Length = 295

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 65/239 (27%), Positives = 101/239 (42%), Gaps = 70/239 (29%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IR+ GVLQRIA+ Y + SL+ +++K                   W       +LV Y  L
Sbjct: 28  IRIPGVLQRIAVCYFIASLIVLYSK-----------------VAWQIAWTVLLLVGYWVL 70

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
           +    VP +   I                     L P  N   Y+D K+L      H   
Sbjct: 71  IKFVPVPGFGAGI---------------------LEPTGNLAWYLDVKLL------HGHT 103

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W        ++P  G             F+PEG+LS++ +I + + GV  G ++   K  
Sbjct: 104 WV-------NAPAPG-------------FDPEGILSTLPAIATMLFGVLTGQLL---KSS 140

Query: 319 LARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             +   W+ + G AL+  GL +  ++ +P+NK L+T SY   TSG A++VF   Y L+D
Sbjct: 141 FTQKTVWMLIFGGALIFLGLVM--SHWLPINKNLWTSSYAVFTSGMASVVFGCCYWLID 197


>gi|313225183|emb|CBY20977.1| unnamed protein product [Oikopleura dioica]
          Length = 335

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 129/328 (39%), Gaps = 68/328 (20%)

Query: 42  RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
           RG+A+ +MI V++ GG +    HA W G  +AD  MP+F+F++GV++  +   +      
Sbjct: 2   RGIAIGIMIFVNYGGGGYWFFDHAVWFGLTVADLAMPWFMFMMGVSLTFSFNSM------ 55

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           VKKV+  +  L+                 T+G         GVLQR A+ Y +VS +++ 
Sbjct: 56  VKKVLRLSYNLV---------------NPTFGT------FPGVLQRFAICYAVVSPLQLA 94

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY-G 220
            +    +                WL+   +  ++L L +G          +        G
Sbjct: 95  QQSTSLR----------------WLVILGLEGLWLGLTFGMAEAGCPAGYLGPGGLHRDG 138

Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
           +  N + G        C     I++K         HP +        D  + G L+   P
Sbjct: 139 QFRNCSGGADKGCLIYCRE---INQKAHIYGDSTSHPVFW------HDEYWFGDLQAYDP 189

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
                    EG+L S++SIL   +G+  G +    +    R  +    G  L   G  L 
Sbjct: 190 ---------EGILGSINSILIVFLGLQAGRIFNFYETFQQRAIRLSVWGTVLTAVGGALT 240

Query: 341 FTNA------IPLNKQLYTLSYVCVTSG 362
             N       IP+ K L+TLS+V V +G
Sbjct: 241 GLNQFQEGSNIPIAKNLWTLSFVLVMAG 268


>gi|390958852|ref|YP_006422609.1| hypothetical protein Terro_3042 [Terriglobus roseus DSM 18391]
 gi|390413770|gb|AFL89274.1| Protein of unknown function (DUF1624) [Terriglobus roseus DSM
           18391]
          Length = 406

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 17/112 (15%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLAD 74
           +  ++ +K QR+ SLDIFRGL +ALMI V+       EI   PW         N     D
Sbjct: 3   ETTRATVKPQRIQSLDIFRGLNIALMIFVNELH----EIKGLPWWTYHAPGAANVMTYVD 58

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGG 122
            V P FL IVG+++ LAL+    R D   +    V+ R++ L+  G++LQ  
Sbjct: 59  MVFPAFLVIVGMSLPLALQARIRRGDETPQLIWYVVLRSVALIVLGLILQNA 110


>gi|29348589|ref|NP_812092.1| hypothetical protein BT_3180 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340494|gb|AAO78286.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 376

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 73/372 (19%), Positives = 149/372 (40%), Gaps = 92/372 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
             + S   T RLASLDI RG  + L++             + P       +  H  W G 
Sbjct: 1   MNKLSEKNTTRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHA 126
              D VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWLVYRRILRRVFLLFIFGMIVQGNL--- 117

Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                 G+D   I L    LQ IA+ YL+ +++++                    + + W
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYLIAAVIQLH-------------------FSFRW 153

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +   +L++++  +  T++ D                            P  N    +DR
Sbjct: 154 QIGITLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDR 187

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTII 304
            VLG           R +        +G    +  +W  +P +    + SS++  ++ ++
Sbjct: 188 CVLG-----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVML 228

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G  
Sbjct: 229 GAFAGKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYC 287

Query: 365 ALVFSAIYALVD 376
            L+ +  Y  +D
Sbjct: 288 FLLMALFYYWID 299


>gi|218260819|ref|ZP_03475938.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
 gi|218224342|gb|EEC96992.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
          Length = 376

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 146/362 (40%), Gaps = 86/362 (23%)

Query: 33  QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++ ++   H  G      W        SH  W G +  D VMP 
Sbjct: 6   KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65

Query: 80  FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           F+F+ GV++  AL R   +PD+    +++I R   L  +G++ QG           G+D 
Sbjct: 66  FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNL--------LGLDP 117

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
             I L    LQ IA+ YL+ +++ +  +  VQ     +G              A  +L+V
Sbjct: 118 GRIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQ-----IGT-------------AVALLLV 159

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y   +        QF  ++     YG              P  N   ++DR VLG     
Sbjct: 160 YWGAM--------QFITVD----GYGG---------GNYTPDGNLAEWVDRTVLGR---- 194

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                 R  A  ++                  +    +LSS++  ++ + G+  G  I+ 
Sbjct: 195 -----FRDAAVVENGQVV----------FAESYRYTWILSSLNFGVTVLTGLFAGQ-ILK 238

Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL 374
           +     R  QW+ +G  + +  L   +   +P+ K+++T S V V+SG   L+    Y  
Sbjct: 239 SAMDQKRKWQWL-LGIGVAMVALGWLWGLQLPVIKKIWTSSMVLVSSGYCFLLMGVFYYW 297

Query: 375 VD 376
           +D
Sbjct: 298 ID 299


>gi|410099161|ref|ZP_11294133.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219183|gb|EKN12146.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 377

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 147/375 (39%), Gaps = 98/375 (26%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNGCN 71
            EK+  K  RL SLD  RG  +  ++        L   A   W        +H  W G +
Sbjct: 1   MEKTTYK--RLESLDALRGFDLFFLVALGPLAHSLARAADVGWLNDCMWAFNHVQWEGFS 58

Query: 72  LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK---LLFW--GILLQGG-FSH 125
             D +MP FLF+ G ++  AL R    +D  KK +FR L    LL W  G++ QG     
Sbjct: 59  PWDLIMPLFLFMSGASMPFALSRFKGVSD--KKTLFRRLGKRILLLWIFGMMCQGNLLGF 116

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
            PD        R+      LQ IA  YL+ +++ ++T     +   +G            
Sbjct: 117 DPD--------RIYLYSNTLQSIAAGYLITAVLFLYT----SRRTQIG------------ 152

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +A  +L+VY A +        QF  I      YG              P  N   +IDR
Sbjct: 153 -VAVALLLVYWAAM--------QFITI----GSYGG---------GNYTPEGNLAEWIDR 190

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
            VLG         +R             P       W H  +    +LS+++  ++ + G
Sbjct: 191 TVLG--------RFRDGAKVVDGEVVFAP-------WYHYTW----ILSTLNFGVTVLTG 231

Query: 306 VHFGHVIIHTKGHLARLKQWVTMGFALL----IFGLTLHFTNAIPLNKQLYTLSYVCVTS 361
           +  G++       + +LK +   G A++    ++GL +      P+ K ++T S V V+S
Sbjct: 232 LFAGYIAKDKTSDIHKLKWYFGAGAAMVAAGWLWGLQM------PVIKTIWTSSMVLVSS 285

Query: 362 GAAALVFSAIYALVD 376
           G   L+    Y L+D
Sbjct: 286 GYCFLLMGLFYYLID 300


>gi|413922900|gb|AFW62832.1| hypothetical protein ZEAMMB73_935848 [Zea mays]
          Length = 1241

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 36/60 (60%)

Query: 222  VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
            V  V CGVR   +  CNAVG IDRK+LGI H+Y  P + RSK   +++      R+ AP+
Sbjct: 966  VLQVKCGVRGDTSSGCNAVGMIDRKILGIQHLYGRPVYARSKNYRKNTLAASSSRRKAPA 1025


>gi|157374353|ref|YP_001472953.1| hypothetical protein Ssed_1214 [Shewanella sediminis HAW-EB3]
 gi|157316727|gb|ABV35825.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
          Length = 378

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 55/93 (59%), Gaps = 4/93 (4%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
           P +PEG+LS++ ++ + + GV  GH II  H KG   ++   +  G A L  G  L F  
Sbjct: 210 PLDPEGILSTIPAVANALAGVFVGHFIIKPHPKGEWFKVVYMLVAGAAFLGLGWLLDFI- 268

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +P+NK+L+T S+  VT G + ++ +  YA+VD
Sbjct: 269 -VPVNKELWTSSFTLVTIGWSLILLTVFYAIVD 300



 Score = 45.8 bits (107), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 29/156 (18%)

Query: 24  QQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGC 70
           +  ++ +  +RL SLD  RG           L   L+      G  W   ++ H+ W+G 
Sbjct: 2   EVAQAKVSKRRLMSLDALRGFDMFWILGGEVLFAGLLAWTGWQGWQWFDTQMHHSEWHGF 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILLQGGFS 124
              D + P F+F+ GVA+ L+ KR+ D+    K++      + R L LLF+GIL   G+ 
Sbjct: 62  TFYDLIFPLFIFLSGVALGLSPKRL-DKLPIAKRMPLYIHAVKRLLLLLFFGILYNHGWG 120

Query: 125 HAPDELTYGVDVRM--IRLCGVLQRIALSYLLVSLV 158
                   GV V +  +R   VL RIA ++   +++
Sbjct: 121 T-------GVPVVLDEVRYASVLGRIAFAWFFAAIL 149


>gi|380693009|ref|ZP_09857868.1| hypothetical protein BfaeM_03398 [Bacteroides faecis MAJ27]
          Length = 376

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/372 (21%), Positives = 151/372 (40%), Gaps = 92/372 (24%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
             +KS   T RLASLDI RG  + L++             + P       +  H  W G 
Sbjct: 1   MSKKSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHA 126
              D VMP FLF+ G ++  +L +    + + + V  R LK +F    +G+++QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYIGTSGSYRPVYRRILKRVFLLFVFGMIVQGNL--- 117

Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
                 G+D + I L    LQ IA+ YL+ +++++                    + + W
Sbjct: 118 -----LGLDGKHIYLYSNTLQSIAVGYLIAAVIQLH-------------------FSFKW 153

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
            +   +L++++  +  T++ D                            P  N    +DR
Sbjct: 154 QIGITLLLLFVYWIPMTFLGD--------------------------FTPAGNFAEQVDR 187

Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTII 304
            VLG           R +        +G    +  +W  +P +    + SS++  ++ ++
Sbjct: 188 WVLG-----------RFR--------DGVYWNEDGTWNFSPYYNYTWIWSSLTFGVTVML 228

Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAA 364
           G   G ++   K    R+ Q +++   LLI GL + ++  +P+ K+L+T S   ++ G  
Sbjct: 229 GAFAGKIMKEGKADRKRVVQVLSVVGVLLI-GLAMLWSLQMPVIKRLWTGSMTLLSGGYC 287

Query: 365 ALVFSAIYALVD 376
            L+ +  Y  +D
Sbjct: 288 FLLMALFYYWID 299


>gi|383124758|ref|ZP_09945419.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
 gi|251841090|gb|EES69171.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
          Length = 376

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 72/368 (19%), Positives = 147/368 (39%), Gaps = 92/368 (25%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCNLAD 74
           S   T RLASLDI RG  + L++             + P       +  H  W G    D
Sbjct: 5   SENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFWD 64

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG         
Sbjct: 65  LVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL------- 117

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D   I L    LQ IA+ Y + +++++                    + + W +  
Sbjct: 118 -LGLDSSHIYLYSNTLQSIAVGYFIAAVIQLH-------------------FSFRWQIGI 157

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++++  +  T++ D                            P  N    +DR VLG
Sbjct: 158 TLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDRWVLG 191

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHF 308
                      R +        +G    +  +W  +P +    + SS++  ++ ++G   
Sbjct: 192 -----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAFA 232

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G   L+ 
Sbjct: 233 GKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYCFLLM 291

Query: 369 SAIYALVD 376
           +  Y  +D
Sbjct: 292 ALFYYWID 299


>gi|223937685|ref|ZP_03629587.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223893657|gb|EEF60116.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 413

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 146/410 (35%), Gaps = 98/410 (23%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
           M+  +A+ T   P   S P  +  +     K  RL SLD +RG  + LM         W 
Sbjct: 1   MNPEEAQATLASPTQESRPARTVPE-----KATRLISLDAYRGFVMLLM--ASEGFNMWR 53

Query: 61  ----------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
                           +  H  W GC L D + P F+F+VGVA+  +L     +  +   
Sbjct: 54  MAEQNPNSSFWQFLKYQTEHVDWRGCALWDLIQPSFMFMVGVAMPFSLASRRAKGQSFNT 113

Query: 105 V----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
           +    ++R++ L+F GI L+    H     TY           VL +I L Y  + L+  
Sbjct: 114 MLGHTLWRSIALVFIGIFLRSVGRHQ----TY------FTFEDVLTQIGLGYTFLFLLAW 163

Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVPDWQFTIINKDSAD 218
               VQ                     A  +LV Y A   LY   V D+          D
Sbjct: 164 TKLRVQFTA------------------AMLILVGYWAAFALYPLPVNDF----------D 195

Query: 219 YGKV-----FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
           Y KV     ++   G  A  +   N    +D+  L +                   P E 
Sbjct: 196 YQKVGIPANWHHLTGFAAHWDKNTNLAAAVDQWFLNL------------------FPREH 237

Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
           P   +   +    F P        S+ + I G+  G  +       ++++  V  G A L
Sbjct: 238 PFVFNGGGYLTLSFVP--------SLATMIFGLLAGQFMREQSTQSSKVRLLVGAGIACL 289

Query: 334 IFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
             G  L  T   P  K+++T S+V  ++G   ++ +  Y ++D   +  W
Sbjct: 290 ALGAVLDMTGICPSVKRIWTPSWVIFSTGWTCILLATFYGIIDWQGYKRW 339


>gi|195167204|ref|XP_002024424.1| GL15027 [Drosophila persimilis]
 gi|194107797|gb|EDW29840.1| GL15027 [Drosophila persimilis]
          Length = 493

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/50 (54%), Positives = 34/50 (68%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           + +RL SLD FRGL++ LMI V+  GG +  I HA WNG +LAD V PF 
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPFL 230


>gi|182412825|ref|YP_001817891.1| hypothetical protein Oter_1003 [Opitutus terrae PB90-1]
 gi|177840039|gb|ACB74291.1| conserved hypothetical protein [Opitutus terrae PB90-1]
          Length = 411

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 163/390 (41%), Gaps = 105/390 (26%)

Query: 34  RLASLDIFRG-------------LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           RL S+D  RG             LA+  M L         ++ H  W G    D + P F
Sbjct: 11  RLVSVDALRGFDMFWILGADALVLALGAMSLSPTLRALAGQLEHKDWAGFAFYDLIFPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDV 136
           +FIVGV+   +L  +     RA AVK+++ RTL LL +GI   GG +H  PD        
Sbjct: 71  VFIVGVSTVFSLTSLVAREGRAAAVKRILRRTLLLLAFGIFYNGGLAHQWPD-------- 122

Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
             +RL GVLQRIAL+Y    L+ +F K               R+      + A +LV Y 
Sbjct: 123 --VRLVGVLQRIALAYAAAGLLFVFFKP--------------RVLA---AVTALLLVGYW 163

Query: 197 ALLYGTYVP--DWQF---TIIN------KDS------ADYGKVFN-VTCGVRAKLNPPCN 238
           ALL  T++P  D Q     +++      +D+      A   ++F+  T  V  +  P  N
Sbjct: 164 ALL--TFIPIRDVQLDRAALVSHLPDAPRDAQGFPAEAQVRQLFDQTTARVTGRFEPGLN 221

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
              ++D   L      H+                              ++PEGLLS++ +
Sbjct: 222 LTNHLDYVYLPGARYDHY------------------------------YDPEGLLSTLPA 251

Query: 299 ILSTIIGVHFGHVIIHTK--GH---LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
           I + ++G+  G ++  T   GH   +      V    A  ++G+        P+ K+L+T
Sbjct: 252 IATCLLGIFAGLLLRRTDIGGHDKVVTLALAGVAALAAGWLWGIQF------PIIKKLWT 305

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
            SYV V  G + L+ +A Y ++D   +  W
Sbjct: 306 SSYVLVAGGWSLLLLAAFYYVIDVRQWRRW 335


>gi|343082821|ref|YP_004772116.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351355|gb|AEL23885.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 367

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 68/153 (44%), Gaps = 26/153 (16%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLA 73
           +  +K  R+ S+D  RG  +  +I  D      H GG  P       + SH  W G    
Sbjct: 3   EQAIKPNRILSIDALRGFDMLFIIFADRFFALLHKGGQTPFTGFLANQFSHPDWFGSTFY 62

Query: 74  DFVMPFFLFIVGVAIALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
           D +MP FLF+VG  I  +L KR+ +   +A   KK+  R L L F G ++QG        
Sbjct: 63  DIIMPLFLFMVGAVIPFSLSKRMQENTGKAQIYKKLFKRVLILFFLGWIVQGNL------ 116

Query: 130 LTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIF 161
               +D+   ++    LQ IA+ Y    L  I+
Sbjct: 117 --LALDINTFKIFSNTLQAIAVGYFFSCLAFIY 147


>gi|167623085|ref|YP_001673379.1| hypothetical protein Shal_1151 [Shewanella halifaxensis HAW-EB4]
 gi|167353107|gb|ABZ75720.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
          Length = 398

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 32/171 (18%)

Query: 15  IISEPDVSDQQEKSHLKTQ---------RLASLDIFRGLAV-----------ALMILVDH 54
           II++P  S  +    L+TQ         RL SLD  RG  +           AL++L   
Sbjct: 4   IITKPQSSLIESHLKLQTQSIAKSEAKPRLKSLDALRGFDMFWILGGEAIFAALLVLTGW 63

Query: 55  AGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV-----KKVIF 107
           AG  W   ++ H+ WNG    D + P F+F+ GVA+ L+ KR+            K  I 
Sbjct: 64  AGFKWFDGQMHHSVWNGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPPRLPLYKHAIK 123

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           R   LL +G++   G+            V  IR   VL RIA ++   +L+
Sbjct: 124 RLFLLLLFGVIYNHGWGTGAS-----FAVGDIRYASVLGRIAFAWFFCALL 169



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 58/99 (58%), Gaps = 4/99 (4%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLT 338
           ++ +A  +PEG+LS++ ++++ + GV  GH I+  H KG   +L      G AL+  G  
Sbjct: 225 TYQNAAVDPEGILSTIPAVVNGLFGVFVGHFIVKPHVKGEWFKLAVLAASGVALVALGWL 284

Query: 339 LHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           +  +  IP+NK L+T S+  V+SG + L  +  YA++D 
Sbjct: 285 I--SPLIPVNKTLWTSSFTLVSSGWSILFLALFYAVIDV 321


>gi|395804714|ref|ZP_10483949.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
 gi|395433102|gb|EJF99060.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
          Length = 380

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/391 (20%), Positives = 146/391 (37%), Gaps = 117/391 (29%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLAD 74
           S+    RL SLD  RG  +  ++  +H           P       ++ HA WNG    D
Sbjct: 2   SNPTNGRLISLDALRGFVMFWIMSGEHIIHALAKAAPIPIFLWMSSQLHHAEWNGITFYD 61

Query: 75  FVMPFFLFIVGVAIALALKR--------IPDRADAVKK------VIFRTLKLLFWGILLQ 120
            + P FLF+ GV++  + ++         P    + +K      ++ RT+ L+  G ++ 
Sbjct: 62  MIFPVFLFVAGVSMPYSFEKKMNLAGVSTPQELPSKEKRKIYLSMLRRTIILVVLGFVVN 121

Query: 121 G-----GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           G     GF H              R   VL RI +++    ++ +   +   K Q +   
Sbjct: 122 GLLRFDGFDHT-------------RFASVLGRIGIAWFFAGMIYL---NFDFKKQFI--- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
                    W +   +L+ Y A +    VP++   ++ K+ +                  
Sbjct: 163 ---------WFLG--ILMGYYAAMKWIPVPEFGAGVLTKEGS------------------ 193

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLS 294
                GYIDR  L                               P   H+  ++PEG+ S
Sbjct: 194 ---LEGYIDRLFL-------------------------------PGRLHSTVYDPEGIFS 219

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF-ALLIFGLTLHFTNAIPLNKQLYT 353
           ++ +I + ++GV F    +  K   +   + + M   A+L+    L +    P+NK L+T
Sbjct: 220 TLPAISTALLGV-FTGTFLKAKNQFSINAKLILMALTAVLLIIAGLIWDIDFPINKHLWT 278

Query: 354 LSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
            S+VC   G + L F   Y ++D + F  W 
Sbjct: 279 SSFVCFVGGFSILFFIFFYLIIDLSGFHKWA 309


>gi|298386962|ref|ZP_06996516.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298260112|gb|EFI02982.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 376

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/368 (19%), Positives = 147/368 (39%), Gaps = 92/368 (25%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCNLAD 74
           S   T RLASLDI RG  + L++             + P       +  H  W G    D
Sbjct: 5   SENNTSRLASLDILRGFDLFLLVFFQPVFAALARQLNLPFLNDILYQFDHEVWEGFRFWD 64

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            VMP FLF+ G ++  +L +    + +     ++++ R   L  +G+++QG         
Sbjct: 65  LVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL------- 117

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
             G+D   I L    LQ IA+ Y + +++++                    + + W +  
Sbjct: 118 -LGLDSSHIYLYSNTLQSIAVGYFIAAVIQLH-------------------FSFRWQIGI 157

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
            +L++++  +  T++ D                            P  N    +DR VLG
Sbjct: 158 TLLLLFIYWIPMTFLGD--------------------------FTPAGNFAEQVDRCVLG 191

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHF 308
                      R +        +G    +  +W  +P +    + SS++  ++ ++G   
Sbjct: 192 -----------RFR--------DGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAFA 232

Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVF 368
           G ++   K +  ++ Q +++   +L+ GL + ++  +P+ K+L+T S   ++ G   L+ 
Sbjct: 233 GKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGGYCFLLM 291

Query: 369 SAIYALVD 376
           +  Y  +D
Sbjct: 292 ALFYYWID 299


>gi|440747820|ref|ZP_20927075.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483562|gb|ELP39602.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 372

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 142/371 (38%), Gaps = 95/371 (25%)

Query: 32  TQRLASLDIFRGLAVALMILVD--------HAGGDWPEI-----SHAPWNGCNLADFVMP 78
           ++RL S+D  RG  + ++   D             W ++      H  W G    DF+ P
Sbjct: 9   SKRLVSIDALRGFDMLMICGADAFFRSLEGKTSFAWVDVLARQFEHPEWIGFTFYDFIFP 68

Query: 79  FFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            FLF+ GV+I  +L +       + +  KK + RTL L+  G+L +    +AP       
Sbjct: 69  LFLFVAGVSIPFSLGKSLAENVSKREIYKKALSRTLLLIGLGMLDK----NAPFPF---F 121

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
           D   IRL  VL RI ++      V +F        Q +G             +   VL+ 
Sbjct: 122 DWEQIRLGSVLGRIGIA----GFVTVFLFLNFPARQRLG-------------IVGLVLIA 164

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
           Y A ++   VP +                         L+   N  G+IDR  L      
Sbjct: 165 YYAAVFLIPVPGFG---------------------AGNLSFEGNLAGWIDRTFL------ 197

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
                              P R          F+  G+L++  +I  TI+G   G ++  
Sbjct: 198 -------------------PGRL-----LQGSFDELGILTTFPAICLTILGAQAGEIL-- 231

Query: 315 TKGHLARLKQWV-TMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
               L+  ++ V T+ F ++  GL L +    P+ K+++T S++ + +G A +  +  Y 
Sbjct: 232 RNAQLSEQQKVVRTLLFGVVCIGLALIWHLHFPIFKRMWTSSFILLNAGMAFVALAGFYW 291

Query: 374 LVDCNIFGPWT 384
           L+D   F  W+
Sbjct: 292 LIDMLHFRKWS 302


>gi|406832166|ref|ZP_11091760.1| hypothetical protein SpalD1_11017 [Schlesneria paludicola DSM
           18645]
          Length = 413

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 95/397 (23%), Positives = 156/397 (39%), Gaps = 85/397 (21%)

Query: 14  LIISEPDVSDQQEKSHLK-------TQRLASLDIFRGLAVALMI-----LVDHAGGDWPE 61
           +I++ P+ S+ +  + L+         RL S+D +RG  + LM+     L D A    PE
Sbjct: 1   MIVTIPNKSEIEGPATLELPAGGAAPSRLVSVDAYRGWVMLLMMAEVLRLRDVAKAL-PE 59

Query: 62  I----------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK----VIF 107
                      SH  W GC L D + P F F+VGVA+ L+L+R       + +      +
Sbjct: 60  SRLWAFLAQQQSHVTWVGCVLHDMIQPSFSFLVGVALPLSLRRRSLSGQPLWQRTAHAAW 119

Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
           R+L L+  G+ L+          T+    R       L +I L YL + L+ + ++  Q 
Sbjct: 120 RSLVLILLGVFLRS---------THSTQTRWT-FEDTLSQIGLGYLFLYLLSLCSRRAQ- 168

Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY-VPDWQFTIINKDSADYGKVFNVT 226
                            W     +LV Y  L +  Y +P   F        D+ K     
Sbjct: 169 -----------------WAALISILVGYW-LFFALYPLPGADF--------DWAK----- 197

Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
             ++   N P +  G+     L  N     PAW          P + P   +   +    
Sbjct: 198 --IKGDPNSPSHLSGFAAHWNLNTN-----PAWAFDTWFLNLFPRQAPFTHNGGGYSTLN 250

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIP 346
           F P        ++ + I+G+  G VI+  +    +L   V+ G   L  GL L      P
Sbjct: 251 FIP--------TLATMILGLLAGGVIVSDRSTRFKLIWLVSAGLIGLASGLILDAAGLCP 302

Query: 347 LNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           + K+++T S+V  + G   L+ +A YA+VD   +  W
Sbjct: 303 IVKKIWTPSWVLFSGGWCFLILAAWYAVVDVAKWQSW 339


>gi|340617673|ref|YP_004736126.1| hypothetical protein zobellia_1684 [Zobellia galactanivorans]
 gi|339732470|emb|CAZ95738.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 346

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 122/325 (37%), Gaps = 97/325 (29%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-KRIP--DRADAVKKVIFRTLKLLFWGI 117
           ++ H PWNG    D + PFF+FIVGVA+  +L KR+   D+    K ++ R   L  +G 
Sbjct: 30  QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGDKKGVTKHILRRCFLLFAFGA 89

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGR 174
           LL   +SHA           +  L  VL ++A + L+   V     ++ +K Q   SVG 
Sbjct: 90  LLHCVYSHA----------LVWELWNVLVQLAFTILIAYAV----MNLSNKTQIGISVG- 134

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                            L+V   +LY  Y P+  +    ++   +G              
Sbjct: 135 -----------------LLVLTEVLYRAYNPEAPYV---QNHESFG-------------- 160

Query: 235 PPCNAVGYIDRKVLG-INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
                  Y+D  V+G +N  Y                           W    F P    
Sbjct: 161 ------AYVDMLVMGQVNDGY---------------------------WVFVNFIPTA-- 185

Query: 294 SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYT 353
                   TI GV  G V++       ++K ++  G  L++ G  + F    P+ K++ T
Sbjct: 186 ------AHTIWGVICGKVLLSHIPSTQKIKPFLIWGAVLVVLGFAMDFVGITPIVKRIAT 239

Query: 354 LSYVCVTSGAAALVFSAIYALVDCN 378
            S+   + G A L  S  Y L+D  
Sbjct: 240 SSFTLASGGIAILTLSLFYWLIDIK 264


>gi|440731410|ref|ZP_20911431.1| membrane protein [Xanthomonas translucens DAR61454]
 gi|440373102|gb|ELQ09871.1| membrane protein [Xanthomonas translucens DAR61454]
          Length = 384

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 5/121 (4%)

Query: 33  QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +R  SLD+FRGL + LMIL +   AG D + ++ HAPW G   AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 90  LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
            AL R       + +V  R+  +   G L+    F H   + ++  + +   R+ GVLQR
Sbjct: 76  FALDRSQPLGAFLCRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135

Query: 148 I 148
           I
Sbjct: 136 I 136


>gi|399069322|ref|ZP_10749357.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
 gi|398045229|gb|EJL37978.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
          Length = 233

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 79/303 (26%), Positives = 122/303 (40%), Gaps = 92/303 (30%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAG--------GDWPEISHAPWNGCNLADFVMPF 79
           S  K  RLASLD+ RGL +  MI+V+ A           +  + HA W G   AD V P 
Sbjct: 3   SRPKAARLASLDVLRGLTIVGMIVVNTASYLHYVSGYAVFAGLEHAEWRGFTAADAVFPA 62

Query: 80  FLFIVGVAIALALKRIP------DR------ADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
           F+F+ GV+I LAL  +       +R        A+++++ R+ +L   G++L   +  A 
Sbjct: 63  FVFMTGVSIPLALGPLALGDGPIERGMAGLDGAALRRLLVRSGRLFLLGLILSNLYWMAT 122

Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
            E        + R  GVLQR+AL+        +  K +  + + +              +
Sbjct: 123 PESV------LFRPMGVLQRLALA---FLAAAVLYKTLGPRARMI--------------L 159

Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
           A  +L +Y           W  T++               G    L P  N VG+ DR V
Sbjct: 160 AVAILALY-----------WPLTLLPFPD-----------GTTDLLRPGANFVGWFDRAV 197

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
           LG  H Y H                GPL           ++PEGLLS++ ++   ++GV 
Sbjct: 198 LG-AHTYVH----------------GPLG----------YDPEGLLSTLPAVAQALLGVA 230

Query: 308 FGH 310
            G 
Sbjct: 231 AGQ 233


>gi|423280893|ref|ZP_17259805.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
 gi|404583534|gb|EKA88212.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
          Length = 375

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 145/365 (39%), Gaps = 99/365 (27%)

Query: 34  RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++        L       W      +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           LF+ G ++  +  +    PD+    +K+I R + L  +G+++QG           G++ +
Sbjct: 71  LFMTGASMPFSFSKFKDDPDKGTIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122

Query: 138 MIRL-CGVLQRIALSYLLVSLVEI---FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
            + L    LQ IA  YL+ +++ +   F + +                    ++ A +L+
Sbjct: 123 YLYLYSNTLQAIATGYLIAAIILLHCSFRRQL--------------------IVTALLLL 162

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +Y A +  T++ D                            P  N    ID+ VLG    
Sbjct: 163 IYWAPM--TFLGD--------------------------FTPEGNFAEKIDKLVLG---- 190

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVI 312
                  R +        +G    +  +W  +P +    + SS++   + ++G   G ++
Sbjct: 191 -------RFR--------DGVYWNEDGTWNFSPHYNYTWIWSSLTFGATVMLGTFAGKIM 235

Query: 313 IHTKGHLAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
              K +  + ++  + +G AL+ F L   ++  +P+ K+L+T S    + G   L+  A 
Sbjct: 236 KAGKDNRRKVVRTLLVIGIALVAFSLV--WSLQMPIIKRLWTSSMTLFSGGLCFLLMGAF 293

Query: 372 YALVD 376
           Y  +D
Sbjct: 294 YYWID 298


>gi|392544017|ref|ZP_10291154.1| hypothetical protein PpisJ2_19642 [Pseudoalteromonas piscicida JCM
           20779]
          Length = 377

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 8/108 (7%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF----ALLIFG 336
           S+ + P +PEG+LSS+ +I++ I GV  G  I + +       +W T+G      +L+  
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQ----GEWKTVGILAGSGVLVLA 259

Query: 337 LTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           L   +    P+NK+L+T S+V VT G +A++ +  YA+VD   F  W 
Sbjct: 260 LGWLWDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAIVDVLSFQRWA 307



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 27/148 (18%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
           K +RLASLD  RG+ +           AL +L    G  W        H+PW+G    D 
Sbjct: 6   KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF-----WGILLQGGFSHAPDEL 130
           + P F+F+ GVA+ L+ KRI       +K  +            +G+L   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKPFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +D   IR   VL RIA ++   +L+
Sbjct: 122 ---MDPDGIRYASVLGRIAFAWFFCALL 146


>gi|409198223|ref|ZP_11226886.1| hypothetical protein MsalJ2_14356 [Marinilabilia salmonicolor JCM
           21150]
          Length = 394

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 11/120 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           T R+ S+DI R + VALM+ V+   G      W   + A  +G  LAD V P FLF VG+
Sbjct: 7   THRIKSIDILRAITVALMVFVNDLPGIRDIPQWLGHASAGHDGMFLADIVFPLFLFWVGM 66

Query: 87  AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
           +I LA+     + D+    V+ ++ RT  L+F G+L+    S    E+T G+D  +  LC
Sbjct: 67  SIPLAVDGRQKKGDSDLTIVRHILKRTFSLVFIGVLMVNT-SRMSVEVT-GIDRNLWALC 124


>gi|294139796|ref|YP_003555774.1| hypothetical protein SVI_1025 [Shewanella violacea DSS12]
 gi|293326265|dbj|BAJ00996.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 378

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 57/94 (60%), Gaps = 4/94 (4%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
           P +PEG+LS++ ++ + + GV  GH I+  H +G  A++   +  G  +L  G  ++   
Sbjct: 210 PLDPEGILSTIPAVANAMAGVFVGHFIVKEHKQGEWAKVVCLLLSGAFILALGWWVNLI- 268

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
            IP+NK L+T S+V VT+G + ++ +  YALVD 
Sbjct: 269 -IPVNKDLWTSSFVLVTTGWSIILLAIFYALVDV 301



 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 25/145 (17%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
           +RL SLD  RG           L   L+      G  W   ++ H+ W+G    D + P 
Sbjct: 11  RRLMSLDALRGFDMFWILGGEALFAGLLAWSSWQGWQWADAQMHHSQWHGFTFYDLIFPL 70

Query: 80  FLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYG 133
           F+F+ GVA+ L+ KR     I  R    K  + R   LLF+G+L   G+ + AP      
Sbjct: 71  FIFLSGVALGLSPKRLDKLPIAQRMPLYKHSVKRLFLLLFFGVLYNHGWGTGAP------ 124

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLV 158
           V +  +R   VL RIA ++   +++
Sbjct: 125 VAIDEVRYASVLGRIAFAWFFAAML 149


>gi|392551353|ref|ZP_10298490.1| hypothetical protein PspoU_08780 [Pseudoalteromonas spongiae
           UST010723-006]
          Length = 379

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 74/160 (46%), Gaps = 34/160 (21%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPW 67
           +SD  +K+    +RLASLD  RG  +           AL +L   AG    E    H+ W
Sbjct: 1   MSDTNKKTK---KRLASLDALRGFDMFWILGGEKIFAALFVLTGWAGWKVAEAQTLHSQW 57

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQGG 122
           +G    D + P F+F+ GVA+ L+ KRI      DR     K   R L L F+G+L   G
Sbjct: 58  HGFTFYDLIFPLFIFLSGVAMGLSPKRIDHLPFVDRKPIYIKAFKRLLLLCFFGVLYNHG 117

Query: 123 FSHA----PDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           +       P+E         +R   VL RIA+++ + +++
Sbjct: 118 WGTGVPLNPEE---------VRYASVLGRIAVAWFVAAML 148



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF----ALLIFGLTL 339
           + P +PEG+LSS+ +I + + GV  G  I   +       +W T G      L+   +  
Sbjct: 209 NKPMDPEGILSSIPAIANALFGVIAGRYIKQAQER----GEWKTAGILFAAGLVALAVGW 264

Query: 340 HFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +    P+NK L+T S+V VT G + ++ +  YA+VD
Sbjct: 265 LWNMVFPVNKDLWTSSFVMVTVGWSFILLAVFYAVVD 301


>gi|313149262|ref|ZP_07811455.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313138029|gb|EFR55389.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 375

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 143/365 (39%), Gaps = 99/365 (27%)

Query: 34  RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++        L       W      +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
           LF+ G ++  +  +    PD+    +K+I R + L  +G+++QG           G++ +
Sbjct: 71  LFMTGASMPFSFSKFKDDPDKGPIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122

Query: 138 MIRL-CGVLQRIALSYLLVSLVEI---FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
            + L    LQ IA  YL+ +++ +   F + +                    ++ A +L+
Sbjct: 123 YLYLYSNTLQAIATGYLIAAIILLHCSFRRQL--------------------IVTALLLL 162

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
           +Y A +  T++ D                            P  N    ID+ VLG  H 
Sbjct: 163 IYWAPM--TFLGD--------------------------FTPEGNFAEKIDKLVLG--HF 192

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVI 312
                W                  +  +W  +P +    + SS++   + ++G   G ++
Sbjct: 193 RDGVYW-----------------NEDGTWNFSPHYNYTWIWSSLTFGATVMLGTFAGKIM 235

Query: 313 IHTKGHLAR-LKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAI 371
              K +  + ++  + +G AL+ F L   ++  +P+ K+L+T S    + G   L+  A 
Sbjct: 236 KAGKDNRRKVVRTLLVIGIALVAFSLV--WSLQMPIIKRLWTSSMTLFSGGLCFLLMGAF 293

Query: 372 YALVD 376
           Y  +D
Sbjct: 294 YYWID 298


>gi|305665830|ref|YP_003862117.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
 gi|88710601|gb|EAR02833.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
          Length = 346

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/322 (22%), Positives = 119/322 (36%), Gaps = 95/322 (29%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-KRIP--DRADAVKKVIFRTLKLLFWGI 117
           ++ H PWNG    D + PFF+FIVGVA+  +L KR+    R  A + ++ R   L  +G 
Sbjct: 30  QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGSRKSATRHILKRCFLLFAFGA 89

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGR 174
           LL   +SHA           +  L  VL ++A +     L+  F   + +K Q   S+G 
Sbjct: 90  LLHCVYSHA----------LVWELWNVLVQLAFTI----LIAYFIMSLSNKAQIGISIG- 134

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                            L++   +LY TY P   +    +    +G              
Sbjct: 135 -----------------LLLLTEILYRTYNPSAPYV---QGHESFGS------------- 161

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
                  +ID  V+G                          + ++  W    F P     
Sbjct: 162 -------FIDMLVMG--------------------------KTNSGYWVVVNFLPTA--- 185

Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTL 354
                  TI GV  G +++ +     +LK  V  G  LLI G ++   N  P+ K++ T 
Sbjct: 186 -----AHTIWGVVCGKLLLSSATDQNKLKPIVIWGVVLLILGYSMDLLNITPIVKRIATT 240

Query: 355 SYVCVTSGAAALVFSAIYALVD 376
           S+   + G   L  +  Y  +D
Sbjct: 241 SFTMASGGWCLLTLALFYWWID 262


>gi|307700906|ref|ZP_07637931.1| putative membrane protein [Mobiluncus mulieris FB024-16]
 gi|307613901|gb|EFN93145.1| putative membrane protein [Mobiluncus mulieris FB024-16]
          Length = 442

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 92/384 (23%), Positives = 148/384 (38%), Gaps = 96/384 (25%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 56  TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
              LL  G S          D    R  G LQ  A+   +++   +F ++          
Sbjct: 162 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 205

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                       MA   +   +A+L    +  W  T                 GV   L+
Sbjct: 206 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 237

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P CN  G  DR +LG  HMY                + G L  D          PEGL++
Sbjct: 238 PSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLVA 270

Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
              ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L+
Sbjct: 271 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRLW 328

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           T S+  +      L+F+  +   D
Sbjct: 329 TPSFSLIAGAVGVLIFAVAFLCFD 352


>gi|269978070|ref|ZP_06185020.1| putative membrane protein [Mobiluncus mulieris 28-1]
 gi|269933579|gb|EEZ90163.1| putative membrane protein [Mobiluncus mulieris 28-1]
          Length = 442

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 92/384 (23%), Positives = 148/384 (38%), Gaps = 96/384 (25%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 56  TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
              LL  G S          D    R  G LQ  A+   +++   +F ++          
Sbjct: 162 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 205

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                       MA   +   +A+L    +  W  T                 GV   L+
Sbjct: 206 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 237

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P CN  G  DR +LG  HMY                + G L  D          PEGL++
Sbjct: 238 PSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLVA 270

Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
              ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L+
Sbjct: 271 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRLW 328

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           T S+  +      L+F+  +   D
Sbjct: 329 TPSFSLIAGAVGVLIFAVAFLCFD 352


>gi|227875179|ref|ZP_03993321.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
 gi|227844084|gb|EEJ54251.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
          Length = 399

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 148/384 (38%), Gaps = 96/384 (25%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 13  TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 66  PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
              LL  G S          D    R  G LQ  A+   +++   +F ++          
Sbjct: 119 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVTIIATCHLFARN---------- 162

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                       MA   +   +A+L    +  W  T                 GV   L+
Sbjct: 163 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 194

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P CN  G  DR +LG  HMY                + G L  D          PEGL++
Sbjct: 195 PSCNPSGMWDRALLG-AHMY----------------YGGFLGHD----------PEGLVA 227

Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
              ++L+   G   GH+ + ++  G      + + +  A+ +FGL L+    +P  K+L+
Sbjct: 228 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLILNIW--VPAFKRLW 285

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           T S+  +      L+F+  +   D
Sbjct: 286 TPSFSLIAGAVGVLIFAVAFLCFD 309


>gi|260684359|ref|YP_003215644.1| hypothetical protein CD196_2626 [Clostridium difficile CD196]
 gi|260688018|ref|YP_003219152.1| hypothetical protein CDR20291_2673 [Clostridium difficile R20291]
 gi|260210522|emb|CBA65033.1| putative membrane protein [Clostridium difficile CD196]
 gi|260214035|emb|CBE06182.1| putative membrane protein [Clostridium difficile R20291]
          Length = 352

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 17/125 (13%)

Query: 50  ILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADA 101
           ++V +  G W    P++ HA W+G  LADF  PFF+  +GV I ++    LK        
Sbjct: 1   MIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTIPISINSKLKNNKSTLSI 60

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           +  +  R++ L+ +G  L   +   P       D+  +R+ GVLQR+ L Y + SLV + 
Sbjct: 61  ILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRILGVLQRMGLVYFVTSLVYLL 111

Query: 162 TKDVQ 166
            K + 
Sbjct: 112 LKKLN 116



 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           FEP+G L+S+ +I S ++G   G V++    G   +  + + M   LLI      F    
Sbjct: 170 FEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FNQYF 227

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD----CNIFGP 382
           P NK+L++ S+V + +G+  ++ S  Y + D      IF P
Sbjct: 228 PFNKRLWSSSFVLLMAGSYGMLLSIFYFICDIKNKSKIFTP 268


>gi|343086706|ref|YP_004776001.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342355240|gb|AEL27770.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 368

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 31  KTQRLASLDIFRGLAVALMI-----LVDHAGGDWPEIS----------HAPWNGCNLADF 75
             +RL SLD +RG+ + L++     L     G +PE+S          H PWNG    D 
Sbjct: 8   SNKRLVSLDAYRGITMFLLVAESARLYGAFEGLFPEVSGWQMFFTQFTHHPWNGLRFWDL 67

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           + PFF+FIVGVA+  +L +  ++    +KV    LK
Sbjct: 68  IQPFFMFIVGVAMPFSLNKRLEKQGDRRKVTLHILK 103


>gi|345881756|ref|ZP_08833266.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
 gi|343918415|gb|EGV29178.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
          Length = 380

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 80/372 (21%), Positives = 145/372 (38%), Gaps = 99/372 (26%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
           + QRL SLDI RG  +A+++L+             A G        ++SH PW G    D
Sbjct: 9   QPQRLLSLDILRGADLAMLVLIQPILFRALKTAHPAEGTIGHFIMGQLSHLPWEGFCFWD 68

Query: 75  FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGGFSHAPDEL 130
            +MP F+F+ G+ I  A+ R    A    +  +R +K  ++ W  G+++QG         
Sbjct: 69  IIMPLFMFMSGITIPFAMSRYKRGARIDGQFYWRIIKRFVVLWVLGMVVQGNL------- 121

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
               D+R + L    LQ IA+ Y+ V+ + +F      + Q V                +
Sbjct: 122 -LAFDLRQLHLFSNTLQSIAVGYVAVAFLFVFCS---LRTQIVA--------------VS 163

Query: 190 CVLVVYLAL--LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
              + Y+A+  L+G +     FTI                          N    IDR V
Sbjct: 164 LSFIAYIAIFALWGHF----DFTI------------------------DANICEAIDRAV 195

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGV 306
           LG                     F   ++    +W   P +    ++SS++ I++  +G 
Sbjct: 196 LG--------------------RFRDGVQWQGDTWHWDPTYHYTWIMSSLNFIVTVYLGT 235

Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
             G+++  T+  + + +  +  G A++   L +H    IP+ K +++ S      G   +
Sbjct: 236 LAGYLLKSTRTAMQKFRWLMWGGVAMIAVSLLMH--PWIPIIKHIWSSSMTLFAGGICFV 293

Query: 367 VFSAIYALVDCN 378
           +    Y ++D  
Sbjct: 294 LMGLFYYIIDVK 305


>gi|255037019|ref|YP_003087640.1| hypothetical protein Dfer_3263 [Dyadobacter fermentans DSM 18053]
 gi|254949775|gb|ACT94475.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 380

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 99/389 (25%)

Query: 16  ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMI-----LVDHAGGDW---------P 60
           +  P V  ++E +      RLAS+D  RG  + LMI      +   GG            
Sbjct: 1   MEAPSVVVKKEVRPSSSPGRLASIDALRGFDM-LMIAGGGQFIATLGGKTGISFIDAVAA 59

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFW 115
           +  H  WNG    DF+ P FLF+ G ++A ++     K IP       KV  R L L+  
Sbjct: 60  QFEHPAWNGFTFYDFIFPLFLFLAGTSLAFSVTGGLAKGIPPSVIR-NKVFKRMLILIAL 118

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
           GIL +    +AP ++    D   IR   VL RI L+  + +++ +     Q         
Sbjct: 119 GILDK----NAPMDI---FDPAHIRYGSVLGRIGLATFISAILYMKFGTNQ--------- 162

Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
              RLY     +   +LV Y   L    VP +        S D              L  
Sbjct: 163 ---RLY-----IGVGILVAYYLTLILVPVPGF-------GSGD--------------LTF 193

Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
             N VG+IDR  +                        G L++         ++   LL+ 
Sbjct: 194 EGNLVGWIDRNFM-----------------------PGILKQ-------GTYDELALLTQ 223

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
             ++  T+ G   G V++      +++ + +      +  G+  +F  A P+NK L++ S
Sbjct: 224 FPALCLTLFGTVAGDVLLRENRGNSKIGKLLLFASTGISIGILWNF--AFPINKHLWSSS 281

Query: 356 YVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           ++ +TSG A  + +  Y ++D   F  W 
Sbjct: 282 FIMLTSGMAFAMLALFYWIIDVKGFQKWA 310


>gi|430747657|ref|YP_007206786.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
 gi|430019377|gb|AGA31091.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
          Length = 418

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 77/380 (20%), Positives = 140/380 (36%), Gaps = 76/380 (20%)

Query: 16  ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMIL----VDHAGGDWPEIS-------- 63
           ++ P   +        ++RLAS+D FRG  + L++     +      +P+          
Sbjct: 15  VNAPKPPESSGSGSAPSRRLASIDAFRGFVMFLLLAEWLKLPQVAKSFPKSELWALLSRH 74

Query: 64  --HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGI 117
             H  W GC+L D + P F F+VGVA+  ++     R  +  ++     +R L L+  GI
Sbjct: 75  QQHVEWVGCSLHDLIQPSFSFLVGVALPFSIASRLARGQSTTRMAGHAFWRALVLVLLGI 134

Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
            L+          + G D         L +I L Y  + L+ +  +  +D+         
Sbjct: 135 FLR----------SMGKDRTNFTFEDTLTQIGLGYGFLFLLGL--RPARDQ--------- 173

Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPP 236
                  W+    +LV Y         P  +F   + D   D+    +   G  +  N  
Sbjct: 174 -------WIALVVILVGYWGAFALYPAPGTEFDYTSVDVPKDWAHNLS---GFASHWNKN 223

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
            NA    D   L +                   P + P   +   +          LS +
Sbjct: 224 SNAAWAFDTWFLNL------------------FPRKAPFIDNRGGYAT--------LSFI 257

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            ++ + I G+  G+V+   +    +L      G   L  G  L   +  P+ K+++T S+
Sbjct: 258 PTLATMIFGLIAGNVLKDDRKAWMKLGWLTAAGILGLFLGAALGELDFCPVVKRIWTPSW 317

Query: 357 VCVTSGAAALVFSAIYALVD 376
              ++G   L+ +A YA+VD
Sbjct: 318 TLFSAGWCFLILAAFYAVVD 337


>gi|320105641|ref|YP_004181231.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
 gi|319924162|gb|ADV81237.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
          Length = 406

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 17/106 (16%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLADFVMPFF 80
           L  QR+ SLDIFRGL +ALMI V+       EI   PW         +     D V P F
Sbjct: 10  LAPQRILSLDIFRGLNIALMIFVNELA----EIKGLPWWTYHAPGKVDVMTYVDMVFPGF 65

Query: 81  LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGG 122
           LFI+G+AI LAL     + D+    +  +  R+  LL  GI+L+ G
Sbjct: 66  LFILGMAIPLALNARIRKGDSPATLLGYIALRSAALLVLGIILENG 111


>gi|306818439|ref|ZP_07452162.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
 gi|304648612|gb|EFM45914.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
          Length = 399

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 92/384 (23%), Positives = 150/384 (39%), Gaps = 96/384 (25%)

Query: 2   SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
           ++ +A TT       +EP+   ++Q E    K  R+ SLD+ RG    L++ V  A    
Sbjct: 13  TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65

Query: 57  --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
              DW  + HAPW G    D + P F+ + G+ +A A     +R     KV  R + +L 
Sbjct: 66  PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118

Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
              LL  G S          D    R  G LQ  A+   +++   +F ++          
Sbjct: 119 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 162

Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
                     W++ A +    +A+L    +  W  T                 GV   L+
Sbjct: 163 ----------WMVWAGI-TAGVAVLQTGLLTWWAGT--------------CPSGV---LS 194

Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
           P CN  G  DR +LG  HMY                + G L  D          PEGL++
Sbjct: 195 PSCNPSGMWDRALLG-AHMY----------------YGGFLGHD----------PEGLVA 227

Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
              ++L+   G   GH+ + ++  G      + + +  A+ +FGL L  T  +P  K+L+
Sbjct: 228 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL--TIWVPAFKRLW 285

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           T S+  +      L+F+  +   D
Sbjct: 286 TPSFSLIAGAVGVLIFAVAFLCFD 309


>gi|406834451|ref|ZP_11094045.1| hypothetical protein SpalD1_22506 [Schlesneria paludicola DSM
           18645]
          Length = 358

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/326 (22%), Positives = 133/326 (40%), Gaps = 90/326 (27%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-------IPDRADAVKKVIFRTLKLL 113
           ++ H  W+G +  D + P FLF+VGV +  +L +       +P+R+    ++I RTL L+
Sbjct: 26  QLEHVKWDGFHFYDLIFPLFLFLVGVVLPFSLTKYQTAGELVPNRSGVYARIIRRTLLLI 85

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G++  G            +D    R  GVLQRI + Y   +L  +    +  + Q+  
Sbjct: 86  ALGLIGNG---------ILQLDFTNFRWPGVLQRIGICYFFAALAVL---HLGIRSQA-- 131

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                       L+A  ++  Y ALL     P +Q              F++T       
Sbjct: 132 ------------LLAIGIIAGYWALLRFVPAPGFQ-------------PFDLTM------ 160

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF-EPEGL 292
               + VGY+DR ++                               P   H    + EG+
Sbjct: 161 --EGSLVGYVDRLLI-------------------------------PGKLHYKLGDNEGV 187

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQL 351
           LS++ +I + ++GV  GH +   +    R  +++ +   A++       ++ + PLNK L
Sbjct: 188 LSTIPAIGTALLGVLTGHWL---RSSYTRFHKFLGLCAGAIVCLAAGYLWSFSFPLNKIL 244

Query: 352 YTLSYVCVTSGAAALVFSAIYALVDC 377
           +T S+V VT G +  + +  Y L+D 
Sbjct: 245 WTSSFVLVTGGWSLTLLAVFYLLIDV 270


>gi|311746093|ref|ZP_07719878.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
 gi|126576311|gb|EAZ80589.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
          Length = 367

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 55/105 (52%), Gaps = 19/105 (18%)

Query: 31  KTQRLASLDIFRGLAVALMIL--------VDHAGGD-------WPEISHAPWNGCNLADF 75
           K+ RL SLD FRGL + L+I         +  A  D       + + +H PWNG    D 
Sbjct: 7   KSGRLVSLDTFRGLTMFLLIAEAAFVYESLLEAFPDPGILNSFFTQFTHHPWNGLRFWDL 66

Query: 76  VMPFFLFIVGVAIALAL-KRI---PDRADAVKKVIFRTLKLLFWG 116
           + PFF+FIVGVA+  +L KR+    +R++  K ++ R   L  +G
Sbjct: 67  IQPFFMFIVGVAMPFSLNKRLENQENRSEVTKHILKRCFYLFLFG 111


>gi|157960931|ref|YP_001500965.1| hypothetical protein Spea_1103 [Shewanella pealeana ATCC 700345]
 gi|157845931|gb|ABV86430.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345]
          Length = 394

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/168 (30%), Positives = 79/168 (47%), Gaps = 30/168 (17%)

Query: 15  IISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAV-----------ALMILVDHAGGD 58
           II +P VS     +   ++     RL SLD  RG  +           AL++L   AG +
Sbjct: 4   IIDKPRVSIASVAAESVSKPAAKPRLKSLDALRGFDMFWILGGEAIFAALLLLTGWAGFN 63

Query: 59  W--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLK 111
           W   ++ H+ W+G    D + P F+F+ GVA+ L+ KR     +P R    +  I R L 
Sbjct: 64  WFDSQMHHSTWHGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPQRMPLYQHAIKRLLL 123

Query: 112 LLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           LL +G++   G+ + AP  L        IR   VL RIA ++   +L+
Sbjct: 124 LLLFGVIYNHGWGTGAPFAL------GDIRYASVLGRIAFAWFFCALL 165



 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 51/100 (51%), Gaps = 12/100 (12%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF 341
           +A  +PEG+LS+V ++++ + GV  GH I+    KG      +W  +    L     L  
Sbjct: 224 NAAVDPEGILSTVPAVVNGLFGVFVGHFIVKPQVKG------EWFKVAILALSGLGLLVL 277

Query: 342 TNAI----PLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
             A+    P+NK L+T S+  VTSG + L  +  Y ++D 
Sbjct: 278 GWAVSPWNPVNKTLWTSSFTLVTSGWSILFLALFYTIIDV 317


>gi|171914858|ref|ZP_02930328.1| hypothetical protein VspiD_26815 [Verrucomicrobium spinosum DSM
           4136]
          Length = 379

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 131/332 (39%), Gaps = 80/332 (24%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGI 117
           ++ H  W G    D + P FLF+VGV+I L++ R+     R+ A+ +++ R+  L   G+
Sbjct: 29  QLQHVEWEGFRFYDAIFPLFLFLVGVSIVLSVDRMVARVGRSRALARIVRRSALLFAVGV 88

Query: 118 LLQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
              GG +   PD          ++L GVL RIAL YL+ + + +    +  K   +    
Sbjct: 89  FYYGGIARPWPD----------VQLSGVLPRIALCYLVAATLYVL---LPRKGIVIA--- 132

Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
                      AAC+L  +  +L   +VP       N D                 +  P
Sbjct: 133 ----------TAACLLGYWALML---FVP-----FPNVD-----------------IKTP 157

Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
             A   ++ K            +  + + T+ +  EG         C  P     L  S 
Sbjct: 158 AGARKQVEAKT-------REELFAGAASTTKGTFKEGLNLAHYVDACWLPGRKRNLYYSN 210

Query: 297 SSILSTI-------IGVHFGHVIIHTKGHLARLKQW-----VTMGFALLIFGLTLHFTNA 344
             +LSTI        GV  G V+ H +    R  +W     V  G A ++ GL       
Sbjct: 211 EGLLSTIPAVASTLFGVLAGWVLTHGR----RSGRWKVGWLVGSGLAGVVIGLLWGLE-- 264

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            P+ K+L+T S+  V +G +A++    Y +VD
Sbjct: 265 FPVIKRLWTSSFCMVAAGFSAVLLGLFYLVVD 296


>gi|409203840|ref|ZP_11232043.1| hypothetical protein PflaJ_21058 [Pseudoalteromonas flavipulchra
           JG1]
          Length = 377

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 56/104 (53%)

Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
           S+ + P +PEG+LSS+ +I++ I GV  G  I + +         +  G  +L   L   
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQGEWKIVGILAGSGVLALALGWL 263

Query: 341 FTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           +    P+NK+L+T S+V VT G +A++ +  YA+VD   F  W 
Sbjct: 264 WDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAIVDVLSFQRWA 307



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 27/148 (18%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
           K +RLASLD  RG+ +           AL +L    G  W        H+PW+G    D 
Sbjct: 6   KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF-----WGILLQGGFSHAPDEL 130
           + P F+F+ GVA+ L+ KRI       +K  +            +G+L   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKSFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
              +D   +R   VL RIA ++   +L+
Sbjct: 122 ---MDPDGVRYASVLGRIAFAWFFCALL 146


>gi|338211620|ref|YP_004655673.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305439|gb|AEI48541.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 393

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 82/371 (22%), Positives = 140/371 (37%), Gaps = 90/371 (24%)

Query: 34  RLASLDIFRGLAVALMIL----VDHAGGDWPEIS----------HAPWNGCNLADFVMPF 79
           R++S+D +RG  + LM+       H     P+ S          H  W GC+L D + P 
Sbjct: 8   RISSVDAYRGFVMFLMMAEVLEFGHISKALPDSSFWAFLAYNQDHVEWVGCSLHDLIQPS 67

Query: 80  FLFIVGVA----IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F F+VGVA    IA  + +  +        + R+L L+F GI L+   S   ++  +  +
Sbjct: 68  FSFLVGVALPYSIASRMAKGQNFGSMFGHTVQRSLILIFLGIFLR---SMHREQTNFTFE 124

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
                    L +I L Y ++  +   T   Q                   L+A   ++V 
Sbjct: 125 -------DTLTQIGLGYPVLFWLGFKTFRTQ-------------------LIALSTILVG 158

Query: 196 LALLYGTY-VPDWQFTIINKDSADYGKVFNVTC-------GVRAKLNPPCNAVGYIDRKV 247
             L +  Y +P   F        DY  V  VT        G  A  N   NA        
Sbjct: 159 YWLFFAAYPLPGADF--------DYASV-GVTADWEHNLQGFAAHWNKNTNA-------- 201

Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
                     AW   +      P E P  ++   +    F P        ++ + ++G+ 
Sbjct: 202 ----------AWSFDQWFMNLFPREKPFIRNGGGYSTLSFIP--------TLGTMVLGLM 243

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            G  +      + ++K++  +G  LL+ G  L++    P  K+++T ++V  + G   L+
Sbjct: 244 AGQWLKSDTAPMEKVKRFAALGVGLLVVGSLLNWLGICPNVKRIWTPTWVLFSGGWCFLL 303

Query: 368 FSAIYALVDCN 378
            +A Y LVD  
Sbjct: 304 LAAFYWLVDIQ 314


>gi|291295418|ref|YP_003506816.1| hypothetical protein [Meiothermus ruber DSM 1279]
 gi|290470377|gb|ADD27796.1| conserved hypothetical protein [Meiothermus ruber DSM 1279]
          Length = 399

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 75/157 (47%), Gaps = 17/157 (10%)

Query: 14  LIISEPDVS---DQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHA 65
           ++I++P  +   + + K+     RL +LD  RGL V LM+LV++     A  D  ++ HA
Sbjct: 1   MMIAQPSTAIAVESKNKATPAGARLLALDGLRGLTVFLMLLVNNLALQEATPD--QLVHA 58

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           P+ G  LAD V P+FLF +G AI  A     D+           +      I L G F  
Sbjct: 59  PFGGVTLADLVFPWFLFCMGAAIPYAASSF-DKQKLPLWRRLLRILRRTSLIFLLGLF-- 115

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
               LT  +    +   GVLQ IAL+Y L +L  + +
Sbjct: 116 ----LTSALARTPVFALGVLQLIALAYCLAALFYLIS 148


>gi|284036950|ref|YP_003386880.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
 gi|283816243|gb|ADB38081.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
          Length = 404

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 16/132 (12%)

Query: 16  ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAP--W 67
           +++ + +D+Q  +S   + R+ ++DI R + + LMI V+      A  DW E  H P   
Sbjct: 1   MTQLETADRQAYRSSSVSMRVDAIDILRAMTMILMIFVNDLWSLTAIPDWLE--HVPHGV 58

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
           +G  LAD V P FLFIVG+++  A+     + D     V  +I R++ LL  G+ L  G 
Sbjct: 59  DGIGLADVVFPGFLFIVGMSLPFAMNARRQKGDTNSALVSHIIMRSIALLVMGVFLVNG- 117

Query: 124 SHAPDELTYGVD 135
             + D+   G++
Sbjct: 118 -ESIDQKATGIN 128


>gi|392965639|ref|ZP_10331058.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
 gi|387844703|emb|CCH53104.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
          Length = 411

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 9/112 (8%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNG 69
           +I +  V+  + +  L  +R+ S+DIFR L +  MI V+         DW E S A  + 
Sbjct: 1   MIDQQVVASPETRRQLPEKRVHSIDIFRALTMLFMIFVNDLWTLIGIPDWLEHSPADVDF 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
             LAD V P FLFIVG++I  A++    + D+    ++ ++ R++ LL  G+
Sbjct: 61  LGLADVVFPCFLFIVGMSIPFAIQGRLAKGDSYGLIIRHIVVRSVALLIMGV 112


>gi|424665544|ref|ZP_18102580.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
 gi|404574617|gb|EKA79366.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
          Length = 375

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 68/146 (46%), Gaps = 25/146 (17%)

Query: 32  TQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++        L       W      +  H  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWALAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 79  FFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
            FLF+ G ++  +  +    PD+    +K++ R + L  +G+++QG           G++
Sbjct: 69  LFLFMTGASMPFSFSKFKDDPDKGPIYRKILKRFILLFIFGMIVQGNL--------LGLN 120

Query: 136 VRMIRL-CGVLQRIALSYLLVSLVEI 160
            + + L    LQ IA  YL+ +++++
Sbjct: 121 PKYLYLYSNTLQAIATGYLIAAIIQL 146


>gi|170725675|ref|YP_001759701.1| hypothetical protein Swoo_1314 [Shewanella woodyi ATCC 51908]
 gi|169811022|gb|ACA85606.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908]
          Length = 378

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 6/93 (6%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFA-LLIFGLTLHFTNA 344
           +PEG+LS++ +I++ + GV  GH I+  H KG   ++   VTM  A  L+ G        
Sbjct: 212 DPEGILSTLPAIVNAMAGVFVGHFIVKEHAKGEWYKV---VTMLIAGALVLGCGWLLNLV 268

Query: 345 IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
           IP+NK L+T S+V VT+G + ++ +  YA VD 
Sbjct: 269 IPVNKDLWTSSFVLVTTGWSMILLAVFYAAVDV 301



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 72/155 (46%), Gaps = 27/155 (17%)

Query: 25  QEKSHLKT--QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNG 69
            E +  KT  +RL SLD  RG           L   L++     G  W   ++ H+ W+G
Sbjct: 1   MEATQAKTPKRRLMSLDALRGFDMFWILGGEALFAGLLLWTGWHGWQWADAQMHHSQWHG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF- 123
               D + P F+F+ GVA+ L+ KR+       R    K  + R L LLF+G+L   G+ 
Sbjct: 61  FTFYDLIFPLFIFLSGVALGLSPKRLDKLPMAQRMPLYKHSVKRLLLLLFFGVLYNHGWG 120

Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           + AP      V +  +R   VL RIA ++   +++
Sbjct: 121 TGAP------VAIDEVRYASVLGRIAFAWFFAAML 149


>gi|410657728|ref|YP_006910099.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
 gi|409020083|gb|AFV02114.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
          Length = 370

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/352 (23%), Positives = 142/352 (40%), Gaps = 84/352 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           T R+ +LD  R L+V L+ L     G      I+HAPW G    DF  P F+ + G ++A
Sbjct: 5   TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64

Query: 90  LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +A ++ +P       ++I R   L+  G++           +++   +  +R  GVLQ +
Sbjct: 65  IAYRKHVP-----WVRLIRRFFVLIIIGLIFN-------SLVSWEFHLSQLRFTGVLQVL 112

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A + ++ +L+      V  K              W W   A +L++              
Sbjct: 113 AFTGIMTTLIT----RVSGK--------------WFWPFTAGLLIL-------------- 140

Query: 209 FTIINKDSADYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                  +A  G +   +      L +P  N  G ID  +   +H+Y H           
Sbjct: 141 -------AAYLGILLYTSQSFPGSLPSPDHNLSGMIDPFIFTKSHLYVH----------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                            A ++PEG+ +  S+I ST+ G   G  +     ++ R      
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAG--LFLNNKNIGR-NFLKI 224

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           +  A ++  LT   +N IP+ K+L+T S+V ++SGA  LV +  + + D  I
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILVLAFAHLIWDPQI 276


>gi|375148919|ref|YP_005011360.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361062965|gb|AEW01957.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 397

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 9/94 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVGV 86
           TQRLAS+D+FR L + LMI V+  G       W E + A  +G  LAD V P FLFIVG+
Sbjct: 5   TQRLASIDVFRALTMLLMIFVNDLGTLKNIPLWLEHTKANEDGMGLADTVFPAFLFIVGL 64

Query: 87  AIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
           +I  A+     +   +++ +  ++ R+  LL  G
Sbjct: 65  SIPFAIGNRWAKGASQSNILGHILIRSFALLVMG 98


>gi|374312990|ref|YP_005059420.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358755000|gb|AEU38390.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 408

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 55/98 (56%), Gaps = 9/98 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVG 85
           +T R+AS+DIFRGL +A+MI V+   G      W   + A  +     D V PFFLFI+G
Sbjct: 14  RTTRVASIDIFRGLTMAIMIFVNDLDGVQGLPWWTHHAKANIDVMTYVDMVFPFFLFIIG 73

Query: 86  VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILL 119
           +++ LA    LK+ P        V+ R++ L+  G++L
Sbjct: 74  LSMPLAIRQRLKKNPSIPQLWLHVLIRSVSLVALGVIL 111


>gi|393787642|ref|ZP_10375774.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
 gi|392658877|gb|EIY52507.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/369 (22%), Positives = 136/369 (36%), Gaps = 102/369 (27%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMP 78
            +RLASLD+ RG  +  ++++      W EI             +H  W G    D +MP
Sbjct: 5   NKRLASLDLLRGFDLFCLLMLQPILMTWLEIENNPSLDPITNQFTHVEWQGVAFWDLIMP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
            F+F+ G+ I  A+ +        +   FR  K    L F G ++QG            +
Sbjct: 65  LFMFMSGITIPFAMSKYKQGEKIDRHFYFRLFKRFFVLFFLGWVVQGNL--------LAL 116

Query: 135 DVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
           D+R   +    LQ IA+ Y++ +L                      LY W      C   
Sbjct: 117 DIRQFHIFANTLQAIAVGYVVAAL----------------------LYVW------C--- 145

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                   ++     FT++   +  Y  VF  T G      P  N    IDR VLG    
Sbjct: 146 --------SFRTQISFTVLCFIT--YLLVF-ATIG-DMNYEPGTNIAEEIDRCVLG---- 189

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEG------LLSSVSSILSTIIGVH 307
                                LR D  +W +  +  +       +LSS++ I++ ++G  
Sbjct: 190 --------------------SLR-DGVTWTNGTWSFDSSYHYTWILSSLNFIVTVMLGSF 228

Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALV 367
            GH++   K  + RLK  +  G  L+   L +      P+ K++++ S      G   L+
Sbjct: 229 AGHILRLRKEPVQRLKVLLLTGGFLVAVALLMD--PLFPIIKRIWSSSMTLFYGGVCFLL 286

Query: 368 FSAIYALVD 376
               Y L+D
Sbjct: 287 MGIFYYLID 295


>gi|375255119|ref|YP_005014286.1| hypothetical protein BFO_1396 [Tannerella forsythia ATCC 43037]
 gi|363406141|gb|AEW19827.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 375

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 91/363 (25%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMPF 79
           +RL SLD+ RG  +  ++++      W EI             +H  W G    D +MP 
Sbjct: 8   KRLVSLDLLRGFDLFCLLMLQPILMTWLEIADNPAWAPLARQFTHVEWRGVAFWDLIMPL 67

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
           F+F+ G+ +  AL +    A        + LK    L F G ++QG   +  P+      
Sbjct: 68  FMFMSGITVPFALSKYKRGAKPGHSFYLKLLKRFVILFFLGWIVQGNLLALDPN------ 121

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
             R       LQ IA+ Y++ +   +             RFS FR+      + A VL  
Sbjct: 122 --RFHIFANTLQAIAVGYVVTAFCYV-------------RFS-FRVQ-----LGATVLFF 160

Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
              LL                      VF    G+     P  N    IDR VLG     
Sbjct: 161 IAYLL----------------------VFATVGGM--NWEPGTNIAEEIDRCVLG----- 191

Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVII 313
                 R +        +G + +   SW   P +    +LSS++ +++ + G   GH++ 
Sbjct: 192 ------RFR--------DGIITEADGSWKFDPAYHYTWILSSLNFVVTVMTGSFAGHILR 237

Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             K    RL + +  G +L++  L +      PL K++++ S      G   L+    Y 
Sbjct: 238 LRKTARQRLMRLLITGVSLVVAALLMD--PVFPLIKRIWSSSMTLFYGGVCFLLMGLFYY 295

Query: 374 LVD 376
           +VD
Sbjct: 296 VVD 298


>gi|393784535|ref|ZP_10372698.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665516|gb|EIY59040.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
          Length = 378

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 26/148 (17%)

Query: 31  KTQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVM 77
            + RLASLDI RG  + L++    V  A G   D P       +  H  W G +L D VM
Sbjct: 10  NSSRLASLDILRGFDLFLLVFFQPVFVALGQQLDLPFLNRLVYQFDHEAWVGFHLWDLVM 69

Query: 78  PFFLFIVGVAIALALKRIPDRADA---VKKVIFRTLKLLF-WGILLQGGFSHAPDELTYG 133
           P FLF+ G ++  +L +    +     V + IFR + LLF +G+++QG           G
Sbjct: 70  PLFLFMTGASMPFSLSKYKISSAGCQFVYRRIFRRVVLLFLFGMIVQGNL--------LG 121

Query: 134 VDVRMIRL-CGVLQRIALSYLLVSLVEI 160
            D + I L    LQ IA+ YL+ +++++
Sbjct: 122 FDSQHIYLYSNTLQAIAVGYLIAAIIQL 149


>gi|308050627|ref|YP_003914193.1| hypothetical protein Fbal_2917 [Ferrimonas balearica DSM 9799]
 gi|307632817|gb|ADN77119.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799]
          Length = 366

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 93/377 (24%), Positives = 142/377 (37%), Gaps = 127/377 (33%)

Query: 33  QRLASLDIFRG-----------LAVALMILVDHAGGDWP-------EISHAPWNGCNLAD 74
           QRL +LD  RG           L  AL++L       WP       ++ H+PW+G    D
Sbjct: 6   QRLQALDALRGFDMFWIIGGEKLFAALLLLTG-----WPLWQVAADQMLHSPWHGFTFYD 60

Query: 75  FVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFS----H 125
            + P F+F+ GV I L  + +      DR    +K + R L L   G+L   G+      
Sbjct: 61  LIFPLFIFLSGVTIGLQRQSLIGIAWSDRQPHYRKALKRLLLLALLGVLYNHGWGTGMPM 120

Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH- 184
           A DE         IR   VL RI +++ L +++                        WH 
Sbjct: 121 ALDE---------IRYASVLGRIGMAWFLAAMI-----------------------AWHL 148

Query: 185 -W---LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
            W   LMA  VL +   L  G +            SA Y    ++               
Sbjct: 149 PWRSQLMAGLVLALGYGLAQGLW-----------GSAPYSSELSLNT------------- 184

Query: 241 GYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
            ++D+  L GI +              Q++P                 +PEGL+S +S+ 
Sbjct: 185 -WVDQHWLPGITY--------------QNAP----------------LDPEGLMSHLSAA 213

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           L+ ++GV  G+ +   +    R    V    AL    L       +P+NK L+TLS+V V
Sbjct: 214 LNALVGVWAGYWLRQPRSVWQRAG--VLALVALASLLLGWLLHPLVPVNKTLWTLSFVLV 271

Query: 360 TSGAAALVFSAIYALVD 376
           T G + L  +   A+VD
Sbjct: 272 TVGWSGLFLAGFLAVVD 288


>gi|212557932|gb|ACJ30386.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
          Length = 387

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 57/102 (55%), Gaps = 4/102 (3%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF 341
           +A  +PEG+LS++ ++++ ++GV  GH I+  H KG   +L   V     L +  L    
Sbjct: 217 NAAVDPEGILSTIPAVVNGLLGVFVGHFIVKTHAKGEWYKLA--VMCLLGLGLLALGWLI 274

Query: 342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPW 383
           +  IP+NK L+T S+V VTSG + L+ +  Y ++D      W
Sbjct: 275 SPIIPVNKTLWTSSFVLVTSGWSVLLLTLFYGVIDVMKLSRW 316



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 28/160 (17%)

Query: 20  DVSDQQEKSHLKTQ-RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHA 65
           +   Q E    K + RL SLD  RG  +           AL++L    G  W   ++ H+
Sbjct: 6   NTQSQTEHGPKKNKVRLKSLDALRGFDMFWILGGEAIFAALIVLTGWGGLHWLDKQMHHS 65

Query: 66  PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILL 119
            W+G    D + P F+F+ GVA+ L+ KR+ D+   V+++      + R L LL  G++ 
Sbjct: 66  AWHGFTFYDLIFPLFIFLSGVALGLSPKRL-DKLPMVQRMPLYQHAVKRLLLLLLLGVIY 124

Query: 120 QGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
             G+ + AP  L        IR   VL RIA ++   +L+
Sbjct: 125 NHGWGTGAPMALGD------IRYASVLGRIAFAWFFCALL 158


>gi|322785713|gb|EFZ12351.1| hypothetical protein SINV_07836 [Solenopsis invicta]
          Length = 106

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 46/84 (54%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           L  +++I    +GV  G +++  KG   R+ +W+         G  LHFTN IP+NK L+
Sbjct: 10  LRILTAIFQVFLGVQAGVILMIYKGWKERVIRWLLWAVFYGCMGCILHFTNIIPINKSLW 69

Query: 353 TLSYVCVTSGAAALVFSAIYALVD 376
           +LS+V V++  A    S  Y L+D
Sbjct: 70  SLSFVFVSTSFALAFLSGCYLLID 93


>gi|403717790|ref|ZP_10942873.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
 gi|403208927|dbj|GAB97556.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
          Length = 461

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 95/246 (38%), Gaps = 51/246 (20%)

Query: 14  LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
           L  ++             ++R  SLD+ RGL + + + V+       W E  HA W G +
Sbjct: 61  LPATQSSTKSPPPAQSFPSRRFISLDVARGLMLVVSVAVNAWITAPAWFE--HAAWAGVH 118

Query: 72  LADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
             D V P F+ + G  +A A  +RIP     ++ ++ R + L   G+      +HA    
Sbjct: 119 PVDLVFPTFVALSGAGLAFAYARRIP-----LRPLLSRVIVLALAGLAYN---AHAQYLS 170

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
           T  +D    R+ GVLQ  A   L+++L+                  + R + W W +   
Sbjct: 171 TGQLDWATFRIPGVLQLYAAIVLVIALLHF----------------VLRRW-WAWPLFTI 213

Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
           V     AL             +N+        F   C   A L P CN  G  D  + G+
Sbjct: 214 VAATCFAL------------ALNR--------FAAGCPGGA-LTPECNPSGLFDPALFGV 252

Query: 251 NHMYHH 256
            H+YH 
Sbjct: 253 EHIYHQ 258


>gi|300123407|emb|CBK24680.2| unnamed protein product [Blastocystis hominis]
          Length = 349

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 23/139 (16%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           YID K+LG  HMYH  A                            FE EG+L  +++I  
Sbjct: 157 YIDLKLLGEKHMYHGSAVVYVYGGNH-------------------FECEGILGMMNAIFL 197

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL----HFTNAIPLNKQLYTLSYV 357
           T +G     V    K    +L  ++ +G  LL+F   L     +   +P+NK  +  S++
Sbjct: 198 TYLGTWIPWVFRTVKKQKNQLLVYLGIGAGLLLFSGILCGFKQYDGYMPINKNKWNTSFI 257

Query: 358 CVTSGAAALVFSAIYALVD 376
            +TSG   L F  IY LVD
Sbjct: 258 AITSGTGFLAFGLIYLLVD 276


>gi|456985619|gb|EMG21386.1| putative membrane protein [Leptospira interrogans serovar
           Copenhageni str. LT2050]
          Length = 296

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 62/134 (46%), Gaps = 28/134 (20%)

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           L P  +   +IDR V G NH+     W+ SK                       ++PEG 
Sbjct: 68  LEPGKDIGAWIDRNVFGENHL-----WKFSKT----------------------WDPEGF 100

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
            S +SSI ++++GV  G  I+ +K +  + +     GF  L   + L +   +P+NK L+
Sbjct: 101 FSGISSITTSLLGVFCGS-ILSSKTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSLW 159

Query: 353 TLSYVCVTSGAAAL 366
           T SYV  T+G A L
Sbjct: 160 TGSYVIYTAGLAFL 173


>gi|456985620|gb|EMG21387.1| hypothetical protein LEP1GSC150_0590 [Leptospira interrogans
          serovar Copenhageni str. LT2050]
          Length = 77

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
          ++ L   R+ SLD+FRG+ VA MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 2  ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFF 58


>gi|317503636|ref|ZP_07961655.1| conserved hypothetical protein, partial [Prevotella salivae DSM
          15606]
 gi|315665261|gb|EFV04909.1| conserved hypothetical protein [Prevotella salivae DSM 15606]
          Length = 59

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFI 83
          KT R+ ++DI RG+ +A MILV++ GG   +  + HA W G    D V PFF+FI
Sbjct: 5  KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWLGLTPTDLVFPFFMFI 59


>gi|456861512|gb|EMF80162.1| hypothetical protein LEP1GSC188_2620 [Leptospira weilii serovar
          Topaz str. LT2116]
          Length = 88

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 5/60 (8%)

Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFF 80
          ++KS     R+ SLD+FRG+ V  MILV++ G  W  I     HA WNGC   D V PFF
Sbjct: 2  EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGS-WSYIYSPLKHAKWNGCTPTDLVFPFF 60


>gi|374372786|ref|ZP_09630447.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
 gi|373234862|gb|EHP54654.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
          Length = 357

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 18/102 (17%)

Query: 35  LASLDIFRGLAVALMIL--------VDHA-GGDWPE-----ISHAPWNGCNLADFVMPFF 80
           + SLD  RGL + L+ L        + HA  G W E       H PW+G +  D + P F
Sbjct: 1   MLSLDFMRGLIMVLLALESTGLYEHLSHASAGTWFEGIMQQFFHHPWHGLHFWDLIQPGF 60

Query: 81  LFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
           +F+ GVA+A +L++   R      ++KK + R+  L FWG+L
Sbjct: 61  MFMAGVAMAYSLQKQKQRDYTWNRSLKKTLRRSGWLFFWGVL 102


>gi|373954275|ref|ZP_09614235.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890875|gb|EHQ26772.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 397

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 67/132 (50%), Gaps = 17/132 (12%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+DIFR + + LMI V+   G     +W + +    +G  LAD V P FLFIVG++
Sbjct: 7   NRVHSIDIFRAVTMFLMIFVNDIDGVPGVPEWIKHAGERTDGLGLADIVFPAFLFIVGLS 66

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
           I  A++    R D+  K    ++ R L L+F       GF HA  E TY  D  +I    
Sbjct: 67  IPHAIQSRISRGDSKTKIAAYIVMRALALIF------IGFIHANME-TYS-DTAVIAQPW 118

Query: 144 VLQRIALSYLLV 155
              +I LS+ L+
Sbjct: 119 WEIQITLSFFLI 130


>gi|94985055|ref|YP_604419.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
 gi|94555336|gb|ABF45250.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
          Length = 573

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 49/79 (62%), Gaps = 5/79 (6%)

Query: 15  IISEP-DVSDQQEKSHLKTQ-RLASLDIFRGLAVALMILVDH-AGGDW--PEISHAPWNG 69
           ++S+P   +       ++T+ RL +LD +RGL V LM+LV++ A GD   P++ HAP+ G
Sbjct: 197 VLSDPAPTTSAGGAGPVQTRVRLTALDAWRGLTVLLMLLVNNVALGDLTPPQLQHAPFGG 256

Query: 70  CNLADFVMPFFLFIVGVAI 88
             L D V P+FLF  G A+
Sbjct: 257 LTLTDLVFPWFLFCAGAAL 275


>gi|359686399|ref|ZP_09256400.1| hypothetical protein Lsan2_17893 [Leptospira santarosai str.
           2000030832]
          Length = 329

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/297 (24%), Positives = 121/297 (40%), Gaps = 88/297 (29%)

Query: 75  FVMPFFLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDEL 130
           F +PFFLF VG +I ++L  K   +R+D    +  R+  L+  G+ L   G +S A    
Sbjct: 9   FGVPFFLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE--- 65

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
                   +R+ GVLQRI   Y +V SL  +F        + +  FS+  L    W++  
Sbjct: 66  --------LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSVPILLIHTWILTQ 111

Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
             L            P  +  +  +   D G                     +IDR + G
Sbjct: 112 IAL------------PG-ESVVSLEQGKDIG--------------------AWIDRTIFG 138

Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
             H+     WR SK                       ++PEG LS V+S+++T+ GV  G
Sbjct: 139 EKHL-----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCG 171

Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAAL 366
            ++        R ++   +G  +L   + L +  ++P+NK L+T SY   T+G + L
Sbjct: 172 FIL------FLRERKNKILGLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLSFL 222


>gi|153805867|ref|ZP_01958535.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
 gi|149130544|gb|EDM21750.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
          Length = 377

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++    V  A G   ++P       +  H  W G    D VMP
Sbjct: 11  SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 70

Query: 79  FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
            FLF+ G ++  +    K  P++    +K+I R + L  +G+++QG      PD L    
Sbjct: 71  LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 127

Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
                     LQ IA  YL+ +++
Sbjct: 128 -----LYSNTLQAIATGYLIAAII 146


>gi|410660784|ref|YP_006913155.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
 gi|409023140|gb|AFV05170.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
          Length = 368

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/352 (22%), Positives = 140/352 (39%), Gaps = 84/352 (23%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           T R+ +LD  R L+V L+ L     G      I+HAPW G    DF  P F+ + G ++A
Sbjct: 5   TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64

Query: 90  LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
           +  ++ +P       ++I R   LL  G++           +++   +  +R  GVLQ +
Sbjct: 65  IVYRKHVP-----WVRLIRRFFVLLIIGLIFN-------SLVSWEFQLSELRFTGVLQVL 112

Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
           A + ++ +L+      V  K              W W   A +L++              
Sbjct: 113 AFTGIMTALIT----RVSGK--------------WFWPFTAGLLIL-------------- 140

Query: 209 FTIINKDSADYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
                  +A  G +   +      L +P  N  G  D  +   +H+Y H           
Sbjct: 141 -------AAYLGILLYTSQSFPGSLPSPDHNLSGMTDPFIFTKSHLYVH----------- 182

Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
                            A ++PEG+ +  S+I ST+ G   G  +     ++ R      
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAG--LFLNNKNIGR-DFLKI 224

Query: 328 MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNI 379
           +  A ++  LT   +N IP+ K+L+T S+V ++SGA  L+ +  + + D  I
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILILAFAHLIWDPQI 276


>gi|423219681|ref|ZP_17206177.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
 gi|392624886|gb|EIY18964.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
          Length = 375

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
           + RLASLDI RG  + L++    V  A G   ++P       +  H  W G    D VMP
Sbjct: 9   SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 68

Query: 79  FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
            FLF+ G ++  +    K  P++    +K+I R + L  +G+++QG      PD L    
Sbjct: 69  LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 125

Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
                     LQ IA  YL+ +++
Sbjct: 126 -----LYSNTLQAIATGYLIAAII 144


>gi|224025513|ref|ZP_03643879.1| hypothetical protein BACCOPRO_02253, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018749|gb|EEF76747.1| hypothetical protein BACCOPRO_02253 [Bacteroides coprophilus DSM
           18228]
          Length = 377

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 20/117 (17%)

Query: 25  QEKSHLKTQRLASLDIFRGLAV--------ALMILVDHAG-GDWPE------ISHAPWNG 69
           +    ++ +RL SLDI RGL +          M L    G  +W +       +H  W G
Sbjct: 1   KNMRKIQKERLESLDILRGLDLFILVGFQSVFMYLAQATGENNWIKTIFDVLFTHVEWEG 60

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK-KVIFRTLK--LLFW--GILLQG 121
            +L D VMP FLF+ G +I  A+ R   + + +  K+ +R LK  +L W  G ++QG
Sbjct: 61  FHLWDQVMPLFLFMAGTSIPYAMARYKRKEEEISGKLFYRVLKRVVLLWIFGAIVQG 117


>gi|255036257|ref|YP_003086878.1| hypothetical protein Dfer_2495 [Dyadobacter fermentans DSM 18053]
 gi|254949013|gb|ACT93713.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 379

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 73/322 (22%), Positives = 119/322 (36%), Gaps = 59/322 (18%)

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
           SH PW GC+L D + P F F+VGVA+  ++     +  +V  +   T++     ILL   
Sbjct: 32  SHVPWVGCSLHDLIQPSFSFLVGVALPYSMASRASKDQSVATMWAHTIRRSLILILLGIF 91

Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
                 E T             L +I L Y ++  + + ++  Q +D  V          
Sbjct: 92  LRSMHSEQT------NFTFEDTLTQIGLGYPILFALGLASEKTQ-RDALV---------- 134

Query: 183 WHWLMAACVLVVYLALLYGTYVP----DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
             W     +L+ Y  +     +P    DW  T    D     K      G  A  N   N
Sbjct: 135 --WGALGIILIGYAGVFAAYPLPGPGFDWSQTGTTADWEHNLK------GFAAHWNKNTN 186

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL--LSSV 296
           A    DR  L +                ++ PFE              F   G   LS +
Sbjct: 187 AAWAFDRWFLNL--------------FPREKPFE--------------FNGGGYSTLSFI 218

Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSY 356
            ++ + I+G+  G  +         LK++      L +  L  HFT   P+ K+++T ++
Sbjct: 219 PTLGTMILGLIAGKWLKTAVSSTWLLKRYAITAGVLFLLALAFHFTGLNPIVKRIWTPAW 278

Query: 357 VCVTSGAAALVFSAIYALVDCN 378
              + G A L+ +A Y +VD  
Sbjct: 279 TLFSGGWAFLLLAAFYFVVDVK 300


>gi|189463412|ref|ZP_03012197.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
 gi|189429841|gb|EDU98825.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
          Length = 219

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 102/263 (38%), Gaps = 81/263 (30%)

Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
           +KKVI RT+ L   G+ +                 + +R+ GVLQR+AL Y    ++   
Sbjct: 4   IKKVIKRTILLFLVGLFINWCAKGM-------CSFQELRIPGVLQRLALCYFFTVII--- 53

Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
             ++ +K                          Y+  L    +  +Q  ++  +   YG 
Sbjct: 54  CTNIHEK--------------------------YIPALITVLLLIYQIILVTGNGFVYG- 86

Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
                         P N +  ID+ +LG +H+Y+                          
Sbjct: 87  --------------PQNIIAVIDQYILGASHLYNDHG----------------------- 109

Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
                 +PEG+LS++ SI  T+IG   G + I  +   ++L++   +G  LL  G    F
Sbjct: 110 -----IDPEGILSTIPSISHTLIGYCIGKICIEKENIHSKLEKLFLIGTVLLFAGYL--F 162

Query: 342 TNAIPLNKQLYTLSYVCVTSGAA 364
           +   P+NK++++ +YV +T GA 
Sbjct: 163 SYGCPINKKIWSPTYVFMTCGAG 185


>gi|404403948|ref|ZP_10995532.1| hypothetical protein AJC13_00860 [Alistipes sp. JC136]
          Length = 369

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 68/149 (45%), Gaps = 30/149 (20%)

Query: 32  TQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVMP 78
            QRL SLD  RG  +  ++    LV      WP          + HA W+G    D + P
Sbjct: 4   NQRLLSLDALRGFDMLFIMGFSGLVASLCALWPNPFTDAVAGSMGHAAWDGLTHHDTIFP 63

Query: 79  FFLFIVGVAIALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
            FLFI GV+   +L +   RA+       + KVI R + L+  G++  G F         
Sbjct: 64  LFLFIAGVSFPFSLAK--QRANGLGERAILGKVIRRGVTLVVLGLVYNGLFK-------- 113

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
            +D   +R+  VL RI L+++  +++ I+
Sbjct: 114 -LDFASLRVASVLGRIGLAWMFAAILYIY 141



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 55/102 (53%), Gaps = 1/102 (0%)

Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQWVTMGFALLIFGLTLHFT 342
           +  F+PEGLLS+V +I+S + G+  G  +   + G     K       A  I  + + ++
Sbjct: 198 YKTFDPEGLLSTVPAIVSAMFGMFTGEFLRRERPGLTGDRKALYMALAAAAITLVGIAWS 257

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
             +P+NK+L++ S+ CV +G +  +F+  Y L+D   +  WT
Sbjct: 258 GVMPINKKLWSSSFTCVVTGYSLGMFALFYYLIDVRGWKRWT 299


>gi|408370371|ref|ZP_11168148.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
 gi|407744129|gb|EKF55699.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
          Length = 394

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 9/110 (8%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
           + + R+ S+DI RG+ + LM+ V+         W   +    +G  LAD+V P FLF+VG
Sbjct: 1   MNSNRIMSIDIMRGITLFLMLFVNDLFIPGVPKWLVHTQEWEDGMGLADWVFPGFLFMVG 60

Query: 86  VAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELT 131
           ++I  A+K   ++  +  +    VI RTL LL  GIL+    S    ELT
Sbjct: 61  LSIPYAMKARKNKGQSNLRLWSHVIMRTLSLLLIGILMV-NISRVNPELT 109


>gi|319902718|ref|YP_004162446.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
 gi|319417749|gb|ADV44860.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
          Length = 380

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 70/152 (46%), Gaps = 25/152 (16%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGC 70
           QQ+   + + RLASLD+ RG  + L++    V  + G   + P       +  H  W G 
Sbjct: 6   QQDSLKISSSRLASLDVLRGFDLFLLVFFQPVLMSLGQQLNLPFMDVVLYQFDHEVWEGF 65

Query: 71  NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGG-FSHA 126
              D +MP FLF+ GV++  +  +    PD+    +K+  R L L   G+++QG      
Sbjct: 66  RFWDLIMPLFLFMTGVSMPFSFAKYQSSPDKCIIYRKIFRRVLLLFLLGMVVQGNLLGLN 125

Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
           P  + +  +         LQ IA+ YL+  ++
Sbjct: 126 PKHIYFYTN--------TLQAIAVGYLIAGMI 149


>gi|255035026|ref|YP_003085647.1| hypothetical protein Dfer_1233 [Dyadobacter fermentans DSM 18053]
 gi|254947782|gb|ACT92482.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 401

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 9/104 (8%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPF 79
             K    + RL S+D+FR + + LMI V+      A   W E S A  +   L+D V P 
Sbjct: 1   MNKVASSSLRLDSIDVFRAVTMLLMIFVNDFWTLEAVPKWLEHSKAEEDAMGLSDVVFPA 60

Query: 80  FLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
           FLFIVG++I  A+     + D     ++ +  RT  LL  GI +
Sbjct: 61  FLFIVGLSIPFAISNRRKKGDGNALIIRHIAERTFALLLMGIFI 104


>gi|317505448|ref|ZP_07963366.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
 gi|315663361|gb|EFV03110.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
          Length = 380

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 29/152 (19%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
           K  RL SLDI RG  +A+++LV             A G        ++ H PW G    D
Sbjct: 9   KPNRLLSLDILRGADLAMLVLVQPILLKALETMQPAEGTVGHFIMGQLLHLPWEGFCFWD 68

Query: 75  FVMPFFLFIVGVAIALALKRIP--DRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
            +MP F+F+ G+ I  A+ R    +R D    ++++ R + L   G++ QG         
Sbjct: 69  IIMPLFMFMSGITIPFAMARYKRGERIDGSFYRRILKRFVVLWILGMVCQGNL------- 121

Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIF 161
               D++ + L    LQ IA+ Y+ V+ + +F
Sbjct: 122 -LAFDLQQLHLYSNTLQSIAVGYVAVAFLYVF 152


>gi|15806610|ref|NP_295325.1| hypothetical protein DR_1602 [Deinococcus radiodurans R1]
 gi|6459373|gb|AAF11168.1|AE002004_7 hypothetical protein DR_1602 [Deinococcus radiodurans R1]
          Length = 388

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 68/134 (50%), Gaps = 22/134 (16%)

Query: 34  RLASLDIFRGLAVALMILVDHA--GGDWP-EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL +LD +RGL V LM+LV++   G   P ++SHA + G  L D V P+FLF  G A+  
Sbjct: 33  RLTALDAWRGLTVLLMLLVNNVALGDSTPRQLSHAHFGGLTLTDLVFPWFLFCAGAALPF 92

Query: 91  ALKRIPDRADAVKKVIFRTL----KLLFWGILLQGGF--SHAPDELTYGVDVRMIRLCGV 144
           +   + ++A      ++R L     LL+    L G F  S     LT G+        GV
Sbjct: 93  SAAAM-NKAGVTGWPLYRRLLERAALLY----LMGAFVTSVTSHRLTLGL--------GV 139

Query: 145 LQRIALSYLLVSLV 158
           LQ IAL+    +L+
Sbjct: 140 LQLIALASFFAALL 153


>gi|300123408|emb|CBK24681.2| unnamed protein product [Blastocystis hominis]
          Length = 213

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 7/70 (10%)

Query: 16  ISEPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWP-EISHAPWNGC 70
           ++EP +  Q+EK  + +Q    R+ S+D+FRG+ + +MI  ++  G +   + HA W+G 
Sbjct: 144 VNEPLI--QKEKQSVVSQPMKSRVQSIDVFRGITICIMIFANYGAGQYSHSLMHAAWDGI 201

Query: 71  NLADFVMPFF 80
             ADF  P +
Sbjct: 202 TFADFAFPLY 211


>gi|88859970|ref|ZP_01134609.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
 gi|88817964|gb|EAR27780.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
          Length = 378

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 27/144 (18%)

Query: 31  KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWP----EISHAPWNGCNLADF 75
           + +RLASLD  RG+ +           AL IL    G  W     +  H+ W+G    D 
Sbjct: 9   QKRRLASLDALRGMDMFWILGGEKIFAALFILTGWTG--WQVAHGQTLHSNWHGFTFYDL 66

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIF-RTLKLLF----WGILLQGGFSHAPDEL 130
           + P F+F+ GVA+ L+ KRI       ++V + + LK LF    +G+L   G+       
Sbjct: 67  IFPLFIFLAGVAMGLSPKRIDHLPFQERRVYYAKALKRLFLLAGFGVLYNHGWGTGIP-- 124

Query: 131 TYGVDVRMIRLCGVLQRIALSYLL 154
               ++  IR   VL RIA+++ +
Sbjct: 125 ---FNLEEIRYASVLGRIAIAWFV 145



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 51/93 (54%), Gaps = 8/93 (8%)

Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM----GFALLIFGLTLHFTN 343
           +PEGLLS+V ++++ ++GV  G +I        +  +W T+    G  L    L   +  
Sbjct: 212 DPEGLLSNVPAVVNALMGVFAGRLIAKAN----QFGEWKTVSYLFGAGLTSLALGWAWNM 267

Query: 344 AIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             P+NK L+T S+V VT G + +  ++ YA+VD
Sbjct: 268 VFPVNKDLWTSSFVLVTVGWSLIFLASFYAVVD 300


>gi|325106033|ref|YP_004275687.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324974881|gb|ADY53865.1| hypothetical protein Pedsa_3330 [Pedobacter saltans DSM 12145]
          Length = 397

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 8/96 (8%)

Query: 32  TQRLASLDIFRGLAVALMILVD--HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + R+ S+DI RGL + LM+ V+  +  G   W   + A  +G  LAD+V P FLF+VG++
Sbjct: 6   SVRILSIDIMRGLTLFLMLFVNDLYEPGVPKWLVHTKANVDGMGLADWVFPGFLFMVGLS 65

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILL 119
           I  A+K    + ++  K    ++ R L LLF GIL+
Sbjct: 66  IPYAVKARKAKGESGFKIFVHILLRALSLLFIGILM 101


>gi|292609605|ref|XP_002660455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Danio
           rerio]
          Length = 292

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 31/47 (65%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           +RL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+
Sbjct: 246 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 292


>gi|134025078|gb|AAI35092.1| Unknown (protein for IMAGE:7224994) [Danio rerio]
          Length = 291

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 31/47 (65%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
           +RL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+
Sbjct: 245 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 291


>gi|374373619|ref|ZP_09631279.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
 gi|373234592|gb|EHP54385.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
          Length = 397

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 83/188 (44%), Gaps = 31/188 (16%)

Query: 34  RLASLDIFRGLAVALMILVD--HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           R+ S+DI RG+ + LM+ V+  +  G   W   + A  +   LAD+V P FLF+VG++I 
Sbjct: 8   RIRSIDIMRGITLCLMLFVNDLYEPGVPHWLVHTKAETDSMGLADWVFPGFLFMVGLSIP 67

Query: 90  LALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL--------------- 130
            A+     + D     V  ++FR++ LL  G+L+  G    P                  
Sbjct: 68  FAIDSRRRKGDEWPQLVLHILFRSVSLLIIGLLMLNGGRVNPQLTGMPVLLWKSLVYLCI 127

Query: 131 -----TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQSVGRFSIFRLYCW 183
                TY V+ RM     +LQ   +  LL  LV IF   +    K    G + I  L  W
Sbjct: 128 FLVWNTYPVNKRMKPFFILLQLAGIGGLL-YLVWIFKAGIPGAIKWMETGWWGILGLIGW 186

Query: 184 HWLMAACV 191
            +L AA +
Sbjct: 187 GYLTAALI 194


>gi|326663866|ref|XP_696425.5| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Danio rerio]
          Length = 170

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 17/104 (16%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA- 344
           PF+PEG+L S++SIL   +G+  G +++H +    + +Q +T     L++GL L   +A 
Sbjct: 2   PFDPEGVLGSINSILMAFLGLQAGKILLHYRD---QHRQIIT---RFLMWGLILGIISAV 55

Query: 345 ----------IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
                     IP+NK L++LSYV   S  A +     Y  VD  
Sbjct: 56  LTKCSRNDGFIPVNKNLWSLSYVTTLSCFAFVALVFFYYTVDVK 99


>gi|326799399|ref|YP_004317218.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326550163|gb|ADZ78548.1| hypothetical protein Sph21_1988 [Sphingobacterium sp. 21]
          Length = 398

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 16/101 (15%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW--------NGCNLADFVMPFF 80
            + ++R+ S+DI RGL + LM+ V+    D  E     W        +G  LAD+V P F
Sbjct: 5   KVASERILSVDIMRGLTLLLMLFVN----DLFEPGVPAWLLHTKVDVDGMGLADWVFPGF 60

Query: 81  LFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
           LFIVGV++  A++   ++ ++ +++I     RTL LL  G+
Sbjct: 61  LFIVGVSVPYAIRSRLNKGESKRQIIGHIAVRTLSLLIIGV 101


>gi|315126665|ref|YP_004068668.1| hypothetical protein PSM_A1588 [Pseudoalteromonas sp. SM9913]
 gi|315015179|gb|ADT68517.1| conserved hypothetical protein [Pseudoalteromonas sp. SM9913]
          Length = 307

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 66/304 (21%), Positives = 123/304 (40%), Gaps = 82/304 (26%)

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
           +FI+G A+  + K+    A+A +      L+L+  G ++      A +   +   +  +R
Sbjct: 1   MFIIGSAMFFSFKKTNSTANASQ-----VLRLIKRGAIIFA-IGLALNMYPFTTSIENLR 54

Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
           + GVLQRI ++Y+L S+  +           + R  +  L       +  +L+ Y  LL 
Sbjct: 55  ILGVLQRIGIAYILASICVLL----------LNRRGVISL-------SVVILIAYWLLLL 97

Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
               P+  +T+ N                        N V  +D  VLG +H++      
Sbjct: 98  SVG-PNNAYTLEN------------------------NLVRTVDIAVLGESHLWQGKG-- 130

Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
                                     F+PEGLLS++ S++S + G     ++  +     
Sbjct: 131 ------------------------LAFDPEGLLSTLPSVVSVLFGFEVTRLLTSSSSQWT 166

Query: 321 RLKQWVTMGFALLIFGLTLHFTNAI--PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
            +K+ + +G    + G+ L    ++  P+NK L+T ++V  TSG A +V +    L  C+
Sbjct: 167 SIKRLLVIG----VVGIALGQLGSLIMPINKSLWTSTFVIYTSGIACIVLAFFVWL--CD 220

Query: 379 IFGP 382
           I  P
Sbjct: 221 IVKP 224


>gi|187735009|ref|YP_001877121.1| transmembrane protein [Akkermansia muciniphila ATCC BAA-835]
 gi|187425061|gb|ACD04340.1| putative transmembrane protein [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 63/147 (42%), Gaps = 25/147 (17%)

Query: 28  SHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCNLAD 74
           S  + QR+A++D  RG           L VA + L      +W     +H  W G    D
Sbjct: 5   SDTRPQRIAAIDALRGFDMFFLTGGLALVVAGINLFYDRSPEWLVKHSTHVAWEGFAAWD 64

Query: 75  FVMPFFLFIVGVAIALAL-KRIPDRA--DAVKKVIFRTLKLLFWGILLQGG-FSHAPDEL 130
            VMP FLFIVG A+  +  KRI          KV  R + L   G+++QG   S  P   
Sbjct: 65  LVMPLFLFIVGTAMPFSFSKRIGSEPLWKIYLKVARRVVVLFLLGMVVQGNLLSFEPS-- 122

Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSL 157
                 RM   C  LQ IA  YL+ ++
Sbjct: 123 ------RMSLYCNTLQAIASGYLIAAI 143


>gi|377572860|ref|ZP_09801940.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
 gi|377538518|dbj|GAB47105.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
          Length = 439

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 114/292 (39%), Gaps = 81/292 (27%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           +  RL SLD+ RG+ + + ++V+      +W E  HA W G +  D V P F+ + G  +
Sbjct: 8   RGGRLESLDVCRGVMLVVSVVVNAWFTAPEWFE--HAAWTGVHPVDLVFPAFVTLSGAGM 65

Query: 89  ALAL-KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRLCGVLQ 146
           A+A  +R+P     V + + R L L   G+     F+ A   L  G VDV  +R  GVLQ
Sbjct: 66  AIAFARRVP-----VARQVRRVLVLTAAGL----AFAVAGQVLGTGAVDVATLRFTGVLQ 116

Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
             A   L + LV +  +                   WHWL AA V+    A L  ++   
Sbjct: 117 LYAFLVLALGLVAVVVR-----------------RWWHWLAAAAVVAGAQAWLLASWASS 159

Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
                + K                      CN  G +D  V G  HMY            
Sbjct: 160 CPGGALTKA---------------------CNPSGVVDAAVFG-PHMY------------ 185

Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHTKG 317
                 G L  D          PEG +++  ++++ ++G   G ++  H +G
Sbjct: 186 ----VMGRLGHD----------PEGFVAAAGALVTALVGAAAGRLMWEHRRG 223


>gi|260911058|ref|ZP_05917694.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260634862|gb|EEX52916.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 409

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/326 (22%), Positives = 132/326 (40%), Gaps = 86/326 (26%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF--RTLK--LLFW- 115
           +I+H PW G    D +MP F+F+ G+ I  ++ +   R ++   V F  R LK  ++ W 
Sbjct: 83  QITHVPWQGFCFWDIIMPLFMFMSGITIPFSMAKY-QRGESKAGVGFLLRLLKRFVVLWV 141

Query: 116 -GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
            G+++QG            +D R + L    LQ IA+ Y++V+L+ ++T           
Sbjct: 142 LGMVVQGNL--------LALDARQLHLYSNTLQSIAVGYVVVALLFVYTS---------- 183

Query: 174 RFSIFRLYCWHWLMA--ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRA 231
                    W   +A  A  +V Y+A+                  A +G++         
Sbjct: 184 ---------WRTQLAVVASCMVAYVAVF-----------------AIWGQM--------- 208

Query: 232 KLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL-RKDAPSWCHAPFEPE 290
                 N    IDR+VLG         WR           +G +   D   W  A +   
Sbjct: 209 DFTIDANICEEIDRQVLG--------RWR-----------DGVIWNGDQWQW-DATYHYT 248

Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQ 350
            +LSS++ + +  +G   G V+  ++   ++L+  +  G  L++    L  +  +P+ K 
Sbjct: 249 WILSSLNFVGTVYLGYLAGVVLRTSQSGTSKLRTLLLSGVGLIVLAFAL--SPVVPIIKH 306

Query: 351 LYTLSYVCVTSGAAALVFSAIYALVD 376
           +++ S      G   L+ +A Y  VD
Sbjct: 307 IWSTSMTFFAGGICFLLMAAAYYWVD 332


>gi|296121958|ref|YP_003629736.1| hypothetical protein Plim_1707 [Planctomyces limnophilus DSM 3776]
 gi|296014298|gb|ADG67537.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776]
          Length = 378

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 72/307 (23%), Positives = 118/307 (38%), Gaps = 62/307 (20%)

Query: 63  SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
            H  W GC+L D V P F+F+VGV+I  +L     +         + ++R++ L+  GI 
Sbjct: 32  EHVAWRGCSLWDMVQPSFMFLVGVSIPWSLAAQKSKNVSTGQGWVRAVWRSVLLVVLGIF 91

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           L      + D               VL +I L YL+V  V   ++ +             
Sbjct: 92  LISNNKPSTD----------FSFVNVLTQIGLGYLVVYAVAQSSRPLAP----------- 130

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
                   MAA +L  Y  LL+  +    Q  I +++ A             A+LN P  
Sbjct: 131 -------FMAAAILAAYW-LLFALWPVVSQGPIGSQELA-------------AQLNLP-- 167

Query: 239 AVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL--LSS 295
               IDR+ L G    ++      + A + D  F     ++ P      F   G   L+ 
Sbjct: 168 ----IDRQQLAGFGAHWNQ---HLNPAASFDRWFLNLFPRETPF----VFNRGGYQTLNF 216

Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLS 355
           + SI + ++G+  G  I    G      + V  G  + + G  L  T   PL K+++T S
Sbjct: 217 IPSIATMLLGLAAGQTIREVPGAFPAAIRLVLGGVVIALLGWLLDITGVCPLVKRIWTPS 276

Query: 356 YVCVTSG 362
           +   + G
Sbjct: 277 WTLWSGG 283


>gi|440749360|ref|ZP_20928608.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436482365|gb|ELP38488.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 401

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 50/95 (52%), Gaps = 9/95 (9%)

Query: 33  QRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           +R+ S+D+FR + + LMI V+          W     A  +G  LAD V P FL IVG++
Sbjct: 11  RRVYSIDVFRAITMMLMIFVNDLWTLEGIPAWLGHVDAKEDGMGLADVVFPAFLVIVGLS 70

Query: 88  IALAL-KRIPDR---ADAVKKVIFRTLKLLFWGIL 118
           I  AL KRI      A  +K + FRTL LL  G+ 
Sbjct: 71  IPFALSKRIEKGERLAGTLKHIFFRTLALLTMGVF 105


>gi|332668157|ref|YP_004450945.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332336971|gb|AEE54072.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 387

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 17/103 (16%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           +K QRL S+DI R + + LMI V+    D   ++H P W        +G  L+D V P F
Sbjct: 1   MKNQRLPSIDILRAVTMLLMIFVN----DLWSLTHVPHWLLHTAAEEDGMGLSDVVFPAF 56

Query: 81  LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILL 119
           LFIVG++I  ALK    +   +   +  ++ RT  LL  G+ +
Sbjct: 57  LFIVGLSIPHALKARLEKGASKGSVMLHILSRTFALLVMGLFM 99


>gi|333379187|ref|ZP_08470911.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
 gi|332885455|gb|EGK05704.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
          Length = 395

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 53/96 (55%), Gaps = 9/96 (9%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K +R+AS+DI+R L +  MI V+          W E + A  +    +D V P FLFI+G
Sbjct: 8   KPKRIASIDIYRALTMFFMIFVNDLWSVSNVPHWLEHAAANEDMLGFSDIVFPSFLFILG 67

Query: 86  VAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
           ++I LA++    + D+    +K +I R++ LL  G+
Sbjct: 68  MSIPLAIEIRKKKGDSNSGILKHIIIRSIALLVMGL 103


>gi|345517559|ref|ZP_08797028.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
 gi|254837353|gb|EET17662.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
          Length = 359

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 65/144 (45%), Gaps = 25/144 (17%)

Query: 32  TQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLADFVMP 78
           + RL SLD+ RGL + L++             D+P       +  H  W G    D VMP
Sbjct: 7   SSRLDSLDMLRGLDLFLLVFFQPVLMSFGQQTDFPWMTSILYQFEHEVWVGFRFWDLVMP 66

Query: 79  FFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
            FLF+ GV++  +  +   I DR    +K+  R L L   G+++QG           G+D
Sbjct: 67  LFLFMTGVSMPFSFAKYRDISDRNAVYRKITRRFLLLFLLGMVVQGNL--------LGLD 118

Query: 136 VRMIRLC-GVLQRIALSYLLVSLV 158
              I L    LQ IA  YL+ +L+
Sbjct: 119 WEHIYLYNNTLQAIAAGYLIAALL 142


>gi|393788826|ref|ZP_10376952.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
 gi|392653932|gb|EIY47582.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
          Length = 376

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 64/145 (44%), Gaps = 26/145 (17%)

Query: 34  RLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++             + P       +  H  W G    D VMP F
Sbjct: 11  RLASLDILRGFDLFLLVFFQPVFVALARQLNLPFLDEVLYQFDHEVWEGFRFWDLVMPLF 70

Query: 81  LFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           LF+ G ++  +L +      D     ++++ R + L  +G+++QG           G D 
Sbjct: 71  LFMTGASMPFSLSKYKTASVDYWPVYRRILKRVILLFIFGMIVQGNL--------LGFDS 122

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
           + I      LQ IA+ Y + +++++
Sbjct: 123 KHIYFYSNTLQSIAVGYFIAAVIQL 147


>gi|167764058|ref|ZP_02436185.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
 gi|167698174|gb|EDS14753.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 60/114 (52%), Gaps = 15/114 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W   + A  +    +D + P FLF
Sbjct: 2   KNLTLQRVAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHAAADEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH+
Sbjct: 62  CMGMSVSFAIQNRYKKGDTTTQVIAHIFWRTVALIAMGLFSLNSGGIEGGLSHS 115


>gi|256424049|ref|YP_003124702.1| hypothetical protein Cpin_5069 [Chitinophaga pinensis DSM 2588]
 gi|256038957|gb|ACU62501.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 390

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 9/94 (9%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
           +QRL S+D FR L +  MI V+   G     +W +   A  +G   AD V P FLFIVG+
Sbjct: 5   SQRLLSIDAFRALTMLTMIFVNDVSGVKNIPEWIDHVKAQDDGMGFADTVFPAFLFIVGL 64

Query: 87  AIALALKRIPDRADAV----KKVIFRTLKLLFWG 116
           +I  A+ +   + D+       ++ R+L ++  G
Sbjct: 65  SIPFAIGKRISKQDSFFSIESHILLRSLAMIVMG 98


>gi|427384705|ref|ZP_18881210.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727966|gb|EKU90825.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
          Length = 398

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W E +    +    +D + P FLF
Sbjct: 6   KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHAEMNEDMMGFSDTIFPAFLF 65

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 66  CMGMSVSFAIQNRYRKGDTTLQVIAHVFWRTVALIAMGLFSLNSGGIEGGLSH 118


>gi|237722081|ref|ZP_04552562.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293373568|ref|ZP_06619919.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|299145142|ref|ZP_07038210.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229448950|gb|EEO54741.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292631466|gb|EFF50093.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|298515633|gb|EFI39514.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 377

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 64/145 (44%), Gaps = 26/145 (17%)

Query: 34  RLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAPWNGCNLADFVMPFF 80
           RLASLDI RG  + L++              +        +  H  W G    D VMP F
Sbjct: 12  RLASLDILRGFDLFLLVFFQPVFVALARQMNMSFLDSILYQFDHEVWEGFRFWDLVMPLF 71

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHAPDELTYGVDV 136
           LF+ G ++  +L +      +   V  R LK +F    +G+++QG           G+D 
Sbjct: 72  LFMTGASMPFSLSKYIGTTGSYWPVYRRILKRVFLLFIFGMIVQGNL--------LGLDA 123

Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
             + L    LQ IA+ YL+ +++++
Sbjct: 124 THLYLYSNTLQSIAVGYLIAAVIQL 148


>gi|430744438|ref|YP_007203567.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
 gi|430016158|gb|AGA27872.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 55/102 (53%), Gaps = 7/102 (6%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPWNGCNLADFVMPFFLF 82
           E +  KT R+ S+D FRG  VA M +V+  GG     P + H   N  + AD +MP FLF
Sbjct: 50  EAAGTKTGRIVSMDQFRGYTVAGMCVVNFLGGLQAIHPVLKHNN-NYFSYADTIMPSFLF 108

Query: 83  IVGVAIAL-ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG 121
             G +  L ALKR+     A   ++ ++R+L L+   +++ G
Sbjct: 109 ACGFSYRLTALKRLDQFGPAAMYRRFVWRSLGLVLLSLMMYG 150


>gi|456861595|gb|EMF80245.1| hypothetical protein LEP1GSC188_2621 [Leptospira weilii serovar
           Topaz str. LT2116]
          Length = 179

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 30/124 (24%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           +IDR ++G  H+     W+ SK                       ++PEGLLS V+SI +
Sbjct: 82  WIDRVIIGEKHL-----WKFSKT----------------------WDPEGLLSGVASIAT 114

Query: 302 TIIGVHFGHVI-IHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT 360
           ++ GV  G ++ +   G  +R+     +GF     GL   +  ++P+NK L+T SY   T
Sbjct: 115 SLFGVLCGFILFLRESGGRSRVFSIFGLGFLFTFVGLL--WDQSLPMNKSLWTGSYAAYT 172

Query: 361 SGAA 364
           +G A
Sbjct: 173 AGLA 176


>gi|408674314|ref|YP_006874062.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387855938|gb|AFK04035.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 391

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 81/367 (22%), Positives = 135/367 (36%), Gaps = 68/367 (18%)

Query: 33  QRLASLDIFRGLAVALMIL-VDHAGG---DWPEIS----------HAPWNGCNLADFVMP 78
            RL S DI+RG  + LM+  V H G      PE S          H  W GC+L D + P
Sbjct: 5   NRLTSADIYRGFVMLLMMAEVLHFGKVSEALPESSFWAFLAFHQDHVEWVGCSLHDLIQP 64

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
            F F+VGV +  ++ R   + +         LK     ILL           TY      
Sbjct: 65  SFSFLVGVVLPYSIARRLTQREGTNAAFLHALKRSLILILLGIFLRSQYKSQTY------ 118

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
                 L +I + Y  + L+   ++ VQ                   + A  +++V   L
Sbjct: 119 FTFEDTLSQIGMGYPFLFLLAFRSQKVQ-------------------ISALIIILVGYWL 159

Query: 199 LYGTY-VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
            +  Y +PD  F  +N   A   +  +      A  N   N     DR  L +       
Sbjct: 160 AFALYPLPDANFDYVNAGVAKDWE--HNLSDFSAHWNKNTNFAWAFDRWFLNL------- 210

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
            + R KA      F G                   LS + ++ + I+G+  G+++     
Sbjct: 211 -FPREKAFL----FNGG--------------GYATLSFIPTLGTMILGLLAGNILKSETK 251

Query: 318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDC 377
              +LKQ++ +G + L+ G+  +     P  K+++T ++V  + G   L  +  Y ++D 
Sbjct: 252 ADEKLKQFIILGVSGLMLGIIFNRLGICPNVKRIWTPTWVLFSGGLCFLFLAFFYWIIDV 311

Query: 378 NIFGPWT 384
                W 
Sbjct: 312 RGKSDWA 318


>gi|329960675|ref|ZP_08299018.1| conserved domain protein [Bacteroides fluxus YIT 12057]
 gi|328532548|gb|EGF59342.1| conserved domain protein [Bacteroides fluxus YIT 12057]
          Length = 394

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 63/118 (53%), Gaps = 23/118 (19%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMP 78
            +L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHAKIDEDMLGFSDTIFP 57

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
            FLF +G++++LA++    + +   +VI    +RT+ LL  G+       ++GG SH+
Sbjct: 58  AFLFCMGMSVSLAIQNRYKKGNTTLQVISHIFWRTIALLAMGLFSLNSGGIEGGLSHS 115


>gi|115770385|ref|XP_001180412.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Strongylocentrotus purpuratus]
          Length = 78

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 47/78 (60%), Gaps = 6/78 (7%)

Query: 49  MILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----V 102
           M+L   A GD  +  +SHA W+G  +ADF+ P+F+FI+G +I L++  +  +  +     
Sbjct: 1   MLLAGGAYGDGHYWFVSHAIWSGITVADFMFPWFVFIMGTSIHLSINILLSKGQSYPSIY 60

Query: 103 KKVIFRTLKLLFWGILLQ 120
           KK++ R++ L   G+ +Q
Sbjct: 61  KKLVSRSITLFIMGVCIQ 78


>gi|283781521|ref|YP_003372276.1| hypothetical protein Psta_3761 [Pirellula staleyi DSM 6068]
 gi|283439974|gb|ADB18416.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
          Length = 417

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 86/399 (21%), Positives = 142/399 (35%), Gaps = 100/399 (25%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS------------- 63
           + P ++     + L   RL SLD +RG    +M+ +   G   P+++             
Sbjct: 4   AAPSLAASTPAATLPA-RLLSLDAYRGF---VMLAMASRGFGIPKVAALPQFASHPTWQF 59

Query: 64  ------HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLL 113
                 H  W G    D + P F+F+VGVA+A +      + D   K+    IFR + L+
Sbjct: 60  LAGQLDHVAWVGSCFWDLIQPSFMFMVGVAMAYSCAARVSKGDPYWKMLLHAIFRAMVLI 119

Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
             G+ L+   S++ D+  +           V  +I L YL + L+      VQ       
Sbjct: 120 ALGVFLR---SNSSDQTNF-------TFMDVTSQIGLGYLPLFLLWGRKFWVQATA---- 165

Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
                         A  +LV Y AL          F +    +ADY              
Sbjct: 166 --------------AIVILVGYFAL----------FALYPLPAADYNYA----------- 190

Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS--------PFEGPLRKDAPSWCHA 285
                AVG +D   +       H  W ++     D         P   P   D   +   
Sbjct: 191 -----AVG-VDENWVHFQGFQRH--WEKNANVAHDVDVKVLNWFPRPEPFMFDRGGYNTL 242

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
            F P        SI + I G+  G  +      L +    V  G A+L+ G  L+     
Sbjct: 243 NFIP--------SIATMIFGLIAGEWLRGGASGLKKFGILVGAGLAMLLIGWGLNELGVC 294

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           P+ K+++T S+   ++G   L+ +A Y +++   F  W 
Sbjct: 295 PVVKRIWTPSWAIYSTGWTCLLLAAFYGVIELIQFRFWA 333


>gi|389866878|ref|YP_006369119.1| hypothetical protein MODMU_5285 [Modestobacter marinus]
 gi|388489082|emb|CCH90660.1| conserved transmembrane protein of unknown function [Modestobacter
           marinus]
          Length = 327

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 41/71 (57%), Gaps = 4/71 (5%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           +RL  +D+ RGLAV  M++VD+ G      +  HA W+G ++AD V P FL + GV  ++
Sbjct: 2   RRLHGVDVLRGLAVVGMLVVDNRGNASIATQWHHAAWDGLHVADVVFPAFLLVAGV--SM 59

Query: 91  ALKRIPDRADA 101
              R  DR  A
Sbjct: 60  PFSRRADRPRA 70


>gi|149437198|ref|XP_001516670.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Ornithorhynchus anatinus]
          Length = 176

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 11/100 (11%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLKQW-VTMGFALLIFGLTLHF 341
           ++PEG+L +++SI+   +GV  G +++  K      + R   W V MG   LI G+   F
Sbjct: 9   YDPEGILGTINSIVMAFLGVQAGKILLFYKEQHRQIMLRFLTWSVVMG---LISGVLTKF 65

Query: 342 TNA---IPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCN 378
           +     +P+NK L+++SYV   S  A +    IY  VD  
Sbjct: 66  SQNEGFVPINKNLWSISYVTTLSCFAFVALLLIYYFVDVK 105


>gi|225013130|ref|ZP_03703543.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225002750|gb|EEG40733.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 365

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 14/88 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL-VDHAGGDWPEIS-------------HAPWNGCNLADFVM 77
           +QRL SLD FRG+ + L++    H  G + +               H  W G +  D + 
Sbjct: 8   SQRLRSLDFFRGVVMFLLVAEFSHLFGVFMKTENETITAAADFLFHHVQWEGLHFWDLIQ 67

Query: 78  PFFLFIVGVAIALALKRIPDRADAVKKV 105
           PFF+FIVGV+I  +     ++ D+ K++
Sbjct: 68  PFFMFIVGVSIPYSYANRLEKGDSEKQI 95


>gi|333382416|ref|ZP_08474086.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828727|gb|EGK01419.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 394

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 56/108 (51%), Gaps = 9/108 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD---HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K  R+AS+DIFR L +  MI V+      G   W E + A  +    +D V P FLFI+G
Sbjct: 8   KPVRVASIDIFRALTMFFMIFVNDFWSVSGVPHWLEHAAASEDMLGFSDVVFPSFLFILG 67

Query: 86  VAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDE 129
           ++I LA++    + +  K++++    R++ LL  G+      S   DE
Sbjct: 68  MSIPLAMESRMKKGETKKQILWHIVVRSVALLVMGLFTVNLESGVADE 115


>gi|374309893|ref|YP_005056323.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751903|gb|AEU35293.1| hypothetical protein AciX8_0944 [Granulicella mallensis MP5ACTX8]
          Length = 399

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 83/368 (22%), Positives = 140/368 (38%), Gaps = 82/368 (22%)

Query: 33  QRLASLDIFRGLAVALMIL----VDHAGGDWPE----------ISHAPWNGCNLADFVMP 78
           QR +++D +RG  +ALM+            +P+           SH  W G +L D + P
Sbjct: 13  QRNSAVDAYRGFVMALMLAEVFRFAFVAKSFPDNFLLHILAYNQSHVEWTGMSLHDMIQP 72

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGV 134
            F F+VGVA+  +L+    + ++ K +    I+R+  L+  GI L+   S A D      
Sbjct: 73  SFTFLVGVALPYSLRSRRRKGESFKYMLGHTIWRSFLLVALGIFLRSIHSTATD------ 126

Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
                     L +I L Y    LV +                  R   W W+    VL  
Sbjct: 127 ----FTFEDTLTQIGLGYTFAFLVAL------------------RPARWLWITLGTVLFS 164

Query: 195 Y-LAL-LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
           Y LA  LY    P++ ++ +   S  +  +F    G  +  N   N     D   L   +
Sbjct: 165 YWLAWALYPAPGPNFPYSSVGVPSEWHQHLF---TGFMSHWNKNSNLGQAFDVWFL---N 218

Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL--LSSVSSILSTIIGVHFGH 310
           ++  P           +PFE              F   G   LS + ++ + ++G+  G 
Sbjct: 219 LFPRP-----------TPFE--------------FNDGGYLTLSFIPTLGTMLLGLAAGR 253

Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
            ++     +   K+ +     LL  GL  HF    P+ K+++T S+   + G   L  +A
Sbjct: 254 WLLEATPKIP-FKKLLLAAAVLLGSGLLFHFAGICPIVKRIWTPSWTLFSGGVCFLFLAA 312

Query: 371 IYALVDCN 378
              +VD  
Sbjct: 313 FSWVVDVK 320


>gi|386821099|ref|ZP_10108315.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
 gi|386426205|gb|EIJ40035.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
          Length = 395

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 8/95 (8%)

Query: 33  QRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+DI RGL + LM+ V+         W   S A  +   LAD+V P FLF+VG++I
Sbjct: 5   TRILSIDIMRGLTLFLMLFVNDLFEPGVPKWLVHSKATEDAMGLADWVFPGFLFMVGLSI 64

Query: 89  ALAL----KRIPDRADAVKKVIFRTLKLLFWGILL 119
             A     K+     + +K ++ RTL LL  G+ +
Sbjct: 65  PFAFLSRRKKGEGDLEILKHILVRTLSLLLIGVFM 99


>gi|168705120|ref|ZP_02737397.1| hypothetical protein GobsU_36644 [Gemmata obscuriglobus UQM 2246]
          Length = 387

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 44/92 (47%), Gaps = 10/92 (10%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG------GDWPEISHAPWNGC 70
           +EP        +     RLASLD FRG  V  M+LV+  G       D P ++H     C
Sbjct: 3   AEPPDPKGAAGAPASAPRLASLDQFRGYTVLGMLLVNFVGSFAVIKADVPVLAHHH-TYC 61

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
           + AD +MP FLF VG A  L   R   R DAV
Sbjct: 62  SYADTIMPQFLFAVGFAFRLTFAR---RRDAV 90


>gi|224536805|ref|ZP_03677344.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521571|gb|EEF90676.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 394

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G  + P  + HA  N      +D + P FLF
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|305666718|ref|YP_003863005.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
 gi|88708942|gb|EAR01176.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
          Length = 395

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 19/114 (16%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADF 75
            EKS  KT R+AS+D+ R L + LMI V+    D+  ++  P W        +    +D 
Sbjct: 1   MEKS--KTLRIASIDVLRALTMLLMIWVN----DFWTLTQVPKWLTHAKPNEDYLGFSDI 54

Query: 76  VMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
           + P FLFIVG++I  A+     +   R+   K ++ R++ LL  G+ +    +H
Sbjct: 55  IFPLFLFIVGLSIPFAINNRMAKGEPRSIMFKHIVIRSISLLIIGVFMVNYETH 108


>gi|423223322|ref|ZP_17209791.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638858|gb|EIY32689.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 394

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G  + P  + HA  N      +D + P FLF
Sbjct: 2   KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       ++GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|395803959|ref|ZP_10483200.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
 gi|395433603|gb|EJF99555.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
          Length = 396

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMP 78
            + K +L  QR+ S+D  RG+ + +MI V+          W +   A  +     D V P
Sbjct: 1   MKIKENLFNQRIVSIDSLRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFP 60

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            FLFIVG++I  A      + D+ K +   TLK
Sbjct: 61  AFLFIVGMSIPFAFNARLIKGDSPKTIWTHTLK 93


>gi|270294981|ref|ZP_06201182.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274228|gb|EFA20089.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 88/191 (46%), Gaps = 34/191 (17%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSHAPDE- 129
           LF +G++++ A++    + D     V  + +RT+ L+  G+       ++GG SH     
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSHPWFSI 119

Query: 130 -------LTYGVDVRMIRLCGVL--QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
                  LT+GV  + +    VL         LL++ + I+ KD+  K   +  + I  L
Sbjct: 120 LMVIGFFLTWGVYPKAVGTKKVLFTAMKTAGVLLLAFLVIY-KDMNGKPFQISWWGILGL 178

Query: 181 YCWHWLMAACV 191
             W + + A +
Sbjct: 179 IGWTYAVCAGI 189


>gi|429727718|ref|ZP_19262477.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
 gi|429151771|gb|EKX94625.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
          Length = 463

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 18/148 (12%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+D  RGL V L + + + G +  +IS+A     WNG  L D ++P FL ++G +I
Sbjct: 47  MRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLLDILVPMFLLVIGSSI 105

Query: 89  ALALKR----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
              +K+      D    VK    +++ +   G++    +  A D          +RL G 
Sbjct: 106 PFYVKKHYEENEDLRHIVKMSFIKSIIVFVIGLIFSCIYYPANDY---------VRLTGP 156

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSV 172
           +Q +A  Y++  L+ I    ++ K+ ++
Sbjct: 157 IQMMAFVYIMSLLLYIGFLKMRIKNNAL 184


>gi|445497064|ref|ZP_21463919.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
 gi|444787059|gb|ELX08607.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
          Length = 381

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 12/108 (11%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
           K  R+ ++D FRG+ + +MI V+   G      W E + A  +     D V P FLFIVG
Sbjct: 5   KPARVLAIDAFRGITILVMIFVNTLAGVRGMPAWMEHAPADADAMTFPDVVFPAFLFIVG 64

Query: 86  VAIALALKRIPDRADAV----KKVIFRTLKLLFWGILL---QGGFSHA 126
           ++I  A+ +     D      + V+ R   LL  G+ +   +GG++ A
Sbjct: 65  MSIPFAMAQRQAAGDTPAARWRHVLARAAGLLVLGVFMVNAEGGYNEA 112


>gi|431799248|ref|YP_007226152.1| hypothetical protein Echvi_3932 [Echinicola vietnamensis DSM 17526]
 gi|430790013|gb|AGA80142.1| Protein of unknown function (DUF1624) [Echinicola vietnamensis DSM
           17526]
          Length = 412

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 57/128 (44%), Gaps = 26/128 (20%)

Query: 4   IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
           +   T    P  +SEP +         + +R  ++D+FR + + LMI V+     W    
Sbjct: 3   VNNPTKTQRPSKVSEPII---------EAKRSYAIDVFRAVTMLLMIFVNDL---WTLEG 50

Query: 64  HAPWNG--------CNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTL 110
           +  W G           +D + P FLFIVG++I  AL     KRIP +    + +I R L
Sbjct: 51  YPDWLGHAAVGEDRLGFSDVIFPAFLFIVGLSIPFALQNRFRKRIP-KIKLAEHIILRGL 109

Query: 111 KLLFWGIL 118
            LL  GI 
Sbjct: 110 ALLVMGIF 117


>gi|372268269|ref|ZP_09504317.1| hypothetical protein AlS89_10220 [Alteromonas sp. S89]
          Length = 365

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 23/146 (15%)

Query: 31  KTQRLASLDIFRGLAVALMI------LVDHAGGDWP-------EISHAPWNGCNLADFVM 77
           K QRLAS+D  RG  +  +I      L   A   W        ++ H PW+G    D + 
Sbjct: 4   KKQRLASVDALRGFDMFWIIGGEALFLPLFALTGWSIFQFGHAQMQHTPWHGFTFYDLIF 63

Query: 78  PFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
           P F+F+ GV + LA K      +  RA   +K   R L L+  GIL   G+         
Sbjct: 64  PLFIFLSGVTLGLANKSLRGLPVSQRAPVYRKATKRLLLLVLLGILYNHGWGTGIPA--- 120

Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
             D+  IR   VL RI  ++   +++
Sbjct: 121 --DLSEIRYASVLARIGFAWFFAAMI 144



 Score = 45.1 bits (105), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 43/75 (57%), Gaps = 3/75 (4%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII-HTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           ++PEG+LS++ +I + + GV  G  +  H   H A LK     G A L+ G   H  +  
Sbjct: 199 YDPEGVLSTIPAIANALFGVFAGRWLSKHAGDHKAILKGLFAAGVACLVAGYVWH--SVY 256

Query: 346 PLNKQLYTLSYVCVT 360
           P+NK+L+T S+V +T
Sbjct: 257 PVNKELWTSSFVLIT 271


>gi|332662942|ref|YP_004445730.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332331756|gb|AEE48857.1| Protein of unknown function DUF2261, transmembrane
          [Haliscomenobacter hydrossis DSM 1100]
          Length = 394

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 14/73 (19%)

Query: 34 RLASLDIFRGLAVALMIL----VDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
          RL S+D++RGL + LM+       H    +P+           SH PW GC+L D + P 
Sbjct: 9  RLGSVDVYRGLVMFLMMAEVLEFGHVAKAFPDSGFWAFLHFHQSHVPWVGCSLHDLIQPS 68

Query: 80 FLFIVGVAIALAL 92
          F F+VGVA+  +L
Sbjct: 69 FSFLVGVALPYSL 81


>gi|289422375|ref|ZP_06424221.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
 gi|289157210|gb|EFD05829.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
          Length = 463

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 34/148 (22%), Positives = 73/148 (49%), Gaps = 18/148 (12%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGVAI 88
            R+ S+D  RGL V L + + + G +  +IS+A     WNG  L D ++P FL ++G +I
Sbjct: 47  MRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLNDILVPMFLLVIGSSI 105

Query: 89  ALALKRIPDRADAVKKVI----FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
              +K+  +  + ++ ++     +++ +   G++    +  A D          +RL G 
Sbjct: 106 PFYVKKHYEENEDIRHIVKMSFIKSIIVFLIGLIFSCIYYPANDY---------VRLTGP 156

Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSV 172
           +Q +   Y++  L+ I    ++ K+ ++
Sbjct: 157 IQMMVFVYIMSLLLYIGFLKMRIKNNAL 184


>gi|189464971|ref|ZP_03013756.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
 gi|189437245|gb|EDV06230.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 15/113 (13%)

Query: 28  SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
            +L  QR+A++D+FR L + LM+ V+   G      W E +    +    +D + P FLF
Sbjct: 2   KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHADINEDMMGFSDTIFPAFLF 61

Query: 83  IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
            +G++++ A++    + D   +VI    +RT+ L+  G+       + GG SH
Sbjct: 62  CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIAGGISH 114


>gi|456890764|gb|EMG01555.1| putative membrane protein [Leptospira borgpetersenii str.
           200701203]
          Length = 270

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 42/132 (31%)

Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
           +IDR++ G  H+     W+ SK                       ++PEG LS ++SI +
Sbjct: 77  WIDRRIFGEKHL-----WKFSKT----------------------WDPEGFLSGIASIAT 109

Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT-------NAIPLNKQLYTL 354
           ++ GV  G ++   +G   R K  V     L IFGL   FT        ++P+NK L+T 
Sbjct: 110 SLFGVICGFILFRREG---RGKNRV-----LSIFGLGFLFTFVGLLWDRSLPMNKSLWTG 161

Query: 355 SYVCVTSGAAAL 366
           SY   T+G + L
Sbjct: 162 SYAVYTTGLSFL 173


>gi|254445881|ref|ZP_05059357.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260189|gb|EDY84497.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 9/96 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+DIFRGL + LMI V+          W E +    +    +D + P FLFIVG++
Sbjct: 3   SRIHSIDIFRGLTMLLMIWVNDFWSLTNVPTWLEHAPGDADAMGFSDIIFPAFLFIVGLS 62

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
           I  AL+    + D+    +  ++ R+  LL  G L+
Sbjct: 63  IPFALRSRLAKGDSKPTIITHILARSFALLLMGFLM 98


>gi|302759308|ref|XP_002963077.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
 gi|300169938|gb|EFJ36540.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
          Length = 293

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 320 ARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           AR+  ++     LL  G  LH    I +NK LY+ SY+C T+GAA  VF  +Y LVD
Sbjct: 8   ARIINFIVPAVILLALGYVLHLL-GIKMNKPLYSFSYMCFTAGAAGAVFCLLYILVD 63


>gi|423304305|ref|ZP_17282304.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
 gi|423310581|ref|ZP_17288565.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392681752|gb|EIY75109.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392684891|gb|EIY78211.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 23/115 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPRLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSH 125
           LF +G++++ A++    + D     V  + +RT+ L+  G+       ++GG SH
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|146302719|ref|YP_001197310.1| hypothetical protein Fjoh_4992 [Flavobacterium johnsoniae UW101]
 gi|146157137|gb|ABQ07991.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 395

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFL 81
           K +L  QR+ S+D  RG+ + +MI V+          W +   A  +     D V P FL
Sbjct: 4   KENLYNQRIISIDALRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFPAFL 63

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           FIVG+++  A      + D+ K +   TLK
Sbjct: 64  FIVGMSVPFAFNARLIKGDSPKVIWTHTLK 93


>gi|456890770|gb|EMG01561.1| hypothetical protein LEP1GSC123_2562 [Leptospira borgpetersenii
          str. 200701203]
          Length = 74

 Score = 46.2 bits (108), Expect = 0.024,   Method: Composition-based stats.
 Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
          +D+FRG+ V  MILV++ G     +  + HA WNGC   D V PFF
Sbjct: 1  MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 46


>gi|320450186|ref|YP_004202282.1| hypothetical protein TSC_c11130 [Thermus scotoductus SA-01]
 gi|320150355|gb|ADW21733.1| putative membrane protein [Thermus scotoductus SA-01]
          Length = 334

 Score = 46.2 bits (108), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 8/75 (10%)

Query: 32  TQRLASLDIFRGLAVALMILVDH-AGGDWPEISHAPWNG-CNLADFVMPFFLFIVGVAIA 89
           + R  +LD FRGL VALM+ V++   G  P + H P+ G   LAD V P++L  +G AI 
Sbjct: 4   SARSLALDAFRGLTVALMLFVNNLPPGAPPYLEHGPFGGSVYLADLVFPWYLLAMGAAIP 63

Query: 90  LALKRIPDRADAVKK 104
            +      RA A+++
Sbjct: 64  FS------RASALRR 72


>gi|403174292|ref|XP_003333277.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375170913|gb|EFP88858.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 386

 Score = 46.2 bits (108), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 79/360 (21%), Positives = 141/360 (39%), Gaps = 94/360 (26%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPW--NGCNLADFVMPFFLFI 83
           +S +  +R  S+D+ RGL    M+LV+ AG   P  +SH          AD + P F+F 
Sbjct: 19  ESDILAKRDRSIDVLRGLTCLAMVLVNTAGPVRPSWLSHPTSIHQSITFADTLFPCFVFT 78

Query: 84  VGVAIALALKRIPD-RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRL 141
            G+A A + K   + R  ++K+ + R +KL   GI               G ++++  R+
Sbjct: 79  SGLASAQSKKNEQNGRNPSLKRTLIRAIKLNLIGIAYNNLIPRLAGLHGDGLLNLKTYRI 138

Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
             VL  I +S L+ +L                          +W++   +    L L   
Sbjct: 139 PSVLGTIGISSLVCTLES------------------------YWIIKPPIFPAILGL--- 171

Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP--P--CNAVGYIDRKVLGINHMYHHP 257
                W                       ++L P  P   ++  ++DR++ G +H+Y   
Sbjct: 172 ----SWYMI--------------------SRLRPFDPLHLSSQAWLDRRIFGPSHLY--- 204

Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI-LSTIIGVHFGHVIIHTK 316
                       P  G             F+PEG+L+++ ++ +S+++G +    I    
Sbjct: 205 -----------DPVPG-------------FDPEGILAALLTVPISSLLGSYLARPI---- 236

Query: 317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
            +LA ++    +G    I G  L      P +K L+T +YV  T+  + L +S   ALV 
Sbjct: 237 QNLATIETPQLIGLG--ILGSLLTGLLPSPASKLLWTTTYVGQTTATSLLYWSFSRALVS 294


>gi|418750786|ref|ZP_13307072.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
 gi|404273389|gb|EJZ40709.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
          Length = 322

 Score = 46.2 bits (108), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 119/312 (38%), Gaps = 87/312 (27%)

Query: 76  VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYG 133
           + PFFLF VG +I  +   + +      K++ R++ L+F G+ L   G +S +       
Sbjct: 1   MFPFFLFAVGASIPFS---VSNGIQEFPKILKRSVILIFLGLFLNFFGEWSFSN------ 51

Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
                +R  GVLQRI  +Y   ++        ++K+          L   +W +   +  
Sbjct: 52  -----LRFPGVLQRIGFAYFFSAIA------YREKNLKFRIILFLTLLISYWYLQEFIPP 100

Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
                  G   P        K+  D+G                     ++DR+V G  H+
Sbjct: 101 P------GAAEPS------MKEGKDWG--------------------AWLDREVFGQAHL 128

Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI- 312
                W+  K                       ++PEGLL+S +SI S   G+  G  + 
Sbjct: 129 -----WKFGKV----------------------WDPEGLLTSFTSIASVFCGIFAGEFLK 161

Query: 313 IHTKGHLARLK---QWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFS 369
           +H +   + L    + V   FA+L+ G    +    P+NK L+T +Y   T+G A L  S
Sbjct: 162 VHLEKKESPLSISGKIVLGAFAVLVVGGV--WGIYYPINKSLWTGTYSLWTAGWALLAVS 219

Query: 370 AIYALVDCNIFG 381
               L   N FG
Sbjct: 220 LFLILEKYNRFG 231


>gi|149178821|ref|ZP_01857402.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
 gi|148842362|gb|EDL56744.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
          Length = 405

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
            +R+ SLD FRG  VA M LV++ G     P +       C+ AD +MP FLF VG A  
Sbjct: 14  NKRIVSLDQFRGYTVAGMFLVNYMGFFVVCPVVLKHHNTYCSYADTIMPHFLFAVGFAFR 73

Query: 90  LALKRIPDRADAVK---KVIFRTLKLL 113
           L   R    A AV    +V+ R L L+
Sbjct: 74  LTFGRRVQTAGAVSAYARVVRRLLGLV 100


>gi|317477968|ref|ZP_07937151.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
 gi|316905882|gb|EFV27653.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 23/115 (20%)

Query: 30  LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
           L  QR+A++D+FR L + LM+ V+    D P + + P W        +    +D + P F
Sbjct: 4   LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59

Query: 81  LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSH 125
           LF +G++++ A++    + D     V  + +RT+ L+  G+       ++GG SH
Sbjct: 60  LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|149277363|ref|ZP_01883505.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
 gi|149232240|gb|EDM37617.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
          Length = 396

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 54/110 (49%), Gaps = 16/110 (14%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPW--NGCNLADFVMPFFLFIVGVA 87
           QRL S+D  R L + LMI V+      D P  + HAP   N   LAD V P FL IVG++
Sbjct: 9   QRLVSIDALRALVMLLMIFVNDLWSLIDIPGWLEHAPGDANYMGLADVVFPAFLVIVGLS 68

Query: 88  IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYG 133
           +  A+     + D  +     +++RT+ LL        GF H   E TYG
Sbjct: 69  VPYAIDSRRRKGDGNRAIFLHIVYRTIALLV------MGFFHVNME-TYG 111


>gi|423213223|ref|ZP_17199752.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693683|gb|EIY86913.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 469

 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 16/105 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAPWN--------GCNLADFVMPF 79
             R  +LD  RG A+  M+L    V H    W   +  P +        G    D V PF
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPDHVFNPLLPGITWVDLVFPF 61

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FLF +G A   ++K+  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 106


>gi|149276664|ref|ZP_01882807.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
 gi|149232333|gb|EDM37709.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
          Length = 359

 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 63/140 (45%), Gaps = 28/140 (20%)

Query: 37  SLDIFRGLAVALM--------ILVDHAGGDWP------EISHAPWNGCNLADFVMPFFLF 82
           SLD+ RGL + L+        + + H    WP      +  H PW+G    D V P F+F
Sbjct: 2   SLDVMRGLIMILLCAESCLLYVSLQHLNPAWPASGLVEQFFHHPWHGLRFWDLVQPAFMF 61

Query: 83  IVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
           + G A+ ++  R  ++  +  +    ++ R+LKL   G+ L   ++  P          +
Sbjct: 62  MAGAAMYISYSRKLEKGSSWSQNWNHILIRSLKLFLCGVGLHCVYAGKP----------V 111

Query: 139 IRLCGVLQRIALSYLLVSLV 158
             L  VL ++A + +L  L+
Sbjct: 112 WELWNVLTQLAFTSILAYLI 131


>gi|392965134|ref|ZP_10330554.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
 gi|387846517|emb|CCH52600.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
          Length = 531

 Score = 45.4 bits (106), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 14/97 (14%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALM---ILVDHAGGD-------WPEI----SHAP 66
           V+  ++   L   RL S+D +RG  + LM   IL  H   +       W  +    SH  
Sbjct: 133 VTAPRDSGGLAGDRLMSMDAYRGFVMLLMAAEILQFHRLHEAFPNSVLWGLLAYHQSHVE 192

Query: 67  WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK 103
           W GC+L D + P F F+VGVA+  +L    +R  +V+
Sbjct: 193 WAGCSLHDLIQPSFSFLVGVALPYSLAARLNRGQSVR 229


>gi|387789753|ref|YP_006254818.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
 gi|379652586|gb|AFD05642.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
          Length = 389

 Score = 45.1 bits (105), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            RL S+D+ R L + LMI V+          W E      +G  L+D + P FLFIVG++
Sbjct: 6   NRLGSIDVIRALTMFLMIFVNDLWSLVNVPKWLEHVDVQTDGMGLSDVIFPAFLFIVGLS 65

Query: 88  IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
           I  +++    + D+    +K +  R+  LL  G       S+ P  L
Sbjct: 66  IPFSVENRIKKGDSTIQLLKHIFIRSFALLVIGFFHVNLESYNPGAL 112


>gi|406831132|ref|ZP_11090726.1| hypothetical protein SpalD1_05831 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 10/108 (9%)

Query: 34  RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD FRG  +  M+LV+  GG     P I     + C+ AD +MP FLF  G A+ L
Sbjct: 14  RLTSLDQFRGYTMLGMLLVNFIGGYKAVSPRILLHTHDYCSYADTIMPHFLFAAGFALRL 73

Query: 91  ALKRIPDRAD------AVKKVIFRTLKLLFW-GILLQGGFSHAPDELT 131
           +L R  +         A+++++   L  + W G    GG  H  + +T
Sbjct: 74  SLGRRMEAGGKMPWGRAIRRILGLALVAIIWYGYCDWGGVVHKFNTMT 121


>gi|407789242|ref|ZP_11136344.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
 gi|407207220|gb|EKE77163.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
          Length = 364

 Score = 45.1 bits (105), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 52/91 (57%), Gaps = 2/91 (2%)

Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAI 345
           P +PEGLLS++ ++++ ++GV  G ++   +   ++          L +  L   ++   
Sbjct: 198 PADPEGLLSTLPAVVNALLGVVAGGLLRSPRQPWSKAVL--LAALGLGLLALGYLWSLVF 255

Query: 346 PLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
           PLNK L+T S+V VTSG +AL+ +  Y L+D
Sbjct: 256 PLNKTLWTSSFVLVTSGWSALLLALFYVLID 286



 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 14/49 (28%), Positives = 28/49 (57%)

Query: 61  EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
           +++H+ W+G    D + P F+F+ GV + LA KR        ++ ++R+
Sbjct: 48  QMAHSDWHGLTAYDGIFPLFIFLSGVTLGLADKRASALGGGARRALYRS 96


>gi|423287389|ref|ZP_17266240.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
 gi|392672504|gb|EIY65971.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
          Length = 470

 Score = 44.7 bits (104), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++K+  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 107


>gi|329849634|ref|ZP_08264480.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
 gi|328841545|gb|EGF91115.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
          Length = 410

 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 26  EKSHLKT-------QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLA 73
           EK   KT        R+ ++D+ R L + LMI V+          W E   +  +G  L+
Sbjct: 6   EKGRAKTMPNKNQFSRVGAIDLVRALTMVLMIFVNDLWSLKGVPVWLEHVASGVDGMGLS 65

Query: 74  DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
           D V P FLFIVG+++  A+     R D++   +     R++ LL  G+ L  G
Sbjct: 66  DVVFPAFLFIVGLSLPFAVSSRQARGDSLGSTVLHILGRSVALLVMGVFLVNG 118


>gi|338209612|ref|YP_004653659.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336303425|gb|AEI46527.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 398

 Score = 44.7 bits (104), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFI 83
            +  +R+ S+D FR L + LMI V+          W E + A  +    +D + P FLFI
Sbjct: 2   QITLKRVPSIDAFRALTMLLMIFVNDFWSLSGIPYWLEHAKAEEDFLGFSDIIFPCFLFI 61

Query: 84  VGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
           +G+AI  A++    + D     V+ +I R++ L+  GI
Sbjct: 62  LGMAIPFAVQNRIAKGDTRWQIVRHIILRSVALIVMGI 99


>gi|406831133|ref|ZP_11090727.1| hypothetical protein SpalD1_05836 [Schlesneria paludicola DSM
          18645]
          Length = 415

 Score = 44.3 bits (103), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 4/69 (5%)

Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
           + RL SLD FRG  +  MILV++ G      P +     + C+ AD +MP F F VG A
Sbjct: 11 SSPRLTSLDQFRGYTMVGMILVNYLGAYKEVTPRLFRHTNDYCSYADTIMPHFFFAVGFA 70

Query: 88 IALAL-KRI 95
          + L+L KRI
Sbjct: 71 MRLSLGKRI 79


>gi|313148038|ref|ZP_07810231.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313136805|gb|EFR54165.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 385

 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 55/99 (55%), Gaps = 17/99 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
           +G++I+ A++    + D++ +VI    +RT+ L+  G+ 
Sbjct: 63  MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101


>gi|424665794|ref|ZP_18102830.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
 gi|404574047|gb|EKA78798.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
          Length = 385

 Score = 44.3 bits (103), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 55/99 (55%), Gaps = 17/99 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
           +G++I+ A++    + D++ +VI    +RT+ L+  G+ 
Sbjct: 63  MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101


>gi|299149192|ref|ZP_07042253.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|298512859|gb|EFI36747.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 135 DVR 137
           D+R
Sbjct: 119 DIR 121


>gi|336405631|ref|ZP_08586307.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
 gi|335937114|gb|EGM99020.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|293369241|ref|ZP_06615831.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|336417197|ref|ZP_08597524.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297813|ref|ZP_17275873.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
 gi|292635666|gb|EFF54168.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|335936517|gb|EGM98443.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664450|gb|EIY57988.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 135 DVR 137
           D+R
Sbjct: 119 DIR 121


>gi|160883830|ref|ZP_02064833.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
 gi|156110915|gb|EDO12660.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
          Length = 470

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGTAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|295087641|emb|CBK69164.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 470

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
             R  +LD  RG A+  M+L    V H    W   +  P     +N    G    D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++++  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107


>gi|406835226|ref|ZP_11094820.1| hypothetical protein SpalD1_26403 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 54/119 (45%), Gaps = 22/119 (18%)

Query: 10  HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW-- 67
           H  PL    PD             R+ S+D FRG AVA MI V+  GG    + H+ +  
Sbjct: 3   HDAPLTTDSPD-------------RVISMDQFRGYAVAAMIFVNFVGGF--GVVHSVFKH 47

Query: 68  --NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQG 121
             N  + AD +M  F+F+VG +  L + R   R   +      + R+L L+F   LL G
Sbjct: 48  NDNYLSYADTIMANFMFMVGFSFRLTMLRRLKRMSWLATCWSYVRRSLLLVFVSTLLYG 106


>gi|430744193|ref|YP_007203322.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
 gi|430015913|gb|AGA27627.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
          Length = 368

 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
           S P ++     S     R+ SLD FRG  V  M+ V+  G     P +     + C+ AD
Sbjct: 8   SAPTLTSTSAPSG---SRIVSLDQFRGYTVVGMLFVNFLGNFDALPAVFKHHNSYCSYAD 64

Query: 75  FVMPFFLFIVGVAIALA-LKRIPDR--ADAVKKVIFRTLKLLFWGILL 119
            +MP F F VG A  L  L+R+       AV  V+ R+L L+  G ++
Sbjct: 65  TIMPQFFFAVGFAYRLTFLRRLETSGIGGAVAAVLRRSLGLILLGFVI 112


>gi|373850799|ref|ZP_09593600.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
 gi|372476964|gb|EHP36973.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
          Length = 401

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 9/101 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
              R+AS+DI R L + LMI+V+          W   S +  +G  +AD V P FLF+VG
Sbjct: 9   NAGRVASIDILRALTMVLMIIVNDLFTLKNTPAWLGHSASGVDGIGVADVVFPAFLFLVG 68

Query: 86  VAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
           +++  AL+   ++ D   ++++    R+  L+  G+ L  G
Sbjct: 69  LSLPHALEARRNKGDTGLRLVWHVAVRSFALIVMGVFLVNG 109


>gi|423281270|ref|ZP_17260181.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
 gi|404583178|gb|EKA87860.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
          Length = 385

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 54/99 (54%), Gaps = 17/99 (17%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
           QR+A++D+FR L + LM+ V+    D P + + P W G           +D + P FLF 
Sbjct: 7   QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62

Query: 84  VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
           +G++I+ A++    + D+  +VI    +RT+ L+  G+ 
Sbjct: 63  MGMSISFAVQNRYQKGDSPLQVIMHIFWRTVALIVMGLF 101


>gi|336417194|ref|ZP_08597521.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297816|ref|ZP_17275876.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
 gi|335936514|gb|EGM98440.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664453|gb|EIY57991.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
          Length = 466

 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 17/111 (15%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
           T+R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V P
Sbjct: 2   TKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSH 125
           FFLF +G A   ++K+  ++ D   ++++  +K    L F+ I +Q  + H
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGAQLTFFAIFIQHFYPH 112


>gi|196234160|ref|ZP_03132993.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196221811|gb|EDY16348.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 417

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 80/373 (21%), Positives = 131/373 (35%), Gaps = 84/373 (22%)

Query: 34  RLASLDIFRGLAVALMIL-------VDHAGGD---W----PEISHAPWNGCNLADFVMPF 79
           RL S+D +RGL + L++        V  A  D   W     +  H  W G  L D + P 
Sbjct: 32  RLGSIDAYRGLVMFLLLAEQFRTASVAKALPDSSFWRFLATQQEHVTWTGAVLHDMIQPS 91

Query: 80  FLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
           F F+VGVA+  ++     R  + +        R L L+  GI L+    H+    T+   
Sbjct: 92  FSFLVGVALPFSIGNRRARGQSPEATTGHAFLRALILVLLGIFLRST-GHSQTNFTFE-- 148

Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
                    L +I L Y  + L+ + +  VQ                  W+    +LV Y
Sbjct: 149 -------DTLTQIGLGYGFLYLIALRSVRVQ------------------WIALVVILVGY 183

Query: 196 LALLYGTYVP----DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
                   +P    DW    +  D A          G  A  N   N     DR  L + 
Sbjct: 184 WLAFARWPLPGEDFDWVHAGVTPDFAGNAS------GFAAHWNKNTNLAWAFDRWFLNL- 236

Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
                             P E P   +   +    F P        ++ + I+G+  G V
Sbjct: 237 -----------------FPREKPFLFNQGGYATLSFIP--------TLGTMILGLLAGEV 271

Query: 312 IIHTKGHLARLKQWVTMGFALLI-FGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSA 370
           I   +    ++  W+ M   + +  G  L  T   P+ K+++T S+V  + G      + 
Sbjct: 272 IRSPRRDSEKMG-WLLMAAVVCVGLGWLLGATGVCPVVKRVWTPSWVLFSGGWCFFATAL 330

Query: 371 IYALVDCNIFGPW 383
            Y ++D + +  W
Sbjct: 331 FYRIIDLSGYRRW 343


>gi|406831131|ref|ZP_11090725.1| hypothetical protein SpalD1_05826 [Schlesneria paludicola DSM
          18645]
          Length = 520

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
          + RL SLD FRG  +  M+LV++ G     P+I     + C+ AD +MP FLF  G A+ 
Sbjct: 13 SARLTSLDQFRGYTMLGMLLVNYLGSYHVCPQILKHSHDYCSYADTIMPQFLFAAGFAMR 72

Query: 90 LALKR 94
          L+L +
Sbjct: 73 LSLGK 77


>gi|436836802|ref|YP_007322018.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
 gi|384068215|emb|CCH01425.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
          Length = 401

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 47/99 (47%), Gaps = 9/99 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
            R+ S+D+ R L + LMI V+      A   W E      +G  LAD V P FLFIVG++
Sbjct: 16  TRVDSIDVLRALTMVLMIFVNDLWSLTAIPGWLEHVPEGADGIGLADVVFPAFLFIVGLS 75

Query: 88  IALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGG 122
           I  A++    R    A     V+ R   LL  G+ L  G
Sbjct: 76  IPFAIQHRRTRHETDAQIAGHVLTRAAALLVMGLWLVNG 114


>gi|436833713|ref|YP_007318929.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065126|emb|CCG98336.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 415

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 48/93 (51%), Gaps = 16/93 (17%)

Query: 27  KSHLKTQRLASLDIFRGLAVALM----ILVDH------AGGDWPEI----SHAPWNGCNL 72
           ++     RL S+D +RG  + LM    +  DH        G W  +    SH  W+GC+L
Sbjct: 23  QTEFPAGRLLSVDAYRGFVMLLMMGEVLHFDHLHEAFPGSGFWALLAYHQSHVDWSGCSL 82

Query: 73  ADFVMPFFLFIVGVAI--ALALKRIPDRADAVK 103
            D + P F F+VGVA+  ++A +++  ++  V+
Sbjct: 83  HDLIQPSFSFLVGVALPYSIASRQLKGQSAGVQ 115


>gi|419719054|ref|ZP_14246346.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
 gi|383304805|gb|EIC96198.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
          Length = 555

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
           EK  +   R+  LD+ + L+ +++IL+  +   +   E+    W G  + +    F +P 
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
           FL I G   AL L R  D+  A+KK ++  + L  W I
Sbjct: 263 FLMISG---ALLLGRKTDQRKAIKKAVYAGIALAIWSI 297


>gi|315650733|ref|ZP_07903787.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
           3986]
 gi|315487007|gb|EFU77335.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
           3986]
          Length = 555

 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
           EK  +   R+  LD+ + L+ +++IL+  +   +   E+    W G  + +    F +P 
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262

Query: 80  FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
           FL I G   AL L R  D+  A+KK ++  + L  W I
Sbjct: 263 FLMISG---ALLLGRKTDQQKAIKKAVYAGVALAIWSI 297


>gi|428319838|ref|YP_007117720.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243518|gb|AFZ09304.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 482

 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 44/93 (47%), Gaps = 13/93 (13%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
           +QR  +LD  RG+AV  M+L            + HA    P+ I +    G    D V P
Sbjct: 14  SQRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQLPPPDHIFNNKLPGLTWVDLVFP 73

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           FFLF +G AI LAL R   +    KKVI   L+
Sbjct: 74  FFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 106


>gi|320104555|ref|YP_004180146.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
 gi|319751837|gb|ADV63597.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
          Length = 399

 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 84/202 (41%), Gaps = 19/202 (9%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
            R  +LD FRG  VA MI+V+  GG    P I       C+ AD +MP F   VG A   
Sbjct: 15  SRWDALDQFRGYTVAGMIVVNFVGGLAAVPAILKHHNTYCSYADTIMPQFFLAVGFAYRW 74

Query: 91  ALKRIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
                 +R     AV+  + R L LL  G L+ G    A        D++ + + GVL +
Sbjct: 75  TFLNRLERGGWQAAVRHALGRNLGLLLVGFLMYGLDGKAESW----SDLKALGIRGVLIQ 130

Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
                   +LV I    +      + R    RL    WL+ +C L V L+    ++  +W
Sbjct: 131 AFQRDFFQTLVHIAIASLWVLP-VINRSVWIRL---TWLLGSCTLHVVLS---SSFYYEW 183

Query: 208 QFTIINKDSADYGKVFNVTCGV 229
              + N+   D G +  +T  V
Sbjct: 184 ---VTNRPGIDGGPLGFLTWTV 202


>gi|284035350|ref|YP_003385280.1| hypothetical protein Slin_0417 [Spirosoma linguale DSM 74]
 gi|283814643|gb|ADB36481.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
          linguale DSM 74]
          Length = 389

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 14/73 (19%)

Query: 34 RLASLDIFRGLAVALM----ILVDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
          RL S+D +RG  + LM    +  DH    +P+           SH  W GC+L D + P 
Sbjct: 4  RLMSMDAYRGFVMVLMAAEMLQFDHLHETFPDSAFWAFLAHHQSHVAWAGCSLHDLIQPS 63

Query: 80 FLFIVGVAIALAL 92
          F F+VGVA+  ++
Sbjct: 64 FSFLVGVALLFSM 76


>gi|404404857|ref|ZP_10996441.1| hypothetical protein AJC13_05463 [Alistipes sp. JC136]
          Length = 392

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 12/115 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVG 85
           +  R+AS+D+FRGL +  M+ V+   +  D P  + HA    +    +D + P FLFI+G
Sbjct: 4   QRNRIASIDVFRGLTMFFMLWVNSFWSLSDVPHWLQHAARGEDMLGFSDTIFPAFLFIMG 63

Query: 86  VAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL---LQGGFSHAPDELTYG 133
            ++ LA+     + D+  K+++    RT  L+  G+L       FS A   L+ G
Sbjct: 64  ASVPLAVGSRRAKGDSTVKIVWHVFTRTFALVVMGLLTVNFGDAFSAAGTGLSRG 118


>gi|237717694|ref|ZP_04548175.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|299149194|ref|ZP_07042255.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229453013|gb|EEO58804.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|298512861|gb|EFI36749.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 466

 Score = 42.4 bits (98), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
            +R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V P
Sbjct: 2   NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++K+  ++ D   ++++  +K    L F+ I +Q  + H    L+   
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFYPHV---LSNPQ 118

Query: 135 DVR 137
           DVR
Sbjct: 119 DVR 121


>gi|312131791|ref|YP_003999131.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
 gi|311908337|gb|ADQ18778.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
          Length = 361

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 20/98 (20%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFFL 81
           K  RL S+DIFR L +  MI V+    D   + + P W        +G   +D + P FL
Sbjct: 7   KKNRLLSIDIFRALTMFFMIFVN----DLFTVKNVPKWMLHTEMHEDGMGFSDVIFPIFL 62

Query: 82  FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
            IVG++I  A      +AD  K +  RT  LL  G+ L
Sbjct: 63  LIVGMSIPFA------KADW-KGIGMRTFALLVMGVFL 93


>gi|186472139|ref|YP_001859481.1| hypothetical protein Bphy_3279 [Burkholderia phymatum STM815]
 gi|184194471|gb|ACC72435.1| conserved hypothetical protein [Burkholderia phymatum STM815]
          Length = 418

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 67/146 (45%), Gaps = 22/146 (15%)

Query: 23  DQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
            +Q+K+  + Q    RL  LD FRGL V L I+VDH GG    +S A  +   L D    
Sbjct: 43  KKQKKAEKRMQIPKNRLIELDFFRGL-VLLFIVVDHIGGS--ILSRATLHAYALCD-AAE 98

Query: 79  FFLFIVGVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS- 124
            F+F+ G A A A   +  R   ADA  +   R+L+L    L   +L+         FS 
Sbjct: 99  VFVFLGGFATATAYASLAKRHTEADARNRFFKRSLELYRAFLVTAVLMLVVSAVMSAFSI 158

Query: 125 HAPDELTYGVDVRMIRLCGVLQRIAL 150
            AP+  T  +D  M     VL+ I L
Sbjct: 159 DAPNMATTDLDDMMDTPTAVLRDILL 184


>gi|326798253|ref|YP_004316072.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326549017|gb|ADZ77402.1| hypothetical protein Sph21_0826 [Sphingobacterium sp. 21]
          Length = 368

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 23/120 (19%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMI---------LVDHAGGDWPE-----ISHAPWNGC 70
            E+  ++T R+ SLD+ RGL + L+          L       WP+       H  W+G 
Sbjct: 1   MEEKKVQT-RILSLDVMRGLIMILLAAESCELYTALSSTYSSGWPQGIIHHFFHHEWHGL 59

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADAV------KKVIFRTLKLLFWGILLQGGFS 124
              D V P F+FI G ++ L+ +R   +A  V      K V +R+ KL   G+ L   +S
Sbjct: 60  YFWDLVQPAFMFIAGTSLYLSFQR--KQAAGVSWSSHFKSVAWRSAKLFLCGVALHCVYS 117


>gi|300867270|ref|ZP_07111930.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300334747|emb|CBN57096.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 486

 Score = 42.0 bits (97), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 46/104 (44%), Gaps = 13/104 (12%)

Query: 21  VSDQQEKSHLKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPW 67
           V+ Q   +    +R  +LD  RG+AV  M+L            + HA    P  + +   
Sbjct: 7   VNPQDMSTPAVNKRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQEPPPSHLFNPKL 66

Query: 68  NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            G    D V PFFLF +G AI LAL R   +    KKVI   L+
Sbjct: 67  AGLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 110


>gi|210622217|ref|ZP_03293007.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
 gi|210154351|gb|EEA85357.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
          Length = 483

 Score = 41.6 bits (96), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 40/184 (21%), Positives = 77/184 (41%), Gaps = 20/184 (10%)

Query: 3   EIKAETTHHHPLIISEPDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAG-- 56
           E+  E +     +I+  +    Q +  +      +R  ++++  G+AV  +I     G  
Sbjct: 80  ELSREESSTEQTVINRGEKEQPQAREVVTGDPLKRRYTTVELIMGVAVIAIICSSGIGVL 139

Query: 57  GDWPE-ISHAPWNGCNLADFVMPFFL----FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
           G+ P  ++ + WNG +  D  +P  L    F++   + L +KR     +   K +   + 
Sbjct: 140 GEMPAFLAFSKWNGISFGDLGLPLLLASVCFMIPTEVELDVKRKKSFKEICIKKVKVGII 199

Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
           L   GIL+          L    +    R+ G+LQ IA+ Y+L SL+ +  +    K   
Sbjct: 200 LFVIGILIN---------LIGAWNFNSFRIMGILQMIAVVYMLGSLLYVLFRRFNFKSSV 250

Query: 172 VGRF 175
           +  F
Sbjct: 251 IAVF 254


>gi|331002841|ref|ZP_08326355.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
           107 str. F0167]
 gi|330413330|gb|EGG92698.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
           107 str. F0167]
          Length = 557

 Score = 41.2 bits (95), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 11/99 (11%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP---EISHAPWNGCNLAD----FVMP 78
           E+  + T+RL  LD+ + ++ ALMI++ HA  +     +I  + W G  + +    F +P
Sbjct: 205 EEKKINTERLIGLDLLKIIS-ALMIILIHASANIYNNHDIGSSVWFGGLILNVIPRFAVP 263

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
            FL I G   AL L R  +   AV+K ++  L L+ W +
Sbjct: 264 TFLMISG---ALLLGRSTEPRKAVRKALYAGLALVVWSV 299


>gi|320107689|ref|YP_004183279.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
 gi|319926210|gb|ADV83285.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
          Length = 419

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 20/113 (17%)

Query: 33  QRLASLDIFRGLAVALMI----LVDHAGGDWPEI----------SHAPWNGCNLADFVMP 78
           QR  ++D +RGL + LM+      +     +P            SH  W G  L D + P
Sbjct: 33  QRNVAVDAYRGLVMLLMMGEVMQFEVVARSFPSSTIWRILSFNQSHVQWVGMGLHDMIQP 92

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAP 127
            F F+VGVA+  +L+    +  + +K+    I+R+  L+  GI L+    H+P
Sbjct: 93  SFTFLVGVALPYSLRSRQKKGQSFQKIVGHTIWRSFLLVALGIFLRS--IHSP 143


>gi|298384739|ref|ZP_06994299.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298263018|gb|EFI05882.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 473

 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
             R  +LD  RG A+  M+L            + HA    P+ I +    G    D V P
Sbjct: 2   NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
           FFLF +G A   ++ +  ++ D+  K+++    R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQ 107


>gi|29345856|ref|NP_809359.1| hypothetical protein BT_0446 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383122991|ref|ZP_09943678.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
 gi|29337749|gb|AAO75553.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251841914|gb|EES69994.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
          Length = 469

 Score = 41.2 bits (95), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
             R  +LD  RG A+  M+L            + HA    P+ I +    G    D V P
Sbjct: 2   NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++ +  ++ D+  K+++    R ++L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQHFY---PYVLSSPQ 118

Query: 135 DVR 137
           D+R
Sbjct: 119 DIR 121


>gi|293369243|ref|ZP_06615833.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292635668|gb|EFF54170.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 466

 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
            +R  +LD  RG A+  M+L            + HA    P+ +  A  +G    D V P
Sbjct: 2   NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
           FFLF +G A   ++K+  ++ D   ++++  +K    L F+ I +Q  +   P  L+   
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFY---PYVLSNPQ 118

Query: 135 DVR 137
           DVR
Sbjct: 119 DVR 121


>gi|189463407|ref|ZP_03012192.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
 gi|189429836|gb|EDU98820.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
          Length = 467

 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 28/123 (22%)

Query: 32  TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
            QR  +LD  RG A+  M+L            + HA    PE I +    G    D V P
Sbjct: 2   NQRALALDALRGYAIITMVLSATIISSILPGWMSHAQTPPPEHIFNPEIPGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRA--------DAVKKVIFRTLKLLFWGILLQGGFSH---AP 127
           FFLF +G A   ++ R  ++         DA+K    R ++L F+ I +Q  + +   +P
Sbjct: 62  FFLFAMGAAFPFSIGRHAEKGRSKLMLCYDAIK----RGIQLTFFAIFIQHFYPYVISSP 117

Query: 128 DEL 130
            +L
Sbjct: 118 QDL 120


>gi|255532593|ref|YP_003092965.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
 gi|255345577|gb|ACU04903.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
          Length = 390

 Score = 41.2 bits (95), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 34  RLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
           R  ++D+ R L + LMI V+  G       W +   A  +G   AD + P FLFIVG+++
Sbjct: 8   RFQAVDVLRALTMFLMIFVNDVGSVKYLPHWVDHVEADVDGMGFADTIFPAFLFIVGLSL 67

Query: 89  ALALKRIPDRADA 101
             AL+   ++  +
Sbjct: 68  PFALQSRMNKGKS 80


>gi|317503638|ref|ZP_07961656.1| transmembrane protein, partial [Prevotella salivae DSM 15606]
 gi|315665239|gb|EFV04888.1| transmembrane protein [Prevotella salivae DSM 15606]
          Length = 322

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 88/238 (36%), Gaps = 74/238 (31%)

Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
           IRL GV  R+A  Y+L S V I       K + +  F           + A  +  YL L
Sbjct: 70  IRLVGVFPRLAFCYMLASFVAI-----SMKHKYIPYF-----------IGALFVGYYLIL 113

Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
            YG           N  + D   + NV                 ID  V G  H+Y    
Sbjct: 114 CYG-----------NGFAHDASNICNV-----------------IDEAVFGRQHLYQ--- 142

Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
           W                  D P       +PEGLLS++ ++   +IG   G  ++     
Sbjct: 143 W------------------DIP-------DPEGLLSNLPALGHVLIGFCVGRSLMSATSI 177

Query: 319 LARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             ++++    G  L   G  L +    P++K+L+T ++  VT G A+   + +  ++D
Sbjct: 178 NEKIEKLFLFGALLTFAGFLLSY--GCPISKKLWTPTFALVTCGLASTTLAMLSWIID 233


>gi|291515652|emb|CBK64862.1| hypothetical protein AL1_27050 [Alistipes shahii WAL 8301]
          Length = 466

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 16/95 (16%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEIS---------------HAPWNGCNLADFV 76
             R  ++DI RGLA+  M+L  +   + P++                 A   G    D V
Sbjct: 2   NNRAYAVDILRGLAIVGMVLSGYIAWN-PDLPAWLFHAQLPPPSFVFDASVAGITWVDLV 60

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
            PFFLF +G A  L+L R  +R D  ++++   LK
Sbjct: 61  FPFFLFSMGAAFPLSLGRRLNRGDPPRRIVVSILK 95


>gi|262381364|ref|ZP_06074502.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296541|gb|EEY84471.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 410

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 9/114 (7%)

Query: 32  TQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVGV 86
           TQR  ++DI R + + +MI V+      D P  + HA +  +   LAD V P FLF VG+
Sbjct: 9   TQRNIAIDILRAVTMCVMIFVNDFWTVHDVPHYLEHAAYGEDFMGLADVVFPAFLFAVGM 68

Query: 87  AIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           +I  A++R   +  + +  I     RTL LL  G  +    +   D+  + + V
Sbjct: 69  SIPFAIERRYAKGMSGESTILHILSRTLALLIMGAFIVNSEAGMADDALFPIGV 122


>gi|344238550|gb|EGV94653.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cricetulus
           griseus]
          Length = 423

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 10/88 (11%)

Query: 24  QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
           Q E  H      RL  +D FRG+A+ LM+ V++ GG +    H+ WN  +     +P  L
Sbjct: 188 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWN-VSWDKVRIPGVL 246

Query: 82  -------FIVGVAIALALKRIPDRADAV 102
                  F+V V   +  K +PDR   V
Sbjct: 247 QRLGVTYFVVAVLELIFSKPVPDRCALV 274


>gi|302796998|ref|XP_002980260.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
 gi|300151876|gb|EFJ18520.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
          Length = 312

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 343 NAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             I +NK LY+ SY+C T+GAA  VF  +Y LVD
Sbjct: 38  QGIKMNKPLYSFSYMCFTAGAAGAVFCLLYILVD 71


>gi|406830436|ref|ZP_11090030.1| hypothetical protein SpalD1_02314 [Schlesneria paludicola DSM
           18645]
          Length = 380

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 57/123 (46%), Gaps = 18/123 (14%)

Query: 34  RLASLDIFRGLAVALMILVD---HAGGDWPE----------ISHAPWNGCNLADFVMPFF 80
           R+ASLD  RG A+A++++      A  + P+          +SH    G +L D   P F
Sbjct: 27  RVASLDTLRGFAIAILLIATPLVSALKEVPQSATRDMLVWQLSHVKGEGISLFDVGWPAF 86

Query: 81  LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           L I GV++  +L R  +R +    A   ++ ++L    +   + GGFS  P    Y  DV
Sbjct: 87  LIIAGVSLNFSLARRLERGETRFAAWLDLVRKSLLCALFAFFVHGGFS-IPWAKVYFADV 145

Query: 137 RMI 139
             I
Sbjct: 146 LFI 148


>gi|323343607|ref|ZP_08083834.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
 gi|323095426|gb|EFZ38000.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
          Length = 468

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 17/111 (15%)

Query: 30  LKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFV 76
           +K +R  +LD  RG A+  MIL            + HA    P+ + +    G    D +
Sbjct: 1   MKQERAHALDALRGYAIMTMILSATEAFRVLPAWMYHAQVPPPDHVFNPSIYGITWVDLI 60

Query: 77  MPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGF 123
            PFFLF +G AI L+L R      +++K+      R LKL F+ I +   F
Sbjct: 61  FPFFLFSMGAAIPLSLGRQYKAGASLRKLCRKSAIRWLKLAFFAIFIYHTF 111


>gi|347736758|ref|ZP_08869314.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
 gi|346919651|gb|EGY01093.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
          Length = 175

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 47/92 (51%), Gaps = 2/92 (2%)

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           +S++ ++ + ++GV  G  +       AR++  +  G   ++ G       A P+NK L+
Sbjct: 1   MSTLPAVATALLGVQAGRWLARPVPLPARIRGLIVAGVVGVLLGEVWDL--AFPINKNLW 58

Query: 353 TLSYVCVTSGAAALVFSAIYALVDCNIFGPWT 384
           T S+V +T+G A L   A+  +V+   +  WT
Sbjct: 59  TSSFVALTAGMACLCLGALLWVVEVRGWRRWT 90


>gi|387791847|ref|YP_006256912.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
 gi|379654680|gb|AFD07736.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
          Length = 487

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 40/130 (30%), Positives = 57/130 (43%), Gaps = 25/130 (19%)

Query: 16  ISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAVALMIL-------------VDHAGG 57
           I+E + S + E S +K Q     R  +LD  RGLA+  M+              + H   
Sbjct: 4   ITEIEAS-KPEASFVKAQIDKAPRSFALDALRGLAIIGMVFSGVFPHEALWPGYMFHGQV 62

Query: 58  DWPEISHAPW-NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----L 112
             P+  + P   G    D V PFFLF +G A  LA+ +     D  K VI   LK    L
Sbjct: 63  GPPDFKYTPEVPGITWVDLVFPFFLFSMGAAFPLAMNKKIQEGDQ-KGVILNVLKRFALL 121

Query: 113 LFWGILLQGG 122
           +F+ I+L+  
Sbjct: 122 VFFAIVLRNA 131


>gi|345856865|ref|ZP_08809325.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
 gi|344330006|gb|EGW41324.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
          Length = 237

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 9/94 (9%)

Query: 27  KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
            S + +QR   +D+ R LA+ LM+L  H   D  E +     G N+ D+  P + F++G 
Sbjct: 8   NSRISSQRYEEIDVLRALAIGLMVLF-HLAYDLKEFA-----GVNI-DYQAPLW-FVIGK 59

Query: 87  AIALALKRIPDRADAV-KKVIFRTLKLLFWGILL 119
             AL    I   +    K  + R LK+LFWG+++
Sbjct: 60  TSALLFIFISGLSSGFSKSSVRRGLKVLFWGMVV 93


>gi|325102778|ref|YP_004272432.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324971626|gb|ADY50610.1| hypothetical protein Pedsa_0021 [Pedobacter saltans DSM 12145]
          Length = 466

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 47/111 (42%), Gaps = 30/111 (27%)

Query: 29  HLKTQRLASLDIFRGLAVALMILVDHAGGD----W--------------PEISHAPWNGC 70
            L  +R +SLD  RG+A+ LM+L           W              PEI    W   
Sbjct: 2   KLTVKRDSSLDSLRGIAIILMVLSGSIAFSILPGWMYHAQVPPPGHKFMPEIPGITW--- 58

Query: 71  NLADFVMPFFLFIVGVAIALALKRIPDRADA-------VKKVIFRTLKLLF 114
              D V PFFLF +G AI LA+K+  + + +       VK+ +  T   LF
Sbjct: 59  --VDLVFPFFLFSMGAAIPLAMKKKIENSSSLNIFISIVKRFVLLTFFALF 107


>gi|256423178|ref|YP_003123831.1| hypothetical protein Cpin_4173 [Chitinophaga pinensis DSM 2588]
 gi|256038086|gb|ACU61630.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 349

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 14/100 (14%)

Query: 31  KTQRLASLDIFRGLAVALMI----LVDHAGGDW------PEISHAPWNGCNLADFVMPFF 80
            + RL SLD+ RGL + L+      V  +  +W       +  H PW+G    D V P F
Sbjct: 3   NSGRLLSLDVMRGLIMILLAGESCRVYESLHEWHDNAFIRQFFHHPWHGLRFWDLVQPAF 62

Query: 81  LFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
           + + G A+ ++    L++    +   K ++ R+LKL   G
Sbjct: 63  MLMAGTAMYISYQSKLRKGVSWSQNFKHILIRSLKLFLLG 102


>gi|423299515|ref|ZP_17277540.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473324|gb|EKJ91846.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
          Length = 467

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 18/106 (16%)

Query: 33  QRLASLDIFRGLAVALMILVDH-AGGDWPE-ISHAPWN------------GCNLADFVMP 78
           +R  SLD FRG A+  M+L    A G  P  + HA               G    D V P
Sbjct: 2   KRAISLDAFRGYAIVTMVLSGTIASGVLPGWMYHAQMGPRSNYIFDPQLYGITWVDLVFP 61

Query: 79  FFLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGILLQ 120
           FFLF +G AI  ++    ++ + + K+I     R ++L F+ I +Q
Sbjct: 62  FFLFAMGAAIPFSVGGKIEKGENLWKIIGECVLRGIRLAFFAIFIQ 107


>gi|160891390|ref|ZP_02072393.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
 gi|156858797|gb|EDO52228.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
          Length = 421

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 10/114 (8%)

Query: 15  IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNG 69
           I+ + D+SD+   S    +R  ++D+ R L +  MI V+     H    W E +    + 
Sbjct: 8   ILKQDDMSDKTVYS----RRNPAIDMLRALTMFTMIFVNDFWKVHDVPHWLEHAVYGEDF 63

Query: 70  CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
             LAD V P FLF VG++I  A++R   +  + +  +   L   F  +L+ G F
Sbjct: 64  MGLADIVFPCFLFAVGMSIPYAIERRYAKGFSAESTLGHILSRTF-ALLVMGAF 116


>gi|420256474|ref|ZP_14759317.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
 gi|398043145|gb|EJL36078.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
          Length = 373

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 18/140 (12%)

Query: 25  QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
           Q+       RL  LD FRGL V + I+VDH GG    +S A  +   L D     F+F+ 
Sbjct: 4   QKGMQTSKNRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLG 59

Query: 85  GVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDEL 130
           G A A A   +  R   ADA  +   R+L+L    L   +L+         FS  AP+  
Sbjct: 60  GFATATAYASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMA 119

Query: 131 TYGVDVRMIRLCGVLQRIAL 150
           T  +D  M     VL+ I L
Sbjct: 120 TTDLDDMMDTPTAVLRDILL 139


>gi|377812665|ref|YP_005041914.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
 gi|357937469|gb|AET91027.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
          Length = 389

 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 60/127 (47%), Gaps = 18/127 (14%)

Query: 23  DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
           +   +   K+QRL  LD FRGL V L+I++DH GG    +S    +   L D     F+F
Sbjct: 6   NTPSRMQQKSQRLVELDFFRGL-VLLIIVIDHIGGSM--LSRFTLHSFALND-AAEVFVF 61

Query: 83  IVGVAIALALKRIPD-RADAVKKVIF--RTLKL-----------LFWGILLQGGFSHAPD 128
           + G A A A   + + R+++  +V F  R  +L           L    +L+  F HAP+
Sbjct: 62  LGGFATATAYVSLAERRSESAARVRFLKRAFELYRAFVVTAVLMLVASFVLRPLFGHAPN 121

Query: 129 ELTYGVD 135
              + +D
Sbjct: 122 LALHDLD 128


>gi|416973167|ref|ZP_11937347.1| OpgC protein, partial [Burkholderia sp. TJI49]
 gi|325520603|gb|EGC99672.1| OpgC protein [Burkholderia sp. TJI49]
          Length = 141

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 8/95 (8%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGG 122
           +A   + +R D  A ++   R    ++   L+  G
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEIYRAFLVTAG 95


>gi|189468533|ref|ZP_03017318.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
           17393]
 gi|189436797|gb|EDV05782.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
           17393]
          Length = 393

 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLFIVGVA 87
           QR+A++D+FR L +  M+ V+   G  + P  + HA  N      +D + P FLF +G++
Sbjct: 7   QRVAAVDVFRALTMFFMLFVNDIPGLKNVPHWLMHAEMNEDMMGFSDTIFPAFLFCMGMS 66

Query: 88  IALALKRIPDRADAVKKVI 106
           I  A++    + D   ++I
Sbjct: 67  IPFAIQNRVKKGDTALQII 85


>gi|372221469|ref|ZP_09499890.1| hypothetical protein MzeaS_04069 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 378

 Score = 39.3 bits (90), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 29/143 (20%)

Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
           L    NA+  +D  +LG  H+Y                     +KD+      PF+PEGL
Sbjct: 174 LEMGSNAITRLDLFLLGEGHVY---------------------KKDS-----VPFDPEGL 207

Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLY 352
           LS++ SI++ ++G +F  V +   G+  +    V M    LI+ L   +    P++K+L+
Sbjct: 208 LSTLPSIVN-VLGGYFAGVYLKKNGNTFK-SIAVLMVAGFLIYTLGQWWNLIFPISKKLW 265

Query: 353 TLSYVCVTSGAAALVFSA-IYAL 374
           T S+   T G    + +  +YA+
Sbjct: 266 TSSFALHTIGLDLFILATLVYAI 288


>gi|312796297|ref|YP_004029219.1| OpgC protein [Burkholderia rhizoxinica HKI 454]
 gi|312168072|emb|CBW75075.1| OpgC [Burkholderia rhizoxinica HKI 454]
          Length = 330

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 9/88 (10%)

Query: 17  SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDW-PEISHAPWNGCNL 72
           S P +      S+L    T RLA LD FRGL V L+I+VDH GG     ++   +  C+ 
Sbjct: 3   SRPALPFHPSPSYLMQPSTARLAELDFFRGL-VLLIIVVDHIGGSMLSRVTLHTYALCDA 61

Query: 73  ADFVMPFFLFIVGVAIALALKRIPDRAD 100
           A+     F+F+ G A A+    +  R D
Sbjct: 62  AE----VFVFLGGYATAIGWTTLAARCD 85


>gi|406834557|ref|ZP_11094151.1| hypothetical protein SpalD1_23036 [Schlesneria paludicola DSM
           18645]
          Length = 534

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 35/67 (52%), Gaps = 6/67 (8%)

Query: 32  TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCN----LADFVMPFFLFIVGVA 87
           T+RL SLD FRG  VA MILV+  G     + H+ +   N     AD +MP F F VG +
Sbjct: 75  TERLVSLDQFRGYTVAGMILVNFIGSF--AVVHSIFKHNNNYFSYADSIMPGFHFAVGYS 132

Query: 88  IALALKR 94
             L   R
Sbjct: 133 YRLTFLR 139


>gi|333380436|ref|ZP_08472127.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826431|gb|EGJ99260.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 469

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 25/133 (18%)

Query: 34  RLASLDIFRGLAVALMIL------------VDHAGGDWPEISHAPWN-GCNLADFVMPFF 80
           R  +LD  RG A+  M+L            + HA    P     P   G    D V PFF
Sbjct: 4   RSCALDALRGYAIVTMVLSGAVVYGVLPGWMYHAQVPPPTHVFNPAAPGITWVDLVFPFF 63

Query: 81  LFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGF----SHAPDELTY 132
           LF +G A   ++++  +R ++  K+I+    R+++L F+ I ++  +    S+  D   +
Sbjct: 64  LFAMGSAFPFSIRKRLERGESKLKLIYDALKRSIQLTFFAIFIRHFYPYVLSNPEDARAW 123

Query: 133 GVDVRMIRLCGVL 145
           G+ +    LC VL
Sbjct: 124 GLSL----LCFVL 132


>gi|313220553|emb|CBY31402.1| unnamed protein product [Oikopleura dioica]
          Length = 194

 Score = 38.9 bits (89), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNA-- 344
           ++PEG+L S++SIL   +G+  G +    +    R  +    G  L   G  L   N   
Sbjct: 46  YDPEGILGSINSILIVFLGLQAGRIFNFYETFQQRAIRLSVWGTVLTAVGGALTGLNQFQ 105

Query: 345 ----IPLNKQLYTLSYVCVTSG 362
               IP+ K L+TLS+V V +G
Sbjct: 106 EGSNIPIAKNLWTLSFVLVMAG 127


>gi|421867873|ref|ZP_16299526.1| OpgC protein [Burkholderia cenocepacia H111]
 gi|358072286|emb|CCE50404.1| OpgC protein [Burkholderia cenocepacia H111]
          Length = 382

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 9/100 (9%)

Query: 17  SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADF 75
           + P        +  +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+ 
Sbjct: 9   TRPAPEGAPMTAPARAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE- 66

Query: 76  VMPFFLFIVGVAIALALKRIPDRAD---AVKKVIFRTLKL 112
               F+F+ G A A+A   + +R D   A ++ I R  ++
Sbjct: 67  ---VFVFLGGFATAIAYNSLAERHDEAAARQRFIRRAFEI 103


>gi|115358787|ref|YP_775925.1| hypothetical protein Bamb_4038 [Burkholderia ambifaria AMMD]
 gi|115284075|gb|ABI89591.1| conserved hypothetical protein [Burkholderia ambifaria AMMD]
          Length = 384

 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 23  RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 77

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 78  IAYNSLAERHDEAAARQRFIRRAFEI 103


>gi|172063517|ref|YP_001811168.1| hypothetical protein BamMC406_4496 [Burkholderia ambifaria MC40-6]
 gi|171996034|gb|ACB66952.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6]
          Length = 367

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|358344082|ref|XP_003636122.1| Magnesium transporter NIPA2 [Medicago truncatula]
 gi|355502057|gb|AES83260.1| Magnesium transporter NIPA2 [Medicago truncatula]
          Length = 328

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
           G +  GG+ H   +LT+GVD++ IRL G+LQR
Sbjct: 105 GGVFTGGYVHRVSDLTFGVDLKQIRLMGILQR 136


>gi|402568588|ref|YP_006617932.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
 gi|402249785|gb|AFQ50238.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
          Length = 365

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|170703183|ref|ZP_02893992.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
 gi|170131915|gb|EDT00434.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
          Length = 367

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|171315634|ref|ZP_02904868.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
 gi|171099166|gb|EDT43939.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
          Length = 367

 Score = 38.1 bits (87), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|319900329|ref|YP_004160057.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
 gi|319415360|gb|ADV42471.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
          Length = 414

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 9/120 (7%)

Query: 26  EKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFF 80
             S   ++R  ++D+ R L +  MI V+     H    W E +    +   LAD V P F
Sbjct: 8   NTSATYSRRNLAIDMLRALTMFTMIFVNDFWKVHDIPRWLEHAGYGEDFMGLADVVFPCF 67

Query: 81  LFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
           LF VG++I  A++R   +  + +     +  RT  LL  G  +         E+ Y + V
Sbjct: 68  LFAVGMSIPYAIERRYAKGFSAESTLGHIFLRTFALLVMGAFITNSEYRLSPEVPYPIGV 127


>gi|134293811|ref|YP_001117547.1| hypothetical protein Bcep1808_5131 [Burkholderia vietnamiensis G4]
 gi|387904835|ref|YP_006335173.1| OpgC protein [Burkholderia sp. KJ006]
 gi|134136968|gb|ABO58082.1| conserved hypothetical protein [Burkholderia vietnamiensis G4]
 gi|387579727|gb|AFJ88442.1| OpgC protein [Burkholderia sp. KJ006]
          Length = 367

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|206563639|ref|YP_002234402.1| hypothetical protein BCAM1788 [Burkholderia cenocepacia J2315]
 gi|444363993|ref|ZP_21164352.1| OpgC protein [Burkholderia cenocepacia BC7]
 gi|444373170|ref|ZP_21172575.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
 gi|198039679|emb|CAR55648.1| putative membrane protein [Burkholderia cenocepacia J2315]
 gi|443592216|gb|ELT61038.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
 gi|443593862|gb|ELT62568.1| OpgC protein [Burkholderia cenocepacia BC7]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|161520584|ref|YP_001584011.1| hypothetical protein Bmul_4038 [Burkholderia multivorans ATCC
           17616]
 gi|189353228|ref|YP_001948855.1| OpgC protein [Burkholderia multivorans ATCC 17616]
 gi|221209834|ref|ZP_03582815.1| putative membrane protein [Burkholderia multivorans CGD1]
 gi|421474568|ref|ZP_15922594.1| OpgC protein [Burkholderia multivorans CF2]
 gi|160344634|gb|ABX17719.1| conserved hypothetical protein [Burkholderia multivorans ATCC
           17616]
 gi|189337250|dbj|BAG46319.1| OpgC protein [Burkholderia multivorans ATCC 17616]
 gi|221170522|gb|EEE02988.1| putative membrane protein [Burkholderia multivorans CGD1]
 gi|400231859|gb|EJO61520.1| OpgC protein [Burkholderia multivorans CF2]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|107026114|ref|YP_623625.1| hypothetical protein Bcen_3760 [Burkholderia cenocepacia AU 1054]
 gi|116692702|ref|YP_838235.1| hypothetical protein Bcen2424_4608 [Burkholderia cenocepacia
           HI2424]
 gi|105895488|gb|ABF78652.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054]
 gi|116650702|gb|ABK11342.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|167584370|ref|ZP_02376758.1| hypothetical protein BuboB_03474 [Burkholderia ubonensis Bu]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           K  R A LD FRGL V L+I++DH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   KPGRYAELDFFRGL-VLLVIVIDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|170738048|ref|YP_001779308.1| hypothetical protein Bcenmc03_5696 [Burkholderia cenocepacia MC0-3]
 gi|254248202|ref|ZP_04941522.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
 gi|124874703|gb|EAY64693.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
 gi|169820236|gb|ACA94818.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|373954327|ref|ZP_09614287.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
          18603]
 gi|373890927|gb|EHQ26824.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
          18603]
          Length = 473

 Score = 38.1 bits (87), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 34/76 (44%), Gaps = 14/76 (18%)

Query: 33 QRLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAP-WNGCNLADFVMP 78
          QR  SLD  RG A+ LM+L             + HA    P     P   G    D V P
Sbjct: 11 QRANSLDALRGTAILLMVLSGSIAFGGILPGWMYHAQVPPPAHQFKPDLPGITWVDLVFP 70

Query: 79 FFLFIVGVAIALALKR 94
          FFLF +G AI LAL +
Sbjct: 71 FFLFAMGAAIPLALVK 86


>gi|421468975|ref|ZP_15917475.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
 gi|400230841|gb|EJO60585.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|78062083|ref|YP_371991.1| hypothetical protein Bcep18194_B1233 [Burkholderia sp. 383]
 gi|77969968|gb|ABB11347.1| conserved hypothetical protein [Burkholderia sp. 383]
          Length = 365

 Score = 38.1 bits (87), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDEAAARQRFIRRAFEI 86


>gi|221196140|ref|ZP_03569187.1| putative membrane protein [Burkholderia multivorans CGD2M]
 gi|221202813|ref|ZP_03575832.1| putative membrane protein [Burkholderia multivorans CGD2]
 gi|221176747|gb|EEE09175.1| putative membrane protein [Burkholderia multivorans CGD2]
 gi|221182694|gb|EEE15094.1| putative membrane protein [Burkholderia multivorans CGD2M]
          Length = 365

 Score = 38.1 bits (87), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)

Query: 31  KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
           +  R A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F+ G A A
Sbjct: 6   RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60

Query: 90  LALKRIPDRAD---AVKKVIFRTLKL 112
           +A   + +R D   A ++ I R  ++
Sbjct: 61  IAYNSLAERHDESAARQRFIRRAFEI 86


>gi|83716709|ref|YP_439651.1| opgC protein [Burkholderia thailandensis E264]
 gi|83650534|gb|ABC34598.1| opgC protein, putative [Burkholderia thailandensis E264]
          Length = 431

 Score = 37.7 bits (86), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 9/93 (9%)

Query: 24  QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW-PEISHAPWNGCNLADFVMPFFLF 82
           ++  +    QR A LD FRGL V L+I+VDH GG     ++   +  C+ A+     F+F
Sbjct: 45  RRRMNAAPAQRYAELDFFRGL-VLLVIVVDHIGGSMLSRVTLHAYALCDAAE----VFVF 99

Query: 83  IVGVAIALALKRIPDR---ADAVKKVIFRTLKL 112
           + G A A+A   +  R   A A ++ I R  ++
Sbjct: 100 LGGFATAIAYNSLAARHTEAAARQRFIKRAFEI 132


>gi|390575261|ref|ZP_10255366.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
 gi|389932764|gb|EIM94787.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
          Length = 367

 Score = 37.7 bits (86), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 33  QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
            RL  LD FRGL V + I+VDH GG    +S A  +   L D     F+F+ G A A A 
Sbjct: 6   NRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLGGFATATAY 61

Query: 93  KRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDELTYGVDVRM 138
             +  R   ADA  +   R+L+L    L   +L+         FS  AP+  T  +D  M
Sbjct: 62  ASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMATTDLDDMM 121

Query: 139 IRLCGVLQRIAL 150
                VL+ I L
Sbjct: 122 DTPTAVLRDILL 133


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.327    0.141    0.457 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,914,411,162
Number of Sequences: 23463169
Number of extensions: 238983224
Number of successful extensions: 725787
Number of sequences better than 100.0: 942
Number of HSP's better than 100.0 without gapping: 625
Number of HSP's successfully gapped in prelim test: 317
Number of HSP's that attempted gapping in prelim test: 722204
Number of HSP's gapped (non-prelim): 1693
length of query: 384
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 240
effective length of database: 8,980,499,031
effective search space: 2155319767440
effective search space used: 2155319767440
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 78 (34.7 bits)