BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014142
         (430 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera]
          Length = 489

 Score =  620 bits (1598), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 308/426 (72%), Positives = 345/426 (80%), Gaps = 20/426 (4%)

Query: 3   DLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKR 62
           D + VEEGLG        ++D  +  + I K++         EQKGE Q   L++QKSKR
Sbjct: 4   DAKRVEEGLGHVH-----KEDISEKADKIEKDESSATPAQSVEQKGEEQ--PLIKQKSKR 56

Query: 63  VATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           VATLDAFRGLT+VLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALALKK
Sbjct: 57  VATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKK 116

Query: 123 VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVV 182
           +P+I+ AVKKI  RTLKLLFWGI+LQGGYSHAPD LSYGVDMKHIRW GILQRIA+VY V
Sbjct: 117 IPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVYFV 176

Query: 183 VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSD 242
           VALIETLTTKRRP V++  H SI +AY+WQWIGGF+AF+IY+ITTY+LYVP+WSF    D
Sbjct: 177 VALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVIDQD 236

Query: 243 HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
           H  K+Y VKCGMRGHLGPACNAVGYVDR++WGINHLYS PVW+RL+ACTLSSPNSGP RE
Sbjct: 237 HEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFRE 296

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI 362
           DAPSWC APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH+ RLK WVSMG  LLI+AI
Sbjct: 297 DAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAI 356

Query: 363 ILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
           ILHFT+AIPINKQLYSFSYVCFTAGAA    GI  S F   +D          W  R P 
Sbjct: 357 ILHFTDAIPINKQLYSFSYVCFTAGAA----GIVLSAFYLVID---------VWGFRTPF 403

Query: 423 RICKWM 428
              +W+
Sbjct: 404 LFLEWI 409


>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera]
          Length = 511

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 308/448 (68%), Positives = 345/448 (77%), Gaps = 42/448 (9%)

Query: 3   DLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKR 62
           D + VEEGLG        ++D  +  + I K++         EQKGE Q   L++QKSKR
Sbjct: 4   DAKRVEEGLGHVH-----KEDISEKADKIEKDESSATPAQSVEQKGEEQ--PLIKQKSKR 56

Query: 63  VATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           VATLDAFRGLT+VLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALALKK
Sbjct: 57  VATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKK 116

Query: 123 VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ-------- 174
           +P+I+ AVKKI  RTLKLLFWGI+LQGGYSHAPD LSYGVDMKHIRW GILQ        
Sbjct: 117 IPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQVFPLPLFT 176

Query: 175 --------------RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAF 220
                         RIA+VY VVALIETLTTKRRP V++  H SI +AY+WQWIGGF+AF
Sbjct: 177 GKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAF 236

Query: 221 VIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYS 280
           +IY+ITTY+LYVP+WSF    DH  K+Y VKCGMRGHLGPACNAVGYVDR++WGINHLYS
Sbjct: 237 LIYMITTYALYVPDWSFVIDQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYS 296

Query: 281 DPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 340
            PVW+RL+ACTLSSPNSGP REDAPSWC APFEPEGLLSTISAILSGTIGIHYGHVLIHF
Sbjct: 297 QPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 356

Query: 341 KGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
           KGH+ RLK WVSMG  LLI+AIILHFT+AIPINKQLYSFSYVCFTAGAA    GI  S F
Sbjct: 357 KGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAA----GIVXSAF 412

Query: 401 IPEMDWDECDACFRAWSTRYPSRICKWM 428
              +D          W  R P    +W+
Sbjct: 413 YLVID---------VWGFRTPFLFLEWI 431


>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis]
 gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis]
          Length = 519

 Score =  607 bits (1565), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 302/401 (75%), Positives = 332/401 (82%), Gaps = 13/401 (3%)

Query: 1   MADLRIVEEGLGRTQLVEQEQDDGKDSEN--------GINKEKGLERSEVQDEQKGELQL 52
           M D R +EEGL   ++  + Q +   SE         G+  EK L  S V  EQ+GE QL
Sbjct: 14  MEDPRKLEEGLAHAKVANENQQEQHLSEKLDKTHDGGGVIPEKELTSSTVLVEQEGE-QL 72

Query: 53  QQLLQ----QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 108
           QQ  Q    QK+KRVATLDAFRGLTVVLMILVD+AG +YARIDHSPWNGCTLADFVMPFF
Sbjct: 73  QQPEQLPVKQKTKRVATLDAFRGLTVVLMILVDNAGESYARIDHSPWNGCTLADFVMPFF 132

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           LFIVGVAIALALK++P+   AVKKI  RTLKLLFWGI+LQGGYSHAP  LSYGVDMK IR
Sbjct: 133 LFIVGVAIALALKRIPRKRDAVKKISLRTLKLLFWGILLQGGYSHAPVDLSYGVDMKLIR 192

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           WCGILQRIALVY+ VALIETLT K R  VL+P H SIFTAY+WQWIGGFIAF+IY+ITTY
Sbjct: 193 WCGILQRIALVYMFVALIETLTIKERQTVLQPNHFSIFTAYRWQWIGGFIAFLIYMITTY 252

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
           +LYVP+WSF+ + D+   +Y VKCGMRGHLGPACNAVGYVDRE+WGINHLY  PVWSRL+
Sbjct: 253 ALYVPDWSFTAYDDNRPTRYTVKCGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLK 312

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
           ACT SSP +GPLR DAPSWC APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS RLK
Sbjct: 313 ACTFSSPATGPLRADAPSWCLAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSERLK 372

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            WVSMG GL +IAIILHFT+AIPINKQLYSFSYVCFTAGAA
Sbjct: 373 QWVSMGLGLFLIAIILHFTDAIPINKQLYSFSYVCFTAGAA 413


>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa]
 gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  579 bits (1493), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 297/433 (68%), Positives = 339/433 (78%), Gaps = 28/433 (6%)

Query: 7   VEEGLGRTQLVEQEQDDG-------KDSENGINKEKGLERSEVQD---EQKGELQLQQLL 56
           +EEGLG T LV    D+          ++ G + EK  ER  V D   E++G+ Q   ++
Sbjct: 1   MEEGLGHTALVANIDDENIHLSEKEGKTDGGDDNEKE-ERRAVHDHLAEREGDRQ--PVV 57

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +QKSKRVATLDAFRGLT+VLMILVDDAGG Y RIDHSPWNGCTLADFVMPFFLFIVGVAI
Sbjct: 58  KQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVGVAI 117

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL-QR 175
           ALA K++PK   AVKKII RTLKLLFWG++LQGGYSHAP  L+YGVDMK IRW GIL QR
Sbjct: 118 ALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRWFGILQQR 177

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           IALVY+VVALIE L  K R   +EP H +IFTAY+WQWI GFI+FVIY++TT++LYVP+W
Sbjct: 178 IALVYMVVALIEALIPKNR-QTIEPDHFTIFTAYRWQWIAGFISFVIYMVTTFALYVPDW 236

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
           SF+   DH  ++Y V+CGMRGHLGPACNAVGYVDRE+WGINHLY  PVWSRL+ACTLSSP
Sbjct: 237 SFTVDEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTLSSP 296

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
            SGP R+DAPSWCRAPFEPEGLLS+ISAILSGTIGIHYGHVLIHFKGH+ RL+ WVSMG 
Sbjct: 297 GSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHAERLRQWVSMGV 356

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRA 415
            LLI+AIILHFT+AIPINKQLYSFSYVCFTAGAA    GI  S F   +D          
Sbjct: 357 ILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAA----GIVFSGFYVLID---------V 403

Query: 416 WSTRYPSRICKWM 428
           W  R P    +W+
Sbjct: 404 WGLRPPFLFLEWI 416


>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 508

 Score =  576 bits (1484), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 296/438 (67%), Positives = 336/438 (76%), Gaps = 25/438 (5%)

Query: 3   DLRIVEEGLGRTQLVEQEQDDGKD-----SENGINKEKGLERSEVQDEQKGEL------Q 51
           D + +EEGL      +  +DD K      + NG +  +  + +  +   +GE       Q
Sbjct: 4   DPKRMEEGLNSALNGDGNKDDLKKRATIKTSNGGSIFEHDKDTMAKPVAEGESVQQIAEQ 63

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFI 111
            Q  ++QK+KRVATLDAFRGLT+VLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLFI
Sbjct: 64  EQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFI 123

Query: 112 VGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           VGVAIALALK++ KI  +VKKII RTLKLLFWGIILQGGYSHAPD L YGV+MK IRWCG
Sbjct: 124 VGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWCG 183

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIALVY VVALIET TTK RP  L   HLSIF AY+WQW GGF+AF+IY+ITT+SLY
Sbjct: 184 ILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKWQWFGGFVAFLIYMITTFSLY 243

Query: 232 VPNWSFSEH-SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           VP+WSF +H +    K+Y V CGMRGHLGPACNAVG+VDR++WG+NHLYS PVW RL+AC
Sbjct: 244 VPDWSFVDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKAC 303

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
           T SSP SGP R+DAPSWC APFEPEGLLS+ISAILSGTIGIHYGHVLIHFKGHS RLK W
Sbjct: 304 TFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQW 363

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECD 410
           VSMGF LLIIAIILHFT+A+PINKQLYSFSYVCFTAGAA    GI  S F   +D     
Sbjct: 364 VSMGFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAA----GIVFSGFYILID----- 414

Query: 411 ACFRAWSTRYPSRICKWM 428
                W  R P    +W+
Sbjct: 415 ----VWGLRTPFLFLEWI 428


>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Glycine max]
          Length = 509

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 282/396 (71%), Positives = 320/396 (80%), Gaps = 13/396 (3%)

Query: 7   VEEGLGRTQLVEQEQDDGK-----DSENG---INKEKGLER----SEVQDEQKGELQLQQ 54
           +EEG+         +DD K      + NG   I  +KG       +E +  QK   Q Q 
Sbjct: 8   MEEGINSALNGGGNKDDLKRRVTIKTSNGGSVIEHDKGTMAKPYGAESESVQKIAEQEQP 67

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           +++QK+KR+ATLDAFRGLT+VLMILVDDAG AY RIDHSPWNGCTLADFVMPFFLFIVG+
Sbjct: 68  VVKQKTKRIATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFIVGI 127

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           AIALALK++ KI  AVKKII RTLKLLFWGIILQGGYSHAPD L YGV+MK IRWCGILQ
Sbjct: 128 AIALALKRIAKIKHAVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWCGILQ 187

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RIALVY VVALIET TTK RP  L   HLSIFTAY+WQW GGF+AF+IY+ITT++LYVP+
Sbjct: 188 RIALVYCVVALIETFTTKLRPTTLASGHLSIFTAYKWQWFGGFVAFIIYMITTFTLYVPH 247

Query: 235 WSFSEH-SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
           WSF +H +    K+Y V CGMRGHLGPACNAVG+VDR++WG+NHLYS PVW RL+     
Sbjct: 248 WSFLDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKMTIDY 307

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
           SP SGP R+DAPSWCR+PFEPEGLLS+ISAILSGTIGIHYGH+LIHFKGHS RLK WV M
Sbjct: 308 SPASGPFRDDAPSWCRSPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHSERLKQWVLM 367

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           GF LLIIAIILHFT+A+PINKQLYSFSYVCFTAGAA
Sbjct: 368 GFVLLIIAIILHFTDALPINKQLYSFSYVCFTAGAA 403


>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 419

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 259/352 (73%), Positives = 297/352 (84%), Gaps = 13/352 (3%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           M+LVDDAGGAY RIDHSPWNGCTLADFVMPFFLFIVGVAIALALK++PK+  AVKKII R
Sbjct: 1   MVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIILR 60

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
           TLKLLFWGI+LQGGYSHAPD LSYGVDM+ IRWCGILQRIALVY VVALIET TTK RP+
Sbjct: 61  TLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRPS 120

Query: 197 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRG 256
            L+P HLSIFTAY+WQW+GGF+AFVIY++T +SLYVP+WSF +++    K+Y V+CGMRG
Sbjct: 121 TLKPGHLSIFTAYRWQWLGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKPKRYTVECGMRG 180

Query: 257 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 316
           HLGPACNAVGYVDR++WG+NHLYS PVW+RL+ACTLSSP  GPLR++AP+WCRAPFEPEG
Sbjct: 181 HLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFEPEG 240

Query: 317 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQL 376
            LS++ AILSGTIGIHYGHVLIHFKGH  RLK W+SMGF LL + +ILHFT+AIPINKQL
Sbjct: 241 FLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPINKQL 300

Query: 377 YSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           YSFSYVCFTAGAA    GI  S+F   +D          W  R P    +W+
Sbjct: 301 YSFSYVCFTAGAA----GIVFSVFYLLID---------VWGLRTPFLFLEWI 339


>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 488

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 274/415 (66%), Positives = 315/415 (75%), Gaps = 19/415 (4%)

Query: 18  EQEQDDGKDSENGINKEKGLE--RSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +Q+ D+ K+  +    +  ++    E + E    +  +  L+QK+KRVATLDAFRGLT+V
Sbjct: 9   KQKDDEEKNMSSHTTSDVIIDCHTKEEEKEVAPTIVEEAQLRQKTKRVATLDAFRGLTIV 68

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           LMILVDDAGGAY+RIDHSPWNGCTLADFVMPFFLFIVGVAIALA K++  I   V KI  
Sbjct: 69  LMILVDDAGGAYSRIDHSPWNGCTLADFVMPFFLFIVGVAIALAFKRIGSIKQGVMKISL 128

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           RT+KL+FWG+ILQGGYSHAPD L YGVDMKHIRWCGILQRIALVY VVA+IE  TT  +P
Sbjct: 129 RTIKLVFWGLILQGGYSHAPDDLEYGVDMKHIRWCGILQRIALVYFVVAMIEAFTTIGKP 188

Query: 196 NV-LEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE-HSDHGVKKYIVKCG 253
            V L+  H SIFTAY+W  IGGF AF+IYIITTY+LYVPNWSFS    D  +  Y V CG
Sbjct: 189 RVVLDHGHFSIFTAYRW--IGGFAAFIIYIITTYALYVPNWSFSVLEDDQLLHHYTVVCG 246

Query: 254 MRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFE 313
           +RGHLGPACNAVG+VDR++WGINHLYS PVW R + CT S+P+ GPLR+DA SWC APFE
Sbjct: 247 VRGHLGPACNAVGHVDRQVWGINHLYSYPVWIRHKDCTFSAPDEGPLRDDAASWCLAPFE 306

Query: 314 PEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPIN 373
           PEGLLS++SAILSGTIGIHYGHVL+HFK HS RLK WVSMGFG  II IILHFTNAIPIN
Sbjct: 307 PEGLLSSVSAILSGTIGIHYGHVLLHFKTHSQRLKQWVSMGFGFFIIGIILHFTNAIPIN 366

Query: 374 KQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           KQLYS SYVCFTAGAA    GI  S F   +D          W  R P    +W+
Sbjct: 367 KQLYSLSYVCFTAGAA----GIVFSGFYILID---------VWGFRKPFLFLEWI 408


>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 498

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 257/420 (61%), Positives = 315/420 (75%), Gaps = 16/420 (3%)

Query: 12  GRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG 71
           GR ++ EQ   D    E+G    K ++ +    E++    +++  ++KS RVA LDAFRG
Sbjct: 12  GRAKVSEQHAIDVAIVEHGSGDGKSIDAAGANAEKERLAVVEEPQKKKSTRVAALDAFRG 71

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK 131
           LT+V+MILVDDAG +Y R+DHSPWNGCTLADFVMPFFLFIVGVAIA A+K+VP +  AVK
Sbjct: 72  LTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFFLFIVGVAIAFAMKRVPNMGAAVK 131

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           K+  RTLK++FWG++LQGGYSHAPD L+YGVDMK IRWCGILQRIALVY  VALIE  TT
Sbjct: 132 KVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIRWCGILQRIALVYFAVALIEVFTT 191

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV---KKY 248
           K RP  +     +IF AY+WQW+G FI  VIY+ITT+SLYVP+WSF  H+D  +   K++
Sbjct: 192 KVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTFSLYVPDWSFVYHNDGDINDGKRF 251

Query: 249 IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
            V+CG+RGHL PACNAVG++DR++WGINHLYS PVW R + CT SSP +G LR+DAP+WC
Sbjct: 252 TVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWIRTKDCTFSSPETGKLRDDAPAWC 311

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
             PFEPEGLLS+IS+I+SGTIGIHYGHVLIHFK H  RL HW+SMGF LL++ I+LHFTN
Sbjct: 312 LGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKERLTHWLSMGFALLLLGILLHFTN 371

Query: 369 AIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           AIPINKQLYSFSY+CFT GAA    GI  S F   +D          W  R P    +W+
Sbjct: 372 AIPINKQLYSFSYICFTGGAA----GIVLSAFYALID---------VWGLRVPFLFLEWI 418


>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
 gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
          Length = 512

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 264/432 (61%), Positives = 309/432 (71%), Gaps = 38/432 (8%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           EQ         +N   G    +V+ E+      +++ ++KS+RVA LDAFRGLT+VLMIL
Sbjct: 16  EQHQHAIDVGHVNHGDGKGEEDVEKERVA--VAEEVPKKKSRRVAALDAFRGLTIVLMIL 73

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK 139
           VDDAGGAY RIDHSPWNGCTLADFVMPFFLFIVGVAIA ALK+VP +  AVK+I  RTLK
Sbjct: 74  VDDAGGAYERIDHSPWNGCTLADFVMPFFLFIVGVAIAFALKRVPNMGNAVKRITIRTLK 133

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ--------------------RIALV 179
           +LFWG++LQGGYSHAPD LSYGVDMK IRW GILQ                    RIALV
Sbjct: 134 MLFWGVLLQGGYSHAPDDLSYGVDMKKIRWMGILQLYIYHGNNLDSFLFFTLGHQRIALV 193

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           Y +VALIE  T K RP  +     +IF A++WQW+GGFIAFVIY++TT+SLYVP+WS+  
Sbjct: 194 YFIVALIEAFTVKVRPTTVRSGPYAIFNAHRWQWLGGFIAFVIYMVTTFSLYVPDWSYVY 253

Query: 240 HSDHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
           H+D  V   K++ VKCG+R  L  ACNAVGYVDR++WGINHLY+ PVW R + CT SSPN
Sbjct: 254 HNDGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDCTSSSPN 313

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
            GPLR DAP WC APFEPEGLLS+IS++LSGTIGIHYGHVLIHFK H  RLKHW+  GF 
Sbjct: 314 MGPLRADAPEWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHWLVTGFS 373

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAW 416
           LL++AIILHFTNAIPINKQLYSFSYVCFT GAA    GI  S F   +D          W
Sbjct: 374 LLVLAIILHFTNAIPINKQLYSFSYVCFTGGAA----GIVLSAFYILID---------VW 420

Query: 417 STRYPSRICKWM 428
             R P    +W+
Sbjct: 421 GLRKPFLFLEWI 432


>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa]
 gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 207/332 (62%), Positives = 260/332 (78%), Gaps = 5/332 (1%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           RVA+LD +RGLTV LMILVDDAGG + +I H+PWNGC LADFVMPFFLFIVG+AI LA K
Sbjct: 32  RVASLDIYRGLTVALMILVDDAGGEWPKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFK 91

Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
           ++   + AV+++I RTLKLLFWGI+LQGG+SHAPD L+YGVDMK IRWCGILQRIA  Y+
Sbjct: 92  RITSRHHAVRRVIVRTLKLLFWGIMLQGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYL 151

Query: 182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS--- 238
           VVAL+E  T K++   L P  LSI+  Y  QW+ G    VIY+   Y  YVP+W F+   
Sbjct: 152 VVALMEIFTKKKQTRELPPGWLSIYKLYSSQWLMGACILVIYLAVIYGTYVPHWQFTVND 211

Query: 239 -EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
            + +D+G K + V+C +RG L P CNAVG++DRE+ GINH+Y  P W R EACT +SP  
Sbjct: 212 RDSADYG-KVFTVECAVRGKLDPPCNAVGFIDREILGINHMYQHPAWKRSEACTENSPYE 270

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
           GP R  APSWC+APFEPEG+LS+ISA+LS  IG+H+GHVL++ +GH+ARLKHW+ MGF L
Sbjct: 271 GPFRTSAPSWCKAPFEPEGILSSISAVLSTIIGVHFGHVLVYMRGHAARLKHWIVMGFAL 330

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           LI+ ++LHFT+AIP+NKQLY+FSYVC T+GAA
Sbjct: 331 LILGLVLHFTHAIPLNKQLYTFSYVCVTSGAA 362


>gi|357510831|ref|XP_003625704.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355500719|gb|AES81922.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 444

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 214/304 (70%), Positives = 241/304 (79%), Gaps = 16/304 (5%)

Query: 126 INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVAL 185
           IN  VKKII RTLKLLFWGI+LQGGYSHAPD L YGV+MK IRWCGILQRIALVY +VAL
Sbjct: 76  INYTVKKIILRTLKLLFWGILLQGGYSHAPDELVYGVNMKFIRWCGILQRIALVYCIVAL 135

Query: 186 IETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHG- 244
           IET TTK RP  L P  ++IFTAY+W   GGF+AF+IY+ITT++LYVPNWSF +H ++  
Sbjct: 136 IETFTTKLRPTTLSPGRIAIFTAYKW--FGGFMAFLIYMITTFALYVPNWSFVDHVNNDE 193

Query: 245 VKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            K+Y V CGMRGHLGPACNAVGYVDR+ WG+NHLYS PVW RL+ACT SSP+ GP R+DA
Sbjct: 194 PKRYTVICGMRGHLGPACNAVGYVDRQTWGVNHLYSQPVWRRLKACTFSSPSEGPFRDDA 253

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
           PSWC APFEPEGLLS+ISAILSGTIGIHYGHVLIHFK HS RLK W SMGF LL++AIIL
Sbjct: 254 PSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKSHSERLKQWFSMGFVLLVVAIIL 313

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRI 424
           HFT+AIPINKQLYSFSYVCFTAGAA    GI  SI    +D          W  R P   
Sbjct: 314 HFTDAIPINKQLYSFSYVCFTAGAA----GIIFSILYILID---------VWGIRTPFLF 360

Query: 425 CKWM 428
            +W+
Sbjct: 361 LEWI 364


>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Vitis vinifera]
          Length = 499

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 211/336 (62%), Positives = 255/336 (75%), Gaps = 5/336 (1%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           QK+KR+A+LD FRGLTV LMILVDDAGG +  I H+PWNGC LADFVMPFFLFIVGVAIA
Sbjct: 59  QKTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNGCNLADFVMPFFLFIVGVAIA 118

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           LALK++P    A+KK+  RTLKLLFWG++LQG ++  PD L+YGVDMK IRWCGILQ IA
Sbjct: 119 LALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDKLTYGVDMKKIRWCGILQXIA 178

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
           L Y+VVAL+E  T K +   L P   SIF  Y W W+ G    ++Y+  +Y  YVP+W F
Sbjct: 179 LAYLVVALLEITTKKAQAKDLSPGQFSIFKLYCWHWLMGACVLIVYMAVSYGTYVPDWHF 238

Query: 238 SEH----SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
           + H    +D+G K   V CG RG L P CN VGY+DRE+ G+NH+Y  P W+R +AC   
Sbjct: 239 TVHDRDSADYG-KVLTVACGARGKLDPPCNVVGYIDREILGMNHMYQHPAWTRSKACNEY 297

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
           SP+ GP R+DAPSWC APFEPEG+LS+ISAILS  IG+H+GHVL+H KGHS RLKHWV M
Sbjct: 298 SPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFGHVLMHLKGHSDRLKHWVVM 357

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           GF LL++ I LHFT AIP+NKQLY+FSYVC T+GAA
Sbjct: 358 GFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAA 393


>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
 gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
          Length = 481

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 208/382 (54%), Positives = 264/382 (69%), Gaps = 13/382 (3%)

Query: 12  GRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG 71
           G +  V + +  G  +E    +  G    E  D +K          ++S+RVA+LD FRG
Sbjct: 3   GSSSKVVELKPHGDVAEEDPARRGGPGTDEADDNEKAP--------RRSRRVASLDVFRG 54

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK 131
           LTV LMILVD AGG +  I H+PWNGC LADFVMPFFLFIVG+AI L+LK++P    AV+
Sbjct: 55  LTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGRAVR 114

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           +++ RTLKLLFWGI+LQG YSHAPD L+YGVDMKH+RW GILQRIAL Y+VVA++E +T 
Sbjct: 115 RVVIRTLKLLFWGILLQGRYSHAPDELTYGVDMKHVRWGGILQRIALAYLVVAVLEIVTK 174

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS----EHSDHGVKK 247
             +         SIF  Y  QWI      VIY+   Y +YVP+W F     +  ++G K 
Sbjct: 175 DAKIQDQSSSGFSIFRMYLSQWIVACCILVIYLALVYGIYVPDWEFRVRNVDSPNYG-KV 233

Query: 248 YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
             V CG RG L P CNAVGY+DR++ GINH+Y  P W R  ACT  SP+ G  R DAP+W
Sbjct: 234 LTVTCGTRGILDPPCNAVGYIDRKVLGINHMYQKPAWRRHRACTDDSPHEGHFRNDAPAW 293

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT 367
           C APFEPEG+LS++SA+LS  IG+HYGHVL+H K H+ RL+ WV+MG  LL++ IILHF+
Sbjct: 294 CVAPFEPEGILSSLSAVLSTIIGVHYGHVLVHMKSHTDRLRQWVTMGICLLVLGIILHFS 353

Query: 368 NAIPINKQLYSFSYVCFTAGAA 389
           +AIP+NKQLY+FSY+C TAGAA
Sbjct: 354 HAIPLNKQLYTFSYICVTAGAA 375


>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 490

 Score =  424 bits (1089), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 218/385 (56%), Positives = 278/385 (72%), Gaps = 13/385 (3%)

Query: 17  VEQEQDDGKDSENGINKEKGLERSEVQDEQKGE-------LQLQQLLQQ--KSKRVATLD 67
           +    D G +S N I++   +   E++ +           +    LL +  KSKR+A+LD
Sbjct: 1   MSPTMDHGNNSPNEISQPL-ISMEEIKPDSTSHHPHRLISVDSDALLPKPVKSKRLASLD 59

Query: 68  AFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKIN 127
            FRGLTV LMILVDDAGG +  I H+PW GC LADFVMPFFLFIVG+AIALALK++P   
Sbjct: 60  IFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQL 119

Query: 128 GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIE 187
            A++K+  RTLKLLFWG++LQGGYSHAPD L+YGVD++ IR  GILQRIAL Y+VVA +E
Sbjct: 120 MAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVE 179

Query: 188 TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV-- 245
            L+ K + NV    H SIF +Y W W+ G    V+Y    Y +YVP+W F+      V  
Sbjct: 180 VLSRKTQSNVQPFNHFSIFKSYFWNWLVGACILVVYFALLYGIYVPDWQFTVTDSESVYY 239

Query: 246 -KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            + + V CG+RG+L P CNAVGY+DR++ GINHLY+ P W R EACT +SP +G  R++A
Sbjct: 240 GRNFTVACGVRGNLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNA 299

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
           PSWC APFEPEG+LS+ISAILS  IG+H+GHVLIHF+ HSARLK WV+MGF LLI+ ++L
Sbjct: 300 PSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQDHSARLKQWVTMGFTLLILGLVL 359

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAA 389
           HFT+AIP+NKQLY+FSYVC T+GAA
Sbjct: 360 HFTHAIPLNKQLYTFSYVCVTSGAA 384


>gi|302754694|ref|XP_002960771.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
 gi|300171710|gb|EFJ38310.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
          Length = 493

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 219/380 (57%), Positives = 270/380 (71%), Gaps = 14/380 (3%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           E  D K+ +   ++E   +R   ++    E Q   +  +K  R+ATLD FRGLTV LM+L
Sbjct: 12  EGHDTKEHKISFHEE---DRHVKKESLLNEDQAVAVPLKKPVRIATLDVFRGLTVALMVL 68

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK 139
           VDDAGG + RI+HSPWNGCTLAD VMPFFLFIVGVAIALALK++P    A +K++ RTLK
Sbjct: 69  VDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRTLK 128

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLE 199
           LLFWG++LQGG+SHAPD LSYGVDM+ IRWCGILQRIA  Y++VAL+E  TTK R   L 
Sbjct: 129 LLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLELP 188

Query: 200 PRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDH----GVKKYI------ 249
                IF  Y+W W       +IY    Y LYVP+W F + S H     + K++      
Sbjct: 189 KGQFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFID-SGHRFVVSLAKFVFSSQIN 247

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
           V+CG+RG +GPACNAVG++DR + GINHLY  P W+R ++C L SP  G    +AP+WC+
Sbjct: 248 VQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAWCK 307

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA 369
           APFEPEG+LS+ISAILS  IGIHYGHVLIHFKGH  R+ HW      LL++A ILHFT+A
Sbjct: 308 APFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFTHA 367

Query: 370 IPINKQLYSFSYVCFTAGAA 389
           IP+NKQLYSFSYVCFTAGAA
Sbjct: 368 IPLNKQLYSFSYVCFTAGAA 387


>gi|302804288|ref|XP_002983896.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
 gi|300148248|gb|EFJ14908.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
          Length = 493

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 220/382 (57%), Positives = 273/382 (71%), Gaps = 13/382 (3%)

Query: 19  QEQDDGKDS-ENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLM 77
            E+ +G D+ E+ I+  +     + +    G+ Q   +  +K  R+ATLD FRGLTV LM
Sbjct: 8   HEKLEGHDTKEHKISFHEEDHHVKKESLLNGD-QAVAVPLKKPVRIATLDVFRGLTVALM 66

Query: 78  ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRT 137
           +LVDDAGG + RI+HSPWNGCTLAD VMPFFLFIVGVAIALALK++P    A +K++ RT
Sbjct: 67  VLVDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRT 126

Query: 138 LKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNV 197
           LKLLFWG++LQGG+SHAPD LSYGVDM+ IRWCGILQRIA  Y++VAL+E  TTK R   
Sbjct: 127 LKLLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLE 186

Query: 198 LEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDH----GVKKYI---- 249
           L   H  IF  Y+W W       +IY    Y LYVP+W F + S H     + K++    
Sbjct: 187 LPKGHFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFID-SGHRFVVSLAKFVFSSQ 245

Query: 250 --VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
             V+CG+RG +GPACNAVG++DR + GINHLY  P W+R ++C L SP  G    +AP+W
Sbjct: 246 INVQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAW 305

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT 367
           C+APFEPEG+LS+ISAILS  IGIHYGHVLIHFKGH  R+ HW      LL++A ILHFT
Sbjct: 306 CKAPFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFT 365

Query: 368 NAIPINKQLYSFSYVCFTAGAA 389
           +AIP+NKQLYSFSYVCFTAGAA
Sbjct: 366 HAIPLNKQLYSFSYVCFTAGAA 387


>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 486

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 210/377 (55%), Positives = 264/377 (70%), Gaps = 9/377 (2%)

Query: 21  QDDGKDSENGINKEKGLERS-EVQDE--QKGELQLQQLLQQKSKRVATLDAFRGLTVVLM 77
           Q+  K    G  ++    R+ E  D+    GE +  +     S+RVA+LD FRGLTV LM
Sbjct: 5   QNVAKRHAPGAEEDPDRHRTHEAADDLDDDGEKKASRPSSSSSRRVASLDVFRGLTVALM 64

Query: 78  ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRT 137
           ILVD AGG +  I H+PW+GC LADFVMPFFLFIVG+AI L+LK++P    AV++++ RT
Sbjct: 65  ILVDGAGGEWPVIGHAPWHGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGWAVRRVVIRT 124

Query: 138 LKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRR-PN 196
           LKLLFWGI+LQGGYSHAPD L+YGVDMKHIRWCGILQRIAL Y+VVA+IE  T   R  +
Sbjct: 125 LKLLFWGILLQGGYSHAPDELTYGVDMKHIRWCGILQRIALAYLVVAVIEIATKDARVQD 184

Query: 197 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS----EHSDHGVKKYIVKC 252
                  S+F  Y  QWI      +IY+   Y +YVP+W F+    +  ++G K   V C
Sbjct: 185 QSSSGFFSVFRLYLSQWIVACCILLIYLSLVYGVYVPDWEFTVRNVDSPNYG-KVLTVTC 243

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
           G RG+L P CNAVGY+DR++ GINHLY  P W R   CT  SP+ GP + DAP+WC +PF
Sbjct: 244 GTRGNLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRDCTDDSPHEGPFKRDAPAWCASPF 303

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           EPEGLLS+ SA+LS  IG+HYGHVL+H K H  RLK WV+MG  LL++ IILHF++AIP+
Sbjct: 304 EPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGVALLLLGIILHFSHAIPL 363

Query: 373 NKQLYSFSYVCFTAGAA 389
           NKQLY+ SY+C TAGAA
Sbjct: 364 NKQLYTLSYICVTAGAA 380


>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays]
 gi|194693076|gb|ACF80622.1| unknown [Zea mays]
 gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 484

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 202/354 (57%), Positives = 258/354 (72%), Gaps = 11/354 (3%)

Query: 46  QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVM 105
           +  E    +   + S+RVA+LD FRGLTV LMILVD AGG +  I H+PWNGC LADFVM
Sbjct: 26  RTDEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVM 85

Query: 106 PFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMK 165
           PFFLFIVG+A+ LALK++P    AV++++ RTLKLLFWGI+LQGGYSHAPD L+YGVDM+
Sbjct: 86  PFFLFIVGMAVPLALKRIPDRGRAVRRVVVRTLKLLFWGILLQGGYSHAPDELAYGVDMR 145

Query: 166 HIRWCGILQRIALVYVVVALIETLTT------KRRPNVLEPRHLSIFTAYQWQWIGGFIA 219
           H+RW GILQRIAL Y+VVA++E +T       + +P     R   +F  Y  QWI     
Sbjct: 146 HVRWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRFSRVFRMYLSQWIVACCI 205

Query: 220 FVIYIITTYSLYVPNWSF----SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
            V+Y+   Y +YVP+W F    ++  D+G K   V+CG RG L P CNAVGY+DR + GI
Sbjct: 206 LVVYLSLAYGVYVPDWEFRVRNADSPDYG-KVLTVRCGTRGALDPPCNAVGYIDRRVLGI 264

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
           NH+Y  P W R  ACT  SP+ GP REDAP+WC APFEPEG+LS++SA+LS  +G+HYGH
Sbjct: 265 NHMYQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILSSLSAVLSTVVGVHYGH 324

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           VL+H K H+ RL+ WV+MG  LL++ IILHF++AIP+NKQLY+FSY+C TAGAA
Sbjct: 325 VLVHMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTFSYICVTAGAA 378


>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
 gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana]
 gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana]
 gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
           thaliana]
          Length = 472

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 209/364 (57%), Positives = 256/364 (70%), Gaps = 8/364 (2%)

Query: 34  EKGLERSEVQ-----DEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYA 88
           E  +ERS  Q      E       ++ L     R+A+LD FRGLTV LMILVDDAGG + 
Sbjct: 3   EIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGDWP 62

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ 148
            I H+PWNGC LADFVMPFFLFIVGV+IAL+LK++     A KK+ FRT KLLFWG++LQ
Sbjct: 63  MIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLLLQ 122

Query: 149 GGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA 208
           GG+SHAPD L+YGVD+  +R+CGILQRIAL Y+VVAL+E  T       L     SIF +
Sbjct: 123 GGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIFKS 182

Query: 209 YQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV---KKYIVKCGMRGHLGPACNAV 265
           Y W WI      VIY+ T Y  YVP+W F  +    V   K   V CG+RG L P CNAV
Sbjct: 183 YYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCNAV 242

Query: 266 GYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAIL 325
           GYVDR++ GINH+Y  P W R +ACT  SP  G +R+DAPSWCRAPFEPEG+LS+ISAIL
Sbjct: 243 GYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISAIL 302

Query: 326 SGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
           S  IG+H+GH+++H KGHSARLKHW+S G  LL + + LHFT+ +P+NKQLYSFSY+C T
Sbjct: 303 STIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYICVT 362

Query: 386 AGAA 389
           +GAA
Sbjct: 363 SGAA 366


>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 464

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 217/362 (59%), Positives = 263/362 (72%), Gaps = 10/362 (2%)

Query: 31  INKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARI 90
           I  E  L  SE       EL L      K+KRVA+LD FRGLTV LMILVDDAGG +  I
Sbjct: 4   IKGEHSLNVSE-------ELPLSDKNLPKTKRVASLDIFRGLTVALMILVDDAGGQWPMI 56

Query: 91  DHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGG 150
            H+PWNGC LADFVMPFFLFIVG+AI LALK++P    AVKK+I RTLKLLFWG++LQGG
Sbjct: 57  GHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGG 116

Query: 151 YSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ 210
           +SHAPD L+YGVDMKHIRWCGILQRIAL Y+VVAL+E  +   +    EP HLSIF  Y 
Sbjct: 117 FSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFKLYY 176

Query: 211 WQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV---KKYIVKCGMRGHLGPACNAVGY 267
           W W+ G     +Y+   Y ++VP+W F+ H+   +       V CG+RG L P CNAVGY
Sbjct: 177 WHWLVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGY 236

Query: 268 VDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSG 327
           +DRE+ GINH+Y  P W R EACT +SP  GP +++APSWC APFEPEG+LS+ISAILS 
Sbjct: 237 IDREVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAILST 296

Query: 328 TIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            IG+H+GHVLIH + H +RLKHW+ +G  LL   +ILHFT+AIP+NKQLY+ SYVC T+G
Sbjct: 297 IIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVTSG 356

Query: 388 AA 389
           AA
Sbjct: 357 AA 358


>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 485

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 208/377 (55%), Positives = 261/377 (69%), Gaps = 14/377 (3%)

Query: 26  DSENGINKEKGLERSEVQDEQK---------GELQLQQLLQQKSKRVATLDAFRGLTVVL 76
           D + G+ K +G    E   +++          +    +   ++S+RVA+LD FRGLTV L
Sbjct: 4   DGQGGVAKRRGAAADEEDPDRRPGAEAAAADKDGDDDEKAARRSRRVASLDVFRGLTVAL 63

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           MILVD AGG +  I H+PW+GC LADFVMPFFLFIVG+AI L+LK++P    AV++++ R
Sbjct: 64  MILVDGAGGEWPVIGHAPWDGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGRAVRRVVIR 123

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
           TLKLLFWGI+LQGGYSHAPD L+YGVDMKHIRWCGILQRIA  Y+VVA+IE  T      
Sbjct: 124 TLKLLFWGILLQGGYSHAPDELAYGVDMKHIRWCGILQRIAFAYLVVAVIEIATKDANIQ 183

Query: 197 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS----EHSDHGVKKYIVKC 252
                  SIF  Y  QWI      +IY+   Y +YVP+W F     +  ++G K   V C
Sbjct: 184 DQSSSGFSIFRMYFSQWIVACCILLIYLSLVYGIYVPDWEFRVRNVDSPNYG-KVLTVTC 242

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
           G RG L P CNAVGY+DR++ GINHLY  P W R  ACT  SP+ GP + DAP+WC +PF
Sbjct: 243 GTRGKLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRACTDDSPHEGPFKSDAPAWCASPF 302

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           EPEGLLS+ SA+LS  IG+HYGHVL+H K H  RLK WV+MG  LL++ IILHF++AIP+
Sbjct: 303 EPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGIALLLLGIILHFSHAIPL 362

Query: 373 NKQLYSFSYVCFTAGAA 389
           NKQLY+FSY+C TAGAA
Sbjct: 363 NKQLYTFSYICVTAGAA 379


>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 465

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 216/357 (60%), Positives = 264/357 (73%), Gaps = 12/357 (3%)

Query: 45  EQKGE--LQLQQLLQQ-------KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW 95
           E KGE  L + Q L +       K+KRVA+LD FRGLTV LMILVDDAG  +  I H+PW
Sbjct: 3   EIKGEHSLNVSQELPEVSDKNLPKTKRVASLDIFRGLTVALMILVDDAGEQWPMIGHAPW 62

Query: 96  NGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAP 155
           NGC LADFVMPFFLFIVG+AI LALK++P    AVKK+I RTLKLLFWG++LQGG+SHAP
Sbjct: 63  NGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGGFSHAP 122

Query: 156 DALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
           D L+YGVDMKHIRWCGILQRIAL Y+VVAL+E  +   +    EP HLSIF  Y W W+ 
Sbjct: 123 DNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSTQARDPEPTHLSIFNLYYWHWLV 182

Query: 216 GFIAFVIYIITTYSLYVPNWSFSEHSDHGV---KKYIVKCGMRGHLGPACNAVGYVDREL 272
           G    V+Y+   Y ++VP+W F+ H+   +       V CG+RG L P CNAVGY+DRE+
Sbjct: 183 GACILVVYLALLYGIHVPDWGFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGYIDREV 242

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
            GINH+Y  P W R EACT +SP  GP +++APSWC APFEPEG+LS+ISAILS  IG+H
Sbjct: 243 LGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAILSTIIGLH 302

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +GHVLIH + H +RLKHW+ +G  LL   +ILHFT+AIP+NKQLY+ SYVC T+GAA
Sbjct: 303 FGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVTSGAA 359


>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 202/346 (58%), Positives = 251/346 (72%), Gaps = 4/346 (1%)

Query: 48  GELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
            E+++++ L   ++ R+A+LD FRGLTV LMILVDDAGG +  I H+PWNGC LADFVMP
Sbjct: 2   AEIKVERSLAGNNRQRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           FFLFIVGV+IAL+LK++     A KK+ FRT KLLFWG++LQGG+SHAPD LSYGVD+  
Sbjct: 62  FFLFIVGVSIALSLKRISNKFEACKKVCFRTCKLLFWGLLLQGGFSHAPDELSYGVDVTM 121

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R+CGILQRIAL Y+VVALIE  T       L    LSIF +Y   WI G    VIY+ T
Sbjct: 122 MRFCGILQRIALSYLVVALIEIFTKDLHEENLSTGRLSIFKSYYCHWIVGVSVLVIYLAT 181

Query: 227 TYSLYVPNWSFSEHSDHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
            Y  YVP+W F  +    +   K   V CG+RG L P CNAVGYVDR++  INH+Y  P 
Sbjct: 182 LYGTYVPDWEFVVNDKDSILYGKIQSVSCGVRGKLNPPCNAVGYVDRQVLVINHMYHHPA 241

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
           W R +A T  SP  G LR+DAPSWC APFEPEG+LS+ISAILS  IG+H+GH++IH +GH
Sbjct: 242 WRRSKAFTDDSPYEGALRQDAPSWCHAPFEPEGILSSISAILSTIIGVHFGHIIIHLQGH 301

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            ARLKHW+S G   L + + LHFT+ +P+NKQLYSFSY+C T+GAA
Sbjct: 302 LARLKHWISTGLVFLTLGLTLHFTHLMPLNKQLYSFSYICVTSGAA 347


>gi|413951398|gb|AFW84047.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
          Length = 503

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 202/373 (54%), Positives = 258/373 (69%), Gaps = 30/373 (8%)

Query: 46  QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVM 105
           +  E    +   + S+RVA+LD FRGLTV LMILVD AGG +  I H+PWNGC LADFVM
Sbjct: 26  RTDEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVM 85

Query: 106 PFFLFIVGVAIALALK-------------------KVPKINGAVKKIIFRTLKLLFWGII 146
           PFFLFIVG+A+ LALK                   ++P    AV++++ RTLKLLFWGI+
Sbjct: 86  PFFLFIVGMAVPLALKVRRRRRSSRPSVVHAMHAHRIPDRGRAVRRVVVRTLKLLFWGIL 145

Query: 147 LQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT------KRRPNVLEP 200
           LQGGYSHAPD L+YGVDM+H+RW GILQRIAL Y+VVA++E +T       + +P     
Sbjct: 146 LQGGYSHAPDELAYGVDMRHVRWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSG 205

Query: 201 RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF----SEHSDHGVKKYIVKCGMRG 256
           R   +F  Y  QWI      V+Y+   Y +YVP+W F    ++  D+G K   V+CG RG
Sbjct: 206 RFSRVFRMYLSQWIVACCILVVYLSLAYGVYVPDWEFRVRNADSPDYG-KVLTVRCGTRG 264

Query: 257 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 316
            L P CNAVGY+DR + GINH+Y  P W R  ACT  SP+ GP REDAP+WC APFEPEG
Sbjct: 265 ALDPPCNAVGYIDRRVLGINHMYQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEG 324

Query: 317 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQL 376
           +LS++SA+LS  +G+HYGHVL+H K H+ RL+ WV+MG  LL++ IILHF++AIP+NKQL
Sbjct: 325 ILSSLSAVLSTVVGVHYGHVLVHMKSHTDRLRQWVTMGVALLVLGIILHFSHAIPLNKQL 384

Query: 377 YSFSYVCFTAGAA 389
           Y+FSY+C TAGAA
Sbjct: 385 YTFSYICVTAGAA 397


>gi|222619812|gb|EEE55944.1| hypothetical protein OsJ_04649 [Oryza sativa Japonica Group]
          Length = 846

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/338 (59%), Positives = 249/338 (73%), Gaps = 5/338 (1%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           L   S+RVA+LD FRGLTV LMILVD AGG +  I H+PWNGC LADFVMPFFLFIVG+A
Sbjct: 404 LAGTSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMA 463

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           I L+LK++P    AV++++ RTLKLLFWGI+LQGGYSHAPD LSYGVDMKH+RWCGILQR
Sbjct: 464 IPLSLKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQR 523

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           IAL Y+VVA++E +T   +         SIF  Y  QWI      VIY+   Y +YVP+W
Sbjct: 524 IALAYLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDW 583

Query: 236 SFS----EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
            F     ++ + G K   V CG RG L P CNAVGY+DR++ GINH+Y  P W R + CT
Sbjct: 584 DFRVSDVKNPNFG-KILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCT 642

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
             SP+ GP + D+P+WC APFEPEGLLS++SA+LS  IG+HYGHVL+H K H+ RLK W 
Sbjct: 643 DDSPHEGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWS 702

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            MG  LLI+ + LHF++AIP+NKQLY+FSY+C TAGAA
Sbjct: 703 IMGITLLILGLTLHFSHAIPLNKQLYTFSYICVTAGAA 740


>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 201/336 (59%), Positives = 242/336 (72%), Gaps = 23/336 (6%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           QK+KR+A+LD FRGLTV LMILVDDAGG +  I H+PWNGC LADFVMPFFLFIVGVAIA
Sbjct: 59  QKTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNGCNLADFVMPFFLFIVGVAIA 118

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           LALK++P    A+KK+  RTLKLLFWG++LQG ++  PD L+YGVDMK IRWCGILQ  A
Sbjct: 119 LALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDKLTYGVDMKKIRWCGILQAQA 178

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
                               L P   SIF  Y W W+ G    ++Y+  +Y  YVP+W F
Sbjct: 179 ------------------KDLSPGQFSIFKLYCWHWLMGACVLIVYMAVSYGTYVPDWHF 220

Query: 238 SEH----SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
           + H    +D+G K   V CG RG L P CN VGY+DRE+ G+NH+Y  P W+R +AC   
Sbjct: 221 TVHDRDSADYG-KVLTVACGARGKLDPPCNVVGYIDREILGMNHMYQHPAWTRSKACNEY 279

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
           SP+ GP R+DAPSWC APFEPEG+LS+ISAILS  IG+H+GHVL+H KGHS RLKHWV M
Sbjct: 280 SPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFGHVLMHLKGHSDRLKHWVVM 339

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           GF LL++ I LHFT AIP+NKQLY+FSYVC T+GAA
Sbjct: 340 GFALLVLGITLHFTGAIPLNKQLYTFSYVCVTSGAA 375


>gi|357511851|ref|XP_003626214.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355501229|gb|AES82432.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 483

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 212/370 (57%), Positives = 261/370 (70%), Gaps = 26/370 (7%)

Query: 40  SEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV----------------LMILVDDA 83
           SEV+     EL       +K KRVA+LD FRGLTV                 LMILVDDA
Sbjct: 14  SEVEPVSAKELP------KKVKRVASLDIFRGLTVADGDLTVFVAVKYRAKQLMILVDDA 67

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFW 143
           GG +  I H+PWNGC LADFVMPFFLFIVG+AI L+LKK+P    AVKK+I RTLKLLFW
Sbjct: 68  GGEWPAIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKKIPNKLLAVKKVIVRTLKLLFW 127

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP-NVLEPRH 202
           G++LQGGYSHAPD LSYGVDMKHIRWCGILQRIAL Y+VVAL+E ++  R+  +  EP +
Sbjct: 128 GLLLQGGYSHAPDHLSYGVDMKHIRWCGILQRIALAYLVVALVEIISRSRQDRDDPEPTN 187

Query: 203 LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV---KKYIVKCGMRGHLG 259
           LSIFT Y W W+      V+Y+   Y ++VP+W F+ H+   +     + V CG+RG L 
Sbjct: 188 LSIFTLYYWHWLVAACILVVYMPLLYGIHVPDWQFTVHNPDSIYNGTTFTVTCGVRGKLD 247

Query: 260 PACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLS 319
           P CNAVGY+DRE+ GINH+Y  P   R EACT+  P  GP ++ AP+WC APFEPEG+LS
Sbjct: 248 PPCNAVGYIDREVLGINHVYKKPASRRSEACTVKPPYEGPFKKTAPAWCYAPFEPEGILS 307

Query: 320 TISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSF 379
           +ISAILS  IG+HYGHVLIH + H +RLK W+ +G  LL +  ILHF++ IP+NKQLY+ 
Sbjct: 308 SISAILSTIIGLHYGHVLIHLQDHLSRLKQWILLGLALLTLGFILHFSHVIPLNKQLYTL 367

Query: 380 SYVCFTAGAA 389
           SYVC T+GAA
Sbjct: 368 SYVCVTSGAA 377


>gi|115442029|ref|NP_001045294.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|57899654|dbj|BAD87323.1| unknown protein [Oryza sativa Japonica Group]
 gi|57900117|dbj|BAD88179.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534825|dbj|BAF07208.1| Os01g0931100 [Oryza sativa Japonica Group]
 gi|215697092|dbj|BAG91086.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 488

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 206/378 (54%), Positives = 262/378 (69%), Gaps = 18/378 (4%)

Query: 29  NGINKEKGLERSEVQDEQKGELQLQQLL-------------QQKSKRVATLDAFRGLTVV 75
            G+ +  G    E  D ++G+   ++               ++ S+RVA+LD FRGLTV 
Sbjct: 6   QGVARRHGAVAEEDPDRRRGDRSGKEGDDDGVDVDEKAPPPRRTSRRVASLDVFRGLTVA 65

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           LMILVD AGG +  I H+PWNGC LADFVMPFFLFIVG+AI L+LK++P    AV++++ 
Sbjct: 66  LMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGRAVRRVVL 125

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           RTLKLLFWGI+LQGGYSHAPD LSYGVDMKH+RWCGILQRIAL Y+VVA++E +T   + 
Sbjct: 126 RTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALAYLVVAVLEIVTKNAKV 185

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS----EHSDHGVKKYIVK 251
                   SIF  Y  QWI      VIY+   Y +YVP+W F     ++ + G K   V 
Sbjct: 186 QDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRVSDVKNPNFG-KILTVT 244

Query: 252 CGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
           CG RG L P CNAVGY+DR++ GINH+Y  P W R + CT  SP+ GP + D+P+WC AP
Sbjct: 245 CGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPHEGPFKTDSPAWCYAP 304

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           FEPEGLLS++SA+LS  IG+HYGHVL+H K H+ RLK W  MG  LLI+ + LHF++AIP
Sbjct: 305 FEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGITLLILGLTLHFSHAIP 364

Query: 372 INKQLYSFSYVCFTAGAA 389
           +NKQLY+FSY+C TAGAA
Sbjct: 365 LNKQLYTFSYICVTAGAA 382


>gi|224123608|ref|XP_002330163.1| predicted protein [Populus trichocarpa]
 gi|222871619|gb|EEF08750.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 199/334 (59%), Positives = 250/334 (74%), Gaps = 5/334 (1%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 119
            +RVA+LD FRGLTV LMILVDDAGG + ++ H+PW+G  LADFVMPFFLFIVG+AI L 
Sbjct: 30  ERRVASLDIFRGLTVALMILVDDAGGEWPKMGHAPWHGSNLADFVMPFFLFIVGMAIPLT 89

Query: 120 LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
            K +   + AVKK+I RTLKLLFWGI+LQGG+SHAPD LSYGVDMK IRWCGILQRIA  
Sbjct: 90  FKGITSRDHAVKKMIVRTLKLLFWGIMLQGGFSHAPDKLSYGVDMKKIRWCGILQRIAFA 149

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS- 238
           Y+V+AL+E  T K +   L P  LSIF  Y  QW+ G    V+Y+   Y +YVP+W F+ 
Sbjct: 150 YLVMALMEIFTKKDQTKDLPPGRLSIFRLYGSQWLVGACILVVYLAVIYGMYVPHWQFTV 209

Query: 239 ---EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
              E SD+G K + V+C +RG L PACNA+ Y+DR++ GINHLY  P W R EACT +S 
Sbjct: 210 NDEESSDYG-KVFTVECAVRGKLDPACNAIAYIDRKILGINHLYQHPAWKRSEACTEASL 268

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
              P +  AP+WC+APFEP+G+LS+IS++LS   G H+GHV +H KG +ARLKHW  MG 
Sbjct: 269 YEAPFQTSAPTWCKAPFEPDGILSSISSVLSTITGAHFGHVHVHLKGDTARLKHWTVMGL 328

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            LLI+ ++LHFT+A+P+NKQLY+FSYVC T+GAA
Sbjct: 329 ALLILGLVLHFTHAMPLNKQLYTFSYVCVTSGAA 362


>gi|62701854|gb|AAX92927.1| hypothetical protein LOC_Os11g14080 [Oryza sativa Japonica Group]
 gi|77549602|gb|ABA92399.1| D8Ertd354e protein, putative [Oryza sativa Japonica Group]
 gi|125576749|gb|EAZ17971.1| hypothetical protein OsJ_33516 [Oryza sativa Japonica Group]
          Length = 447

 Score =  358 bits (919), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 180/289 (62%), Positives = 214/289 (74%), Gaps = 14/289 (4%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++KSKRVA LDAFRGLT+VLMILVDDAGGAY R+DHSPWNGCTLADFVMPFFLFIVGVAI
Sbjct: 55  RKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIVGVAI 114

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           A ALK+VPK+  AVKKI  RTLK+LFWG++LQGGYSHAPD LSYGVDMK IRWCGILQRI
Sbjct: 115 AFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGILQRI 174

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           ALVY VVALIE  TTK RP  +     +IF AY+WQW+GGF+A  IY++TT+SLYVP+WS
Sbjct: 175 ALVYFVVALIEAFTTKVRPTTVRSGPYAIFHAYRWQWLGGFVALFIYMVTTFSLYVPDWS 234

Query: 237 FSEHSDHGV---KKYI---------VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
           +  H+D  V   K++          V+CG+RGHL PACNAVGYVDR +WGINHLY+ PVW
Sbjct: 235 YVYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYTQPVW 294

Query: 285 SRLEACTLSS--PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
            R +   + S   N  PL   + S+   P   +    +     +G  G+
Sbjct: 295 IRSKFNIIDSVRDNWDPLWTCSRSFQAIPINKQLYSLSYVCFTAGAAGV 343



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/61 (45%), Positives = 32/61 (52%), Gaps = 13/61 (21%)

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKW 427
            AIPINKQLYS SYVCFTAGAA    G+  S F   +D          W  R P    +W
Sbjct: 320 QAIPINKQLYSLSYVCFTAGAA----GVVLSAFYILID---------VWGLRTPFLFLEW 366

Query: 428 M 428
           +
Sbjct: 367 I 367


>gi|359473865|ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Vitis vinifera]
 gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera]
          Length = 444

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 172/382 (45%), Positives = 241/382 (63%), Gaps = 7/382 (1%)

Query: 11  LGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFR 70
           +G  + V +++D      N          S + +  + +   +       +R+ +LD FR
Sbjct: 1   MGMYETVRRDEDPLVLDANTPENLSADVESSLLNSPRSDGSGRGGGNASKRRLVSLDVFR 60

Query: 71  GLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
           GLTV +MILVDDAGG    I+HSPWNG TLADFVMPFFLFIVGV++ALA K +     A 
Sbjct: 61  GLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAYKNLSSGYLAT 120

Query: 131 KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLT 190
           K  + R LKLL +G+ LQGGY H  + L+YGVD++ IR  GILQRIA+ Y + A+ E + 
Sbjct: 121 KMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAYFLAAVCE-IW 179

Query: 191 TKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKK 247
            K   NV      S+   YQ+QW    +  V Y    Y LYVP+W +S   E S   +K 
Sbjct: 180 LKGDSNVKSGS--SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIPSETSSSALKI 237

Query: 248 YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
           + VKCG+R   GPACNAVG +DR + GI HLY  P+++R++ C+++SP+ GPL  +AP+W
Sbjct: 238 FKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDYGPLPPNAPTW 297

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT 367
           C+APF+PEGLLS++ AI++  +G+HYGH+++HFK H  R+ HW+     LL++   L F 
Sbjct: 298 CQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCLLVLGFALDFF 357

Query: 368 NAIPINKQLYSFSYVCFTAGAA 389
             + +NK LY+ SY+C TAGAA
Sbjct: 358 -GMHVNKALYTLSYMCVTAGAA 378


>gi|115485801|ref|NP_001068044.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|77551354|gb|ABA94151.1| expressed protein [Oryza sativa Japonica Group]
 gi|113645266|dbj|BAF28407.1| Os11g0543500 [Oryza sativa Japonica Group]
 gi|125577433|gb|EAZ18655.1| hypothetical protein OsJ_34172 [Oryza sativa Japonica Group]
 gi|215701389|dbj|BAG92813.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 448

 Score =  332 bits (850), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 162/330 (49%), Positives = 215/330 (65%), Gaps = 2/330 (0%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV LMILVDD GG    I HSPW+G TLADFV PFFLFIVGV++A A 
Sbjct: 54  QRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLFIVGVSLAFAY 113

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           KKVP    A KK + R +KL   G+ILQGG+ H    L+YGVD++ IR  G+LQRIA+ Y
Sbjct: 114 KKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLMGVLQRIAIAY 173

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +VVAL E    +            + T Y  Q   G +  V Y++  Y L+VP+W +   
Sbjct: 174 LVVALCEIWLRRVSSGGNIGSGSMLITRYHHQMFVGLVLVVTYLVILYGLHVPDWEYEVT 233

Query: 241 S-DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
           S D  VK ++VKCG++G  GP CNAVG +DR + GI HLY+ PV+ + E C+++SP +GP
Sbjct: 234 SPDSTVKHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKTEQCSMASPRNGP 293

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  +APSWC APF+PEGLLS++ AI++  IG+  GHV++HFK H+ R+K W  +   LL 
Sbjct: 294 LPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERIKRWSILSLCLLT 353

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +   LH    + +NK LYS SY C T G A
Sbjct: 354 LGFSLHLF-GLHMNKSLYSLSYTCVTTGTA 382


>gi|255543288|ref|XP_002512707.1| conserved hypothetical protein [Ricinus communis]
 gi|223548668|gb|EEF50159.1| conserved hypothetical protein [Ricinus communis]
          Length = 426

 Score =  328 bits (840), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 174/367 (47%), Positives = 234/367 (63%), Gaps = 14/367 (3%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           R+ +LD FRGLTV LMILVD AGG    I+HSPWNG TLAD VMPFFLFIVGV++ L  K
Sbjct: 51  RLLSLDVFRGLTVALMILVDYAGGILPAINHSPWNGLTLADLVMPFFLFIVGVSLGLTYK 110

Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
           K+P    A +K I RTLKLL  G  LQGGY H  + L+YGV+++ +R  GILQRIA+ Y+
Sbjct: 111 KLPCKAVATRKAILRTLKLLTLGFFLQGGYLHGLNDLTYGVNVEKLRLMGILQRIAIAYL 170

Query: 182 VVALIET-LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF--- 237
           V AL E  L      +       S+   Y++QW    +    Y+   Y LYVP+W +   
Sbjct: 171 VGALCEIWLKGDDHVDSCS----SLLRKYRFQWAMALVLISTYLSLIYGLYVPDWEYQIP 226

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
           +E S    K ++VKCG+RG+ GPACNAVG +DR   GI HLY  PV++R + C+++SP+ 
Sbjct: 227 AEASSSPAKIFLVKCGVRGNTGPACNAVGLIDRTTLGIQHLYGKPVYARTKLCSINSPDY 286

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
           GPL  DAPSWC+APF+PEG+LS++ A+++  IG+HYGH+++HFK H  R+ HW+     L
Sbjct: 287 GPLPADAPSWCQAPFDPEGILSSVMAVVTCLIGLHYGHIIVHFKDHRNRMLHWMIPSICL 346

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA----DGCLGITNSIFIPEMDWDECDACF 413
           + + + L F   + +NK LYSFSY+  TAGAA     G   + +  F  E+D D C    
Sbjct: 347 IGLGLALDFL-GMHVNKALYSFSYMSVTAGAAGILFTGIYKLAHD-FGVEVDGDACTDDL 404

Query: 414 RAWSTRY 420
              S +Y
Sbjct: 405 YNCSLQY 411


>gi|242065256|ref|XP_002453917.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
 gi|241933748|gb|EES06893.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
          Length = 439

 Score =  327 bits (839), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 166/380 (43%), Positives = 239/380 (62%), Gaps = 8/380 (2%)

Query: 11  LGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFR 70
           +GR +LV    DD      G + E G   S+                 +  R+ +LD FR
Sbjct: 1   MGRYELVRS--DDAAAGATGTDLECGASASKASTTSPAAPSTTSP-AARQPRLVSLDVFR 57

Query: 71  GLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
           G+TV+LMI+VDDAGG    ++HSPW+G T+ADFVMPFFLFIVGV++ LA K+VP    A 
Sbjct: 58  GITVLLMIIVDDAGGFLPSLNHSPWDGVTIADFVMPFFLFIVGVSLTLAYKRVPDKLEAT 117

Query: 131 KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLT 190
           KK + R LKL   G++LQGG+ H   +L++GVD+  IR  GILQRIA+ Y++ A+ E + 
Sbjct: 118 KKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAIAYLLAAICE-IW 176

Query: 191 TKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YI 249
            K   +V       +   Y++Q   G +  + Y I  Y +YVP+W +        +K + 
Sbjct: 177 LKGDDDV--DSGYGLLRRYRYQLFVGLVLSIAYTILLYGIYVPDWEYKISGPGSTEKSFS 234

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
           VKCG+RG  GPACNAVG VDR + GI+HLY  PV++R + C+++ P +GPL  DAPSWC+
Sbjct: 235 VKCGVRGDTGPACNAVGMVDRTILGIDHLYRRPVYARTKECSINYPENGPLPPDAPSWCQ 294

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA 369
           APF+PEGLLS++ AI++  IG+ +GH++IHF+ H  R+ +W+   F +L +A ++ F+  
Sbjct: 295 APFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLIPSFSMLALAFLMDFSG- 353

Query: 370 IPINKQLYSFSYVCFTAGAA 389
           + +NK LY+ SY   TAGAA
Sbjct: 354 MRMNKPLYTISYTLATAGAA 373


>gi|302759310|ref|XP_002963078.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
 gi|300169939|gb|EFJ36541.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
          Length = 401

 Score =  327 bits (839), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 165/353 (46%), Positives = 223/353 (63%), Gaps = 20/353 (5%)

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           +MILVD+AGG +  I+HSPWNG TLAD VMPFFLFIVGVA+AL  KK+P    + +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R+LKL F G+ LQGGY H  + LSYGVD+  IRWCGILQRIA VYV+VAL E    +   
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYVIVALCEVWLPR--- 117

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMR 255
             ++  +  I   Y + WI   +   +Y+   Y L VP+W F   ++  +    V CG R
Sbjct: 118 --VQGSYFGIMQNYLFHWIFVVVTLTVYLSLLYGLKVPHWQFELPNNRNITM-TVTCGTR 174

Query: 256 GHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPE 315
            +L PACNAVGYVDR++ G+NHL   PV+ R E+C+++SP+ GPL  DAP WC APF+PE
Sbjct: 175 SNLDPACNAVGYVDRQILGVNHLDQQPVFIRTESCSINSPDYGPLPADAPVWCHAPFDPE 234

Query: 316 GLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQ 375
           G+LS++SAI++  IG+HYGH ++  K H  R+ +++     LL +  +LH    I +NK 
Sbjct: 235 GILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAVILLALGYVLHLL-GIKMNKP 293

Query: 376 LYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           LYSFSY+CFTAGAA     +   +                +  RYP+ + +WM
Sbjct: 294 LYSFSYMCFTAGAAGAVFCLLYIL-------------VDVYDIRYPTLLLEWM 333


>gi|218185886|gb|EEC68313.1| hypothetical protein OsI_36402 [Oryza sativa Indica Group]
          Length = 450

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 163/330 (49%), Positives = 216/330 (65%), Gaps = 2/330 (0%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV LMILVDD GG    I HSPW+G TLADFV PFFLFIVGV++A A 
Sbjct: 56  QRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLFIVGVSLAFAY 115

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           KKVP    A KK + R +KL   G+ILQGG+ H    L+YGVD++ IR  G+LQRIA+ Y
Sbjct: 116 KKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLMGVLQRIAIAY 175

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +VVAL E    +            + T Y  Q   G +  V Y++  Y L+VP+W +   
Sbjct: 176 LVVALCEIWLRRVSSGGDIGSGSMLITRYHHQMFVGLVLVVTYLVILYGLHVPDWEYEVT 235

Query: 241 S-DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
           S D  VK ++VKCG++G  GP CNAVG +DR + GI HLY+ PV+ + E C++ SP +GP
Sbjct: 236 SLDSTVKHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKTEQCSMDSPRNGP 295

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  +APSWC APF+PEGLLS++ AI++  IG+  GHV++HFK H+ R+K W ++   LL 
Sbjct: 296 LPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERIKRWSTLSLCLLT 355

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +   LH    + +NK LYS SY C TAG A
Sbjct: 356 LGFSLHLF-GLHMNKSLYSLSYTCVTAGTA 384


>gi|302796996|ref|XP_002980259.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
 gi|300151875|gb|EFJ18519.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
          Length = 401

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 164/353 (46%), Positives = 222/353 (62%), Gaps = 20/353 (5%)

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           +MILVD+AGG +  I+HSPWNG TLAD VMPFFLFIVGVA+AL  KK+P    + +K I 
Sbjct: 1   MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R+LKL F G+ LQGGY H  + LSYGVD+  IRWCGILQRIA VY+VVAL E    +   
Sbjct: 61  RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYLVVALCEVWLPR--- 117

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMR 255
             ++  +      Y + WI   +   +Y+   Y L VP+W F   ++  +    V CG R
Sbjct: 118 --VQGSYFGFMQNYLFHWIFVVVTLTVYLSLLYGLKVPDWQFELPNNRNITM-TVTCGTR 174

Query: 256 GHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPE 315
            +L P CNAVGYVDR++ G+NHL   PV+ R E+C+++SP+ GPL  DAP WC APF+PE
Sbjct: 175 SNLDPPCNAVGYVDRQILGVNHLDQRPVFIRTESCSINSPDYGPLPADAPVWCHAPFDPE 234

Query: 316 GLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQ 375
           G+LS++SAI++  IG+HYGH ++  K H  R+ +++   F LL +  +LH    I +NK 
Sbjct: 235 GILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAFILLALGYVLHLL-GIKMNKP 293

Query: 376 LYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           LYSFSY+CFTAGAA     +   +                +  RYP+ + +WM
Sbjct: 294 LYSFSYMCFTAGAAGAVFCLLYIL-------------VDVYDIRYPTLLLEWM 333


>gi|168035930|ref|XP_001770461.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678169|gb|EDQ64630.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 487

 Score =  325 bits (832), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 180/386 (46%), Positives = 247/386 (63%), Gaps = 25/386 (6%)

Query: 24  GKDSENGINKEKGLERSEVQD-------EQKGELQLQQLLQQKSKRVATLDAFRGLTVVL 76
           G+D E  + +E GL R+EV+        E  G         +KS R+A+LD FRGL++ +
Sbjct: 3   GEDVE--VLEEAGL-RAEVEPVLLQPLVEGGGGSGYAATAAEKSPRLASLDVFRGLSIAV 59

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING-AVKKIIF 135
           MILVD+AGG +  I+HSPW G TLADFVMPFFLFIVGVA+AL  K++ +    A +K + 
Sbjct: 60  MILVDNAGGVWPSINHSPWTGITLADFVMPFFLFIVGVALALTYKRITRDKKVASQKALG 119

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           RT KLL  G+++QGGY H     SYGVD++ IRWCG+LQRIAL Y+VVAL E    +RR 
Sbjct: 120 RTAKLLIVGLVIQGGYFHGLHDTSYGVDLERIRWCGVLQRIALAYMVVALCEIWAPRRRQ 179

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKYIVK- 251
           +V    + +IF  Y + W         Y+   Y +YVP+W F   +  +   +   +V+ 
Sbjct: 180 DV-SNDNFAIFKTYHFHWAVAAAIVATYLALLYGVYVPDWDFIPPTVLNSTALHVSVVRV 238

Query: 252 --------CGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
                   CG+RG++GPACNAVGY+DR + G++HLY  PV+ R  AC+++SP+ GPL   
Sbjct: 239 NGSMSEVHCGVRGNIGPACNAVGYLDRTILGVSHLYQRPVFRRTPACSVNSPDYGPLPSG 298

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAII 363
           AP WC+APF+PEGLLS++SA+ S  +G+H+GHVL+H K H ARL  W+ M   +L+I  +
Sbjct: 299 APDWCKAPFDPEGLLSSLSAVGSCFLGLHFGHVLVHRKEHIARLWDWMIMSL-VLLIVGL 357

Query: 364 LHFTNAIPINKQLYSFSYVCFTAGAA 389
           L     +P NK LYS SY+ FT GAA
Sbjct: 358 LLHLLGVPFNKPLYSVSYMLFTGGAA 383


>gi|224057870|ref|XP_002299365.1| predicted protein [Populus trichocarpa]
 gi|222846623|gb|EEE84170.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  323 bits (829), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 167/331 (50%), Positives = 222/331 (67%), Gaps = 7/331 (2%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           R+ +LD FRGLTV LMILVDDAGG    I+HSPWNG TLAD VMPFFLFIVGV++ L  K
Sbjct: 1   RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFIVGVSLGLTYK 60

Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
           K+     A +K I RTLKLL  G+ LQGG+ H  + L+YGVDM  IRW GILQRIA+ Y+
Sbjct: 61  KLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTYGVDMTQIRWMGILQRIAIGYL 120

Query: 182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS--- 238
           V A+ E +  K   +V     LS+   YQ+QW    +   IY+   Y L+VP+W +    
Sbjct: 121 VGAMCE-IWLKGGNHVTS--GLSMLRKYQFQWAAVLMFVTIYLSLLYGLHVPDWEYQIPV 177

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
             S    K + VKCG+RGH GPACNA G +DR + GI HLY  P+++R + C+++SP  G
Sbjct: 178 AASASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYRKPIYARTKPCSINSPGYG 237

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLL 358
           PL  DAPSWC+APF+PEGLLS++ AI++  +G+HYGH+++HFK H  R  HW+ +     
Sbjct: 238 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRTLHWM-VPSTCF 296

Query: 359 IIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           ++  ++     + +NK LY+FSY+C TAGAA
Sbjct: 297 LVLGLVLDLLGMHVNKALYTFSYMCVTAGAA 327


>gi|326505544|dbj|BAJ95443.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 429

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 157/330 (47%), Positives = 222/330 (67%), Gaps = 5/330 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMI+VDDAG     ++HSPW G T+ADFVMPFFLFIVGVA+ALA 
Sbjct: 38  QRLVSLDVFRGITVLLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAY 97

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K   R LKL   G++LQGG+ H   +L++GVD+  IR  GILQRIA+ Y
Sbjct: 98  KRVPDKLDATRKATLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAY 157

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS-E 239
           +V AL + +  K   +V     L +   Y++Q + G +  + Y+   Y  YVP+W +   
Sbjct: 158 LVTALCQ-IWLKGDDDV--DSGLDLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRIS 214

Query: 240 HSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
                 K + V+CG+RG  GP CNAVG +DR++ GI HLY  PV++R + C++ SP +GP
Sbjct: 215 GPGFTEKTFTVRCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGP 274

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  DAPSWC+APF+PEGLLS++ AI++  IG+ YGH+++HF+ H  R+ HW+   FG+L+
Sbjct: 275 LPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLV 334

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +A  + F   + +NK LY+ SY   TAGAA
Sbjct: 335 LAFAMDFFG-MHMNKPLYTLSYTLCTAGAA 363


>gi|242075654|ref|XP_002447763.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
 gi|241938946|gb|EES12091.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
          Length = 446

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 241/386 (62%), Gaps = 13/386 (3%)

Query: 11  LGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQL------LQQKSKRVA 64
           +G  +LV    DD   +   ++ E G   +   D+     +             + +R+ 
Sbjct: 1   MGGYELVRS--DDAPAAAIAVDLEAGGTTAPCGDDYPKRRRGSSTPSPPAPASTRPQRLV 58

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP 124
           +LD FRG+TV+LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA K+VP
Sbjct: 59  SLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVP 118

Query: 125 KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVA 184
               A KK + R LKL   G++LQGG+ H   +L++GVD++ IR  GILQRIA+ Y++ A
Sbjct: 119 DKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTA 178

Query: 185 LIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHG 244
           L E +  K   +V       +   Y++Q + G +  + Y+   Y  YVP+W +       
Sbjct: 179 LCE-IWLKGDEDV--DYGYDLLKRYRYQLLVGAVVAITYMCLLYGTYVPDWEYQTSGPGS 235

Query: 245 VKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
           ++K + VKCG+RG   P CNAVG +DR++ GI HLY  PV++R + C++ SP +GPL  D
Sbjct: 236 IEKSFFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPLPPD 295

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAII 363
           APSWC+APF+PEGLLS++ AI++  IG+ YGH+++HF+ H  R+ +W+   F +L++A  
Sbjct: 296 APSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFA 355

Query: 364 LHFTNAIPINKQLYSFSYVCFTAGAA 389
           + F   + +NK LYS SY   TAGAA
Sbjct: 356 MDFFG-LHMNKPLYSLSYTLATAGAA 380


>gi|219885579|gb|ACL53164.1| unknown [Zea mays]
 gi|413937084|gb|AFW71635.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 482

 Score =  321 bits (822), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 161/359 (44%), Positives = 229/359 (63%), Gaps = 7/359 (1%)

Query: 34  EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHS 93
           E G   S+V                + +R+A+LD FRG+TV+LMI+VDDAGG    ++HS
Sbjct: 62  ESGSGASKVAGAPPPSPAPSASPAARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHS 121

Query: 94  PWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSH 153
           PW+G T+ADF+MPFFLFIVGV++ LA K+VP    A +K + R LKL   G++LQGG+ H
Sbjct: 122 PWDGVTVADFIMPFFLFIVGVSLTLAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFH 181

Query: 154 APDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
              +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V       +   Y++Q 
Sbjct: 182 GVHSLTFGVDLTKIRLMGILQRIAIAYLLAAVCE-IWLKGDDDV--DSGYGLLRRYRYQL 238

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
             G +  + Y I  Y +YVP+W +      S    K + VKCG+RG  GPACNAVG VDR
Sbjct: 239 FVGLVLSIAYSILLYGMYVPDWEYQIAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDR 298

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
            + GI+HLY  PV++R + C++  P +GPL  DAPSWC+APF+PEGLLS++ AI++  IG
Sbjct: 299 TVLGIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIG 358

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           + +GHV+IHF+ H  R+  W+   F +L +A ++ F   + +NK LY+ SY   TAGAA
Sbjct: 359 LQFGHVIIHFEKHRGRIASWLVPSFSMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAA 416


>gi|413937082|gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 441

 Score =  320 bits (821), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 161/359 (44%), Positives = 229/359 (63%), Gaps = 7/359 (1%)

Query: 34  EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHS 93
           E G   S+V                + +R+A+LD FRG+TV+LMI+VDDAGG    ++HS
Sbjct: 21  ESGSGASKVAGAPPPSPAPSASPAARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHS 80

Query: 94  PWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSH 153
           PW+G T+ADF+MPFFLFIVGV++ LA K+VP    A +K + R LKL   G++LQGG+ H
Sbjct: 81  PWDGVTVADFIMPFFLFIVGVSLTLAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFH 140

Query: 154 APDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
              +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V       +   Y++Q 
Sbjct: 141 GVHSLTFGVDLTKIRLMGILQRIAIAYLLAAVCE-IWLKGDDDV--DSGYGLLRRYRYQL 197

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
             G +  + Y I  Y +YVP+W +      S    K + VKCG+RG  GPACNAVG VDR
Sbjct: 198 FVGLVLSIAYSILLYGMYVPDWEYQIAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDR 257

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
            + GI+HLY  PV++R + C++  P +GPL  DAPSWC+APF+PEGLLS++ AI++  IG
Sbjct: 258 TVLGIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIG 317

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           + +GHV+IHF+ H  R+  W+   F +L +A ++ F   + +NK LY+ SY   TAGAA
Sbjct: 318 LQFGHVIIHFEKHRGRIASWLVPSFSMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAA 375


>gi|226509496|ref|NP_001144452.1| uncharacterized protein LOC100277415 [Zea mays]
 gi|195642330|gb|ACG40633.1| hypothetical protein [Zea mays]
          Length = 441

 Score =  320 bits (821), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 161/359 (44%), Positives = 229/359 (63%), Gaps = 7/359 (1%)

Query: 34  EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHS 93
           E G   S+V                + +R+A+LD FRG+TV+LMI+VDDAGG    ++HS
Sbjct: 21  ESGSGASKVAGAPPPSPAPSASPAARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHS 80

Query: 94  PWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSH 153
           PW+G T+ADF+MPFFLFIVGV++ LA K+VP    A +K + R LKL   G++LQGG+ H
Sbjct: 81  PWDGVTVADFIMPFFLFIVGVSLTLAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFH 140

Query: 154 APDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
              +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V       +   Y++Q 
Sbjct: 141 GVHSLTFGVDLTKIRLMGILQRIAIAYLLAAVCE-IWLKGDDDV--DSGYGLLRRYRYQL 197

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
             G +  + Y I  Y +YVP+W +      S    K + VKCG+RG  GPACNAVG VDR
Sbjct: 198 FVGLVLSIAYSILLYGMYVPDWEYQIAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDR 257

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
            + GI+HLY  PV++R + C++  P +GPL  DAPSWC+APF+PEGLLS++ AI++  IG
Sbjct: 258 TVLGIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIG 317

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           + +GHV+IHF+ H  R+  W+   F +L +A ++ F   + +NK LY+ SY   TAGAA
Sbjct: 318 LQFGHVIIHFEKHRGRITSWLVPSFSMLALAFVMDFVG-MRMNKPLYTMSYTLATAGAA 375


>gi|224033113|gb|ACN35632.1| unknown [Zea mays]
 gi|413918233|gb|AFW58165.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 444

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 154/330 (46%), Positives = 222/330 (67%), Gaps = 5/330 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K + R LKL   G++LQGG+ H   +LS+GVD++ IR  G+LQRIA+ Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           ++ AL E      R +        +   Y++Q   G +  + Y+   Y  YVP+W +   
Sbjct: 173 LLTALCEIWI---RGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 229

Query: 241 SDHGVKKYI-VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
           +    +K++ VKCG+RG   P CNAVG +DR++ GI HLY  PV++R + C++ SP +GP
Sbjct: 230 APGSTEKHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGP 289

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+ +W+   F +L+
Sbjct: 290 LPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLV 349

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +A  + F   + +NK LY+ SY   TAGAA
Sbjct: 350 LAFAMDFFG-LHMNKPLYTLSYTLGTAGAA 378


>gi|226494648|ref|NP_001146383.1| uncharacterized protein LOC100279961 [Zea mays]
 gi|219886923|gb|ACL53836.1| unknown [Zea mays]
 gi|413918231|gb|AFW58163.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 469

 Score =  318 bits (814), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 154/330 (46%), Positives = 222/330 (67%), Gaps = 5/330 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA 
Sbjct: 78  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 137

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K + R LKL   G++LQGG+ H   +LS+GVD++ IR  G+LQRIA+ Y
Sbjct: 138 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 197

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           ++ AL E      R +        +   Y++Q   G +  + Y+   Y  YVP+W +   
Sbjct: 198 LLTALCEIWI---RGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 254

Query: 241 SDHGVKKYI-VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
           +    +K++ VKCG+RG   P CNAVG +DR++ GI HLY  PV++R + C++ SP +GP
Sbjct: 255 APGSTEKHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGP 314

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+ +W+   F +L+
Sbjct: 315 LPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLV 374

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +A  + F   + +NK LY+ SY   TAGAA
Sbjct: 375 LAFAMDFFG-LHMNKPLYTLSYTLGTAGAA 403


>gi|219886509|gb|ACL53629.1| unknown [Zea mays]
 gi|414587417|tpg|DAA37988.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 438

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 156/330 (47%), Positives = 221/330 (66%), Gaps = 5/330 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA 
Sbjct: 47  QRLVSLDVFRGITVLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 106

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A KK + R LKL   G++LQGG+ H   +L++GVD++ IR  GILQRIA+ Y
Sbjct: 107 KRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAY 166

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           ++ AL E +  K   +V       +   Y++Q   G I  + Y+   Y  YV +W +   
Sbjct: 167 LLTALCE-IWLKGDEDV--DYGYDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTS 223

Query: 241 SDHGVKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
               ++K + VKCG+RG   P CNAVG +DR + GI HLY  PV++R + C++ SP +GP
Sbjct: 224 GPGSIEKSFFVKCGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGP 283

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           L  DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+ +W+   F +L+
Sbjct: 284 LPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLV 343

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +A  + F   + +NK LY+ SY   TAGAA
Sbjct: 344 LAFAMDFLG-LRMNKPLYTLSYTLATAGAA 372


>gi|116309454|emb|CAH66526.1| H0502B11.6 [Oryza sativa Indica Group]
 gi|218194797|gb|EEC77224.1| hypothetical protein OsI_15768 [Oryza sativa Indica Group]
          Length = 448

 Score =  315 bits (807), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/322 (49%), Positives = 217/322 (67%), Gaps = 5/322 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMILVDDAG     I+HSPW+G TLADFVMPFFLFIVGVA+ALA 
Sbjct: 57  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 116

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K I R LKL   G++LQGG+ H   +L++G+DM+ IR  GILQRIA+ Y
Sbjct: 117 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 176

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +V AL E +  K   +V       +    ++Q   G I  + Y+   Y  YVP+W +   
Sbjct: 177 IVTALCE-IWLKGDDDV--DSGFDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 233

Query: 241 SDHGVKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
                +K + VKC +RG  GP CNAVG +DR++ GI HLY  PV++R + C+++SP +GP
Sbjct: 234 VPGSTEKSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQNGP 293

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           LR DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+  W+   F +LI
Sbjct: 294 LRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLI 353

Query: 360 IAIILHFTNAIPINKQLYSFSY 381
           +A  L F   + +NK LY+ SY
Sbjct: 354 LAFSLDFF-GMHMNKPLYTVSY 374


>gi|115458212|ref|NP_001052706.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|113564277|dbj|BAF14620.1| Os04g0404900 [Oryza sativa Japonica Group]
 gi|222628804|gb|EEE60936.1| hypothetical protein OsJ_14685 [Oryza sativa Japonica Group]
          Length = 447

 Score =  315 bits (807), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/322 (49%), Positives = 217/322 (67%), Gaps = 5/322 (1%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMILVDDAG     I+HSPW+G TLADFVMPFFLFIVGVA+ALA 
Sbjct: 56  QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 115

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K I R LKL   G++LQGG+ H   +L++G+DM+ IR  GILQRIA+ Y
Sbjct: 116 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 175

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +V AL E +  K   +V       +    ++Q   G I  + Y+   Y  YVP+W +   
Sbjct: 176 IVTALCE-IWLKGDDDV--DSGFDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 232

Query: 241 SDHGVKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
                +K + VKC +RG  GP CNAVG +DR++ GI HLY  PV++R + C+++SP +GP
Sbjct: 233 VPGSTEKSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQNGP 292

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
           LR DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+  W+   F +LI
Sbjct: 293 LRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLI 352

Query: 360 IAIILHFTNAIPINKQLYSFSY 381
           +A  L F   + +NK LY+ SY
Sbjct: 353 LAFSLDFF-GMHMNKPLYTVSY 373


>gi|255548527|ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
 gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis]
          Length = 460

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 165/399 (41%), Positives = 235/399 (58%), Gaps = 19/399 (4%)

Query: 1   MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKS 60
           M+ L  V E   R  L+    D+ +       KE+ +  S    +++  L         +
Sbjct: 1   MSALITVAEDEQRQSLLHHYNDEDE-------KEEEIAPSSSSSDEREALPPPT----PN 49

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRGLT+ LMILVDDAGGA+  I+HSPW G TLADFVMPFFLF VGV+I+L  
Sbjct: 50  QRLMSLDVFRGLTIALMILVDDAGGAFPSINHSPWFGVTLADFVMPFFLFGVGVSISLVF 109

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           KK+   + A KK++ RT+KL   G++LQGGY H  + L+YG+D+  IRW G+LQRI++ Y
Sbjct: 110 KKISSKSVATKKVMLRTIKLFLLGVLLQGGYFHGRNHLTYGIDVLKIRWLGVLQRISIGY 169

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +  ++ E        + +    L+    Y  QW+   I   +Y    Y L+VPNW F   
Sbjct: 170 LFASISEIWLVN---HCIVDSPLAFMKKYYAQWMVSLILCSLYTCLLYFLFVPNWEFEAS 226

Query: 241 S----DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
           S     +G     V CG+RG L P CNAVG +DR L G +HLY  PV+ R + C+++SP+
Sbjct: 227 SINLFGYGSGTQTVICGVRGSLEPPCNAVGLIDRFLLGEHHLYQRPVYRRTKQCSVNSPD 286

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
            GPL  ++P WC APF+PEG+LS++ A ++  +G+ +GHVL+H K H  R+  W+   F 
Sbjct: 287 YGPLPPNSPPWCLAPFDPEGILSSLMAAVTCLLGLQFGHVLVHLKDHMQRILVWLISSFS 346

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           LL+   +L     IP +K LY+ SY C T GA+   L I
Sbjct: 347 LLVTGFVLKLI-GIPFSKPLYTLSYTCITTGASGLLLTI 384


>gi|356527477|ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 463

 Score =  314 bits (804), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 232/355 (65%), Gaps = 6/355 (1%)

Query: 37  LERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWN 96
           L  + + +EQ    + Q   Q KS R+ +LD FRGLTV LMILVDDAGG    ++HSPWN
Sbjct: 47  LHINNIIEEQHIIARHQPQPQPKSPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWN 106

Query: 97  GCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPD 156
           G TLAD+VMPFFLFIVGV++AL  KK+     A +K   R LKLL  G+ LQGGY H  +
Sbjct: 107 GLTLADYVMPFFLFIVGVSLALTYKKLSCGVDASRKASLRALKLLVLGLFLQGGYFHRVN 166

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
            L+YGVD+K IRW GILQRI + Y+V AL E            P   S+   Y++QW   
Sbjct: 167 DLTYGVDLKQIRWMGILQRIGVAYLVAALCEIWLKSDDTVNSGP---SLLRKYRYQWAVA 223

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGV--KKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            I   +Y+   Y LYVP+W +   ++     K + VKCG+RG+ GPACNAVG +DR + G
Sbjct: 224 LILSFLYLCLLYGLYVPDWVYQIQTEPSSEPKTFSVKCGVRGNTGPACNAVGMIDRTILG 283

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
           I+HLY  P+++R+  C+++SPN GPL  DAP+WC+APF+PEGLLS++ AI++  IG+HYG
Sbjct: 284 IHHLYQRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYG 343

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           H+++HFK H  R+ +W+     L++  + L     + INK LYS SY C TAGAA
Sbjct: 344 HIIVHFKDHRVRIIYWMIPTSCLVVFGLALDLF-GMHINKVLYSLSYTCVTAGAA 397


>gi|147844298|emb|CAN82113.1| hypothetical protein VITISV_031338 [Vitis vinifera]
          Length = 401

 Score =  313 bits (803), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 164/382 (42%), Positives = 231/382 (60%), Gaps = 21/382 (5%)

Query: 11  LGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFR 70
           +G  + V +++D      N          S + +  + +   +       +R+ +LD FR
Sbjct: 1   MGMYETVRRDEDPLVLDANTPENLSADVESSLLNSPRSDGSGRGGGNASKRRLVSLDVFR 60

Query: 71  GLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
           GLTV +MILVDDAGG    I+HSPWNG TLADFVMPFFLFIVGV++ALA K +     A 
Sbjct: 61  GLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAYKNLSSGYLAT 120

Query: 131 KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLT 190
           K              +  GGY H  + L+YGVD++ IR  GILQRIA+ Y + A+ E + 
Sbjct: 121 K--------------MASGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAYFLAAVCE-IW 165

Query: 191 TKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKK 247
            K   NV      S+   YQ+QW    +  V Y    Y LYVP+W +S   E S   +K 
Sbjct: 166 LKGDXNVKSGS--SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIPSETSSSALKI 223

Query: 248 YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
           + VKCG+R   GPACNAVG +DR + GI HLY  P+++R++ C+++SP+ GPL  +AP+W
Sbjct: 224 FKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDYGPLPPNAPTW 283

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT 367
           C+APF+PEGLLS++ AI++  +G+HYGH+++HFK H  R+ HW+     LL++   L F 
Sbjct: 284 CQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCLLVLGFALDFF 343

Query: 368 NAIPINKQLYSFSYVCFTAGAA 389
             + +NK LY+ SY+C TAGAA
Sbjct: 344 -GMHVNKALYTLSYMCVTAGAA 364


>gi|356569086|ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 461

 Score =  313 bits (803), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 180/385 (46%), Positives = 243/385 (63%), Gaps = 11/385 (2%)

Query: 7   VEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATL 66
           VE  L  +Q+ E          + I +   L    + +EQ+   + Q   Q KS R+ +L
Sbjct: 20  VEMALPHSQISESRS---ATVSSPIGQTTPLHIHNIIEEQRIISRHQP--QPKSPRLVSL 74

Query: 67  DAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKI 126
           D FRGLTV LMILVDDAGG    ++HSPWNG TLAD+VMPFFLFIVGV++AL+ KK+   
Sbjct: 75  DVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALSYKKLSCG 134

Query: 127 NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
             A +K   R LKLL  G+ LQGGY H  + L++GVD+K IRW GILQRIA+ Y+VVAL 
Sbjct: 135 VDASRKASLRALKLLALGLFLQGGYFHRVNDLTFGVDIKQIRWMGILQRIAVAYLVVALC 194

Query: 187 ETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV- 245
           E            P   S+   Y++QW    I   +Y+   Y LYVP+W +   ++    
Sbjct: 195 EIWLKSDDTVNSGP---SLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPSAE 251

Query: 246 -KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            K + VKCG+RG+ GPACN VG +DR + GI HLY  P+++R+  C+++SPN GPL  DA
Sbjct: 252 PKTFSVKCGVRGNTGPACNVVGMIDRMILGIQHLYKRPIYARMPECSINSPNYGPLPPDA 311

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
           P+WC+APF+PEGLLS++ AI++  IG+HYGH+++HFK H  R+ +W+     LL+  + L
Sbjct: 312 PAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLLVFGLAL 371

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAA 389
                + INK LYS SY C TAGAA
Sbjct: 372 DLF-GMHINKVLYSLSYTCVTAGAA 395


>gi|449446789|ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 494

 Score =  313 bits (802), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 169/354 (47%), Positives = 236/354 (66%), Gaps = 13/354 (3%)

Query: 44  DEQKGELQLQQLLQQKSK--RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLA 101
           DE +    ++ +L+   +  R+ +LD FRG+TV LMI+VD AGG    I+HSPW+G TLA
Sbjct: 80  DEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWDGLTLA 139

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           D VMPFFLFIVGV++ALA KK+P    A +K + RTLKLLF G+ LQGG+ H  + L+YG
Sbjct: 140 DLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYG 199

Query: 162 VDMKHIRWCGILQRIALVYVVVALIET-LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAF 220
           VD++ IRW GILQRIA+ Y + AL E  L      N       ++   YQ Q +   +  
Sbjct: 200 VDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVN----SETALRRKYQLQLVAAVVLT 255

Query: 221 VIYIITTYSLYVPNW-----SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
           ++Y+  +Y LYVP+W     S +       K + VKCG RG  GPACNAVG +DR+++GI
Sbjct: 256 MLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGPACNAVGMIDRKIFGI 315

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
            HLY  P+++R E C++++P+ GPL  DAPSWC+APF+PEGLLST+ A+++  +G+HYGH
Sbjct: 316 QHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGH 375

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +++HFK H  R+ HW+     L+++AI L F   + INK LY+ SY+  TAGAA
Sbjct: 376 IIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GMHINKVLYTVSYMSVTAGAA 428


>gi|326512130|dbj|BAJ96046.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 162/343 (47%), Positives = 213/343 (62%), Gaps = 16/343 (4%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           QQK +RVA+LD FRGLTV +MILVDDAGGA+  I+H+PW G T+ADFVMP FLFI+GV+ 
Sbjct: 48  QQKPQRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADFVMPAFLFIIGVSA 107

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           AL  KK P      KK   R +KL   G+ILQGGY H    L+YGVD+  IRW G+LQRI
Sbjct: 108 ALVFKKTPNKIATSKKAACRAIKLFILGVILQGGYIHGRHKLTYGVDLDQIRWLGVLQRI 167

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A+ Y + A+ E        N      +S    Y  +WI   I   +YI   + LYVPNW 
Sbjct: 168 AIGYFLAAISEIWLVN---NTSVDSPVSFVKKYFMEWIMAIIISALYIGLVFGLYVPNWE 224

Query: 237 F----------SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           F          +  +D G K   ++CG+ G LGP CNAVG+VDR L G +HLY +PV+ R
Sbjct: 225 FKVQTSSSTFSNPSNDVGFKT--IQCGLTGSLGPPCNAVGFVDRVLLGESHLYKNPVYKR 282

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
            + C+++SP+ GPL  +AP WC APF+PEGLLST+ A +S  +G+H+GHVLIH K HS R
Sbjct: 283 TKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFGHVLIHCKTHSQR 342

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +  W+     L +   +L     +P +K LY+ SY+  T G +
Sbjct: 343 MMSWLLASTVLTVSGFLLQLL-GMPFSKPLYTVSYMLLTGGVS 384


>gi|212724122|ref|NP_001131867.1| uncharacterized protein LOC100193245 [Zea mays]
 gi|194692766|gb|ACF80467.1| unknown [Zea mays]
 gi|413948803|gb|AFW81452.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
 gi|413948804|gb|AFW81453.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
          Length = 492

 Score =  311 bits (796), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 163/385 (42%), Positives = 228/385 (59%), Gaps = 22/385 (5%)

Query: 13  RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGL 72
           R  L+ Q ++    +E    +   L+ +  Q E+          Q+K +RVA+LD FRG 
Sbjct: 18  RRPLLAQNEEIHLYTEPPNRQHPPLDAAATQLEE----------QRKPERVASLDVFRGF 67

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK 132
           TV +MILVDDAGGA+  I+H+PW G T+ADFVMP FLFI+GV+ AL  KK+     A KK
Sbjct: 68  TVAMMILVDDAGGAWPGINHAPWFGVTVADFVMPAFLFIIGVSAALVFKKMANKTAATKK 127

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
              R  KL   G+ILQGGY H    L+YGVD+ HIRW G+LQRIA+ Y V A+ E     
Sbjct: 128 AAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVN 187

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS--------DHG 244
              N L    +     Y  +W       V+Y+   + LYV NW F   +         + 
Sbjct: 188 ---NNLVDSPVPFVKKYFIEWFMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNS 244

Query: 245 VKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
           ++  +++CG+RG LGP CNAVG VDR L G NHLY +PV+ R + C+++SP+ GPL  +A
Sbjct: 245 IETKMIQCGVRGSLGPPCNAVGLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNA 304

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
           P WC APF+PEGLLST+ A+++  +G+ +GHVLIH K HS R+  W+ +   +L I+  L
Sbjct: 305 PDWCLAPFDPEGLLSTLMAVVTCFVGLFFGHVLIHCKNHSQRMLIWL-LASVVLTISAYL 363

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAA 389
                +P +K LY+ +Y+  T G +
Sbjct: 364 VLLLGMPFSKPLYTVNYMLLTGGVS 388


>gi|195652797|gb|ACG45866.1| hypothetical protein [Zea mays]
          Length = 492

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 162/385 (42%), Positives = 227/385 (58%), Gaps = 22/385 (5%)

Query: 13  RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGL 72
           R  L+ Q ++    +E    +   L+ +  Q E+          Q+K +RVA+LD FRG 
Sbjct: 18  RRPLLAQNEEIHLYTEPPNRQHPPLDAAATQLEE----------QRKPERVASLDVFRGF 67

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK 132
           TV + ILVDDAGGA+  I+H+PW G T+ADFVMP FLFI+GV+ AL  KK+     A KK
Sbjct: 68  TVAMXILVDDAGGAWPGINHAPWFGVTVADFVMPAFLFIIGVSAALVFKKMANKTAATKK 127

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
              R  KL   G+ILQGGY H    L+YGVD+ HIRW G+LQRIA+ Y V A+ E     
Sbjct: 128 AAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVN 187

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS--------DHG 244
              N L    +     Y  +W       V+Y+   + LYV NW F   +         + 
Sbjct: 188 ---NNLVDSPVPFVKKYFIEWFMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNS 244

Query: 245 VKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
           ++  +++CG+RG LGP CNAVG VDR L G NHLY +PV+ R + C+++SP+ GPL  +A
Sbjct: 245 IETKMIQCGVRGSLGPPCNAVGLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNA 304

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
           P WC APF+PEGLLST+ A+++  +G+ +GHVLIH K HS R+  W+ +   +L I+  L
Sbjct: 305 PDWCLAPFDPEGLLSTLMAVVTCFVGLFFGHVLIHCKNHSQRMLIWL-LASVVLTISAYL 363

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAA 389
                +P +K LY+ +Y+  T G +
Sbjct: 364 VLLLGMPFSKPLYTVNYMLLTGGVS 388


>gi|413920627|gb|AFW60559.1| hypothetical protein ZEAMMB73_831897 [Zea mays]
          Length = 343

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/258 (62%), Positives = 189/258 (73%), Gaps = 16/258 (6%)

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIALVY  VALIE LT K RP  +     +IF AY+WQW+GG +AFV+Y++TT+SLYVP
Sbjct: 19  QRIALVYFFVALIEALTVKVRPTTVRSGPYAIFDAYRWQWLGGLVAFVVYMVTTFSLYVP 78

Query: 234 NWSFSEHSDHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           +WSF  H++  V   K++ VKCG+R  L  ACNAVGYVDR++WGINHLY+ PVW R + C
Sbjct: 79  DWSFVYHNEGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDC 138

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
           T SSPN GPLR DAP+WC APFEPEGLLS+IS++LSGTIGIHYGHVLIHFK H  RLKHW
Sbjct: 139 TSSSPNMGPLRSDAPAWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHW 198

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECD 410
           +  GF LL++ IILHFTNAIPINKQLYSFSYVCFT GAA    GI  S F   +D     
Sbjct: 199 LLTGFSLLVLGIILHFTNAIPINKQLYSFSYVCFTGGAA----GIVLSAFYILID----- 249

Query: 411 ACFRAWSTRYPSRICKWM 428
                W  R P    +W+
Sbjct: 250 ----VWGLRTPFLFLEWI 263


>gi|388508176|gb|AFK42154.1| unknown [Lotus japonicus]
          Length = 467

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 161/335 (48%), Positives = 224/335 (66%), Gaps = 6/335 (1%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           Q +S+R+ ++D FRGLTV LMILVDDAGG    ++HSPW+G T+ADFVMP FLFIVG+++
Sbjct: 71  QSQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIVGLSL 130

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           AL  KK+     A +K I R LKLL  G+ LQGGY H  + L++GVDMK IR  GILQRI
Sbjct: 131 ALTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGILQRI 190

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A+ Y++ AL E      + + +     S+   Y++QW   F+    Y+   Y LYVP+W 
Sbjct: 191 AIAYLLTALCEIWL---KCDDIVKSGSSLLRKYRYQWAVAFVLSGFYLCLLYGLYVPDWE 247

Query: 237 FSEHSDHGV--KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
           +   +D     K + VKCG+    GPACN VG +DR++ GI HLY  P+++R+  C+++S
Sbjct: 248 YQIPTDSSSVPKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMPECSINS 307

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
           P+ GPL  DAP+WC+APF+PEGLLS++ AI++  IG+HYGH+++H+K H  R+ HW+   
Sbjct: 308 PDYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRIIHWMIPT 367

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             L++    LH    + +NK LYSFSY C TAGAA
Sbjct: 368 SCLIVFGFALHLF-GMHVNKVLYSFSYTCVTAGAA 401


>gi|357134575|ref|XP_003568892.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 495

 Score =  308 bits (788), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 212/343 (61%), Gaps = 16/343 (4%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++K  RVA+LD FRGLTV +MILVDDAGGA+  I+H+PW G T+ADFVMP FLFI+GV+ 
Sbjct: 45  ERKPHRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADFVMPAFLFIIGVSA 104

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           AL  K+        KK  +R  KL   G+ILQGGY H    L+YGVD+ HIRW G+LQRI
Sbjct: 105 ALVFKRTQNKIATSKKAAYRAFKLFILGVILQGGYIHGRHNLTYGVDLDHIRWLGVLQRI 164

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A+ Y + A+ E        N+     +S    Y  +W+   +   +YI   + LYVPNW 
Sbjct: 165 AIGYFLAAMSEIWLVN---NISVDSPVSFVKKYFMEWVMAIMISALYISLIFGLYVPNWE 221

Query: 237 FSEHSDH----------GVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           F   + +          G K   V+CG+RG LGP CNAVG+VDR L G NHLY +PV+ R
Sbjct: 222 FKVQTSNLTFSNGSNEIGFKT--VQCGLRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKR 279

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
            + C+++SP+ G L  +AP WC APF+PEGLLST+ A +S  +G+H+GHVLIH + HS R
Sbjct: 280 TKECSVNSPDYGALPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFGHVLIHCQNHSQR 339

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +  W+     L     +L  +  +P +K LY+ SY+  T G +
Sbjct: 340 MLSWLLASTVLTASGFLLQLS-GMPFSKPLYTVSYMLLTGGVS 381


>gi|186530230|ref|NP_199601.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332008203|gb|AED95586.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  305 bits (780), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 172/383 (44%), Positives = 238/383 (62%), Gaps = 16/383 (4%)

Query: 13  RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGL 72
           RT+L   E     D  +   +EK    S +Q  +   L          +R+ +LD FRGL
Sbjct: 2   RTKLTMYEAIKDNDDNDHQWREKKDIESALQISRSSSLP------PDKERLVSLDVFRGL 55

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK 132
           TV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +K
Sbjct: 56  TVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRK 115

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
            + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  K
Sbjct: 116 ALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWLK 174

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYI 249
              NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +    E     +  ++
Sbjct: 175 GNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFL 232

Query: 250 ---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
              VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL  DAPS
Sbjct: 233 NLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPS 292

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHF 366
           WC+APF+PEGLLS++ A ++  +G+HYGH++IHFK H  RL  W+   F  L++  +   
Sbjct: 293 WCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRSF-CLLMLGLALN 351

Query: 367 TNAIPINKQLYSFSYVCFTAGAA 389
              + +NK LY+ SY+C T+GA+
Sbjct: 352 LFGMHLNKPLYTLSYMCVTSGAS 374


>gi|357149263|ref|XP_003575052.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Brachypodium distachyon]
          Length = 432

 Score =  302 bits (773), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 155/356 (43%), Positives = 228/356 (64%), Gaps = 9/356 (2%)

Query: 31  INKEKGLERSEVQDEQKGELQLQQLLQQKS----KRVATLDAFRGLTVVLMILVDDAGGA 86
           +  + G   +   D + G  +   L    S    +R+ +LD FRG+TV+LMI+VDDAGG 
Sbjct: 7   VRSDDGAAAATTPDLESGASKASPLPTPVSPAARQRLVSLDVFRGITVLLMIIVDDAGGF 66

Query: 87  YARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGII 146
              ++HSPW+G T+ DFVMPFFLFIVGV++ LA K+VP+   A KK + R LKL   G++
Sbjct: 67  LPALNHSPWDGVTIGDFVMPFFLFIVGVSLTLAYKRVPERLEATKKAVLRALKLFCLGLV 126

Query: 147 LQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIF 206
           LQGG+ H   +L++GVD+  IR  GILQRIA+ Y++ A+ E      + N    R L + 
Sbjct: 127 LQGGFFHGVRSLTFGVDITEIRLMGILQRIAIAYLIAAICEIWL---KGNDEVDRGLDLL 183

Query: 207 TAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACNAV 265
             Y++Q   G +  V+Y +  Y +YVP+W +        +K  +VKCG+RG  GP CNAV
Sbjct: 184 RRYRYQLFVGLLLSVMYTVLLYGIYVPDWEYQITGPGSTEKSLLVKCGVRGDTGPGCNAV 243

Query: 266 GYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAIL 325
           G VDR + GI+HLY  PV++R + C++  P +GPL  DAPSWC+APF+PEGLLS++ AI+
Sbjct: 244 GMVDRTMLGIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIV 303

Query: 326 SGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSY 381
           +  +G+ +GHV+IHF+ H  R+ +W+   F +L +A ++ F   + +NK LY+ SY
Sbjct: 304 TCLMGLQFGHVIIHFEKHKERIINWLIPSFSMLALAFLMDFI-GMRMNKPLYTISY 358


>gi|168007055|ref|XP_001756224.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692734|gb|EDQ79090.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 411

 Score =  301 bits (770), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 163/321 (50%), Positives = 211/321 (65%), Gaps = 19/321 (5%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING---AVKKI 133
           MILVD AGG +  I+HSPW+G TLADFV+PFFLFIVGVA+AL  KK+  IN    A +K 
Sbjct: 1   MILVDYAGGIWPAINHSPWDGVTLADFVLPFFLFIVGVALALTYKKI--INEKQLASQKA 58

Query: 134 IFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           I R+LKL+  G+ +QGGY H     SYGVD++ IRWCG+LQRIAL Y+VVAL E    + 
Sbjct: 59  IGRSLKLVIVGLFIQGGYFHGVHNTSYGVDLESIRWCGVLQRIALAYMVVALCEIWAPRG 118

Query: 194 RPNVLE-----PRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
             + +       R    F A             IY++  Y +YVP+W F   +D  V  +
Sbjct: 119 HYDSMNVYIKSTRRFGTFRAVA------AAIVAIYLVLLYGVYVPDWEFVSAADSTV--F 170

Query: 249 IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
            VKCG+RG +GP+CN VGY+DR L G++HLY   V+ R  AC++ SP+ GPL   AP WC
Sbjct: 171 QVKCGVRGDVGPSCNVVGYLDRTLLGLSHLYQKAVYRRAPACSVLSPDYGPLPAGAPVWC 230

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
           +APF+PEGLLS++SAI+S  +G+H+GHVL+H K H+ARLK WV M   LL+   +LH   
Sbjct: 231 KAPFDPEGLLSSMSAIVSCFLGLHFGHVLVHHKEHNARLKDWVLMSLTLLVTGALLHVLG 290

Query: 369 AIPINKQLYSFSYVCFTAGAA 389
            +P NK LYS SY+ FT GAA
Sbjct: 291 -MPWNKPLYSVSYMLFTGGAA 310


>gi|115462187|ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group]
 gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group]
 gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group]
          Length = 491

 Score =  300 bits (769), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 214/343 (62%), Gaps = 14/343 (4%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++K +RVA+LD FRGLTV +MILVDDAGGA+  ++HSPW G T+ADFVMP FLFI+GV+ 
Sbjct: 49  ERKPRRVASLDVFRGLTVAMMILVDDAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSA 108

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           AL  KK P    A KK   R +KL   G+ILQGGY H    L+YG+D+ HIRW G+LQRI
Sbjct: 109 ALVFKKTPNKTVATKKAAIRAIKLFILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRI 168

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A+ Y + A+ E        N+     +S    Y  +WI   +   +Y+     LYV NW 
Sbjct: 169 AIGYFLAAISEIWLVN---NISVDSAISFVKKYFMEWIVAVMISALYVGLLLGLYVSNWE 225

Query: 237 FSEHSDHGVKKY----------IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           F   + + +             +++CG+RG LGP CNAVG+VDR L G NHLY +PV+ R
Sbjct: 226 FKVQTSNSILTIPTPGNEIGMKMIQCGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKR 285

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
            + C+++SP+ GPL  +AP WC APF+PEGLLST+ A ++  +G+H+GHVL+H K HS R
Sbjct: 286 TKECSVNSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPR 345

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +  W+     L +   +L     +P +K LY+ SY+  T G +
Sbjct: 346 MLLWLLASTVLTVSGFLLQLL-GMPFSKPLYTVSYMLLTGGVS 387


>gi|224072443|ref|XP_002303734.1| predicted protein [Populus trichocarpa]
 gi|222841166|gb|EEE78713.1| predicted protein [Populus trichocarpa]
          Length = 381

 Score =  300 bits (769), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/317 (49%), Positives = 214/317 (67%), Gaps = 7/317 (2%)

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           LMILVDDAGG    I+HSPWNG TLAD VMPFFLF+VGV++ L  KK+P    A +K I 
Sbjct: 3   LMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYKKLPSKAVATRKAIL 62

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R LKLL  G+ LQGG+ H  + L++GVDM  IRW GILQRIA+ Y++ A+ E +  K   
Sbjct: 63  RALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYLIGAMCE-IWLKGDN 121

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKC 252
           +V     LS+   YQ QW    +   +Y+   Y LYVP+W +      S    K + VKC
Sbjct: 122 HVAS--GLSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAASSSSPKIFRVKC 179

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
           G+RG  G ACNAVG +DR + GI HLY  P+++R +AC+++SP+ GPL  DAPSWC+APF
Sbjct: 180 GVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLPPDAPSWCQAPF 239

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLS++ AI++  +G+HYGH+++HFK H  R+ HW+      +++ ++L  +  + +
Sbjct: 240 DPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLGLVLDLS-GMHV 298

Query: 373 NKQLYSFSYVCFTAGAA 389
           NK LY+FSY+C TAGAA
Sbjct: 299 NKALYTFSYMCVTAGAA 315


>gi|297791891|ref|XP_002863830.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309665|gb|EFH40089.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 432

 Score =  300 bits (768), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 226/340 (66%), Gaps = 10/340 (2%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           L    +R+ +LD FRGLTV LMILVDD G     I+HSPW+G TLADFVMPFFLFIVGV+
Sbjct: 31  LPPDKERLVSLDVFRGLTVALMILVDDVGEILPSINHSPWDGVTLADFVMPFFLFIVGVS 90

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           +A A K +     A +K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR+ GILQR
Sbjct: 91  LAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRFMGILQR 150

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           IA+ Y+V AL E +  K   NV     LS+   Y++ W+  F+   IY+   Y LYV +W
Sbjct: 151 IAIAYLVAALCE-IWLKGNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVSDW 207

Query: 236 SF---SEHSDHGVKKYI---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
            +   +E     +  ++   VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + 
Sbjct: 208 EYQISTEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQ 267

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
           C++SSPN+GPL  DAPSWC+APF+PEGLLS++ AI++  +G+HYGH++IHFK H  RL  
Sbjct: 268 CSISSPNNGPLPPDAPSWCQAPFDPEGLLSSLMAIVTCLVGLHYGHIIIHFKDHKKRLNQ 327

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           W+   F  L++  +      + +NK LY+ SY+C T+GA+
Sbjct: 328 WILRSF-CLLMLGLALNLFGMHLNKPLYTLSYMCVTSGAS 366


>gi|125533951|gb|EAY80499.1| hypothetical protein OsI_35679 [Oryza sativa Indica Group]
          Length = 444

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 161/290 (55%), Positives = 200/290 (68%), Gaps = 19/290 (6%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++KSKRVA LDAFRGLT+VLMILVDDAGGAY R+DHSPWNGCTLADFVMPFFLFIVGVAI
Sbjct: 55  RKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIVGVAI 114

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           A ALK+VPK+  AVKKI  RTLK+LFWG++LQGGYSHAPD LSYGVDMK IRWCGILQ +
Sbjct: 115 AFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGILQNL 174

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQ-WIGGFIAFVIYIITTYSLYVPNW 235
            +++      ++    R  +     H S  +AY     +GGF+A  IY++TT+SLYVP+W
Sbjct: 175 LVLFDNAE--DSFGVLRGCSDRGIHHKS--SAYDSAVRLGGFVALFIYMVTTFSLYVPDW 230

Query: 236 SFSEHSDHGV---KKYIV---------KCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
           S+  H+D  V   K++ V         +CG+RGHL PACNAVGYVDR +WGINHLY+ PV
Sbjct: 231 SYIYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYTQPV 290

Query: 284 WSRLEACTLSS--PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
           W R +   + S   N  PL   + S+   P   +    +     +G  G+
Sbjct: 291 WIRSKFNIVDSVRDNWDPLWTRSRSFQAIPINKQLYSLSYVCFTAGAAGV 340



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/61 (45%), Positives = 32/61 (52%), Gaps = 13/61 (21%)

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKW 427
            AIPINKQLYS SYVCFTAGAA    G+  S F   +D          W  R P    +W
Sbjct: 317 QAIPINKQLYSLSYVCFTAGAA----GVVLSAFYILID---------VWGLRTPFLFLEW 363

Query: 428 M 428
           +
Sbjct: 364 I 364


>gi|224131042|ref|XP_002320987.1| predicted protein [Populus trichocarpa]
 gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 165/373 (44%), Positives = 229/373 (61%), Gaps = 21/373 (5%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 119
           ++R+ +LD FRGLTV LMILVDDAGGA+  I+HSPW G TLADFVMPFFLF+VGV+I+L 
Sbjct: 48  TQRLLSLDVFRGLTVALMILVDDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLV 107

Query: 120 LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
            KKV     A KK+I RT+KL   G++LQGGY H    L+YGVD+  IRW G+LQRI++ 
Sbjct: 108 FKKVSSKPMATKKVIQRTIKLFLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIG 167

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           Y+  A+ E        ++     ++    Y  QW+  F+    Y+   Y LYVP+W F  
Sbjct: 168 YLFAAMSEIWLVD---SITVDSPMAFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEV 224

Query: 240 HS----DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
            S    +H     IV CG+RG L P CNAVG +DR   G +HLY  PV+ R + C+++SP
Sbjct: 225 PSTNLFEHEFGTKIVNCGVRGSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSP 284

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
           + GPL  ++P WC APF+PEG+LS++ A ++  +G+ +GH+L+HFKGH  RL  W    F
Sbjct: 285 DYGPLPPNSPGWCLAPFDPEGILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSF 344

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRA 415
            +LI   +      +P+ K LY+ SY+C TAGA+   L +T   +I ++           
Sbjct: 345 IILITGYVFELL-GVPLCKPLYTLSYMCITAGASG--LALTIIFYIVDVK---------- 391

Query: 416 WSTRYPSRICKWM 428
              R P+ I +WM
Sbjct: 392 -HFRKPTMILQWM 403


>gi|242071239|ref|XP_002450896.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
 gi|241936739|gb|EES09884.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
          Length = 455

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 163/332 (49%), Positives = 214/332 (64%), Gaps = 3/332 (0%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + +R+A+LD FRG+TVVLMILVDD GG    I HSPW+G TLADFV PFFLFIVGV++A 
Sbjct: 60  RGQRLASLDVFRGITVVLMILVDDVGGLVPAISHSPWDGVTLADFVFPFFLFIVGVSLAF 119

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           A K+VP    A KK + R  KL   G++LQGGY H    LSYGVD+  IR  GILQRIA+
Sbjct: 120 AYKRVPNKTLATKKALIRASKLFLLGLLLQGGYFHTIHDLSYGVDLHKIRLMGILQRIAI 179

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y  VAL E        +     ++ I   Y+ Q   G +  V Y +  Y +YVP+W + 
Sbjct: 180 AYFAVALCEIWLRGGASDNGAGGYVLI-RRYRHQLFVGLVLTVTYTVLLYGMYVPDWEYV 238

Query: 239 EHS-DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
             S D  +K ++VKCG+RG  GP CNAVG +DR + GI HLY+ PV+ +   C+++SP +
Sbjct: 239 VTSPDTTLKNFMVKCGVRGDTGPGCNAVGMIDRCVLGIQHLYAHPVYLKTAQCSINSPRN 298

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
           GPL  DAP+WC APF+PEGLLS++ AI++  IG+  GHV++HFK HS R+  W      L
Sbjct: 299 GPLPSDAPTWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKQHSKRIVRWSIPSLIL 358

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           LI+ + L     + +NK LYS SY C T G+A
Sbjct: 359 LILGVSLDLF-GMHMNKSLYSLSYTCVTTGSA 389


>gi|38346153|emb|CAE02025.2| OSJNBb0118P14.13 [Oryza sativa Japonica Group]
          Length = 415

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 157/336 (46%), Positives = 216/336 (64%), Gaps = 15/336 (4%)

Query: 57  QQKSKRVATLDAFRG--LTVV--------LMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           Q++  +V+ +D   G  LT V        LMILVDDAG     I+HSPW+G TLADFVMP
Sbjct: 10  QRQGIQVSVVDVAEGQWLTCVHLFMPEMPLMILVDDAGAFLPAINHSPWDGVTLADFVMP 69

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           FFLFIVGVA+ALA K+VP    A +K I R LKL   G++LQGG+ H   +L++G+DM+ 
Sbjct: 70  FFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEK 129

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           IR  GILQRIA+ Y+V AL E +  K   +V       +    ++Q   G I  + Y+  
Sbjct: 130 IRLMGILQRIAIAYIVTALCE-IWLKGDDDV--DSGFDLLKRNRYQLFIGLIVMITYMGF 186

Query: 227 TYSLYVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
            Y  YVP+W +        +K + VKC +RG  GP CNAVG +DR++ GI HLY  PV++
Sbjct: 187 LYGTYVPDWEYRISVPGSTEKSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYA 246

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
           R + C+++SP +GPLR DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  
Sbjct: 247 RSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKE 306

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSY 381
           R+  W+   F +LI+A  L F   + +NK LY+ SY
Sbjct: 307 RIMKWLIPSFSMLILAFSLDFF-GMHMNKPLYTVSY 341


>gi|195642128|gb|ACG40532.1| hypothetical protein [Zea mays]
          Length = 379

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 211/317 (66%), Gaps = 5/317 (1%)

Query: 74  VVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKI 133
           ++LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA K+VP    A KK 
Sbjct: 1   MLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKA 60

Query: 134 IFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           + R LKL   G++LQGG+ H   +L++GVD++ IR  GILQRIA+ Y++ AL E +  K 
Sbjct: 61  VLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCE-IWLKG 119

Query: 194 RPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKC 252
             +V       +   Y++Q + G +  + Y+   Y  YVP+  +       ++K + VKC
Sbjct: 120 DEDV--DYGYDLLKRYRYQLLVGAVVAITYMSLLYGTYVPDCEYQTSGPGSIEKSFFVKC 177

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
           G+RG   P CNAVG +DR + GI HLY  PV++R + C++ SP +GPL  DAPSWC+APF
Sbjct: 178 GVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPF 237

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLS++ AI++  IG+ YGH+++HF+ H  R+ +W+   F +L++A  + F   + +
Sbjct: 238 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLRM 296

Query: 373 NKQLYSFSYVCFTAGAA 389
           NK LY+ SY   TAGAA
Sbjct: 297 NKPLYTLSYTLATAGAA 313


>gi|212723180|ref|NP_001132467.1| uncharacterized protein LOC100193923 [Zea mays]
 gi|194694464|gb|ACF81316.1| unknown [Zea mays]
 gi|414587418|tpg|DAA37989.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
          Length = 391

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 148/315 (46%), Positives = 208/315 (66%), Gaps = 5/315 (1%)

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA K+VP    A KK + 
Sbjct: 15  LMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAVL 74

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R LKL   G++LQGG+ H   +L++GVD++ IR  GILQRIA+ Y++ AL E +  K   
Sbjct: 75  RALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCE-IWLKGDE 133

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKCGM 254
           +V       +   Y++Q   G I  + Y+   Y  YV +W +       ++K + VKCG+
Sbjct: 134 DV--DYGYDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTSGPGSIEKSFFVKCGV 191

Query: 255 RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
           RG   P CNAVG +DR + GI HLY  PV++R + C++ SP +GPL  DAPSWC+APF+P
Sbjct: 192 RGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPFDP 251

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINK 374
           EGLLS++ AI++  IG+ YGHV++HF+ H  R+ +W+   F +L++A  + F   + +NK
Sbjct: 252 EGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL-GLRMNK 310

Query: 375 QLYSFSYVCFTAGAA 389
            LY+ SY   TAGAA
Sbjct: 311 PLYTLSYTLATAGAA 325


>gi|224080634|ref|XP_002306188.1| predicted protein [Populus trichocarpa]
 gi|222849152|gb|EEE86699.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  294 bits (753), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 160/360 (44%), Positives = 219/360 (60%), Gaps = 22/360 (6%)

Query: 49  ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 108
           +++ Q    +K+ RVA+LD FRGL V LM+LVD  G     I HSPWNG  LADFVMPFF
Sbjct: 11  DIEEQPRTSKKTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGLHLADFVMPFF 70

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           LFI GV++AL  K+VP    A +K + R ++L   G+ILQGGY H  + L+YGVDMK IR
Sbjct: 71  LFIAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFLTYGVDMKRIR 130

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           W GILQRI++ Y+  AL E   + R       R +S   +Y W W   F    IY+   Y
Sbjct: 131 WLGILQRISIGYIFAALCEIWLSCR-----SRRDVSFLKSYYWHWGAAFSLSAIYLGLLY 185

Query: 229 SLYVPNWSFSEH--------SDHGVKKYI--VKCGMRGHLGPACNAVGYVDRELWGINHL 278
            LYVP+W F           ++H     +  VKC +RG LGPACN+ G +DR + GI+HL
Sbjct: 186 GLYVPDWQFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMIDRYVLGIDHL 245

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
           Y  PV+  L+ C +S+  +G + E APSWC APF+PEG+LS+I+A ++  IG+ YGH L 
Sbjct: 246 YKKPVYRNLKECNMST--NGQVPESAPSWCHAPFDPEGVLSSITAAVACIIGLQYGHSLA 303

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNS 398
           H + H  R+++W+     LL++ ++L      P+NK LY+F Y+  T  +A    GIT S
Sbjct: 304 HLQDHKQRMQNWILFSLSLLLVGLLLAVVGD-PVNKSLYTFGYMLITCASA----GITYS 358


>gi|125582342|gb|EAZ23273.1| hypothetical protein OsJ_06967 [Oryza sativa Japonica Group]
          Length = 423

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 151/363 (41%), Positives = 225/363 (61%), Gaps = 8/363 (2%)

Query: 28  ENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAY 87
            N  NK K   R + Q  +     ++ +L +       L  F  L   LMI+VDDAG   
Sbjct: 2   SNSTNKRK--RRKDTQKSRMCAASMRTVLVRSPSSDKILKIFVWL-FQLMIIVDDAGAFL 58

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIIL 147
             ++HSPW+G T+ADFVMPFFLF+VG+++ LA K+VP    A KK + R LKL   G++L
Sbjct: 59  PALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLRALKLFCLGLVL 118

Query: 148 QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFT 207
           QGG+ H   +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V     L +  
Sbjct: 119 QGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICE-IWLKGDDDV--DCGLDVIR 175

Query: 208 AYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACNAVG 266
            Y++Q +   +   +Y +    +YVP+W +        +K + V+CG+RG  GPACNAVG
Sbjct: 176 RYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTEKSFSVRCGVRGDTGPACNAVG 235

Query: 267 YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILS 326
            +DR + GI+HLY  PV++R + C+++ P +GPL  DAPSWC+APF+PEGLLS++ AI++
Sbjct: 236 MLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPEGLLSSVMAIVT 295

Query: 327 GTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
             IG+ +GH++IHF+ H  R+ +W+   F +L +A  + F   I +NK LY+ SY   T+
Sbjct: 296 CLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRMNKPLYTISYALATS 354

Query: 387 GAA 389
           GAA
Sbjct: 355 GAA 357


>gi|224103167|ref|XP_002312951.1| predicted protein [Populus trichocarpa]
 gi|222849359|gb|EEE86906.1| predicted protein [Populus trichocarpa]
          Length = 419

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 157/358 (43%), Positives = 221/358 (61%), Gaps = 21/358 (5%)

Query: 49  ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 108
           +++ Q    +K  R A+LD FRGL V LM+LVD  G     I HSPWNG  LAD VMPFF
Sbjct: 11  DIEEQLHTSKKPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGLHLADSVMPFF 70

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           LFI GV++AL  KKVP    A  K + + +KL   G+++QGGY H  ++L+YGVDMK IR
Sbjct: 71  LFIAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSLTYGVDMKRIR 130

Query: 169 WCGILQRIALVYVVVALIET-LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           W GILQ+I++ Y+V AL E  L+ + R      R +S   +Y W W   F    IY+   
Sbjct: 131 WLGILQKISVGYIVAALCEIWLSCRTR------RGVSFLKSYYWHWCVAFSLSAIYLGLL 184

Query: 228 YSLYVPNWSF-------SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYS 280
           Y LYVP+W F       S    +    Y+VKC +RG LGPACN+ G +DR + GI+HLY 
Sbjct: 185 YGLYVPDWQFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRYILGIDHLYK 244

Query: 281 DPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 340
            PV+  L+ C +S+   G + +++ SWC APF+PEG+LS+++A ++  IG+ YGH+L H 
Sbjct: 245 KPVYRNLKECNMST--DGQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGLQYGHLLAHL 302

Query: 341 KGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNS 398
           + H  R+++W    F LL++ ++L      P+NK LY+FSY+  T+ +A    GIT S
Sbjct: 303 QDHKGRMENWTLFSFSLLVVGLLLVVIGD-PVNKSLYTFSYMLITSASA----GITYS 355


>gi|449528551|ref|XP_004171267.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 380

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 215/319 (67%), Gaps = 11/319 (3%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           MI+VD AGG    I+HSPW+G TLAD VMPFFLFIVGV++ALA KK+P    A +K + R
Sbjct: 1   MIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLR 60

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIET-LTTKRRP 195
           TLKLLF G+ LQGG+ H  + L+YGVD++ IRW GILQRIA+ Y + AL E  L      
Sbjct: 61  TLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYV 120

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW-----SFSEHSDHGVKKYIV 250
           N       ++   YQ Q +   +  ++Y+  +Y LYVP+W     S +       K + V
Sbjct: 121 N----SETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSV 176

Query: 251 KCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRA 310
           KCG RG  GPACNAVG +DR+++GI HLY  P+++R E C++++P+ GPL  DAPSWC+A
Sbjct: 177 KCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQA 236

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEGLLST+ A+++  +G+HYGH+++HFK H  R+ HW+     L+++AI L F   +
Sbjct: 237 PFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFL-GM 295

Query: 371 PINKQLYSFSYVCFTAGAA 389
            INK LY+ SY+  TAGAA
Sbjct: 296 HINKVLYTVSYMSVTAGAA 314


>gi|326510109|dbj|BAJ87271.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 304

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 139/300 (46%), Positives = 197/300 (65%), Gaps = 4/300 (1%)

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF 135
           LMI+VDDAG     ++HSPW G T+ADFVMPFFLFIVGVA+ALA K+VP    A +K   
Sbjct: 8   LMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAYKRVPDKLDATRKATL 67

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R LKL   G++LQGG+ H   +L++GVD+  IR  GILQRIA+ Y+V AL + +  K   
Sbjct: 68  RALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAYLVTALCQ-IWLKGDD 126

Query: 196 NVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS-EHSDHGVKKYIVKCGM 254
           +V     L +   Y++Q + G +  + Y+   Y  YVP+W +         K + V+CG+
Sbjct: 127 DV--DSGLDLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRISGPGFTEKTFTVRCGV 184

Query: 255 RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
           RG  GP CNAVG +DR++ GI HLY  PV++R + C++ SP +GPL  DAPSWC+APF+P
Sbjct: 185 RGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPLPPDAPSWCQAPFDP 244

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINK 374
           EGLLS++ AI++  IG+ YGH+++HF+ H  R+ HW+   FG+L++A  + F     +N 
Sbjct: 245 EGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVLAFAMDFFGKGVVNS 304


>gi|357442361|ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 476

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 161/374 (43%), Positives = 229/374 (61%), Gaps = 22/374 (5%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA 119
           ++R+ +LD FRGLTV LMILVDD G A+  ++HSPW G TLADFVMPFFLF VGV+IAL 
Sbjct: 42  NQRLVSLDVFRGLTVALMILVDDVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIALV 101

Query: 120 LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
            KKV     A KKII RT+KL   G++LQGGY H    L+YG+D+  +RW G+LQRI++ 
Sbjct: 102 FKKVSSKQNATKKIISRTIKLFLLGLLLQGGYFHGRGNLTYGLDLTKLRWFGVLQRISIG 161

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           Y + ++ E        N+L     +    Y  QWI   +   +Y+   Y LYVPNW F E
Sbjct: 162 YFLASMSEIWLVNG--NILVDSPAAFVRKYSIQWIFSILLCSVYLCLLYGLYVPNWEF-E 218

Query: 240 HSD---HGVKKYI--VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
           HS+    G    I  V C MRG L P CNAVG++DR + G +H+Y  PV+ R + C+++S
Sbjct: 219 HSNLLWPGRVSTIQNVHCDMRGSLDPPCNAVGFIDRLILGEDHMYQRPVYRRTKECSVNS 278

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
           P+ GPL  D+P WC APF+PEG+LS++ A ++  +G+ +GH+L+ F+ H  R+  W    
Sbjct: 279 PDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFVGLQFGHILVIFQAHKQRVLLWSVFS 338

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFR 414
           F LL++  +L     IP++K LY+ S++  TAGA+   L +T   +I ++          
Sbjct: 339 FSLLVVGYVLEIL-GIPLSKALYTLSFMFITAGASG--LVLTAIYYIVDIK--------- 386

Query: 415 AWSTRYPSRICKWM 428
               R P+ + +WM
Sbjct: 387 --QLRKPTVLLQWM 398


>gi|218190872|gb|EEC73299.1| hypothetical protein OsI_07466 [Oryza sativa Indica Group]
          Length = 454

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 153/366 (41%), Positives = 229/366 (62%), Gaps = 10/366 (2%)

Query: 29  NGINKEK-GLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTV---VLMILVDDAG 84
           NG  K   GLE S     ++  +Q ++L +    R  + D    + V    LMI+VDDAG
Sbjct: 28  NGCEKSLLGLEESVTVLVRRPGIQ-ERLFRAVLVRSPSSDKILKIFVWLFQLMIIVDDAG 86

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWG 144
                ++HSPW+G T+ADFVMPFFLF+VG+++ LA K+VP    A KK + R LKL   G
Sbjct: 87  AFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLRALKLFCLG 146

Query: 145 IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLS 204
           ++LQGG+ H   +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V     L 
Sbjct: 147 LVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICE-IWLKGDDDV--DCGLD 203

Query: 205 IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACN 263
           +   Y++Q +   +   +Y +    +YVP+W +        +K + V+CG+RG  GPACN
Sbjct: 204 VIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTEKSFSVRCGVRGDTGPACN 263

Query: 264 AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISA 323
           AVG +DR + GI+HLY  PV++R + C+++ P +GPL  DAPSWC+APF+PEGLLS++ A
Sbjct: 264 AVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPEGLLSSVMA 323

Query: 324 ILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
           I++  IG+ +GH++IHF+ H  R+ +W+   F +L +A  + F   I +NK LY+ SY  
Sbjct: 324 IVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRMNKPLYTISYAL 382

Query: 384 FTAGAA 389
            T+GAA
Sbjct: 383 ATSGAA 388


>gi|334188248|ref|NP_001190487.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008209|gb|AED95592.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  288 bits (736), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 178/444 (40%), Positives = 248/444 (55%), Gaps = 54/444 (12%)

Query: 13  RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGL 72
           RT+L   E     D  +   +EK    S +Q  +   L   +      +R+ +LD FRGL
Sbjct: 2   RTKLTMYEAIKDNDDNDHQWREKKDIESALQISRSSSLPPDK------ERLVSLDVFRGL 55

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK 132
           TV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +K
Sbjct: 56  TVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRK 115

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
            + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  K
Sbjct: 116 ALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWLK 174

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKYI 249
              NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +    E     +  ++
Sbjct: 175 GNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFL 232

Query: 250 ---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
              VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL  DAPS
Sbjct: 233 NLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPS 292

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK------------------------- 341
           WC+APF+PEGLLS++ A ++  +G+HYGH++IHFK                         
Sbjct: 293 WCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVYNEPSISIRPFFFILSE 352

Query: 342 ------------GHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                        H  RL  W+   F  L++  +      + +NK LY+ SY+C T+GA+
Sbjct: 353 TYLLLYVINFLQDHKKRLNQWILRSF-CLLMLGLALNLFGMHLNKPLYTLSYMCVTSGAS 411

Query: 390 DGCL-GITNSIFIPEMDWDECDAC 412
              L  I   ++   MD +E   C
Sbjct: 412 GFLLSAIYLMVWSTFMDTNEQALC 435


>gi|356516509|ref|XP_003526936.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 416

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 152/355 (42%), Positives = 209/355 (58%), Gaps = 15/355 (4%)

Query: 42  VQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLA 101
           + D Q   L   +  Q ++ R+A+LD FRGL+V LMI VD A   +  I H+PWNG  LA
Sbjct: 1   MADPQPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLA 60

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           DFVMPFFLFI G+++AL  K+ P    A  K   R L L   GI+LQGGY H   +L++G
Sbjct: 61  DFVMPFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFG 120

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
           VD++ IRW GILQRI++ Y+V AL E      R      + L    +Y WQW    I   
Sbjct: 121 VDIQRIRWLGILQRISIGYIVAALCEIWLPAPR-----WKELGFVKSYYWQWFVAVILLA 175

Query: 222 IYIITTYSLYVPNWSFSEHSDH-------GVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
           +Y    Y LYVP+W F   +         G   Y+V C +RG LGPACN+ G +DR + G
Sbjct: 176 LYSGLLYGLYVPDWQFDVSASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILG 235

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
           ++HLY  PV+  L+ C +S+   G + + +PSWC APF+PEG+LS+I+A +S  IG+ YG
Sbjct: 236 LDHLYRKPVYRNLKGCNMSA--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYG 293

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           HVL H + H  RL +W+      L + + L     IP+NK LY+ SY+  T+ A+
Sbjct: 294 HVLAHLQDHKGRLYNWMCFSLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAAS 347


>gi|115446433|ref|NP_001046996.1| Os02g0526000 [Oryza sativa Japonica Group]
 gi|49388281|dbj|BAD25399.1| unknown protein [Oryza sativa Japonica Group]
 gi|49388287|dbj|BAD25402.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536527|dbj|BAF08910.1| Os02g0526000 [Oryza sativa Japonica Group]
          Length = 376

 Score =  284 bits (727), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 140/314 (44%), Positives = 208/314 (66%), Gaps = 5/314 (1%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           MI+VDDAG     ++HSPW+G T+ADFVMPFFLF+VG+++ LA K+VP    A KK + R
Sbjct: 1   MIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLR 60

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
            LKL   G++LQGG+ H   +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +
Sbjct: 61  ALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICE-IWLKGDDD 119

Query: 197 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK-YIVKCGMR 255
           V     L +   Y++Q +   +   +Y +    +YVP+W +        +K + V+CG+R
Sbjct: 120 V--DCGLDVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTEKSFSVRCGVR 177

Query: 256 GHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPE 315
           G  GPACNAVG +DR + GI+HLY  PV++R + C+++ P +GPL  DAPSWC+APF+PE
Sbjct: 178 GDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPE 237

Query: 316 GLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQ 375
           GLLS++ AI++  IG+ +GH++IHF+ H  R+ +W+   F +L +A  + F   I +NK 
Sbjct: 238 GLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFI-GIRMNKP 296

Query: 376 LYSFSYVCFTAGAA 389
           LY+ SY   T+GAA
Sbjct: 297 LYTISYALATSGAA 310


>gi|356534906|ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Glycine max]
          Length = 489

 Score =  284 bits (727), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 158/380 (41%), Positives = 225/380 (59%), Gaps = 23/380 (6%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           L   ++R+++LD FRGLTV LMILVD+ G A+  ++HSPW G TLADFVMPFFLF+VGV+
Sbjct: 48  LSLPNQRLSSLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADFVMPFFLFVVGVS 107

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           I L  KKV     A KK+I RTLKL   G++LQGGY H    L+YGVD+  IRW G+LQR
Sbjct: 108 IGLVFKKVSSKPNATKKVISRTLKLFLLGLLLQGGYFHGHGKLTYGVDLSKIRWLGVLQR 167

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           I++ Y   ++ E        N+L          Y  QW+   +   +Y+   Y LYVPNW
Sbjct: 168 ISIGYFFASISEIWLVNH--NILVDSPAGFVRKYSIQWMFSILLCSVYLCLLYGLYVPNW 225

Query: 236 SFSEHSDHGVKKYI-------VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            F   +               V C +RG L P CN VG++DR + G +H+Y  PV+ R +
Sbjct: 226 KFKHSNLLSSSDSSHLSIIQNVHCEVRGSLEPPCNVVGFIDRLILGEDHMYQRPVYIRTK 285

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
            C+++SP+ GPL  D+P WC APF+PEG+LS++ A ++  +G+ YGH+++H +GH  R+ 
Sbjct: 286 ECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQYGHIIVHLQGHKQRVL 345

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDE 408
            W    F LL+I  IL     +P++K LY+ SY C TAGA+   L +T   +I +++   
Sbjct: 346 LWSVFSFSLLLIGYILEIL-GMPLSKALYTLSYTCITAGASG--LVLTAIYYIVDIE--- 399

Query: 409 CDACFRAWSTRYPSRICKWM 428
                     R P+ + +WM
Sbjct: 400 --------HLRKPTVLLQWM 411


>gi|32487909|emb|CAE05368.1| OJ000315_02.13 [Oryza sativa Japonica Group]
          Length = 452

 Score =  284 bits (726), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 157/360 (43%), Positives = 216/360 (60%), Gaps = 39/360 (10%)

Query: 57  QQKSKRVATLDAFRG--LTVV--------LMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           Q++  +V+ +D   G  LT V        LMILVDDAG     I+HSPW+G TLADFVMP
Sbjct: 10  QRQGIQVSVVDVAEGQWLTCVHLFMPEMPLMILVDDAGAFLPAINHSPWDGVTLADFVMP 69

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           FFLFIVGVA+ALA K+VP    A +K I R LKL   G++LQGG+ H   +L++G+DM+ 
Sbjct: 70  FFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEK 129

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           IR  GILQRIA+ Y+V AL E +  K   +V       +    ++Q   G I  + Y+  
Sbjct: 130 IRLMGILQRIAIAYIVTALCE-IWLKGDDDV--DSGFDLLKRNRYQLFIGLIVMITYMGF 186

Query: 227 TYSLYVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
            Y  YVP+W +        +K + VKC +RG  GP CNAVG +DR++ GI HLY  PV++
Sbjct: 187 LYGTYVPDWEYRISVPGSTEKSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYA 246

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLL------------------------STI 321
           R + C+++SP +GPLR DAPSWC+APF+PEGLL                        S++
Sbjct: 247 RSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLRLQQYNISFANFAKFSLFFLDSRISSV 306

Query: 322 SAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSY 381
            AI++  IG+ YGHV++HF+ H  R+  W+   F +LI+A  L F   + +NK LY+ SY
Sbjct: 307 MAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFF-GMHMNKPLYTVSY 365


>gi|186530235|ref|NP_001119392.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008204|gb|AED95587.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 359

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 156/335 (46%), Positives = 213/335 (63%), Gaps = 15/335 (4%)

Query: 13  RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGL 72
           RT+L   E     D  +   +EK    S +Q  +   L   +      +R+ +LD FRGL
Sbjct: 2   RTKLTMYEAIKDNDDNDHQWREKKDIESALQISRSSSLPPDK------ERLVSLDVFRGL 55

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK 132
           TV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +K
Sbjct: 56  TVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRK 115

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
            + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  K
Sbjct: 116 ALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWLK 174

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKYI 249
              NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +    E     +  ++
Sbjct: 175 GNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFL 232

Query: 250 ---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
              VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL  DAPS
Sbjct: 233 NLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPS 292

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
           WC+APF+PEGLLS++ A ++  +G+HYGH++IHFK
Sbjct: 293 WCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 327


>gi|449458622|ref|XP_004147046.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
 gi|449489633|ref|XP_004158370.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cucumis sativus]
          Length = 418

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 202/337 (59%), Gaps = 14/337 (4%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           K+ RV +LD FRGL+V +M+LVD  G     I HSPW G  LADFVMP+FLFI GV++AL
Sbjct: 20  KAPRVVSLDVFRGLSVFMMMLVDYGGSFLPIISHSPWIGLHLADFVMPWFLFIAGVSVAL 79

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
             K+V     A +    R L L   G+ LQGGY H   +L+YGVD++ IRW GILQRI++
Sbjct: 80  VYKEVESKVAAARNAACRGLYLFLLGVFLQGGYFHGITSLTYGVDLESIRWLGILQRISI 139

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y++ AL E   T  R    E +H   F+   W W   F    +Y+  +Y LYVP+W F 
Sbjct: 140 GYLIAALCEIWLT--RCTREEAQHTKSFS---WHWCIIFFLLSLYMGLSYGLYVPDWDFK 194

Query: 239 ------EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                      G   Y V C +RG LGPACN+ G +DR + GI+HLY+ PV+  L+ C +
Sbjct: 195 ISAPSSSLPLSGSYVYKVNCSLRGDLGPACNSAGMIDRYVLGIHHLYTKPVYRNLKECNI 254

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
           SS  SG   E +PSWCRAPFEPEGLLS+++A ++  IG+ YGH+L   + H  R   W  
Sbjct: 255 SS--SGQFPETSPSWCRAPFEPEGLLSSLTATVACIIGLQYGHILARAQDHKTRTNGWFL 312

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           + F +L   I L F   IP+NK LY+ SY+  T+ +A
Sbjct: 313 LSFKILAFGIFLVFI-GIPVNKSLYTVSYMLITSASA 348


>gi|238481501|ref|NP_001154765.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008206|gb|AED95589.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 435

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 151/325 (46%), Positives = 212/325 (65%), Gaps = 11/325 (3%)

Query: 28  ENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAY 87
           +N  N  +  E+ ++  E   ++     L    +R+ +LD FRGLTV  MILVDD GG  
Sbjct: 7   DNDDNDHQWREKKDI--ESALQISRSSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGIL 64

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIIL 147
             I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +K + R+LKLL  G+ L
Sbjct: 65  PSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFL 124

Query: 148 QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFT 207
           QGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  K   NV     LS+  
Sbjct: 125 QGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWLKGNHNV--SSELSMIK 181

Query: 208 AYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKYI---VKCGMRGHLGPA 261
            Y++ W+  F+   IY+   Y LYVP+W +    E     +  ++   VKCG+RGH GP 
Sbjct: 182 KYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPG 241

Query: 262 CNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTI 321
           CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL  DAPSWC+APF+PEGLLS++
Sbjct: 242 CNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSL 301

Query: 322 SAILSGTIGIHYGHVLIHFKGHSAR 346
            A ++  +G+HYGH++IHFK + ++
Sbjct: 302 MATVTCLVGLHYGHIIIHFKRNGSK 326


>gi|255581844|ref|XP_002531722.1| conserved hypothetical protein [Ricinus communis]
 gi|223528625|gb|EEF30642.1| conserved hypothetical protein [Ricinus communis]
          Length = 397

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 197/318 (61%), Gaps = 14/318 (4%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           M+LVD  G  +  I HSPWNG  LADFVMPFFLFI GV++AL  KKV K   A  K + R
Sbjct: 1   MMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKKVTKRIDATWKAMLR 60

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
            +KL F G+ LQGGY H  ++L+YGVD++ IRW GILQRI++ Y+V AL E   ++R  +
Sbjct: 61  AVKLFFLGVFLQGGYFHGINSLTYGVDIERIRWFGILQRISIGYIVAALCEIWLSRRTQS 120

Query: 197 VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF-------SEHSDHGVKKYI 249
               R +  F  Y W W+  F    +Y+   Y LYVP+W F       S    +G   Y+
Sbjct: 121 ---QREIGFFKNYYWHWVVAFSLSAVYLGLLYGLYVPDWQFEMSNAASSALPINGSNVYM 177

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
           VKC +RG LGPACN+ G +DR + G +HLY+ PV   L+ C ++   +G + E +PSWC 
Sbjct: 178 VKCSVRGDLGPACNSAGMIDRYVLGFDHLYTKPVHRNLKECNMT---NGQVSESSPSWCH 234

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA 369
           APF+PEGLLS+++A ++  IG+  GHVL H + H  R++ W      LL++  +L F   
Sbjct: 235 APFDPEGLLSSLTAAITCIIGLQCGHVLAHIQEHKGRIESWSLFSASLLLLGSVLAFI-G 293

Query: 370 IPINKQLYSFSYVCFTAG 387
           IP+NK LY+ SY+  T+ 
Sbjct: 294 IPVNKSLYTISYMLITSA 311


>gi|357467537|ref|XP_003604053.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
 gi|355493101|gb|AES74304.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
           truncatula]
          Length = 421

 Score =  268 bits (685), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 203/342 (59%), Gaps = 17/342 (4%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + +  RVA++D FRGL+V LMI VD  G  +  I H+PWNG  LADFVMPFFLF+VG+++
Sbjct: 14  ETQFPRVASVDVFRGLSVFLMIFVDYGGSIFPIISHAPWNGLHLADFVMPFFLFLVGISL 73

Query: 117 ALAL--KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           AL    K+      +  K + R+ +L   GI+LQGGY H   + +YGVD++ IR+ G+LQ
Sbjct: 74  ALVYKNKRSRPTQSSTWKPLLRSFQLFILGILLQGGYFHGIHSFTYGVDVQTIRFFGVLQ 133

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI++ Y+V AL +          L  +H S F  Y   W    I   I+    Y L+VP+
Sbjct: 134 RISIGYIVAALCQICLPT-----LPSKHTSFFKTYYSHWFVAAILLAIHSGLLYGLHVPD 188

Query: 235 WSF----SEHSDHGVKK---YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
           W F    S  S   ++    Y V C +RG LGPACN+ G +DR + G++HLY  PV+  L
Sbjct: 189 WQFDASLSTSSLPPIQAGNVYTVNCSVRGDLGPACNSAGMIDRYILGLDHLYKKPVFRNL 248

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
           + C +SS  +G + + +PSWC APF+PEG+LS+I+A +S  IG+ YGH+L + + H  RL
Sbjct: 249 KECNMSS--TGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHILANLEDHKGRL 306

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             W+      L +   L     IP+NK LY+ SY+  ++ A+
Sbjct: 307 NQWLGFSVSFLALGWFLALI-GIPLNKSLYTVSYMLLSSAAS 347


>gi|395146531|gb|AFN53685.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 694

 Score =  267 bits (682), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 203/341 (59%), Gaps = 37/341 (10%)

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           QL  L ++K+KR+A+LDAFRGL + LM+LVD  G  +  I HS WNG  LADFVMPFFLF
Sbjct: 36  QLLMLYRKKNKRLASLDAFRGLCIFLMMLVDYGGHVFPTIAHSAWNGIHLADFVMPFFLF 95

Query: 111 IVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           IVGV+IAL  KK P    A +K + +++KL   GI+LQ                      
Sbjct: 96  IVGVSIALVYKKAPNRVEATRKALLKSVKLFLVGILLQE--------------------- 134

Query: 171 GILQRIALVYVVVALIET-LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
              QRI++ Y+V A+ E  L+ +R+ +V       I  +Y W WI       +Y   +Y 
Sbjct: 135 ---QRISIGYIVGAICEIWLSIRRKGDV------GIIKSYYWHWIAALAIVAVYARLSYG 185

Query: 230 LYVPNWSFSEHSD-HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
           LYVP+W FS   D H V  + VKC ++G +GPACN+ G +DR + G++HLY+ PV+  L+
Sbjct: 186 LYVPDWQFSLPGDQHHV--FTVKCSVKGDVGPACNSAGMIDRYVLGLSHLYAKPVYKNLK 243

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
            C +SS    P  EDAPSWC APF+PEGLLS+++A ++  IG+ +GHVL H + H  RL+
Sbjct: 244 VCNMSSNKQVP--EDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHIQDHKGRLE 301

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +W       L++ + L      PINK LYS SY+  T+ +A
Sbjct: 302 NWSGFSVFFLVLGLFL-VRLGFPINKPLYSISYMLITSASA 341


>gi|326432441|gb|EGD78011.1| hypothetical protein PTSG_09649 [Salpingoeca sp. ATCC 50818]
          Length = 1087

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 155/397 (39%), Positives = 210/397 (52%), Gaps = 53/397 (13%)

Query: 47  KGELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVM 105
           +G L  Q   +   K R+++LD FRG TV LM+ VD+ G A+  IDHSPWNG  LADFVM
Sbjct: 621 RGTLDAQPAQRSLPKERLSSLDVFRGFTVALMVFVDETGAAFPPIDHSPWNGVRLADFVM 680

Query: 106 PFFLFIVGVAIALALKK--------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDA 157
           PFF FIVGV++AL+ KK         P++  A++K   R LKL   G++ QGG     D 
Sbjct: 681 PFFDFIVGVSLALSFKKFDLEDATTTPRVWPALRKATIRFLKLFILGMLTQGGI----DI 736

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETL--TTKRRPNVLEPRH--------LSIFT 207
           ++Y  D+ HIR  GILQR+A+ Y  VAL+E      K+  N  E           L +  
Sbjct: 737 MNY--DLAHIRIMGILQRVAVCYYAVALMEIFLPRNKKYRNYNETDTVTGWAVDVLHMLW 794

Query: 208 AYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGY 267
            Y+W W      F  +    Y + VP+ +F E           +CG RG L PACNA  Y
Sbjct: 795 RYKWHWFTAACLFATHTGIMYGVNVPD-AFGE-----------ECG-RGVLTPACNAATY 841

Query: 268 VDRELWGINHLY----------SDPVWSRLEACTLSSPNSGPLREDAPSWC-RAPFEPEG 316
           +DR +  + H+Y          +D  + RL  C+  SP      EDAP+WC   PF+PEG
Sbjct: 842 IDRNVLTVEHMYFPANGGDKSGNDVTFQRLPECSTCSPGKCVPPEDAPAWCLHGPFDPEG 901

Query: 317 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQL 376
           L+S+++AI++  IGIHYGHVL   +   AR+ HW + G   L+I   LHF+ A  +N  L
Sbjct: 902 LVSSLNAIIATVIGIHYGHVLRRVQSPKARIVHWTAFGVVQLVIGFALHFSGAFVMNTDL 961

Query: 377 YSFSYVCFTAGAADGCLG----ITNSIFIPEMDWDEC 409
           YS SY   TAG     L     I + + + E  W  C
Sbjct: 962 YSISYTLVTAGTGGVLLALFYVIVDRLHVGEWAWSGC 998


>gi|413918232|gb|AFW58164.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 423

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 113/269 (42%), Positives = 170/269 (63%), Gaps = 5/269 (1%)

Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
           +VP    A +K + R LKL   G++LQGG+ H   +LS+GVD++ IR  G+LQRIA+ Y+
Sbjct: 93  RVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYL 152

Query: 182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
           + AL E      R +        +   Y++Q   G +  + Y+   Y  YVP+W +   +
Sbjct: 153 LTALCEIWI---RGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSA 209

Query: 242 DHGVKKYI-VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPL 300
               +K++ VKCG+RG   P CNAVG +DR++ GI HLY  PV++R + C++ SP +GPL
Sbjct: 210 PGSTEKHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPL 269

Query: 301 REDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLII 360
             DAPSWC+APF+PEGLLS++ AI++  IG+ YGHV++HF+ H  R+ +W+   F +L++
Sbjct: 270 PSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLVL 329

Query: 361 AIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           A  + F   + +NK LY+ SY   TAGAA
Sbjct: 330 AFAMDFFG-LHMNKPLYTLSYTLGTAGAA 357


>gi|413937083|gb|AFW71634.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 317

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 113/258 (43%), Positives = 159/258 (61%), Gaps = 6/258 (2%)

Query: 34  EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHS 93
           E G   S+V                + +R+A+LD FRG+TV+LMI+VDDAGG    ++HS
Sbjct: 62  ESGSGASKVAGAPPPSPAPSASPAARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHS 121

Query: 94  PWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSH 153
           PW+G T+ADF+MPFFLFIVGV++ LA K+VP    A +K + R LKL   G++LQGG+ H
Sbjct: 122 PWDGVTVADFIMPFFLFIVGVSLTLAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFH 181

Query: 154 APDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
              +L++GVD+  IR  GILQRIA+ Y++ A+ E +  K   +V       +   Y++Q 
Sbjct: 182 GVHSLTFGVDLTKIRLMGILQRIAIAYLLAAVCE-IWLKGDDDV--DSGYGLLRRYRYQL 238

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
             G +  + Y I  Y +YVP+W +      S    K + VKCG+RG  GPACNAVG VDR
Sbjct: 239 FVGLVLSIAYSILLYGMYVPDWEYQIAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDR 298

Query: 271 ELWGINHLYSDPVWSRLE 288
            + GI+HLY  PV++R +
Sbjct: 299 TVLGIDHLYRRPVYARTK 316


>gi|238481505|ref|NP_001154767.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008208|gb|AED95591.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 340

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 121/278 (43%), Positives = 177/278 (63%), Gaps = 10/278 (3%)

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           ++   +P    A +K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA
Sbjct: 1   MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
           + Y+VVAL E +  K   NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +
Sbjct: 61  IAYLVVALCE-IWLKGNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEY 117

Query: 238 S---EHSDHGVKKYI---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
               E     +  ++   VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+
Sbjct: 118 QILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCS 177

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
           ++ PN+GPL  DAPSWC+APF+PEGLLS++ A ++  +G+HYGH++IHFK H  RL  W+
Sbjct: 178 INYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWI 237

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              F  L++  +      + +NK LY+ SY+C T+GA+
Sbjct: 238 LRSF-CLLMLGLALNLFGMHLNKPLYTLSYMCVTSGAS 274


>gi|395146473|gb|AFN53630.1| putative aquaporin PIP2-8 [Linum usitatissimum]
          Length = 692

 Score =  214 bits (544), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 115/272 (42%), Positives = 168/272 (61%), Gaps = 15/272 (5%)

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           KKVP    A +K   +++KL   GI+LQGG+ H   +L+YGVD++ IR  GILQRI++ Y
Sbjct: 96  KKVPNRVEATRKAFLKSVKLFLVGILLQGGFFHGLHSLTYGVDIERIRLLGILQRISIGY 155

Query: 181 VVVALIET-LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           +V A+ E  L+ +R+ +V       I  +Y   W+      V+Y   +Y LYVP+W F+ 
Sbjct: 156 IVGAICEIWLSVRRKGDV------GIIKSYYSHWVAALAIVVVYARLSYGLYVPDWQFAL 209

Query: 240 HSD-HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
             D H V  Y VKC ++G +GPACN+ G +DR + G++HLY+ PV+  L+ C +SS    
Sbjct: 210 PQDQHHV--YTVKCSVKGDVGPACNSAGMMDRYVLGLSHLYAKPVYKNLKICNMSSNKQV 267

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG-L 357
           P  EDAPSWC APF+PEGLLS+++A ++  IG+ +GHVL H + H  RL++W   GF   
Sbjct: 268 P--EDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHVQDHKGRLENW--SGFSVF 323

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            ++  +       PINK LYS SY+  T+ +A
Sbjct: 324 FLLLGLFLVLLGFPINKPLYSISYMLITSASA 355


>gi|186530239|ref|NP_001119393.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008205|gb|AED95588.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 292

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/278 (45%), Positives = 172/278 (61%), Gaps = 14/278 (5%)

Query: 13  RTQLVEQEQ-DDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG 71
           RT+L   E   D  D+++   ++K +E S +Q  +   L   +      +R+ +LD FRG
Sbjct: 2   RTKLTMYEAIKDNDDNDHQWREKKDIE-SALQISRSSSLPPDK------ERLVSLDVFRG 54

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK 131
           LTV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +
Sbjct: 55  LTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATR 114

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  
Sbjct: 115 KALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWL 173

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKY 248
           K   NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +    E     +  +
Sbjct: 174 KGNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTF 231

Query: 249 IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           +VKCG+RGH GP CNAVG +DR   GI HLY  PV++R
Sbjct: 232 LVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYAR 269


>gi|255642425|gb|ACU21476.1| unknown [Glycine max]
          Length = 326

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 111/276 (40%), Positives = 157/276 (56%), Gaps = 15/276 (5%)

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           ++ P    A  K   R L L   GI+LQGGY H   +L++GVD++ IRW GILQRI++ Y
Sbjct: 41  QRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRISIGY 100

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
           +V AL E      R      + L    +Y WQW    I   +Y    Y LYVP+W F   
Sbjct: 101 IVAALCEIWLPAPR-----WKELGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDVS 155

Query: 241 SDH-------GVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
           +         G   Y+V C +RG LGPACN+ G +DR + G++HLY  PV+  L+ C +S
Sbjct: 156 ASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNMS 215

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
           +   G + + +PSWC APF+PEG+LS+I+A +S  IG+ YGHVL H + H  RL +W+  
Sbjct: 216 A--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMCF 273

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
               L + + L     IP+NK LY+ SY+  T+ A+
Sbjct: 274 SLSFLALGLFLALI-GIPLNKSLYTVSYMLLTSAAS 308


>gi|356536971|ref|XP_003537005.1| PREDICTED: uncharacterized protein LOC100781855 [Glycine max]
          Length = 357

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 114/198 (57%), Positives = 138/198 (69%), Gaps = 23/198 (11%)

Query: 1   MADLRIVEEGLGRT----QLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLL 56
           M + + +EEGL  T       E + +  + + NG + E      + +  Q+GE  +QQ++
Sbjct: 1   MDEAKRMEEGLKTTYNDYHKGELKHEIERTNGNGDSIE---HDKDARITQEGE-SVQQIV 56

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA--GGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           +Q+               +LM+L DDA  GGAY RIDHSPWNGCTLADFVMPFFLF+VGV
Sbjct: 57  EQEQP-------------LLMVLEDDADAGGAYPRIDHSPWNGCTLADFVMPFFLFVVGV 103

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           AIALALK++PK+  AVK II RTLKLLFWGI+LQGGYSHAPD LSYGVDM+ IRWCGILQ
Sbjct: 104 AIALALKRIPKVKYAVKNIILRTLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQ 163

Query: 175 RIALVYVVVALIETLTTK 192
           RIALVY  VALIET TT 
Sbjct: 164 RIALVYCAVALIETYTTN 181



 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/103 (63%), Positives = 73/103 (70%), Gaps = 4/103 (3%)

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
           G L+  A  +C          + ISA LSGTIGIHYGHVLIHFKGHS RLK W+ MGF L
Sbjct: 160 GILQRIALVYCAVALIETYTTNCISASLSGTIGIHYGHVLIHFKGHSERLKQWLLMGFLL 219

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
           L + ++LHFT AIPINKQLYSFSYVCFTAGAA    GI  S+F
Sbjct: 220 LTLGLMLHFTEAIPINKQLYSFSYVCFTAGAA----GIVFSVF 258


>gi|167522597|ref|XP_001745636.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775985|gb|EDQ89607.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1047

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 137/376 (36%), Positives = 193/376 (51%), Gaps = 55/376 (14%)

Query: 40  SEVQDEQKGELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGC 98
           S +Q +   +L    +  +  + R++ LD +RGLT+ +MILVD+ G A+  IDH+PWNG 
Sbjct: 583 SNIQSKSNIDLATDPVAPKPPRERLSALDVYRGLTIAVMILVDETGAAFPPIDHAPWNGL 642

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKING-------AVKKIIFRTLKLLFWGIILQGGY 151
            LAD V+P F FIVGV+IALA K+     G       A KK   R LK LF GI      
Sbjct: 643 HLADTVVPSFDFIVGVSIALAFKRFDLEAGAQGQRWTAFKKATDRFLK-LFGGITFM--- 698

Query: 152 SHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP---------RH 202
                      D+ +IR  GILQR+A+ Y  VAL+E    +    +            R 
Sbjct: 699 ---------NYDLTNIRIFGILQRVAVCYFAVALMEIFLPRLTGALPADNGTWADWMRRT 749

Query: 203 LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC 262
             +F  Y+W W    +   ++    Y + VP+ +F E           +CG RG L PAC
Sbjct: 750 QHLFWRYRWHWFSAALLLAVHTSILYGVDVPD-AFGE-----------RCG-RGQLTPAC 796

Query: 263 NAVGYVDRELWGINHLY----------SDPVWSRLEACTLSSPNSGPLREDAPSWC-RAP 311
           NA  Y+DR +  + H+Y          +D  + RL  C+  SP       DAP+WC   P
Sbjct: 797 NAATYIDRLILTVPHMYFPENGGDPAHADVTFKRLPECSSCSPGLCVAPADAPAWCLHGP 856

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGL+S+++AI++  IG+HYGHVL   K    R+  W S     L++ +ILHF+  IP
Sbjct: 857 FDPEGLVSSLTAIVTTIIGVHYGHVLRQIKSPMERIFQWSSFALLQLLLGLILHFS-GIP 915

Query: 372 INKQLYSFSYVCFTAG 387
           +N  LYS S+V  T G
Sbjct: 916 LNINLYSVSFVLVTGG 931


>gi|238481503|ref|NP_001154766.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008207|gb|AED95590.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 295

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 126/281 (44%), Positives = 172/281 (61%), Gaps = 17/281 (6%)

Query: 13  RTQLVEQEQ-DDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG 71
           RT+L   E   D  D+++   ++K +E S +Q  +   L   +      +R+ +LD FRG
Sbjct: 2   RTKLTMYEAIKDNDDNDHQWREKKDIE-SALQISRSSSLPPDK------ERLVSLDVFRG 54

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK 131
           LTV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +
Sbjct: 55  LTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATR 114

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E +  
Sbjct: 115 KALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE-IWL 173

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF---SEHSDHGVKKY 248
           K   NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +    E     +  +
Sbjct: 174 KGNHNV--SSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTF 231

Query: 249 I---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           +   VKCG+RGH GP CNAVG +DR   GI HLY  PV++R
Sbjct: 232 LNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYAR 272


>gi|413947252|gb|AFW79901.1| hypothetical protein ZEAMMB73_198786 [Zea mays]
          Length = 505

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 81/184 (44%), Positives = 115/184 (62%), Gaps = 5/184 (2%)

Query: 209 YQWQWIGGFIAFVIYIITTYSLYVPNWSFS---EHSDHGVKKYIVKCGMRGHLGPACNAV 265
           Y++Q   G +  + Y I  Y +YVP+W +      S    K + VKCG+RG   PACNAV
Sbjct: 310 YRYQLFVGLVLSIAYSILLYGIYVPDWEYQIAGPGSSSTKKSFFVKCGVRGDTRPACNAV 369

Query: 266 GYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAIL 325
           G VDR + GI+HLY  PV+ R + C++    +GPL  DAPSWC+APF+PEGLLS + AI+
Sbjct: 370 GMVDRTILGIDHLYRRPVYVRTKECSIDYLENGPLPPDAPSWCQAPFDPEGLLSFVMAIV 429

Query: 326 SGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
           +  IG+ + HV+IHF+ H  R+  W+   F +L +A ++ F   + +NK LY+ SY    
Sbjct: 430 TCLIGLQFRHVIIHFEKHRGRIASWLVPSFSMLALAFVMDFV-GMRMNKPLYTMSYT-LA 487

Query: 386 AGAA 389
           AGAA
Sbjct: 488 AGAA 491


>gi|255635187|gb|ACU17949.1| unknown [Glycine max]
          Length = 217

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 88/196 (44%), Positives = 114/196 (58%), Gaps = 5/196 (2%)

Query: 42  VQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLA 101
           + D Q   L   +  Q ++ R+A+LD FRGL+V LMI VD A   +  I H+PWNG  LA
Sbjct: 1   MADPQPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGTHLA 60

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           DFVMPFFLFI G+++AL  K+ P    A  K   R L L   GI+LQGGY H   +L++G
Sbjct: 61  DFVMPFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFG 120

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
           VD++ IRW GILQRI++ Y+V AL E      R      + L    +Y WQW    I   
Sbjct: 121 VDIQRIRWLGILQRISIGYIVAALCEIWLPAPR-----WKELGFVKSYYWQWFVAVILLA 175

Query: 222 IYIITTYSLYVPNWSF 237
           +Y    Y LYVP+W F
Sbjct: 176 LYSGLLYGLYVPDWQF 191


>gi|413953638|gb|AFW86287.1| hypothetical protein ZEAMMB73_717084 [Zea mays]
          Length = 357

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/233 (38%), Positives = 118/233 (50%), Gaps = 40/233 (17%)

Query: 127 NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
             A KK   R  KL   G+ILQGGY H    L+YGVD+ HIRW G+LQRIA+ Y V A+ 
Sbjct: 5   TAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAAMS 64

Query: 187 E------TLTTKRRP--------------------------NVLEPRHLSIFTAYQWQWI 214
           E       L     P                          N L    +     Y  +W 
Sbjct: 65  EIWLVNNNLVDSPVPFVKKYFIEWIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIEWF 124

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHG--------VKKYIVKCGMRGHLGPACNAVG 266
                 V+Y+   + LYV NW F   + +         ++  +++CG+RG LGP CNAVG
Sbjct: 125 MAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVG 184

Query: 267 YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLS 319
            VDR L G NHLY +PV+ R + C+++SP+ GPL  +AP WC APF+PEGLLS
Sbjct: 185 LVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLS 237


>gi|242007028|ref|XP_002424344.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507744|gb|EEB11606.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 497

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/398 (28%), Positives = 190/398 (47%), Gaps = 50/398 (12%)

Query: 5   RIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVA 64
           R ++  +    +++ E        NG N      ++  +D    + +  +  + K+ R+ 
Sbjct: 53  RQIKSNVTTRFIIDTEFGFESYFSNG-NNSFCTVKNHFEDFGNIDHRNDRYSRIKNSRIK 111

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP 124
           +LDAFRGL +++MI V+  GG Y+   HSPWNG T+ADFV P+F++I+G +  L++    
Sbjct: 112 SLDAFRGLAILIMIFVNYNGGDYSVFKHSPWNGITIADFVFPWFIWIMGASTVLSIDNNF 171

Query: 125 KINGAVKKIIFRTLKLLFW----GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           +   + K+I FR LK  F+    GI+L  G+  +   L         R CG+LQRI L Y
Sbjct: 172 RRAQSKKEIFFRILKRSFYLIALGIVLNSGHRDSKGFL---------RVCGVLQRIGLTY 222

Query: 181 VVVALIETLTTKRRPNV-LEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFS 238
            ++A +E    K   N    P + S      W QW+   +   I++I T++L+VP     
Sbjct: 223 FIIASLEIFALKSLLNEHFGPWNFSRNIIKIWIQWLVPILLVAIHVIITFTLHVPGCPLG 282

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
                G+  +      R   G    A GY+DR +   NH+Y                   
Sbjct: 283 YTGPGGLSNH---SAFRNCTG---GAAGYIDRLIITDNHMYH---------------RGS 321

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLL 358
            L+   PS    PF+PEGLL T++++    +G+    +LI+ +   +++K W+   F  +
Sbjct: 322 FLKIFKPS---VPFDPEGLLGTLTSVFCAFLGVQSARILINHENSFSKIKSWI---FWAI 375

Query: 359 IIAIILHF-------TNAIPINKQLYSFSYVCFTAGAA 389
           ++ +I  F       +  IPINK L+S SYV  T+  A
Sbjct: 376 VMGLISGFLCNWSQNSGIIPINKNLWSLSYVLATSSIA 413


>gi|413918234|gb|AFW58166.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
          Length = 202

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 72/127 (56%), Positives = 97/127 (76%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+TV+LMI+VDDAG     ++HSPW+G T+ADFVMPFFLFIVGVA+ALA 
Sbjct: 53  QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           K+VP    A +K + R LKL   G++LQGG+ H   +LS+GVD++ IR  G+LQRIA+ Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172

Query: 181 VVVALIE 187
           ++ AL E
Sbjct: 173 LLTALCE 179


>gi|328696746|ref|XP_003240114.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 591

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 180/361 (49%), Gaps = 69/361 (19%)

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           Q+   ++  S R+ +LD FRG++++LM+ V+  GG Y   +H+PWNG TLADF++P+F +
Sbjct: 193 QIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLADFILPWFCW 252

Query: 111 IVGVAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           ++GV+IA++L+   + +   K    ++I R++ LL  G++L               +++ 
Sbjct: 253 VMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVL---------NSVNNNNLRT 303

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSI--FTAYQWQWIGGFIAFVIYI 224
            R  G+LQR+AL+Y + A +ET+  K +P     R   I        QW    I   I+ 
Sbjct: 304 FRPLGVLQRLALIYFIAATLETIFMKPQPYFTNTRLDVIRDIIESARQWFIVIILVAIHT 363

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDREL 272
           + T+ L VP          G  K        G+LGP           C   A GY+DR +
Sbjct: 364 VITFFLPVP----------GCPK--------GYLGPGGLYNSSSNTNCTGGAAGYIDRLV 405

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
           +G NH+Y              SP         P +   PF+PEG+LST++  L   +G+H
Sbjct: 406 FGENHMYP------------GSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 445

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGL-LIIAIILHFTNA---IPINKQLYSFSYVCFTAGA 388
            G +++ ++  + R+K W++    L LI   + +F+     IPINK L+S SY   T  +
Sbjct: 446 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 505

Query: 389 A 389
           A
Sbjct: 506 A 506


>gi|193664422|ref|XP_001945789.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 568

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 180/361 (49%), Gaps = 69/361 (19%)

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           Q+   ++  S R+ +LD FRG++++LM+ V+  GG Y   +H+PWNG TLADF++P+F +
Sbjct: 170 QIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLADFILPWFCW 229

Query: 111 IVGVAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           ++GV+IA++L+   + +   K    ++I R++ LL  G++L               +++ 
Sbjct: 230 VMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVL---------NSVNNNNLRT 280

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSI--FTAYQWQWIGGFIAFVIYI 224
            R  G+LQR+AL+Y + A +ET+  K +P     R   I        QW    I   I+ 
Sbjct: 281 FRPLGVLQRLALIYFIAATLETIFMKPQPYFTNTRLDVIRDIIESARQWFIVIILVAIHT 340

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDREL 272
           + T+ L VP          G  K        G+LGP           C   A GY+DR +
Sbjct: 341 VITFFLPVP----------GCPK--------GYLGPGGLYNSSSNTNCTGGAAGYIDRLV 382

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
           +G NH+Y              SP         P +   PF+PEG+LST++  L   +G+H
Sbjct: 383 FGENHMYP------------GSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 422

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGL-LIIAIILHFTNA---IPINKQLYSFSYVCFTAGA 388
            G +++ ++  + R+K W++    L LI   + +F+     IPINK L+S SY   T  +
Sbjct: 423 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIPINKNLFSLSYAFITGSS 482

Query: 389 A 389
           A
Sbjct: 483 A 483


>gi|167538367|ref|XP_001750848.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770669|gb|EDQ84352.1| predicted protein [Monosiga brevicollis MX1]
          Length = 779

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 114/422 (27%), Positives = 190/422 (45%), Gaps = 89/422 (21%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG 84
           K+ E G      L+ + V     G   +Q + ++ S+R+ +LD+FRG  + +MI V+  G
Sbjct: 304 KEGEPGETTGLLLQHATVMPSDAGMHAIQDM-KRSSQRLRSLDSFRGFALTIMIFVNFNG 362

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKL 140
           G Y   +HS WNG T+AD V P+F++I+G ++A+A    LK+       + KI  R L L
Sbjct: 363 GFYWFFNHSAWNGLTVADLVFPWFIWIMGTSMAIAFNSLLKRQTPTTTILYKIFRRMLIL 422

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI-----------ETL 189
             +GI + G +           D+++ R  G+LQR A+ Y+VVAL+            ++
Sbjct: 423 FAFGIFIIGNFH----------DLRNGRIPGVLQRFAVSYLVVALVMLYAPKMESWCASV 472

Query: 190 TTKRRPNVLEPR-----------------------------HLSIFTAYQWQWIGGFIAF 220
           +T   P     R                             H    T Y W+W+  F+  
Sbjct: 473 STSDSPTPALVRGIAKPGSGHQLDVAADIAEMKPWVRTFLLHTRDLTPYIWEWVAMFVII 532

Query: 221 VIYIITTYSLYVPNW---------SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRE 271
           +I+   T+ L VP           + +E+      +  V CG          A GY+DR+
Sbjct: 533 IIHTCITFLLPVPGCPTGYIGPGGALAEYGQFAPPEGEV-CG-ESTFCCEGGASGYIDRQ 590

Query: 272 LWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
           ++G  H+Y               P S P+ E        P++PEGLL ++++I+   +G+
Sbjct: 591 VFGWRHIYDQ-------------PTSQPIYE------TGPYDPEGLLGSLTSIVMCFLGL 631

Query: 332 HYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAG 387
             G +++H+K H+ R +HW+     L +IA  L         IP++K L+S S++   A 
Sbjct: 632 QSGKIIVHYKSHAQRSRHWLMWALVLGVIATGLCGASQNNGVIPVSKNLWSLSFIILLAS 691

Query: 388 AA 389
            A
Sbjct: 692 FA 693


>gi|384249073|gb|EIE22555.1| hypothetical protein COCSUDRAFT_42235 [Coccomyxa subellipsoidea
           C-169]
          Length = 395

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 111/341 (32%), Positives = 160/341 (46%), Gaps = 62/341 (18%)

Query: 74  VVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA---- 129
           + LM+ V+ AG     + H+ W+G  LAD VMP FL +VGV++AL+L   P+ +G     
Sbjct: 1   MALMLFVNHAGHEVPWVAHAAWDGVHLADLVMPCFLLLVGVSVALSLG--PRASGPRRPL 58

Query: 130 VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
           ++K++ RT KL   G+++QGG            D+  +R+CG+LQRIAL + +V+L+   
Sbjct: 59  LRKVLARTGKLAGLGLLIQGGVGAGAFP---AWDLSRLRYCGVLQRIALCFALVSLVVLY 115

Query: 190 TTKRRPNVLEPRHLSI-------------FTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
                P    PR  S+             F  Y   WI G   FV +      L  P   
Sbjct: 116 L----PQTPSPRLQSLLDRGDESASLMAPFRFYALWWILGTALFVAFNWMALFLRPPG-- 169

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
                          C  R  L   CN   YVD  L G +HLY  P      +C  ++P 
Sbjct: 170 ---------------CLARPALTADCNVAAYVDARLLGRSHLYPWP------SCRRANPP 208

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGT-IGIHYGHVLIHFKGHSARLKHWVSMGF 355
              L            +PEGL +T+S  L+ T +G+ +G VL+  +GH ARL+ W     
Sbjct: 209 CEYL------------DPEGLFATLSGALASTFLGLWFGAVLLTLRGHRARLRSWAYASV 256

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGIT 396
            L  + + LH T A+P NK LYS S V  TAG++   LG+ 
Sbjct: 257 LLTELGLALHVTGAVPFNKNLYSASSVLLTAGSSGAFLGLV 297


>gi|391346547|ref|XP_003747534.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Metaseiulus occidentalis]
          Length = 564

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 111/386 (28%), Positives = 179/386 (46%), Gaps = 49/386 (12%)

Query: 18  EQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK-RVATLDAFRGLTVVL 76
           E+ +     S+ G  +   L   E      G    +  + +  K R+ +LDAFRG  + L
Sbjct: 131 EKRRLRQLRSDTGDAETSSLGPLEASSSTAGSRPPEDGIGKAGKPRIKSLDAFRGFCLFL 190

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL----KKVPKINGAVKK 132
           MI V+  GG     +H PW+G T AD + P+F++I+GV++A++L    +K   ++    K
Sbjct: 191 MIFVNYGGGGLWLFEHIPWDGLTFADLLFPWFVWIMGVSMAISLRSMRRKCVPLSEIFFK 250

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
           I+ R++KL   G+IL        +++    D+  +R  G+LQR A+ Y VVA +    ++
Sbjct: 251 ILSRSVKLFLLGLIL--------NSMGKNNDISKLRIPGVLQRFAVSYFVVASMHMFFSR 302

Query: 193 RRPNVLEPRHLSIF-TAYQWQ-WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIV 250
                   +   I   A  WQ W+       I+++ T+ L VP+           K Y+ 
Sbjct: 303 ATDAAETAKWAKIRDVALYWQEWVMMISLVAIHVLLTFLLDVPDCP---------KGYLG 353

Query: 251 KCGMR---GHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
             G+     H      A GY+DR + G NH+Y  P   ++   +                
Sbjct: 354 PGGLHENGTHFNCTGGAAGYIDRVVLGPNHMYGHPTTEKIYETS---------------- 397

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL-LIIAIILHF 366
              PF+PEG+L  +++I    +G+  G +L+ F     RL  W+  G  L L+  I+  F
Sbjct: 398 --QPFDPEGVLGCLTSIFLTFLGLQAGKILLTFNNPGRRLSRWICWGVLLGLLAGILCGF 455

Query: 367 TNA---IPINKQLYSFSYVCFTAGAA 389
           +     IPINK L+S SYV  TAG A
Sbjct: 456 SKEDGWIPINKNLWSLSYVLCTAGLA 481


>gi|443694948|gb|ELT95966.1| hypothetical protein CAPTEDRAFT_92095 [Capitella teleta]
          Length = 431

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 183/381 (48%), Gaps = 53/381 (13%)

Query: 26  DSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG 85
           +S   I+  K  + +    E+K E +L    + K +R+ +LDAFRGL ++LMI V+  GG
Sbjct: 2   ESNVSISSAK-TDSTRRNSEEKDEGKLITPKEVKKERLRSLDAFRGLNILLMIFVNYGGG 60

Query: 86  AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKL 140
            Y    H+ WNG  + D + P+F+FI+G ++ L +     K+V  + G +  II+R++KL
Sbjct: 61  GYWYFSHAVWNGLYITDLIFPWFIFIMGTSLGLGISSLVKKEVDPVEG-LWGIIWRSVKL 119

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI----ETLT--TKRR 194
              GI+     S+         D+++IR  G+LQR+A+VY + A++    E+L    + R
Sbjct: 120 FAVGIMYNTKSSN---------DLENIRMTGVLQRLAMVYFITAIVHYAGESLQCCMRSR 170

Query: 195 PNVLEPRH-LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
             V   RH LS    Y  +WI   +   IY   TY   VP          G+ +      
Sbjct: 171 GTVSRWRHILSDLAPYFGEWITMLVIIGIYCYFTYWFAVPGCEAGYVGPGGLHR------ 224

Query: 254 MRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFE 313
              H G    A  Y+D +++ + H+Y  P     +  T+   +S              F+
Sbjct: 225 DGAHAGCTGGAALYIDLKVYTMRHIYQWP-----DIRTIYQTDSA-------------FD 266

Query: 314 PEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW-----VSMGFGLLIIAIILHFTN 368
           PEGLL T+++I    +G+  G +L+  KGH  RL  W     ++ G G L+    L    
Sbjct: 267 PEGLLGTLTSIFLCFLGLQAGKILVCHKGHRERLVRWLIWAIITGGIGTLLCKAQLE-DG 325

Query: 369 AIPINKQLYSFSYVCFTAGAA 389
            +PINK + S S+V  +AG  
Sbjct: 326 WVPINKNIMSISFVLVSAGTG 346


>gi|242022263|ref|XP_002431560.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516863|gb|EEB18822.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 607

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 177/384 (46%), Gaps = 67/384 (17%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG 84
           K + N +N        E+ D    +   Q   +    R+ ++D FRGL VVLMI V+D  
Sbjct: 169 KKTSNKVN-------CEMSDYGGDDRTTQASSKPARHRIKSIDTFRGLAVVLMIFVNDGA 221

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWG 144
           G Y  ++H+ WNG  +ADFV P+FL+++G+ I ++++   K N +  KI+   +K    G
Sbjct: 222 GHYWFLEHATWNGILVADFVFPWFLWVMGLCIPISIRTQLKRNVSRWKILGHVIKR---G 278

Query: 145 IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRR-PNVLEPRHL 203
           I+L G      + +  G D++ IR  G+LQR ++VY+++A++    T R   N       
Sbjct: 279 ILLFG-LGVLLNTVGIGSDLETIRIPGVLQRFSIVYLIIAILGVCFTPRSISNENRFPGS 337

Query: 204 SIFTAYQ------WQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGH 257
           S    +Q       QWI      ++ I+  Y  +V    FS           V     G+
Sbjct: 338 SFRETFQDIIIIFPQWI-----VILSIVAAYCYFV---FFSP----------VPGCPSGY 379

Query: 258 LGPA----------CNA--VGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAP 305
           LGP           C     GYVD+ L G+ H+Y +P  S++                  
Sbjct: 380 LGPGGIQDGGRFNECTGGMTGYVDKVLLGVEHIYKNPTSSKV------------------ 421

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH 365
            +   PF+PEGLL  + +I     G+  G  L++     A+L  W + G    I+A++L 
Sbjct: 422 -YKSGPFDPEGLLGVMPSIFQAFFGVQAGATLLYHPEWKAKLIRWFTWGILNGILALLLS 480

Query: 366 FTNAIPINKQLYSFSYVCFTAGAA 389
               +PINK L+S SYV  T  +A
Sbjct: 481 LPGIVPINKNLWSLSYVFTTTSSA 504


>gi|350423601|ref|XP_003493532.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 565

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 159/351 (45%), Gaps = 49/351 (13%)

Query: 46  QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVM 105
            K ++    + Q   +RV  +D  RG + +LMI V+D  G Y  + H+ WNG    D + 
Sbjct: 166 MKSQVDDGAMKQPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLF 225

Query: 106 PFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYG 161
           P F++I+GV I +A+    K     ++I++    R++ L   G+ L        + +S G
Sbjct: 226 PCFIWIMGVCIPIAMSSQMKRMTPKRQILYGIVKRSILLFLIGLSL--------NTVSTG 277

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP---RHLSIFTAYQWQWIGGFI 218
             ++ IR  G+LQR  + Y VVAL+  L   RRP  ++    R +  F     QW    +
Sbjct: 278 GQLETIRIFGVLQRFGITYFVVALLYFLLMSRRPRKIQSPMLREVQDFLLLLPQWCVMLV 337

Query: 219 AFVIYIITTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
             V++ + T+ L VP            H   KY    G          A GY+DR +   
Sbjct: 338 IVVVHCVITFCLNVPGCPTGYLGPGGLHDDAKYFDCVG---------GAAGYIDRVILKE 388

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
            HL+                        A  +   P++PEG+L T++A     +G+H G 
Sbjct: 389 AHLH----------------------HSATVYKSGPYDPEGILGTLTAAFQVFLGLHAGI 426

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
           +++ +K    R+  W++       +  +LHFTN IP+NK+L+S S+V  T 
Sbjct: 427 IMMTYKDWKERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTT 477


>gi|307178470|gb|EFN67159.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 512

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 154/348 (44%), Gaps = 46/348 (13%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           K +RV  +D FRG+  + MI V+D  G+Y  + H+ WNG  L D V P F++I+GV + +
Sbjct: 119 KHRRVKAIDTFRGVCTLFMIFVNDGSGSYTTLGHATWNGMLLGDLVFPCFMWIMGVCVPI 178

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           AL    K   +  +I F   K  F   ++      A + L     +++IR  G+LQR  +
Sbjct: 179 ALSAQLKRGLSKLEISFSIFKRSFLLFLI----GIALNTLGTNAQLENIRIFGVLQRFGI 234

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFT-------AYQWQWIGGFIAFVIYIITTYSLY 231
            Y++V+L+    T ++P V +    +  T       +    W       +++   T+ L 
Sbjct: 235 TYLIVSLLYLCFTPQQPKVAQNLSQTWMTHKMQDILSLLPHWCIMLTLVMVHCAVTFCLP 294

Query: 232 VPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
           +P            H   KY    G          A GY+DR L  ++H+Y  P    + 
Sbjct: 295 IPGCPTGYLGPGGRHEDGKYFNCTG---------GATGYIDRILLTLSHIYQWPTIDSIY 345

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                 PF+PEG+L  +++I    +G+H G +L+ +KG   R+ 
Sbjct: 346 G-------------------SGPFDPEGILGCLTSIFQVFLGVHTGVILMMYKGWKERII 386

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYV----CFTAGAADGC 392
            W+        +  I HFTN IPINK L+S S+V    CF      GC
Sbjct: 387 RWLVWAVFYGCLGCIFHFTNIIPINKNLWSLSFVLVSTCFALAFLSGC 434


>gi|307201549|gb|EFN81312.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 564

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 173/386 (44%), Gaps = 64/386 (16%)

Query: 30  GINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR 89
           GI+  + L R+  +   KG  +     +   +RV  +DAFRG + + MI V+D  G+Y+ 
Sbjct: 142 GISAGRFLWRTYRRRYGKGGKEEATNKEPTKRRVKAIDAFRGASTLFMIFVNDGSGSYSV 201

Query: 90  IDHSPWNGCTLADFVMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWG 144
           + H+ WNG    D V P F++I+GV + +AL     + +PK+  A   ++ R+  L   G
Sbjct: 202 LGHTTWNGMLPGDLVFPCFMWIMGVCVPIALSAQLRRGIPKLEIAFT-VLKRSFLLFLIG 260

Query: 145 IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI------------ETLTTK 192
           + L        + L     ++ IR  G+LQR  + Y+VV+++            +  +  
Sbjct: 261 VSL--------NTLGTNAQLEKIRVFGVLQRFGVTYLVVSVMYLCLEPSLQLQDQDSSRN 312

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK--YIV 250
           R   VL  R + +   Y   W    I  +++   T+ L VPN         G  +  Y +
Sbjct: 313 RVTRVL--RDMQVLLPY---WSFMLILVMVHCGLTFGLAVPNCPTGYLGPGGTHEDGYYM 367

Query: 251 KCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRA 310
            C           A GY+DR +  INH+++ P                     A  +   
Sbjct: 368 NC--------TGGAAGYIDRVVLTINHIFAGP-------------------TIASVYGSG 400

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEG+L  ++A     +G+H G +L+ +K    R+  W+S      ++  ILHF N I
Sbjct: 401 PFDPEGILGCLTATFQVYLGVHAGVILMMYKNWKERVVRWLSWAVLYGVLGCILHFCNVI 460

Query: 371 PINKQLYSFSYV----CFTAGAADGC 392
           P+NK L+S S+V     F      GC
Sbjct: 461 PVNKNLWSLSFVFVSTSFALAFLSGC 486


>gi|395507548|ref|XP_003758085.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Sarcophilus harrisii]
          Length = 634

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 185/385 (48%), Gaps = 61/385 (15%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMI 78
           ++ + ++++  IN E G       +    +LQ++   L     R+ +LD FRG+ +++M+
Sbjct: 197 KKMNPRETDRLINSELG--SPTRAESYSSDLQVEAWRLTLPVYRLRSLDTFRGIALIIMV 254

Query: 79  LVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KI 133
            V+  GG Y    H  WNG TLAD V P+F+FI+G +IAL+L  + +  G  K     KI
Sbjct: 255 FVNYGGGKYWFFKHESWNGLTLADLVFPWFVFIMGSSIALSLSSMLR-RGCSKWKLLGKI 313

Query: 134 IFRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
           ++R+L L   GI I+   Y   P      +    +R  G+LQR+ L Y+VVA++E L  K
Sbjct: 314 LWRSLLLCVIGIFIVNPNYCLGP------LSWDKLRIPGVLQRLGLTYLVVAVLELLFAK 367

Query: 193 RRP-NVLEPRHLSIFT---AYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
             P N    R  S F    +Y  QWI   +    ++  T+ L VP          G    
Sbjct: 368 AVPENSAMERSCSSFQDIISYWPQWIFILMLEAAWVCVTFLLPVP----------GCPTG 417

Query: 249 IVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            +  G  G  G  P C   A GY+DR L G +H+Y  P           SPN        
Sbjct: 418 YLGPGGIGDFGKYPNCTGGAAGYIDRLLLGEDHIYQHP-----------SPN-------V 459

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLII 360
               + P++PEGLL TI++I+   +G+  G +L+ +K        R   W +M  G +I 
Sbjct: 460 LYHTKVPYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQPKQIMLRFLLWSAM-LG-IIS 517

Query: 361 AIILHFTNA---IPINKQLYSFSYV 382
            ++  F+     IP+NK L+S SYV
Sbjct: 518 GVLTKFSQNEGFIPVNKNLWSISYV 542


>gi|449500329|ref|XP_004174928.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Taeniopygia guttata]
          Length = 789

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 109/344 (31%), Positives = 174/344 (50%), Gaps = 59/344 (17%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRGL++V+M+ V+  GG Y    H  WNG T+AD V P+F+FI+G +IALAL
Sbjct: 399 QRLRSLDTFRGLSLVIMVFVNYGGGKYWFFKHVSWNGLTVADLVFPWFVFIMGTSIALAL 458

Query: 121 KKVPKINGA----VKKIIFRTLKLLFWGII-LQGGYSHAPDALSYGVDMKHIRWCGILQR 175
             + +   +    ++KII+R+  L+  GII +   Y   P      +   ++R  G+LQR
Sbjct: 459 GSMLRWGSSKWKVLRKIIWRSFVLILLGIIVVNPNYCLGP------LSWDNLRIPGVLQR 512

Query: 176 IALVYVVVALIETLTTK--RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           +   Y+VVA +E L T+  RR   L+         Y  QWI   +   I++  T+ L VP
Sbjct: 513 LGFTYLVVAALELLFTRADRRFPALQD-----ILPYWPQWIFILVLETIWLCLTFLLPVP 567

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEA 289
           +           + Y+   G+ G  G  P C   A GY+DR + G  H+Y          
Sbjct: 568 DCP---------RGYLGPGGI-GDFGKYPNCTGGAAGYIDRLILGEKHMYQH-------- 609

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR-LK 348
                P+SG   +        P++PEG+L TI++I+   +G+  G + + +K H  + + 
Sbjct: 610 -----PSSGVTYQST-----MPYDPEGILGTINSIVMAFLGLQAGKITLFYKDHPKQIMS 659

Query: 349 HWVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            ++  G  + +I+ IL   +     IP+NK L+S SYV    CF
Sbjct: 660 RFIIWGIVMGVISAILTKCSKEEGFIPVNKNLWSISYVTTMSCF 703


>gi|10177926|dbj|BAB11337.1| unnamed protein product [Arabidopsis thaliana]
          Length = 384

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 55/92 (59%), Positives = 74/92 (80%)

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
           VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL  DAPSWC+
Sbjct: 261 VKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQ 320

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
           APF+PEGLLS++ A ++  +G+HYGH++IHFK
Sbjct: 321 APFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 352



 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 76/167 (45%), Positives = 106/167 (63%), Gaps = 8/167 (4%)

Query: 13  RTQLVEQEQ-DDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG 71
           RT+L   E   D  D+++   ++K +E S +Q  +   L   +      +R+ +LD FRG
Sbjct: 102 RTKLTMYEAIKDNDDNDHQWREKKDIE-SALQISRSSSLPPDK------ERLVSLDVFRG 154

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK 131
           LTV  MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +     A +
Sbjct: 155 LTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATR 214

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQ + +
Sbjct: 215 KALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQNLKV 261


>gi|340727662|ref|XP_003402158.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 554

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 97/341 (28%), Positives = 157/341 (46%), Gaps = 51/341 (14%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           Q   +RV  +D  RG + +LMI V+D  G Y  + H+ WNG    D + P F++I+GV I
Sbjct: 166 QPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFIWIMGVCI 225

Query: 117 ALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +A+    K++   +  +  I+ R++ L   G+ L        + +S G  ++ IR  G+
Sbjct: 226 PIAMSSQMKRMTLKHQILYGIVKRSILLFLIGLSL--------NTVSTGGQLETIRIFGV 277

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEP---RHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
           LQR  + Y+VVAL+  L   RRP+ ++    R +  F     QW    +  V++   T+ 
Sbjct: 278 LQRFGITYLVVALLYFLLMSRRPSKIQSPMLREVQDFLLLLPQWCVMLVIVVVHCAITFC 337

Query: 230 LYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL-YSDPVWS 285
           L VP            H   KY    G          A GY+DR +    HL YS  V+ 
Sbjct: 338 LNVPGCPTGYLGPGGLHDDAKYFDCVG---------GAAGYIDRMILKEAHLHYSATVYK 388

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                                    P++PEG+L T++      +G+H G +++ +K    
Sbjct: 389 S-----------------------GPYDPEGILGTLTTAFQVFLGLHAGIIMMTYKDWKE 425

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
           R+  W++       +  +LHFTN IP+NK+L+S S+V  T 
Sbjct: 426 RVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTT 466


>gi|26330552|dbj|BAC29006.1| unnamed protein product [Mus musculus]
 gi|74213594|dbj|BAE35603.1| unnamed protein product [Mus musculus]
 gi|74225342|dbj|BAE31601.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 191 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 241

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 242 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 300

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 301 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 354

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 355 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 404

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 405 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 456

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 457 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 503

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 504 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 538


>gi|23272280|gb|AAH24084.1| Hgsnat protein [Mus musculus]
 gi|148700869|gb|EDL32816.1| DNA segment, Chr 8, ERATO Doi 354, expressed [Mus musculus]
          Length = 624

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 191 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 241

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 242 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 300

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 301 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 354

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 355 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 404

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 405 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 456

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 457 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 503

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 504 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 538


>gi|74208071|dbj|BAE29143.1| unnamed protein product [Mus musculus]
          Length = 656

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 223 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 273

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 274 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKFKLL 332

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 333 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 386

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 387 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 436

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 437 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 488

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 489 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 535

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 536 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 570


>gi|115292433|ref|NP_084160.1| heparan-alpha-glucosaminide N-acetyltransferase [Mus musculus]
 gi|341940800|sp|Q3UDW8.2|HGNAT_MOUSE RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 656

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 223 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 273

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 274 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 332

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 333 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 386

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 387 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 436

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 437 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 488

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 489 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 535

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 536 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 570


>gi|74198170|dbj|BAE35261.1| unnamed protein product [Mus musculus]
          Length = 624

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 191 IASRETDRLINSELGSPSRGDPLSADYQPETR---------RSSANRLRCVDTFRGLALV 241

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 242 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 300

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 301 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 354

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 355 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 404

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 405 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 456

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 457 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 503

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 504 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 538


>gi|348529394|ref|XP_003452198.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 600

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 192/398 (48%), Gaps = 46/398 (11%)

Query: 5   RIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVA 64
           R++  GL   + +        ++E  IN E G  R  V       +    L    SKR+ 
Sbjct: 151 RMLGTGLDAVRGLVLRLGSSMETERLINSELGPTR--VVPPVTDNILPPPL--TSSKRLR 206

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP 124
           +LD FRG+++V+M+ V+  GG Y    H  WNG T+AD V P+F+FI+G +IAL++  + 
Sbjct: 207 SLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFIMGTSIALSINALL 266

Query: 125 KINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
           +        ++K ++R+L+L   G+++       P+     +  +++R  G+LQR+A  Y
Sbjct: 267 RAGATRCSLLRKAVWRSLQLFIIGVLVIN-----PNYCQGALAWENLRIPGVLQRLAWSY 321

Query: 181 VVVALIETLTTKRRPNVLEP----RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           +VVA ++ L  + + +V+           F  Y   W+   +  V+++  T+ L VP+  
Sbjct: 322 LVVACLDLLVARGQLDVITVDAWWSPAIDFLLYWPAWLCVILLEVLWLFLTFLLPVPDCP 381

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
                  G+        M  ++     A G++DR L G  H+Y +P  SR+   T     
Sbjct: 382 TGYLGPGGIGD------MGLYVNCTGGAAGFIDRLLLGEKHMYQNPS-SRVIYAT----- 429

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH-SARLKHWVSMGF 355
                       R P++PEG+L +I++IL   +G+  G +++H++    + +  ++  G 
Sbjct: 430 ------------RIPYDPEGVLGSINSILMAFLGLQAGKIILHYRERPKSIMSRFLIWGL 477

Query: 356 GLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAA 389
            L +I+  L   +     IP+NK L+S SYV   A  A
Sbjct: 478 SLGVISAFLTKCSTDRGFIPVNKNLWSLSYVTTLACFA 515


>gi|195476975|ref|XP_002100049.1| GE16376 [Drosophila yakuba]
 gi|194187573|gb|EDX01157.1| GE16376 [Drosophila yakuba]
          Length = 576

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/341 (30%), Positives = 165/341 (48%), Gaps = 51/341 (14%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      G+ K     +I++R++KL   G+ L        +++S G +++ +R+ G+L
Sbjct: 242 SVKSQLS-RGSSKARICLRILWRSIKLFVIGLCL--------NSMS-GPNLEQLRFMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI-----YIITTY 228
           QR  + Y+VV ++ TL  +R P  + P+ L     +      G +A ++     Y+  TY
Sbjct: 292 QRFGVAYLVVGVLHTLCCRREP--ISPQRLWQRAVHDVCLFSGELAVLLALVATYLGLTY 349

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L VP          G   Y        H      A GYVD ++ G  H+Y  P    + 
Sbjct: 350 GLRVPGCPRGYLGPGGKHDY------NAHPNCIGGAAGYVDLQVLGNAHIYQHPTAKYVY 403

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
             T                    F+PEG+   I +++   +G   G  L+      +R++
Sbjct: 404 DST-------------------AFDPEGIFGCILSVVQVLLGAFAGVTLLVHPNWQSRIR 444

Query: 349 HWVSMGFGL-LIIAIILHFTN---AIPINKQLYSFSYVCFT 385
            W  +   L LI   +  F+    AIP+NK L+S S+VC T
Sbjct: 445 RWTFLAILLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVT 485


>gi|345482764|ref|XP_001600799.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Nasonia vitripennis]
          Length = 569

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 62/353 (17%)

Query: 49  ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 108
           E +L+   +   +RV +LD  RG++++LMI V++    YA ++H+ WNG  + D V P F
Sbjct: 174 EDELEVGKKTAKRRVRSLDTVRGMSILLMIFVNNGAAGYALLEHATWNGLLVGDLVFPCF 233

Query: 109 LFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM 164
           ++I+GV I L++        +     + I+ R++ L   G+ L        + L     +
Sbjct: 234 MWIMGVCIPLSISAQLSRGSSRLRLCRAIVKRSVYLFAIGLAL--------NTLGGRNQL 285

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
           + IR  G+LQR  L Y+V  ++  L   R  +    R L    A   QWI   +    + 
Sbjct: 286 ERIRIFGVLQRFGLAYLVAGIVYALAA-RPDDKQSKRMLGDVVALIPQWIVALLILAAHC 344

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDREL 272
              + L VP                     RG+LGP           C+  A GYVD+ L
Sbjct: 345 AVVFLLPVPGCP------------------RGYLGPGGRHADGKYWNCSGGATGYVDKVL 386

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
            G++H+Y  P           + NS         +   PF+PEG+L ++++I    +GI 
Sbjct: 387 LGVDHIYQLP-----------TANS--------VYGSGPFDPEGVLGSLTSIFQVFLGIQ 427

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
            G +L  +    ARL  W+     L  +   LH+TN +P+NK L+S S+V  T
Sbjct: 428 AGQILRTYGSWKARLVRWLLWAVLLGAVGAALHYTNVVPVNKNLWSVSFVLVT 480


>gi|322790964|gb|EFZ15612.1| hypothetical protein SINV_04659 [Solenopsis invicta]
          Length = 581

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 194/382 (50%), Gaps = 65/382 (17%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           GK S + I+ +      E+Q+ +     ++    + S R+ ++D FRG+ ++LMI VD+ 
Sbjct: 140 GKLSPDNIHDD----LDELQEAETSTPIVRT--SRSSTRIRSIDTFRGIALLLMIFVDNG 193

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFW 143
           GG Y   +HS WNG T+AD V+P+F +I+G++I ++ +   +++ +  KIIFR L+    
Sbjct: 194 GGKYVFFNHSAWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRMKIIFRCLQ---- 249

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK--RRPNVLEPR 201
             ++        +++S    +KH+R+ G+LQ +A+ Y V A IET+  K   + +VL+  
Sbjct: 250 RALVLVLLGLMLNSMSME-SLKHLRFPGVLQLLAVSYFVCATIETIFMKAHSQDDVLQFG 308

Query: 202 HLSIF--TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHL 258
             SI       W QW+      V +I+ T+ L VPN             Y+   G     
Sbjct: 309 RFSILRDILNNWAQWLIILAIMVTHILITFLLPVPNCP---------TGYLGPGGNYSRY 359

Query: 259 G--PACN--AVGYVDRELWGINHLYS---DPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
           G  P C   A GY+DR ++G +H+YS   +PV+  +                       P
Sbjct: 360 GKFPNCTGGAAGYIDRLVFG-SHVYSKTQNPVYGTI----------------------LP 396

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV--SMGFGLLIIAIILHF--- 366
            +PEG+++T+S IL   +G+H G +L+ +   + R+  W+  S   G LI  ++ HF   
Sbjct: 397 HDPEGIMNTMSIILVVYMGVHAGKILLLYYQCNGRVIRWLLWSSVTG-LIAGLLCHFDKE 455

Query: 367 TNAIPINKQLYSFSYV----CF 384
           +  IP++K++ S S+V    CF
Sbjct: 456 SGVIPVSKKMMSLSFVLTVSCF 477


>gi|328780782|ref|XP_396570.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 569

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 166/368 (45%), Gaps = 50/368 (13%)

Query: 30  GINKEKGLERSEVQDEQKGELQLQQLLQQKSKR-VATLDAFRGLTVVLMILVDDAGGAYA 88
           G++  K L     +   K +L     ++Q SKR V  +D  RG + +LMI V+D  G Y 
Sbjct: 153 GLSGLKSLWNVFKKKCVKRQLDDTTAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYR 212

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGI--- 145
            + H+ WNG    D + P F++I+GV I +A+       G +K+++ +   ++F+GI   
Sbjct: 213 ILGHATWNGLLPGDLLFPCFIWIMGVCIPIAMA------GQMKRMLPK--HMIFYGIVKR 264

Query: 146 -ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP---R 201
            IL      + + +S G  ++ IR  G+LQR  + Y +VALI      R+P   +    +
Sbjct: 265 SILMFLIGLSLNTVSTGPQLETIRIFGVLQRFGITYFIVALIYLCLMTRKPKKTQSPMLK 324

Query: 202 HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHL 258
            +  F     QW    +   ++   T+ L VP            H   KY    G     
Sbjct: 325 EVQDFLLLLPQWCVMLVIVAVHCFITFCLKVPGCPTGYLGPGGLHDDAKYFDCVG----- 379

Query: 259 GPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLL 318
                A GY+DR +   +HL+                        A  +   P++PEG+L
Sbjct: 380 ----GAAGYIDRMILKESHLH----------------------HSATVYKSGPYDPEGIL 413

Query: 319 STISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYS 378
            T++      +G+H G +++ +K    R+  W++       I  ILHFTN IP+NK+L+S
Sbjct: 414 GTLTTTFQVFLGLHAGIIMMTYKDWKERVIRWLTWAAFFSCIGCILHFTNIIPVNKKLWS 473

Query: 379 FSYVCFTA 386
            S+V  T 
Sbjct: 474 LSFVFVTT 481


>gi|126304129|ref|XP_001381943.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Monodelphis domestica]
          Length = 638

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 117/386 (30%), Positives = 186/386 (48%), Gaps = 63/386 (16%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMI 78
           ++ + ++++  IN E G       +    +LQ++   L     R+ +LD FRG+++++MI
Sbjct: 201 KKMNPRETDRLINSELG--SPTRAESYSSDLQVEAWRLTPPVHRLRSLDTFRGISLIIMI 258

Query: 79  LVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KI 133
            V+  GG Y    H  WNG T+AD V P+F+FI+G +IAL+L  + +  G  K     KI
Sbjct: 259 FVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGSSIALSLSSMLR-RGCSKWKLLGKI 317

Query: 134 IFRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
           ++R+  L   G+ I+   Y   P      +    +R  G+LQR+ L Y+VVA++E L  K
Sbjct: 318 LWRSFLLCVIGVLIMNPNYCLGP------LSWDKLRIPGVLQRLGLTYLVVAVLELLFAK 371

Query: 193 RRP--NVLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
             P  + +E    S     +Y  QWI   +   +++  T+ L VP          G    
Sbjct: 372 AVPENSTMESLCASFQDIISYWPQWIFILMLEAVWVCVTFLLPVP----------GCPTG 421

Query: 249 IVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            +  G  G  G  P C   A GY+DR L G +H+Y  P           SPN        
Sbjct: 422 YLGPGGIGDFGKYPNCTGGAAGYIDRLLLGEDHIYQHP-----------SPN-------V 463

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLII 360
               +  ++PEGLL TI++I+   +G+  G +L+ +K        R   W +M   L II
Sbjct: 464 LYHTKVAYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQHKQIMLRFLLWSAM---LAII 520

Query: 361 A-IILHFTNA---IPINKQLYSFSYV 382
           + ++  F+     IP+NK L+S SYV
Sbjct: 521 SGVLTKFSQNEGFIPVNKNLWSISYV 546


>gi|443685781|gb|ELT89271.1| hypothetical protein CAPTEDRAFT_227545 [Capitella teleta]
          Length = 605

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 184/409 (44%), Gaps = 75/409 (18%)

Query: 5   RIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVA 64
           RI+   +   QL+E +  D ++S+  +  E   + +E     K             +R+ 
Sbjct: 165 RIISLLVKEEQLLE-DLGDPEESDPEMQTESATDDAETTAVNK----------THKERLR 213

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP 124
           +LDAFRG+++ +MI V+  GG Y   DHS WNG TLAD V P+F +I+G A+AL+++   
Sbjct: 214 SLDAFRGMSLTIMIFVNYGGGGYWFFDHSYWNGLTLADLVFPWFTWIIGTALALSIQGQM 273

Query: 125 KINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
           +  G  K     KII RT  L   GI+L  G    P      VD++ +R  G+LQR+A+ 
Sbjct: 274 R-RGKTKFSIAAKIIRRTCVLFALGIVLGSGGGSEP------VDVQTLRIPGVLQRLAIS 326

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSI---FTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           Y+VVAL+  +  K   +  +P  L +    T +  QW    +    ++  T+ L + +  
Sbjct: 327 YLVVALLHLIFAKANKD-HQPSRLDMVRDITDHWPQWGIVLVMVACHLGLTFLLPISD-- 383

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDRELWGINHLYSDPVW 284
                       +      G+LGP           C   A   +DR  +   H+Y  P  
Sbjct: 384 ------------VEGTCPTGYLGPGGLHEGGKYENCTGGAAAVIDRWFFSRQHVYQTPTC 431

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
             +                       P +PEG+L T+++I    +G+  G +L  FK  S
Sbjct: 432 KEVYKTV------------------EPHDPEGILGTLTSIFLCFLGLQAGVILTTFKQKS 473

Query: 345 ARLKHWVSMGFGL-LIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
            R++ W+  G  L LI  ++  F      IP+NK L+S S+V   A  A
Sbjct: 474 PRMRRWIVWGIILGLIAGLLCGFKQDGGWIPVNKNLWSLSFVLGLASMA 522


>gi|351712254|gb|EHB15173.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Heterocephalus glaber]
          Length = 537

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 117/381 (30%), Positives = 179/381 (46%), Gaps = 59/381 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQ-QLLQQKSKRVATLDAFRGLTVVLMILVD 81
           + +++++ IN + G       D   G+ Q + Q       R+  LD FRG+ +VLM+ V+
Sbjct: 162 NSQETDHLINSDLGSPSGA--DPLTGDPQPEAQCASASGHRLRCLDTFRGIALVLMVFVN 219

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    HS WNG T+AD V P+F+FI+G ++ L++  V +  G  K     KI +R
Sbjct: 220 YGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQ-RGCSKFKLLGKIAWR 278

Query: 137 TLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GI I+   Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 279 SFLLICIGIVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPIP 332

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
              VLE    S+   T+   QW+   +   I++  T+ L VP          G     + 
Sbjct: 333 ENCVLERSCPSLRDITSSWSQWLLILLLEGIWLGLTFLLPVP----------GCPTGYLG 382

Query: 252 CGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G LG   N  G    Y+D  L G +HLY  P  + L                    
Sbjct: 383 PGGIGDLGKYANCTGGAARYIDHLLLGSDHLYQHPSSTVLYH------------------ 424

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAII 363
              P++PEG+L TI++I+   +G+  G +L+ +KG +     R   W  +  GL+ +A+ 
Sbjct: 425 TEVPYDPEGILGTINSIVMAFLGVQAGKILLCYKGQTKDILIRFTAWCCV-LGLISVALT 483

Query: 364 LHFTNA--IPINKQLYSFSYV 382
               N   IPINK L+S SYV
Sbjct: 484 KMSENKGFIPINKNLWSISYV 504


>gi|380028317|ref|XP_003697852.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 555

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 158/348 (45%), Gaps = 48/348 (13%)

Query: 46  QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVM 105
            K +L    + Q   +RV  +D  RG + +LMI V+D  G Y  + H+ WNG    D + 
Sbjct: 161 MKRQLDETAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLF 220

Query: 106 PFFLFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           P F++I+GV I +A+    K++   +  +  I+ R++ +   G+ L        + +S G
Sbjct: 221 PCFIWIMGVCIPIAMASQMKRMLPRHVILYGIVKRSILMFLIGLSL--------NTVSTG 272

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
             ++ IR  G+LQR  + Y++VALI      R+P   + + +  F     QW    +   
Sbjct: 273 PQLETIRVFGVLQRFGITYLIVALIYFCLMARKPK--KTQVMQDFLLLLPQWCVMLVIVA 330

Query: 222 IYIITTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
           ++ + T+ L VP            H   KY    G          A GY+DR +      
Sbjct: 331 VHCVITFCLKVPGCPTGYLGPGGLHDDAKYFDCVG---------GAAGYIDRMI------ 375

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
                        L  P+   L   A  +   P++PEG+L T++      +G+H G +++
Sbjct: 376 -------------LKEPH---LHHSATVYKSGPYDPEGILGTLTTTFQVFLGLHAGIIMM 419

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
            +K    R+  W++       I  ILHF+N IP+NK+L+S S+V  T 
Sbjct: 420 TYKDWKERVIRWLAWAAFFSCIGCILHFSNIIPVNKKLWSLSFVFVTT 467


>gi|432907420|ref|XP_004077635.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 482

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 179/385 (46%), Gaps = 57/385 (14%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG 84
           +D +N ++ +   E + V  + +G          +  R+ +LD FRG  + +M+ V+  G
Sbjct: 60  QDPQNSVDDDGAPETAVVAADSRGT---------RPARLLSLDTFRGFALTVMVFVNYGG 110

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK----KIIFRTLKL 140
           G Y   +H+PWNG T+AD VMP+F+FI+G ++ LA   + +     +    KI +RT+ L
Sbjct: 111 GGYWFFEHAPWNGLTVADLVMPWFVFIMGTSVVLAFSSMQRRGVGRRQLLGKITWRTVVL 170

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP 200
           +  G      YS     LS+      +R  G+LQR+A  Y V++L++T   ++     E 
Sbjct: 171 MLLGFCFL-NYSPRDGPLSW----SWLRIPGVLQRLAFTYFVLSLLQTFWGRKAIPESEN 225

Query: 201 ---RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGH 257
                +     +  QW+  F+   +++  T+ + VPN             Y+   G+  H
Sbjct: 226 HWWNPVQDVVLFWPQWLLIFLLETLWLCITFLMPVPN---------CPTGYLGAGGIGDH 276

Query: 258 -LGPACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
            L P C   A G +DR ++G N +Y  P   +L                       P++P
Sbjct: 277 GLYPNCTGGAAGSIDRWMFGDN-MYRYPTCKKLYR------------------TEQPYDP 317

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHW-VSMGFGLLIIAIILHFTNA 369
           EG+L TI++I+ G +G+  G +++ +K  S     R   W V +G    I++        
Sbjct: 318 EGVLGTINSIVMGFLGMQAGKIIVFYKRKSGHILWRYLTWAVILGISAAILSKCTRDGGF 377

Query: 370 IPINKQLYSFSYVCFTAGAADGCLG 394
           IP+NK L+S SYV  T   +   LG
Sbjct: 378 IPVNKNLWSLSYVTCTGALSFLLLG 402


>gi|149057830|gb|EDM09073.1| rCG43316 [Rattus norvegicus]
          Length = 626

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 119/392 (30%), Positives = 183/392 (46%), Gaps = 65/392 (16%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +    L         R+  +D FRG+ ++
Sbjct: 191 IASRETDRLINSELGSPSRADPLGADCQPETRRASALPH-------RLRCVDTFRGVALI 243

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 244 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSLLQ-RGCSKIKLL 302

Query: 132 -KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLT 190
            KI +R+  L+  GII+       P+     +    +R  G+LQR+ + Y VVA++E   
Sbjct: 303 GKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAMLELFF 357

Query: 191 TKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVK 246
            K  P+   LE   LS+   T+   QW+   I   I++  T+ L VP          G  
Sbjct: 358 WKPVPDSCTLERSCLSLRDITSSWPQWLIILILESIWLALTFFLPVP----------GCP 407

Query: 247 KYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
              +  G  G LG  P C   A GY+DR L G +HLY  P  + L    ++         
Sbjct: 408 TGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDSHLYQHPSSAVLYHTEVA--------- 458

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLL 358
                    ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+
Sbjct: 459 ---------YDPEGVLGTINSIVMAFLGVQAGKILLYYKDQTKAILIRFAAWCCI-LGLI 508

Query: 359 IIAIILHFTNA--IPINKQLYSFSYV----CF 384
            IA+     N   IPINK L+S SYV    CF
Sbjct: 509 SIALTKMSANEGFIPINKNLWSISYVTTLSCF 540


>gi|354472121|ref|XP_003498289.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cricetulus griseus]
          Length = 782

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 180/378 (47%), Gaps = 57/378 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQ-QLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G+ Q + +       R+  +D FRG+ ++LM+ V+  
Sbjct: 350 RETDRLINSELG--SPSRADPLSGDCQPETRHTSALPYRLRCVDTFRGIALILMVFVNYG 407

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    HS WNG T+AD V P+F+FI+G ++ L++       G  K     KI +R+ 
Sbjct: 408 GGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSALH-RGCSKFRLLGKITWRSF 466

Query: 139 KLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN-- 196
            L+  GII+       P+     +    +R  G+LQR+ + Y VVA++E + +K  P+  
Sbjct: 467 LLICIGIIVVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELIFSKPVPDRC 521

Query: 197 VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGM 254
            LE  +LS+   T    QW+   I   I++  T+ L VP          G     +  G 
Sbjct: 522 ALERSYLSLRDITCSWPQWLVVLILESIWLALTFFLPVP----------GCPTGYLGPGG 571

Query: 255 RGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRA 310
            G +G  P C   A GY+D  L G NHLY  P  + L    ++                 
Sbjct: 572 IGDMGKYPHCTGGASGYIDHLLLGDNHLYQHPSSTVLYHTQVA----------------- 614

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHF 366
            ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ IA+    
Sbjct: 615 -YDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKAILMRFTAWCCI-LGLISIALTKMS 672

Query: 367 TNA--IPINKQLYSFSYV 382
            +   IPINK L+S SYV
Sbjct: 673 ADEGFIPINKNLWSISYV 690


>gi|312381520|gb|EFR27253.1| hypothetical protein AND_06166 [Anopheles darlingi]
          Length = 782

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 113/415 (27%), Positives = 179/415 (43%), Gaps = 81/415 (19%)

Query: 4   LRIVEEGLGRTQL-------------VEQEQDDGKDSENGINKEKGLERSEVQDEQKGEL 50
           +R+V+ GL RT+              V +E+ DG D     +K++G   S          
Sbjct: 335 VRLVQYGLKRTRTAHSSHQPTANSSAVAEERTDGDD-----DKQEGFTASAAPPSVAPA- 388

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLF 110
                     KR+ +LD  RG+ ++LMI V+  GG Y  I+H+ WNG  +AD V P+FLF
Sbjct: 389 ------NIARKRLQSLDTLRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLF 442

Query: 111 IVGVAIALALKKVPKINGAVKKII----FRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           I+GV I ++L+     N + ++I+     R++ L   G+ L             G +M +
Sbjct: 443 IMGVCIPISLRGQLARNVSKRQIVSSITTRSISLFLIGLCLNS---------MNGPNMAN 493

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLS-----IFTAYQWQWIGGFIA-F 220
           +R  G+LQR  + Y VV+L+  L   R     + R +      I    QW  +G  +  +
Sbjct: 494 LRIFGVLQRFGVAYFVVSLVH-LFCHREQIASQHRFVRANVDIIRLVRQWIIVGLLVVIY 552

Query: 221 VIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNA--VGYVDRELWGINHL 278
           +  I+   +   P   F     H    Y           P C     GY+DR L G++HL
Sbjct: 553 LAVILLIPAPGCPRGYFGPGGKHLFNVY-----------PNCTGGITGYIDRVLLGMSHL 601

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
           Y  P    +                   +   PF+PEG  + +  IL   +G+  G  ++
Sbjct: 602 YQHPTARYV-------------------YDGQPFDPEGPFACLPTILQVFLGLQCGSTIL 642

Query: 339 HFKGHSARLKHWVSMGFGL-LIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
            F GH  RL+ +      L L+  ++  F+     +P+NK L+S SYV  TA  A
Sbjct: 643 SFTGHRQRLQRFAVWSVALGLVAGVLCGFSKNDGWLPVNKNLWSLSYVLATASLA 697


>gi|194762450|ref|XP_001963347.1| GF20351 [Drosophila ananassae]
 gi|190629006|gb|EDV44423.1| GF20351 [Drosophila ananassae]
          Length = 576

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 167/339 (49%), Gaps = 47/339 (13%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG Y  IDH+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYTWIDHAAWNGLHLADLVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      GA K     +I++R++KL   G+ L        +++S G  ++ +R  G+L
Sbjct: 242 SVKAQLS-RGASKGRICLRILWRSIKLFAIGLCL--------NSMS-GPGLEELRIMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR  + ++VV ++ TL ++R P  + P+       +      G +A ++ ++ TY     
Sbjct: 292 QRFGVAFLVVGVLHTLCSRRDP--ISPQRSWQRAVHDICLFSGELAVLLALVATYL---- 345

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHL-GPAC--NAVGYVDRELWGINHLYSDPVWSRLEAC 290
             +F        K Y+   G   +   P C   A GYVD ++ G  H+Y  P    +   
Sbjct: 346 GLTFGLRVPGCPKGYLGPGGKFDYASNPNCIGGAAGYVDLKVLGNAHIYQHPTAKYV--- 402

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                           +  A F+PEG+   I +++   +G   G  L+      +R++ W
Sbjct: 403 ----------------YDSAAFDPEGIFGCILSVVQVLLGAFAGVTLLVHPTWQSRIRRW 446

Query: 351 VSMGFGL-LIIAIILHFTN---AIPINKQLYSFSYVCFT 385
           + +   L LI   +  F+    AIP+NK L+S S+VC T
Sbjct: 447 LILAVVLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVT 485


>gi|443731781|gb|ELU16770.1| hypothetical protein CAPTEDRAFT_135912, partial [Capitella teleta]
          Length = 388

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 160/341 (46%), Gaps = 49/341 (14%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL- 120
           R+ +LD FRG+++V+MI V+  GG Y    HS WNG TLAD V P+F+FI+G ++AL+  
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 121 ----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
               + +P+    + K++ R + L   G+++      A       VD++ +R  G+LQR+
Sbjct: 61  GALRRGIPRFK-LILKVLKRAMILFALGVMISNSKGKA-------VDLRTLRVPGVLQRL 112

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           AL Y+V+ ++E    K          +        QW    +   ++   T+ L VP   
Sbjct: 113 ALTYLVLGIMEAALAKSHDPHQWWSSVRDVVGNLGQWAAVLMFVAVHCCLTFLLPVP--- 169

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLEACTL 292
                  G  K  +  G   H G   N  G    Y+DR ++G  H+Y  P       C +
Sbjct: 170 -------GCPKGYLGPGGLQHGGAYENCTGGATAYIDRMIFGTEHMYGHP------TCMI 216

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                       P     P +PEG+L T+++I    +G+  G V++ F+G  +R+  W+ 
Sbjct: 217 ------------PYQTTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWMC 264

Query: 353 MGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAA 389
                 ++A  L   +A    IPINK L+S SYV   A  A
Sbjct: 265 WSLVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMA 305


>gi|332027964|gb|EGI68015.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 569

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 186/378 (49%), Gaps = 58/378 (15%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           GK S + ++ +      E+Q+ +   + ++    + S R+ ++D FRG++++LMI V++ 
Sbjct: 148 GKLSPDNVHDD----LDELQEAETANIMIRT--NRSSIRIRSVDTFRGISILLMIFVNNG 201

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFW 143
           GG Y   +HS WNG T+AD V+P+F +I+G++I ++ +   +++ +  KII R L+    
Sbjct: 202 GGQYMFFNHSAWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIIVRCLQRTII 261

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP--NVLEPR 201
            I+L    +       Y   +  +R+ G+LQ +A+ Y + A IET+  K  P  +VL+  
Sbjct: 262 LILLGLMLNSI-----YAKSLDDLRFPGVLQLLAVSYFICATIETIFMKTHPQDDVLQFG 316

Query: 202 HLSIF--TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHL 258
             ++       W QW+        +I+ T+ L VPN         G            H 
Sbjct: 317 RFTVLRDILNNWAQWLIILAIMTTHILITFLLPVPNCPTGYLGPGGYH----------HF 366

Query: 259 GPACN----AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
           G   N    A GY+DR ++G +H+YS             +P  G +          P +P
Sbjct: 367 GEFANCTGGAAGYIDRLVFG-SHMYSKT----------QNPVYGTI---------LPHDP 406

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL-LIIAIILHFTN---AI 370
           EG+++TIS IL   +G+H G +L+ +   +A++  W+   F   LI  I+  F      I
Sbjct: 407 EGIMNTISIILVVYLGVHAGKILLLYYQCNAKVIRWLLWSFVTGLIAGILCDFDKEFGVI 466

Query: 371 PINKQLYSFSYV----CF 384
           P++K++ S S+V    CF
Sbjct: 467 PVSKKMMSLSFVLTVSCF 484


>gi|432099917|gb|ELK28811.1| Heparan-alpha-glucosaminide N-acetyltransferase [Myotis davidii]
          Length = 586

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 178/384 (46%), Gaps = 61/384 (15%)

Query: 18  EQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLM 77
            +E D   +SE G     GL   + Q E          L     R+  +D FRGL ++LM
Sbjct: 153 SRETDRLINSELGSPSRAGLFGDDAQPEV-------WRLSAVPPRLRCVDTFRGLALILM 205

Query: 78  ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----K 132
           + V+  GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     K
Sbjct: 206 VFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGK 264

Query: 133 IIFRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           + +R+  L+  GI I+   Y   P      +    +R  G+LQR+ + Y VVA++E L  
Sbjct: 265 VAWRSFLLICIGIVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFA 318

Query: 192 KRRPN--VLEPR--HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK 247
           K  P   V E R   L   T+   QW+   +   +++  T+ L VP          G   
Sbjct: 319 KPVPESCVSERRCSCLQDITSSWPQWLVILMLESVWLALTFFLPVP----------GCPT 368

Query: 248 YIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
             +  G  G LG  P C   A GY+DR L G +H+Y  P  + L   T++          
Sbjct: 369 GYLGPGGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSNVLYHTTVA---------- 418

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL-----KHWVSMGFGLL 358
                   ++PEG+L TI++I+   +G+  G +L+++K  +  +       W  +G   +
Sbjct: 419 --------YDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWWCFLGLISV 470

Query: 359 IIAIILHFTNAIPINKQLYSFSYV 382
           ++  +      IP+NK L+S SYV
Sbjct: 471 VLTKVSENEGFIPVNKNLWSISYV 494


>gi|121489785|emb|CAK18864.1| hypothetical protein [Phillyrea latifolia]
          Length = 129

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 61/131 (46%), Positives = 86/131 (65%), Gaps = 3/131 (2%)

Query: 255 RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
            G LGPACN+ G +DR + GI+HLY+ PV+  L+ C +SS   G + E APSWC APF+P
Sbjct: 1   NGDLGPACNSAGMIDRNVLGIDHLYAKPVYRNLKECNISS--HGQVPETAPSWCHAPFDP 58

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINK 374
           EG+LS+++A +S  IG+ YGH+L+  + H  RL +W    F  L + + L F   IP+NK
Sbjct: 59  EGILSSLTAAVSCIIGLQYGHILVRLQDHKERLCNWSIFSFAFLGLGLFLAFV-GIPLNK 117

Query: 375 QLYSFSYVCFT 385
            LY+ SY+  T
Sbjct: 118 SLYTISYLLVT 128


>gi|281209034|gb|EFA83209.1| hypothetical protein PPL_03999 [Polysphondylium pallidum PN500]
          Length = 1154

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 110/377 (29%), Positives = 165/377 (43%), Gaps = 79/377 (20%)

Query: 49  ELQLQQLLQQ--KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           E QL  L     K  R+ +LD FRGL++ +MI V+  GG Y   +HS WNG T+AD V P
Sbjct: 210 EYQLHHLESNDPKKDRMKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFP 269

Query: 107 FFLFIVGVAI-----ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           +F+FI+G A+     AL  + VPK    V K++ R++ L   G+ L  G           
Sbjct: 270 WFVFIMGCAMPMSFNALESRGVPK-KTIVIKLVRRSITLFALGMFLNNGN---------- 318

Query: 162 VDMKHIRWCGILQRIALVYVVVALI------------ETLTTKRRP-----------NVL 198
            D++H R  G+LQR  + Y+V  LI            + L+ +++P              
Sbjct: 319 -DLQHWRILGVLQRFGISYLVTGLIMMFVPVWRYRQLDDLSEEQQPLYGGGSIQDRIRSR 377

Query: 199 EPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHL 258
            PR  +    Y  QW+   +   ++ + T+ L VP          G     +  G  G  
Sbjct: 378 YPRMFADILPYWIQWVVALMLLSVWFLVTFLLPVP----------GCPTGYIGPGGIGSQ 427

Query: 259 GPACNAVG----YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEP 314
           G   N  G    YVD +++G NH+Y  P    +        N+G             ++P
Sbjct: 428 GQYANCTGGAARYVDLKIFGENHIYQTPTCQTIY-------NTG------------SYDP 468

Query: 315 EGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL----HFTNAI 370
           EG L  I++I    +G+  G  ++ FK  S RL  W   G  L  IA  L         +
Sbjct: 469 EGTLGYITSIFMCFLGVQCGRTILAFKKASCRLIRWSIWGVVLCGIAAGLCGMSQNNGWL 528

Query: 371 PINKQLYSFSYVCFTAG 387
           PINK L++ S+V   +G
Sbjct: 529 PINKNLWTPSFVLLLSG 545


>gi|345781561|ref|XP_539948.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Canis
           lupus familiaris]
          Length = 638

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 175/385 (45%), Gaps = 63/385 (16%)

Query: 18  EQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLM 77
            +E D   +SE G     G    E Q E                R+ ++D FRGL ++LM
Sbjct: 205 SRETDRLINSELGSPGRAGSIGGEAQQEAWHP-------PSALPRLRSIDTFRGLALILM 257

Query: 78  ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----K 132
           + V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     K
Sbjct: 258 VFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSMLQ-RGCSKFRLLGK 316

Query: 133 IIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           I +R+  L   G +I+   Y   P      +    +R  G+LQR+ + Y VVA++E +  
Sbjct: 317 IAWRSFLLFCIGVVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELIFA 370

Query: 192 KRRPN--VLEPRHLSIF-TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK 247
           K  P     E R  S+      W QW+   +   I++  T+ L VP          G   
Sbjct: 371 KPVPESCASERRCFSLRDIILSWPQWLFILLLESIWLGLTFFLPVP----------GCPT 420

Query: 248 YIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
             +  G  G LG  P C   A GY+DR L G +H+Y  P  + L                
Sbjct: 421 GYLGPGGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYH-------------- 466

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLI 359
                + P++PEG+L TIS+I+   +GI  G +L+++K  +     R   W     GL+ 
Sbjct: 467 ----TKVPYDPEGILGTISSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCF-LGLIS 521

Query: 360 IAIILHFTNA--IPINKQLYSFSYV 382
           +A+     N   IPINK L+S SYV
Sbjct: 522 VALTKISENEGFIPINKNLWSISYV 546


>gi|332028000|gb|EGI68051.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
           echinatior]
          Length = 557

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/346 (27%), Positives = 154/346 (44%), Gaps = 46/346 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +RV  +D FRG + + MI V+D  G+Y  ++H+ W+G  L D V P F++I+GV I +AL
Sbjct: 166 RRVKAIDTFRGASTLFMIFVNDGSGSYTVLEHTIWDGMLLGDIVFPCFMWIMGVCIPIAL 225

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
               K   +  +I +  LK  F  ++   G S   + L     +++IR  G+LQR  + Y
Sbjct: 226 SAQLKRGVSKLQISYSILKRSF--LLFLIGVSL--NTLGTDSQVENIRIFGVLQRFGVTY 281

Query: 181 VVVALIETLTTKRRPNVLEP-------RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           +VV+L+      ++  +L         R +    +    W    I  +++   T+ L VP
Sbjct: 282 LVVSLVYLCFPSQQSKILRNTSPTWIMRKMQDILSLLPHWFVMLIFVIVHCALTFCLPVP 341

Query: 234 NWSFSEHSDHGVK---KYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
                     G+    KY    G          A GY+D+ +  +NH+Y  P    +   
Sbjct: 342 GCPTGYLGPGGMHEDGKYFNCTG---------GATGYIDKTVLTLNHIYQYPTIKSVYG- 391

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                               PF+PEG+L  ++AI    +G+H G +L+ +K    R+  W
Sbjct: 392 ------------------SGPFDPEGILGCLTAIFQVFLGVHAGTILMLYKDWKDRVMRW 433

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV----CFTAGAADGC 392
           +        +    HFTN IP+NK L+S S+V     F+     GC
Sbjct: 434 LLWAVFYACLGCAFHFTNTIPVNKNLWSLSFVFVTTSFSLAFLSGC 479


>gi|307209305|gb|EFN86390.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
           saltator]
          Length = 552

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 201/393 (51%), Gaps = 88/393 (22%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           GK S + ++ +  L+R +  +     ++  ++    + R+ ++D FRG++++LMI V++ 
Sbjct: 131 GKLSPDSVHDD--LDRLQEAESSNPVIRTSRV----NTRIRSVDTFRGISILLMIFVNNG 184

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK---- 139
           GG Y   +HS WNG T+AD V+P+F +I+G++I ++ +   +++ +  KII R L+    
Sbjct: 185 GGKYVFFNHSVWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIILRCLQRAFI 244

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK--RRPNV 197
           L+  G++L   ++ +         +K +R+ GILQ +A+ Y V A IET+  +   + ++
Sbjct: 245 LILLGLMLNSIHTKS---------LKDLRFPGILQLLAVSYFVCATIETIFMRAHSQDDL 295

Query: 198 LEPRHLSIFTAY--QW-QWIGGFIAFVIYIITTYSL---YVPNWSFSEHSDHGVKKYIVK 251
           L+    ++       W QW     + ++ I TT++L    +P               ++ 
Sbjct: 296 LQFGRFTVLRDILDSWAQW-----SIIVAIATTHTLITFLLP---------------VLD 335

Query: 252 CGMRGHLGPA----------CN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
           C  +G+LGP           C   A GY+DR ++G +H+Y+             +P  G 
Sbjct: 336 CP-KGYLGPGGYHLFGKNANCTGGAAGYIDRLVFG-SHMYNKT----------HNPVYGT 383

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS-MGFGLL 358
           +          P++PEG+++TIS IL   +G+H G +L+ +   +AR+  W+   G   L
Sbjct: 384 I---------LPYDPEGIMNTISVILVVYMGVHAGKILLLYYQCNARIVRWLLWSGVTGL 434

Query: 359 IIAIILHF---TNAIPINKQLYSFSYV----CF 384
           I  I+ HF   +  IP+NK++ S S+V    CF
Sbjct: 435 IAGILCHFDKESGVIPVNKRMMSLSFVLTVSCF 467


>gi|449283383|gb|EMC90042.1| Heparan-alpha-glucosaminide N-acetyltransferase [Columba livia]
          Length = 560

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 182/389 (46%), Gaps = 56/389 (14%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           ++ + ++++  IN E G       D    +   Q       +R+ +LD FRGL++++M+ 
Sbjct: 124 KKLNPRETDRLINSEPG--SPNAADPISSDPAPQLWSSAPRQRLRSLDTFRGLSLIIMVF 181

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KII 134
           V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L  + +  G+ K     KI+
Sbjct: 182 VNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSMLR-QGSSKWKVLGKIL 240

Query: 135 FRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK- 192
           +R+  L+  G I++   Y   P      +  +++R  G+LQR+ L Y+VVA +E L T+ 
Sbjct: 241 WRSFLLILLGVIVVNPNYCLGP------LSWENLRIPGVLQRLGLAYLVVAALELLFTRA 294

Query: 193 -RRPNVLEP--RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYI 249
                 LE     L     Y  QW+   +  V+++  T+ L VP          G  +  
Sbjct: 295 GAESGTLETPCPALRDILPYWPQWVFVLMLEVLWLCLTFLLPVP----------GCPRGY 344

Query: 250 VKCGMRGHLGPACN----AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAP 305
           +  G  G  G   N    A GY+DR L G  H+Y  P  + +   T+             
Sbjct: 345 LGPGGIGDFGNYANCTGGAAGYIDRLLLGDKHIYQHPSSNVIYQTTM------------- 391

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH----W-VSMGFGLLII 360
                P++PEG+L TI+ I    +G+  G +++ +K    R+      W + MG    I+
Sbjct: 392 -----PYDPEGILGTINTIFMAFLGLQAGKIILFYKDQHKRIMSRFFIWTIVMGIISAIL 446

Query: 361 AIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                    IPINK L+S SYV  T+  A
Sbjct: 447 TKCSKEEGFIPINKNLWSVSYVTTTSCFA 475


>gi|390367684|ref|XP_789038.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 624

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/396 (27%), Positives = 175/396 (44%), Gaps = 78/396 (19%)

Query: 28  ENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAY 87
           E  ++ + G   + V +     +Q     + K KR+ +LDAFRG+++V+MI V+  GG Y
Sbjct: 201 ERVVSSDLGTPSNPVAEADSNSIQRPS--RDKPKRLKSLDAFRGMSLVIMIFVNYGGGQY 258

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLF- 142
           +  +HS WNG T+AD V P+F++I+GV+I ++   + + +G  +++IF    R   +LF 
Sbjct: 259 SFFNHSIWNGLTVADLVFPWFIWIMGVSITMSFYALVR-HGVSRRVIFTKIIRRFVILFG 317

Query: 143 WGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRH 202
            GIIL G           G+D    R  G+LQRIA  Y+VVA +     K +      RH
Sbjct: 318 LGIILDG-----------GIDFSTFRVPGVLQRIAFSYLVVATVHLFAVKHKDEEYRIRH 366

Query: 203 -----LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGH 257
                L     Y ++WI       ++I  T+ L VP                      G+
Sbjct: 367 VVYRELRDLLDYWYEWIIMISFLALHICLTFFLPVPGCP------------------TGY 408

Query: 258 LGPACNAVG--------------YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
           LGP    VG              Y+D+ +   NH Y      ++   T+           
Sbjct: 409 LGPGGPLVGENESLVNCTGGAANYIDKVILTYNHTYPRGTPRKIYQTTV----------- 457

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIA-I 362
                  P +PEG+L T+++I    +G+  G +   F     R+  ++       +IA  
Sbjct: 458 -------PHDPEGILGTLTSIFMTFLGLQAGKIFHLFSYPRDRILRFLGWCVVTGVIAGA 510

Query: 363 ILHFTNA---IPINKQLYSFSYVCFTAGAADGCLGI 395
           +  F+     IP+NK L+S S++  TA  A   L I
Sbjct: 511 LCGFSKEDGIIPVNKNLWSVSFILATASMAFFLLAI 546


>gi|395842491|ref|XP_003794051.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Otolemur garnettii]
          Length = 677

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/352 (30%), Positives = 167/352 (47%), Gaps = 56/352 (15%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           L     R+  +D FRG+++ LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +
Sbjct: 275 LSTHPPRLRCVDTFRGISLTLMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSS 334

Query: 116 IALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           + L++  V +   +    ++KI +R+  L+  GII+       P+     +    +R  G
Sbjct: 335 VFLSMTSVLQRGCSKGRLLRKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPG 389

Query: 172 ILQRIALVYVVVALIETLTTKRRP-NVLEPR---HLSIFTAYQWQWIGGFIAFVIYIITT 227
           +LQR+ + Y VVA++E L  K  P N    R    L   T+   QW+       +++  T
Sbjct: 390 VLQRLGVTYFVVAVLELLFAKPVPENCASQRGCFSLGDVTSSWPQWLLILTLESVWLCLT 449

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPV 283
           + L VP          G     +  G  G LG  P C   A GY+D  L G NHLY  P 
Sbjct: 450 FFLPVP----------GCPTGYLGPGGIGDLGKYPNCTGGAAGYIDHLLLGENHLYHHPS 499

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
            + L    ++                  ++PEG+L TI++I+   +G+  G +L+++K  
Sbjct: 500 SAVLYHTEVA------------------YDPEGILGTINSIVMAFLGVQAGKILLYYKDQ 541

Query: 344 SA----RLKHWVSMGFGLLIIAIILHFTNA--IPINKQLYSFSYV----CFT 385
           +     R   W  +  GL+ +A+     N   IP+NK L+S SYV    CF 
Sbjct: 542 TKDILMRFAGWCCI-LGLISVALTKVSENEGFIPVNKNLWSISYVTTLSCFA 592


>gi|291409013|ref|XP_002720836.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Oryctolagus cuniculus]
          Length = 613

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 179/379 (47%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G+LQ +   L     R+  +D FRG+ +VLM+ V+  
Sbjct: 181 RETDRLINSELG--SPSRADPLSGDLQPETWHLSAAKHRLRCVDTFRGIALVLMVFVNYG 238

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    HS WNG T+AD V P+F+FI+G +I L++    +  G  K     KI +R+ 
Sbjct: 239 GGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSIFLSMMSALQ-RGCSKLRLLGKIAWRSF 297

Query: 139 KLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GI I+   Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 298 LLIMIGIVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEN 351

Query: 197 -VLEPR--HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
            VLE     L   T+   QW+   +   I++  ++ L VP          G     +  G
Sbjct: 352 WVLESSCTCLRDVTSSWPQWLLILLLESIWLGLSFFLPVP----------GCPTGYLGPG 401

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 402 GIGDWGKYPNCTGGAAGYIDRVLLGDDHLYKHPSSTVLYHTEVA---------------- 445

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L TI++I++  +G+  G +L+ +K  +     R   W S   GL+ +A+   
Sbjct: 446 --YDPEGILGTINSIVTAFLGVQAGKILLFYKDQTKSILIRFTAW-SCILGLISVALTKI 502

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IPINK L+S SYV
Sbjct: 503 SENEGFIPINKNLWSISYV 521


>gi|194226375|ref|XP_001488696.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Equus caballus]
          Length = 663

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 185/381 (48%), Gaps = 59/381 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKS-KRVATLDAFRGLTVVLMILVD 81
           + +++++ IN E G       D   G+ QL+      + +R+  +D FRG+ +++M+ V+
Sbjct: 229 NSRETDHLINSELG--SPSRADALGGDSQLEAWGPAAAPQRLRCVDTFRGIALIIMVFVN 286

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    HS WNG T+AD V P+F+FI+G +I L++    +  G  K     KI +R
Sbjct: 287 YGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSTLQ-RGCSKFRLLGKIAWR 345

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP- 195
           +  L+  GI++       P+     +    +R  G+LQR+ + Y VVA++E L  K  P 
Sbjct: 346 SFLLISLGIVVVN-----PNYCLGPLSWDKLRIPGVLQRLGVTYFVVAVLELLFAKPVPG 400

Query: 196 -NVLEPRHLS---IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
               E R  S   I +++  QW+   +   I++  T+ L VP          G     + 
Sbjct: 401 SGASERRCSSLRDILSSWP-QWLFILLLESIWLGLTFFLPVP----------GCPTGYLG 449

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G LG  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 450 PGGIGDLGRYPNCTGGAAGYIDRLLLGEDHLYQHPSSAVLYHTEVA-------------- 495

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAII 363
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W S   GL+ +A+ 
Sbjct: 496 ----YDPEGILGTINSIVMAFLGVQAGRILLYYKDQTKAILLRFTAW-SCFLGLISVALT 550

Query: 364 LHFTN--AIPINKQLYSFSYV 382
               N  +IPINK L+S SYV
Sbjct: 551 KVSENEGSIPINKNLWSISYV 571


>gi|432845830|ref|XP_004065874.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oryzias latipes]
          Length = 622

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 186/377 (49%), Gaps = 52/377 (13%)

Query: 26  DSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG 85
           ++E  IN E G  ++E          +    Q   KR+ +LD FRG+ +V+M+ V+  GG
Sbjct: 194 ETERLINSELGSPQTEAPLTDN----ILPPPQSPRKRLRSLDTFRGIALVIMVFVNYGGG 249

Query: 86  AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKI----NGAVKKIIFRTLKLL 141
            Y    H  WNG T+AD V P+F+F++G +IAL++  + +        ++KI++R+++L 
Sbjct: 250 RYWFFRHESWNGLTVADLVFPWFVFVMGTSIALSINSLLRAGLTRGSLLRKIVWRSIQLF 309

Query: 142 FWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPR 201
             G+ +       P+    G+  +++R  G+LQR+A  Y+VVA ++ +  +   +VL+  
Sbjct: 310 LIGVFIIN-----PNYCQGGLSWENLRIPGVLQRLAFSYLVVASLDLMVARGHLDVLQTD 364

Query: 202 -----HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRG 256
                 L +   Y   W+   +  V+++  T+ L VP+         G+    +     G
Sbjct: 365 AWWSPFLDVLL-YWPAWVVVLLLEVLWLSLTFLLPVPDCPTGYLGPGGIGDMGLYANCTG 423

Query: 257 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 316
                  A G++DR L G  H+Y  P  SR+   T                 + P++PEG
Sbjct: 424 ------GAAGFLDRWLLGEKHIYQTPS-SRVLYLT-----------------QIPYDPEG 459

Query: 317 LLSTISAILSGTIGIHYGHVLIHFKG-HSARLKHWVSMGFGLLIIAIILHFTNA----IP 371
           +L +I+++L   +G+  G +++H++   ++ +  ++  G  + +I+ +L   +     IP
Sbjct: 460 VLGSINSVLMAFLGLQAGKIILHYRDLPTSIMSRFLIWGLFMGVISAVLTRCSTDQGFIP 519

Query: 372 INKQLYSFSYV----CF 384
           +NK L+S SYV    CF
Sbjct: 520 VNKNLWSLSYVTTLACF 536


>gi|313242995|emb|CBY39713.1| unnamed protein product [Oikopleura dioica]
          Length = 597

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 187/421 (44%), Gaps = 82/421 (19%)

Query: 30  GINKEKGLERSEVQ------DEQKGELQLQQLLQQKSK--------RVATLDAFRGLTVV 75
            I + + LE  E +      D+Q  + + Q++ + +          R  +LD  RGL+++
Sbjct: 161 NIRRRRALEPEEAKPENPFADDQTDKQEDQEVQEDEPAPPAPAKKKRYKSLDTLRGLSLI 220

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKK--- 132
           +MI V+  GG Y  ++H  WNG T+AD VMP+FLF+ GV+I +AL+   +I   + K   
Sbjct: 221 IMIFVNYGGGEYWFMEHVAWNGLTVADLVMPWFLFMSGVSIRIALQS--RIKRGISKTEI 278

Query: 133 ---IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
              I+ R++KL+  G+I  GG              ++ R+ G+LQRI   Y VVA+I  L
Sbjct: 279 SYEILVRSVKLIGLGMITIGG----------NESWEYFRFPGVLQRIGFSYFVVAIIHLL 328

Query: 190 TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI---YIITTYSLYVPNWSFSEHSDHGVK 246
             +  P+     +  +F    + +    I++ I   +I  TY L +P          G+ 
Sbjct: 329 VIE-HPDKEPETNWGLFKEMSFNFKEHLISWSILGAFICLTYLLPIPGCPTGYTGPGGLS 387

Query: 247 KYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
           +         H      A GY+DR+L G  H+Y+ P     E      PN          
Sbjct: 388 E------NGEHYHCIGGAAGYIDRKLLGEKHIYNWPTAYHDE------PNG--------- 426

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV--------------S 352
               PF+PEGLL T+++I    +G+  G     FK   + + H +              +
Sbjct: 427 ---VPFDPEGLLGTLTSIFMVYLGLQAGKCFDIFKTPKSIILHLLGLAAVYGISGMLLAT 483

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDAC 412
           +GFG +  AI       IPINK L+S S+V   +  A   L     I I   DW +   C
Sbjct: 484 IGFGKIAEAI-------IPINKNLWSVSFVFVLSSMAFTLLAFLYFI-IDVRDWWDGAPC 535

Query: 413 F 413
           +
Sbjct: 536 Y 536


>gi|194888520|ref|XP_001976930.1| GG18736 [Drosophila erecta]
 gi|190648579|gb|EDV45857.1| GG18736 [Drosophila erecta]
          Length = 576

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 165/347 (47%), Gaps = 55/347 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      G+ K     +I++R++KL   G+ L        +++S G +++ +R+ G+L
Sbjct: 242 SVKSQLS-RGSSKARICLRILWRSIKLFVIGLCL--------NSMS-GPNLEQLRFMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY----- 228
           QR  + Y+VV ++ TL  +R P  + P+       +      G +A ++ ++ TY     
Sbjct: 292 QRFGVAYLVVGVLHTLCCRREP--ISPQRSWQRAVHDVCLFSGELAVLLALVATYLGLTF 349

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC--NAVGYVDRELWGINHLYSDPVWSR 286
            L VP          G   Y        +  P C   A GY D ++ G  H+Y  P    
Sbjct: 350 GLRVPGCPRGYLGPGGKHDY--------NAHPHCIGGAAGYADLQVLGNAHIYQHPTAKY 401

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
           +   T                    F+PEG+   I +++   +G   G  L+      +R
Sbjct: 402 VYDST-------------------AFDPEGVFGCILSVVQALLGAFAGVTLLVHPNWQSR 442

Query: 347 LKHWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAGAA 389
           ++ W+ +   L +I   L        AIP+NK L+S S+VC T   A
Sbjct: 443 MRRWMLLAILLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLA 489


>gi|301608954|ref|XP_002934053.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Xenopus (Silurana) tropicalis]
          Length = 633

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 176/382 (46%), Gaps = 61/382 (15%)

Query: 29  NGINKEKGLERSEVQDEQKGELQLQQLLQQ----KSKRVATLDAFRGLTVVLMILVDDAG 84
           N I  ++ L  SE+    + ++  Q+   +      +R+ +LD FRGL + +M+ V+  G
Sbjct: 201 NPIETDR-LVNSELGSPNRADISSQETYSRAWNPSVQRLRSLDTFRGLALTIMVFVNYGG 259

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKL 140
           G Y    H  WNG T+AD V P+F+FI+G +I L+L  +     +    + K+++R+++L
Sbjct: 260 GGYWFFKHQSWNGLTVADLVFPWFVFIMGTSIYLSLNSMLSKGSSRWNLLGKVLWRSVQL 319

Query: 141 LFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLE 199
              G+ ++   Y   P + S       IR  G+LQR++L Y+ V+ +E + +K  P+ L 
Sbjct: 320 FLIGLFVINVNYCRGPLSFS------EIRIMGVLQRLSLTYLAVSALELIFSKPTPDALT 373

Query: 200 PRHLSIF---TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMR 255
                         W +WI       +++  T  L VP+         G+          
Sbjct: 374 QSRTCFLLQDVLSHWPKWIVILALEAVWLCLTLLLQVPDCPLGYLGPGGI---------- 423

Query: 256 GHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
           G  G  P C   A GY+DR + G  H+Y  P  + +   T+                  P
Sbjct: 424 GDFGKFPNCTGGAAGYIDRMILGQGHIYQHPTSNVIYKSTM------------------P 465

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKG-HSARLKHWVSMGFGLLIIAIILHFTNA- 369
           ++PEGLL TI+ ++    G+  G +L+ +K  H   L  + S    + +++ +L   +  
Sbjct: 466 YDPEGLLGTINCVVMAFFGLQAGIILVLYKNQHKYVLVRFFSWAIIMGVLSAVLTKCSTN 525

Query: 370 ---IPINKQLYSFSYV----CF 384
              IP+NK L+S SY+    CF
Sbjct: 526 EGIIPVNKNLWSISYITTLSCF 547


>gi|124007195|sp|Q68CP4.2|HGNAT_HUMAN RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
           AltName: Full=Transmembrane protein 76
          Length = 663

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 180/379 (47%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 231 RETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYG 288

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+ 
Sbjct: 289 GGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSF 347

Query: 139 KLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 348 LLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEH 401

Query: 197 -VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
              E   LS+   T+   QW+   +   +++  T+ L VP          G     +  G
Sbjct: 402 CASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLGPG 451

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 452 GIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA---------------- 495

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+   
Sbjct: 496 --YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKV 552

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IP+NK L+S SYV
Sbjct: 553 SENEGFIPVNKNLWSLSYV 571


>gi|443685179|gb|ELT88879.1| hypothetical protein CAPTEDRAFT_26311, partial [Capitella teleta]
          Length = 361

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 166/353 (47%), Gaps = 74/353 (20%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL- 120
           R+ +LD FRG+++V+MI V+  GG Y    HS WNG TLAD V P+F+FI+G ++AL+  
Sbjct: 1   RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60

Query: 121 ----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
               + +P+    + K++ R + L   G+++    S++  A     D++ +R  G+LQR+
Sbjct: 61  GALRRGIPRFK-LILKVLKRAMILFALGVMI----SNSKGAF----DLRTLRVPGVLQRL 111

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW------------QWIGGFIAFVIYI 224
           AL Y+V+ ++E    K      +P        +QW            QW    +   ++ 
Sbjct: 112 ALTYLVLGIMEAALAKSH----DP--------HQWWSLVRDVVGNLGQWAAVLMFVAVHC 159

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVG----YVDRELWGINHLYS 280
             T+ L VP          G  K  +  G   H G   N  G    Y+DR ++G  H+Y 
Sbjct: 160 CLTFLLPVP----------GCPKGYLGPGGLQHGGAYENCTGGATAYIDRMIFGTEHMYG 209

Query: 281 DPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 340
            P       C +            P     P +PEG+L T+++I    +G+  G V++ F
Sbjct: 210 HP------TCMI------------PYQTTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIF 251

Query: 341 KGHSARLKHWVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAA 389
           +G  +R+  W+       ++A  L   +A    IPINK L+S SYV   A  A
Sbjct: 252 QGWKSRVSRWMCWSLVTGLVAGCLCKFSAEDGFIPINKNLWSLSYVMALASMA 304


>gi|307178500|gb|EFN67189.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
           floridanus]
          Length = 466

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 191/384 (49%), Gaps = 68/384 (17%)

Query: 23  DGKDSENGINKE-KGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVD 81
            GK S N ++ +   L+ +E  +   G         + S R+ ++D FRG+ ++LMI V+
Sbjct: 44  KGKLSPNNVHDDLDRLQEAETSNPIIGT-------NRSSTRIRSVDTFRGIAILLMIFVN 96

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK-- 139
           + GG Y   +HS WNG T+AD V+P+F +I+G++I ++ +   +++ +  KII R L+  
Sbjct: 97  NRGGEYVFFNHSAWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRTKIILRCLQRA 156

Query: 140 --LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK--RRP 195
             L+  G++L    S++         ++++R+ G+LQ +A+ Y V A IET+  +   + 
Sbjct: 157 FILILLGLMLNSIRSNS---------LQNLRFPGVLQLLAVSYFVCATIETIFMRMHSQD 207

Query: 196 NVLEPRHLSIF--TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
           ++L+    +        W QW+      V + + T+ L VPN               +  
Sbjct: 208 DLLQFGRFTFLRDILNNWAQWLIILAIVVTHTLITFLLPVPN----------CPTGYLGP 257

Query: 253 GMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
           G   H G  P C   A GY+DR ++G +H+Y+             +P  G +        
Sbjct: 258 GGYSHFGNFPNCTGGAAGYIDRLVFG-SHMYNKT----------KNPVYGTI-------- 298

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS-MGFGLLIIAIILHFT 367
             P +PEG+++T+S IL   +G+H G +L+ +   +AR+  W+   G   +I  ++ +F 
Sbjct: 299 -LPHDPEGIMNTVSIILVVYLGVHAGKILLLYYQCNARVVRWLLWSGVTGIIAGLLCNFD 357

Query: 368 N---AIPINKQLYSFSYV----CF 384
                IP++K++ S S+V    CF
Sbjct: 358 KEGGVIPVSKKMMSLSFVLTVSCF 381


>gi|150378452|ref|NP_689632.2| heparan-alpha-glucosaminide N-acetyltransferase precursor [Homo
           sapiens]
 gi|332826066|ref|XP_519741.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pan
           troglodytes]
 gi|194385774|dbj|BAG65262.1| unnamed protein product [Homo sapiens]
 gi|410222096|gb|JAA08267.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410256018|gb|JAA15976.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
 gi|410299048|gb|JAA28124.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 180/379 (47%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 203 RETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYG 260

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+ 
Sbjct: 261 GGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSF 319

Query: 139 KLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 320 LLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEH 373

Query: 197 -VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
              E   LS+   T+   QW+   +   +++  T+ L VP          G     +  G
Sbjct: 374 CASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLGPG 423

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 424 GIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA---------------- 467

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+   
Sbjct: 468 --YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKV 524

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IP+NK L+S SYV
Sbjct: 525 SENEGFIPVNKNLWSLSYV 543


>gi|431902215|gb|ELK08716.1| Heparan-alpha-glucosaminide N-acetyltransferase [Pteropus alecto]
          Length = 585

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 180/381 (47%), Gaps = 59/381 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
           + ++++  IN E G      Q     +L+  +L      R+  +D FRG+ ++LM+ V+ 
Sbjct: 151 NSRETDRLINSELGSPSRAGQLGDDTQLEAWRL-SAVPLRLRCVDTFRGIALILMVFVNY 209

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     K+ +R+
Sbjct: 210 GGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKVTWRS 268

Query: 138 LKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
             L+  GI I+   Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P 
Sbjct: 269 FLLICIGIFIVNPNYCLGP------LSWDKLRIPGVLQRLGVTYFVVAVLELLFAKPVPE 322

Query: 197 --VLEPRHLS---IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
              +E R  S   I +++  QW+   +   I++  T+ L VP          G     + 
Sbjct: 323 SCTVERRCSSLQDIISSWP-QWLFILMLESIWLALTFFLPVP----------GCPTGYLG 371

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G LG  P C   A GY+DR L G +HLY  P  + L                    
Sbjct: 372 PGGIGDLGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYH------------------ 413

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAII 363
            +  ++PEG+L TI++I+   +G+  G +L+++K  +     R   W     GL+ +A  
Sbjct: 414 TKVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCF-LGLMSVAFT 472

Query: 364 LHFTNA--IPINKQLYSFSYV 382
               N   IP+NK L+S SYV
Sbjct: 473 KISENEGFIPVNKNLWSISYV 493


>gi|410332579|gb|JAA35236.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
          Length = 635

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 180/379 (47%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 203 RETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYG 260

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+ 
Sbjct: 261 GGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSF 319

Query: 139 KLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 320 LLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEH 373

Query: 197 -VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
              E   LS+   T+   QW+   +   +++  T+ L VP          G     +  G
Sbjct: 374 CASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLGPG 423

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 424 GIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA---------------- 467

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+   
Sbjct: 468 --YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKV 524

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IP+NK L+S SYV
Sbjct: 525 SENEGFIPVNKNLWSLSYV 543


>gi|326918494|ref|XP_003205523.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Meleagris gallopavo]
          Length = 532

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 187/391 (47%), Gaps = 62/391 (15%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           ++ + ++++  IN E G   +          +L +   Q+  R+ +LD FRGL++++M+ 
Sbjct: 92  KKLNPRETDRLINSELGSPSTTDSISTDPSPRLWRATSQQ--RLRSLDTFRGLSLIIMVF 149

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIF 135
           V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L  + +   +    + KI++
Sbjct: 150 VNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSMLRWGSSKQKVLGKILW 209

Query: 136 RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           R+  L   G+I+       P+     +  +++R  G+LQR+ L Y+VVA +E L T R  
Sbjct: 210 RSFLLTLLGVIVVN-----PNYCLGALSWENLRIPGVLQRLGLTYLVVAALELLFT-RAV 263

Query: 196 NVLEPRHLSIFTAYQ--------W-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVK 246
           N+    HL    +Y         W QWI      VI++  T+ L VP          G  
Sbjct: 264 NISPSLHLMQEMSYPALQDVLPFWPQWIFILTLEVIWLCLTFLLPVP----------GCP 313

Query: 247 KYIVKCGMRGHLGPACN----AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
           +  +  G  G  G   N    A GY+DR + G  H+Y  P  + L   T+          
Sbjct: 314 RGYLGPGGIGDFGKYANCTGGAAGYIDRLVLGEKHIYQHPSCNVLYQTTV---------- 363

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG-HSARLKHWVSMGFGLLIIA 361
                   P++PEG+L TI+ IL   +G+  G +++ +K  H   +  ++     + II+
Sbjct: 364 --------PYDPEGILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFLIWSLVMGIIS 415

Query: 362 IILHFTNA----IPINKQLYSFSYV----CF 384
            IL   +     IPINK L+S SYV    CF
Sbjct: 416 AILTKCSKEEGFIPINKNLWSTSYVTTMSCF 446


>gi|383849627|ref|XP_003700446.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 552

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 160/365 (43%), Gaps = 61/365 (16%)

Query: 33  KEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDH 92
           K+K ++R +V D  +         Q   +RV  +D  RG + +LMI V+D  G Y  + H
Sbjct: 150 KKKCMKR-QVDDTAR---------QPVKRRVKAIDTVRGASTLLMIFVNDGSGGYKTLGH 199

Query: 93  SPWNGCTLADFVMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIIL 147
           + WNG    D + P F++I+GV I +AL     + VPK +  +  I+ R++ L   G+ L
Sbjct: 200 ATWNGLLPGDLLFPCFIWIMGVCIPIALGSQLKRMVPK-HVILYGILKRSVLLFLIGVSL 258

Query: 148 QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP---RHLS 204
                   + +  G  ++ IR  G+LQR  + Y +VA+I      +RP  ++    R + 
Sbjct: 259 --------NTVGTGPQLESIRIFGVLQRFGVTYFIVAVIYLFLISKRPTKVQSPMLRDVQ 310

Query: 205 IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPA 261
            F     QW         + I T+ L VP            H   KY    G        
Sbjct: 311 DFLLLLPQWTVMLAIVAAHCIITFCLPVPGCPTGYLGPGGLHDDAKYFDCVG-------- 362

Query: 262 CNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTI 321
             A GY+D+ +    HL+                           +  APF+PEG+L  +
Sbjct: 363 -GAAGYIDKVVLKEQHLH----------------------HSMTVYKSAPFDPEGILGCL 399

Query: 322 SAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSY 381
           ++     +G+H G +++ +K    R+  W++       I   LHFTN IP+NK+L+S S+
Sbjct: 400 TSTFHVFLGLHAGIIMMTYKDWKERVIRWLAWAAFFSCIGCALHFTNVIPVNKKLWSLSF 459

Query: 382 VCFTA 386
           V  T 
Sbjct: 460 VFVTT 464


>gi|195340719|ref|XP_002036960.1| GM12376 [Drosophila sechellia]
 gi|194131076|gb|EDW53119.1| GM12376 [Drosophila sechellia]
          Length = 576

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 166/347 (47%), Gaps = 55/347 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      G+ K     +I+ R++KL   G+ L        +++S G +++ +R  G+L
Sbjct: 242 SVKSQLS-RGSSKARICLRILVRSIKLFVIGLCL--------NSMS-GPNLEQLRVMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY----- 228
           QR  + Y+VV ++ TL  +R P  + P+       +      G +A ++ ++ TY     
Sbjct: 292 QRFGVAYLVVGVLHTLCCRREP--ISPQRSWQRAVHDVCLFSGELAVLLALVATYLGLTF 349

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC--NAVGYVDRELWGINHLYSDPVWSR 286
            L VP          G   Y        +  P C   A GY D ++ G  H+Y  P    
Sbjct: 350 GLRVPGCPRGYLGPGGKHDY--------NAHPKCIGGAAGYADLQVLGNAHIYQHPTAKY 401

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
           +   T                    F+PEG+   I +++   +G   G  L+    + +R
Sbjct: 402 VYDST-------------------AFDPEGIFGCILSVVQVLLGAFAGVTLLVHPNYQSR 442

Query: 347 LKHWVSMGFGLLIIAIIL-HFTN---AIPINKQLYSFSYVCFTAGAA 389
           ++ W  +   L +I   L  F++   AIP+NK L+S S+VC T   A
Sbjct: 443 IRRWTLLAILLGLIGGALCGFSSEGGAIPVNKNLWSLSFVCVTVSLA 489


>gi|326427923|gb|EGD73493.1| heparan-alpha-glucosaminide N-acetyltransferase [Salpingoeca sp.
           ATCC 50818]
          Length = 788

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 176/407 (43%), Gaps = 92/407 (22%)

Query: 41  EVQDEQKGELQLQQLLQ-------------QKSK-RVATLDAFRGLTVVLMILVDDAGGA 86
           E +DEQ   L  Q+  +             ++SK R+ +LD+FRG+ + +MI V+  GG 
Sbjct: 327 EQKDEQMLLLNTQKYTRDPLLSSTHAIGNPKRSKTRLQSLDSFRGMALTIMIFVNYGGGD 386

Query: 87  YARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLF 142
           Y   DHS WNG T+AD V P+F++I+G ++A+    + K +  ++ I++    RTL L  
Sbjct: 387 YNFFDHSVWNGLTVADLVFPWFIWIMGTSMAITFNSLFKRHTPLRTILYKVARRTLLLFG 446

Query: 143 WGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK---------- 192
            G+I           ++   D++  R  G+LQR A+ Y+VVAL+     K          
Sbjct: 447 IGVIF----------INVVHDLRFARVPGVLQRFAIAYLVVALVIIFVPKAVSLLRNVDE 496

Query: 193 -----RR--PNVLEP------------RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
                RR  P V  P            RHL     Y  +WI   +  VI+   T+ L VP
Sbjct: 497 VTPLIRRLTPTVRNPASDLDPGGCGMLRHLPDVAPYVGEWIAIIVLVVIHTCITFLLPVP 556

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPAC---------NAVGYVDRELWGINHLYSDPVW 284
                     G    + + G       +C          A G+VDR L    H+Y  P  
Sbjct: 557 GCPTGYIGPGGA---LAEFGQFAPANGSCVNGTFCCEGGAAGHVDRWLLSWKHIYGSPT- 612

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
                                ++    ++PEG+L ++++IL   +G+  G +++H+K   
Sbjct: 613 ------------------SQETYQTGAYDPEGILGSLTSILICYLGLQSGKIIVHYKAAR 654

Query: 345 ARLKHWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAG 387
           AR   W++ G     IA  L         IP++K L+S S+V   +G
Sbjct: 655 ARSVRWLAWGVLCCAIATGLCGGSKNDGVIPVSKNLWSLSFVLLMSG 701


>gi|297682811|ref|XP_002819101.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pongo
           abelii]
          Length = 645

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/379 (29%), Positives = 180/379 (47%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 213 RETDRLINSELG--SPSRTDPLDGDVQPAMWRLSALPPRLRSVDTFRGIALILMVFVNYG 270

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+ 
Sbjct: 271 GGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSF 329

Query: 139 KLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 330 LLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEH 383

Query: 197 -VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
              E   LS+   T+   QW+   +   +++  T+ L VP          G     +  G
Sbjct: 384 CASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLGPG 433

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 434 GIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA---------------- 477

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L T+++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+   
Sbjct: 478 --YDPEGILGTVNSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKV 534

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IP+NK L+S SYV
Sbjct: 535 SENEGFIPVNKNLWSLSYV 553


>gi|198434539|ref|XP_002120178.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Ciona intestinalis]
          Length = 624

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 183/386 (47%), Gaps = 59/386 (15%)

Query: 28  ENGINKEKG-LERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGA 86
           E  I+++ G  E + VQ+          L+++KS+R+ ++D FRGL +V+M+ V+  GG 
Sbjct: 189 ETQIHEDLGNTEPNSVQEANP-----TPLVREKSERIKSIDTFRGLCLVVMVFVNFRGGD 243

Query: 87  YARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLL 141
           Y    HSPW+G T+AD V P+F+FI+GV I L++     K VP    A  K+I RT+ L 
Sbjct: 244 YWFFHHSPWHGLTVADLVFPWFMFIMGVNITLSINSLITKNVPNSKIAY-KLIRRTVLLF 302

Query: 142 FWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIE--------TLTTKR 193
             G+ +          +++       R  G+LQR A+ Y +  +++         + T+ 
Sbjct: 303 GLGMFV----------VNHSTSWAAFRVPGVLQRFAIAYFLPFVLQWAFHLTPIEIETRA 352

Query: 194 RPNVLEPR---HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVK---K 247
           + N  E +          Y  QW+       +++  T+ L +P          G+    K
Sbjct: 353 KTNEGELKWWHWCKDVVPYWLQWLIVLAMEALWLFLTFLLPIPGCPTGYLGPGGLDNDGK 412

Query: 248 YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
           YI +  + G       A GY+DR ++G  H+Y  P    +   T +S             
Sbjct: 413 YINETCVGG-------AAGYIDRVIFGEAHIYGHPTCKNVYYPTYTSDQ----------- 454

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT 367
            R P++PEGLL +I++ +   +G   G + +++K    R   W+   F L +I+IIL   
Sbjct: 455 -RVPYDPEGLLGSINSCIIVILGCQAGKIFLYYKHPLDRAMRWILWCFFLGVISIILCKA 513

Query: 368 NA----IPINKQLYSFSYVCFTAGAA 389
           +A    IP+NK L++ ++V   A  A
Sbjct: 514 SANGGWIPVNKNLWTTTFVTTLACMA 539


>gi|403303686|ref|XP_003942455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Saimiri boliviensis boliviensis]
          Length = 631

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 181/382 (47%), Gaps = 61/382 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK--RVATLDAFRGLTVVLMILV 80
             ++++  IN E G       D   G++Q   + +  S   R+ ++D FRG+ ++LM+ V
Sbjct: 197 SSRETDRLINSELG--SPSRTDPLDGDVQ-SAVWRPSSPPLRLRSVDTFRGIALILMVFV 253

Query: 81  DDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIF 135
           +  GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +
Sbjct: 254 NYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSIFLSMTSIMQ-RGCSKFRLLGKIAW 312

Query: 136 RTLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRR 194
           R+  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  
Sbjct: 313 RSFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 366

Query: 195 PN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIV 250
           P     E   LS+   T+   QW+       +++  T+ L VP          G     +
Sbjct: 367 PEHCASERSCLSLQDITSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTGYL 416

Query: 251 KCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
             G  G  G  P C   A GY+DR L G NHLY  P  + L    ++             
Sbjct: 417 GPGGIGDFGKYPNCTGGAAGYIDRLLLGDNHLYQHPSSAVLYHTEVA------------- 463

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAI 362
                ++PEG+L TI++IL   +G+  G +L+++K  +     R   W  +  GL+ +A+
Sbjct: 464 -----YDPEGILGTINSILMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVAL 517

Query: 363 ILHFTNA--IPINKQLYSFSYV 382
                N   IP+NK L+S SYV
Sbjct: 518 TKVSENEGFIPVNKNLWSLSYV 539


>gi|417411833|gb|JAA52338.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 182/380 (47%), Gaps = 61/380 (16%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK--RVATLDAFRGLTVVLMILVDD 82
           ++++  IN E G   S  +  Q G+    ++ +  +   R+  +D FRGL ++LM+ V+ 
Sbjct: 163 RETDRLINSELG---SPSRAGQLGDDAQSEVWRPAAAPLRLRCVDTFRGLALILMVFVNY 219

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    H+ WNG T+ADFV P+F+FI+G +I L++  V +  G  K     K+ +R+
Sbjct: 220 GGGQYWYFKHASWNGLTVADFVFPWFVFIMGSSIFLSMSSVLQ-RGCSKFRLLGKVAWRS 278

Query: 138 LKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
             L+  G+I+       P+     +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 279 FLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLGVTYFVVAVLELLFAKPVPER 333

Query: 197 -VLEPRHLS---IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
              E R  S   I +++  QW+   +   I++  T+ L VP          G     +  
Sbjct: 334 GAWEGRCSSLQDIMSSWP-QWLFILMLESIWLALTFFLPVP----------GCPTGYLGP 382

Query: 253 GMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
           G  G LG  P C   A GY+DR L G +H+Y  P  + L                     
Sbjct: 383 GGIGDLGKYPNCTGGAAGYIDRLLLGEDHIYQHPSSTVLYH------------------T 424

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIIL 364
           R  ++PEG+L TI++I+   +G+  G +L+++K  +     R   W     GL+ + +  
Sbjct: 425 RVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILIRFTAWCCF-LGLISVGLTK 483

Query: 365 HFTNA--IPINKQLYSFSYV 382
              N   IP+NK L+S SYV
Sbjct: 484 ISENEGFIPVNKNLWSISYV 503


>gi|397505549|ref|XP_003823319.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           1 [Pan paniscus]
          Length = 585

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 180/381 (47%), Gaps = 59/381 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 151 SSRETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVN 208

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 209 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWR 267

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 268 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 321

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+   +   +++  T+ L VP          G     + 
Sbjct: 322 EHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLG 371

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 372 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA-------------- 417

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAII 363
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+ 
Sbjct: 418 ----YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALT 472

Query: 364 LHFTNA--IPINKQLYSFSYV 382
               N   IP+NK L+S SYV
Sbjct: 473 KVSENEGFIPVNKNLWSLSYV 493


>gi|51491261|emb|CAH18694.1| hypothetical protein [Homo sapiens]
          Length = 459

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 180/381 (47%), Gaps = 59/381 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 25  SSRETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVN 82

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 83  YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWR 141

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 142 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 195

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+   +   +++  T+ L VP          G     + 
Sbjct: 196 EHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLG 245

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 246 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA-------------- 291

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAII 363
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+ 
Sbjct: 292 ----YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALT 346

Query: 364 LHFTNA--IPINKQLYSFSYV 382
               N   IP+NK L+S SYV
Sbjct: 347 KVSENEGFIPVNKNLWSLSYV 367


>gi|66826507|ref|XP_646608.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
 gi|60474509|gb|EAL72446.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
          Length = 426

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 167/376 (44%), Gaps = 75/376 (19%)

Query: 40  SEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNG 97
           + +   +  + ++ + L+   +R+ +LDA RGLT+  MILVD+  G      ++ + WNG
Sbjct: 23  THINHGKDTKTKIIEYLKPVQRRMGSLDAVRGLTIFGMILVDNQAGNDVIWPLNETEWNG 82

Query: 98  CTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDA 157
            + AD + P F+FI G +IALALK           II RTL L F    L     H    
Sbjct: 83  LSTADLIFPSFIFISGFSIALALKNSKNTTSTWYGIIRRTLLLFFIQCFLNLMGDH---- 138

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                +    R  G+LQRIA+ Y        L+    P  L+   L   T          
Sbjct: 139 ----FNFTTFRIMGVLQRIAICY----FFSCLSFLCFPIFLQRLFLLSVT---------- 180

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
              V YI   Y+L VP                 KCG R +L   CNA  Y+D +++G+N 
Sbjct: 181 ---VTYISIMYALNVP-----------------KCG-RANLTQNCNAGAYIDSKVFGLNI 219

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +    +             +GP   D          PEGL+ST+S+ ++  +G+ +G + 
Sbjct: 220 MKESNL-------------NGPYYND----------PEGLISTMSSFITAWMGLEFGRIF 256

Query: 338 IHFK-----GHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGC 392
             F      G++  +  W+ +    ++ AI L  T  +P NK+++SFS+  FT GA+ G 
Sbjct: 257 TRFYKKHDFGNTDIIVRWILLVILFMVPAISLGAT-VMPFNKKIWSFSFALFTVGAS-GS 314

Query: 393 LGITNSIFIPEMDWDE 408
           L +   I I  +DW+ 
Sbjct: 315 LILIAFILIDVIDWES 330


>gi|24639786|ref|NP_572198.1| CG6903 [Drosophila melanogaster]
 gi|7290544|gb|AAF45996.1| CG6903 [Drosophila melanogaster]
 gi|21483396|gb|AAM52673.1| LD22376p [Drosophila melanogaster]
          Length = 576

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 164/347 (47%), Gaps = 55/347 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRG+++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGISIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      G+ K     +I++R++KL   G+ L        +++S G +++ +R  G+L
Sbjct: 242 SVKSQLS-RGSSKARICLRILWRSIKLFVIGLCL--------NSMS-GPNLEQLRIMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY----- 228
           QR  + Y+VVA++ TL  +R P  + P+       +      G +A ++ ++ TY     
Sbjct: 292 QRFGVAYLVVAILHTLCCRREP--ISPQRSWQRAVHDVCLFSGELAVLLALVATYLGLTF 349

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC--NAVGYVDRELWGINHLYSDPVWSR 286
            L VP          G   Y        +  P C   A GY D ++ G  H+Y  P    
Sbjct: 350 GLRVPGCPRGYLGPGGKHDY--------NAHPKCIGGAAGYADLQVLGNAHIYQHPTAKY 401

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
           +   T                    F+PEG+   I +++   +G   G  L+      +R
Sbjct: 402 VYDST-------------------AFDPEGIFGCILSVVQVLLGAFAGVTLLVHPNFQSR 442

Query: 347 LKHWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAGAA 389
           ++ W  +   L +I   L        AIP+NK L+S S+VC T   A
Sbjct: 443 IRRWTLLAILLGLIGGALCGFSREGGAIPMNKNLWSLSFVCVTVSLA 489


>gi|348577435|ref|XP_003474490.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Cavia porcellus]
          Length = 638

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/385 (29%), Positives = 174/385 (45%), Gaps = 59/385 (15%)

Query: 18  EQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLM 77
            QE D   +S+ G      L   E Q E +        L     R+  LD FRG+ ++LM
Sbjct: 201 SQETDRLINSDLGSPSRADLLTGEPQPETR-------CLPAPGYRLRCLDTFRGIALILM 253

Query: 78  ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----K 132
           + V+  GG Y    HS WNG T+AD V P+F+FI+G ++ L++  V +  G  K     K
Sbjct: 254 VFVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSVFLSVTSVLQ-RGCSKLKLLGK 312

Query: 133 IIFRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           I +R+  L+  GI I+   Y   P      +    +R  G+LQR+ + Y VVA++E L T
Sbjct: 313 IAWRSFLLICIGIVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFT 366

Query: 192 KR-RPNVLEPRHLSIF---TAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK 247
           K    N +  R        T    QW+   +   +++  T+ L VP   +     +    
Sbjct: 367 KPVHENCVSDRRFPFLRDITCSWPQWLLILLLESLWLGLTFLLPVPGCPYVSEPGY---- 422

Query: 248 YIVKCGMRGHLGPACN----AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
             +  G  G LG   N    A GY+D  L G +HLY  P  + L                
Sbjct: 423 --LGPGGIGDLGKYVNCTGGAAGYIDHLLLGSDHLYQHPSSAVLYH-------------- 466

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLI 359
                +  ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  G++ 
Sbjct: 467 ----TKVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCV-LGVIS 521

Query: 360 IAIILHFTNA--IPINKQLYSFSYV 382
            A+     N   IP+NK L+S SYV
Sbjct: 522 AALTKMSENEGFIPVNKNLWSISYV 546


>gi|350412149|ref|XP_003489557.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus impatiens]
          Length = 571

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 187/378 (49%), Gaps = 65/378 (17%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           GK S N +  +  L R + ++     +++ +     S R+ ++DAFRG+ ++LMI V++ 
Sbjct: 149 GKLSPNNVYDD--LNRLQEENTTHPVIRVTKF----SSRIQSVDAFRGIAILLMIFVNNG 202

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLK 139
           GG Y   +HS W G ++AD ++P+F +I+G++I ++ +   ++  +  KI      R+  
Sbjct: 203 GGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITISKRAELRLTTSRVKITLCCLRRSAI 262

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--V 197
           L+  G++L    S +         +  +R+ GILQ +A+ Y V A++ET+  K      +
Sbjct: 263 LILLGLMLNSIDSKS---------LNDLRFPGILQLLAVSYFVCAILETIFMKPHSQDIL 313

Query: 198 LEPRHLSIFTAY-----QWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
           L+    +IF        QW  + G +    + + T+ L++PN         G   Y    
Sbjct: 314 LQFGRFAIFRDILDSWPQWLIMAGIMT--THTLITFFLHMPNCPTGYFGPGGKYHY---- 367

Query: 253 GMRG-HLGPACNAVGYVDRELWGINHLYS---DPVWSRLEACTLSSPNSGPLREDAPSWC 308
             RG ++     A GY+DR ++G NH YS   D ++ ++            LR D     
Sbjct: 368 --RGKYMNCTAGAAGYIDRLIFG-NHTYSKIKDSIYGQI------------LRYD----- 407

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG-FGLLIIAIILHFT 367
                PEGL++TISAI    +G+H G +L+ +   +ARL  W     F  +I  I+ +F 
Sbjct: 408 -----PEGLMNTISAIFIVYLGVHAGKILLLYYQGNARLIRWFLWAIFTGIIAGILCNFE 462

Query: 368 N---AIPINKQLYSFSYV 382
           N    IP++K++ + SYV
Sbjct: 463 NEGGVIPVSKRMMTLSYV 480


>gi|332241088|ref|XP_003269721.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
           [Nomascus leucogenys]
          Length = 654

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 181/380 (47%), Gaps = 57/380 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G   S + D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 220 SSRETDRLINSELG-SPSRI-DSLDGDVQPALWRLSALPPRLRSVDTFRGIALILMVFVN 277

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 278 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMASILQ-RGCSKFRLLGKIAWR 336

Query: 137 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
           +  L+  GII+       P+     +    +R  G+LQR+ + Y VVA++E L  K  P 
Sbjct: 337 SFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPE 391

Query: 197 --VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
               E   LS+   T+   QW+   +   +++  T+ L VP          G+       
Sbjct: 392 HCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPIGYLGPGGI------- 444

Query: 253 GMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
              G  G  P C   A GY+DR L G +HLY  P  + L    ++               
Sbjct: 445 ---GDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA--------------- 486

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIIL 364
              ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+  
Sbjct: 487 ---YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTK 542

Query: 365 HFTNA--IPINKQLYSFSYV 382
              N   IP+NK L+S SYV
Sbjct: 543 VSENERFIPVNKNLWSLSYV 562


>gi|363733262|ref|XP_420455.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gallus
           gallus]
          Length = 581

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 178/378 (47%), Gaps = 40/378 (10%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           ++ + ++++  IN E G   +          +L +      +R+ +LD FRGL++++M+ 
Sbjct: 145 KKLNPRETDRLINSELGSPSTTDSPSSDPSPRLWR--ATSRQRLRSLDTFRGLSLIIMVF 202

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK 139
           V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L    +   + +K++++ L 
Sbjct: 203 VNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSTLRWGSSKQKVLWKILW 262

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNV-- 197
             F  +IL G     P+     +  +++R  G+LQR+ L Y+VVA +E L T+   +   
Sbjct: 263 RSFL-LILLGVIVVNPNYCLGALSWENLRIPGVLQRLGLTYLVVAALELLFTRTGADSGT 321

Query: 198 --LEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMR 255
             +    L     +  QWI   +  VI++  T+ L VP          G+  +       
Sbjct: 322 LEMSCPALQDILPFWPQWIFILMLEVIWLCLTFLLPVPGCPRGYLGPGGIGDF------G 375

Query: 256 GHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPE 315
            +L     A GY+DR + G  H+Y  P  + L   T+                  P++PE
Sbjct: 376 NYLNCTGGAAGYIDRLVLGEKHIYQHPSCNVLYQTTV------------------PYDPE 417

Query: 316 GLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHW-VSMGFGLLIIAIILHFTNAI 370
           G+L TI+ IL   +G+  G +++ +K       +R   W V MG    I+         I
Sbjct: 418 GILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFFIWSVVMGIISAILTKCSKEEGFI 477

Query: 371 PINKQLYSFSYV----CF 384
           PINK L+S SYV    CF
Sbjct: 478 PINKNLWSTSYVTTMSCF 495


>gi|158294726|ref|XP_315774.3| AGAP005761-PA [Anopheles gambiae str. PEST]
 gi|157015699|gb|EAA10745.3| AGAP005761-PA [Anopheles gambiae str. PEST]
          Length = 581

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 165/389 (42%), Gaps = 56/389 (14%)

Query: 4   LRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRV 63
           +R+   GL R++    +Q     +  G   E     +E  D +  +           KR+
Sbjct: 147 VRMARYGLRRSRASGFQQPAASHTAAG--DETAHSSNEETDRKPHQTGPGSGQVTPKKRL 204

Query: 64  ATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKV 123
            +LD FRG+ ++LMI V+  GG Y  I+H+ WNG  +AD V P+FLFI+GV + ++L+  
Sbjct: 205 QSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVCVPISLRGQ 264

Query: 124 PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVV 183
              N        R++KL   G+ L             G  M ++R  G+LQR  + Y+VV
Sbjct: 265 LNRN--------RSVKLFIIGLCLNS---------MNGPSMANLRIFGVLQRFGIAYLVV 307

Query: 184 ALIETLTTKRRPNVLEPRHL----SIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           + +  L  +++  V     L          + QW+   +  V+Y++  + +  P    + 
Sbjct: 308 STVHLLCHEQQVQVQSQNRLLRASEDIVRLKKQWLVIGLLTVLYLVVMFFVPAPGCPSAY 367

Query: 240 HSDHGVKKYIVKCGMRGHLGPACNA--VGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
               G   Y        +  P C     GY+DR L GI HLY  P    +          
Sbjct: 368 FGPGGKHLY--------NAFPNCTGGITGYIDRALLGIAHLYQHPTARYV---------- 409

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
                    +   PF+PEG    +  IL   +G+  G  ++ +  H  R+  + S    L
Sbjct: 410 ---------YDGMPFDPEGPFGCLPTILQVFLGLQCGCTILAYTEHRQRMVRFASWSLVL 460

Query: 358 -LIIAIILHFTNA---IPINKQLYSFSYV 382
            L    +  FT     IPINK L+S SYV
Sbjct: 461 GLAAGALCGFTKNDGWIPINKNLWSLSYV 489


>gi|426256612|ref|XP_004021932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Ovis
           aries]
          Length = 641

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 172/376 (45%), Gaps = 53/376 (14%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
           + ++++  IN E G   S   D Q    +          R+  +D FRG+ ++LM+ V+ 
Sbjct: 211 NSRETDRLINSELG-SPSRASDPQPEAWRRSA----APLRLRCVDTFRGMALILMVFVNY 265

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI++R+
Sbjct: 266 GGGKYWYFKHSSWNGLTVADLVFPWFVFIMGASIFLSMASILQ-RGCSKLRLLGKIVWRS 324

Query: 138 LKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
             L+  GI ++   Y   P      +  +  R  G+LQR+   Y VVA++E L  K  P 
Sbjct: 325 FLLICIGIFVVNPNYCLGP------LSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPE 378

Query: 197 V--LEPRHLSIF--TAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
              LE    S+   TA   QW+   I   +++  T+ L VP          G        
Sbjct: 379 TCALERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPGGIG 432

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
               +      A GYVDR L G  HLY  P  + L    ++                  +
Sbjct: 433 DGGRYRNCTGGAAGYVDRLLLGDQHLYQHPSSAVLYHTEVA------------------Y 474

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFTN 368
           +PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+     N
Sbjct: 475 DPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASEN 533

Query: 369 A--IPINKQLYSFSYV 382
              IP+NK L+S SYV
Sbjct: 534 EGFIPVNKNLWSVSYV 549


>gi|410956346|ref|XP_003984803.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Felis
           catus]
          Length = 629

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 178/382 (46%), Gaps = 61/382 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVD 81
           + ++++  IN E G       D   G+ Q        S  R+  +D FRG+ ++LM+ V+
Sbjct: 195 NSRETDRLINSELG--SPSRADSLGGDTQPGGWCPPASPPRLRCVDTFRGIALILMVFVN 252

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 253 YGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKLKLMGKIGWR 311

Query: 137 TLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  G+ I+   Y   P      +    +R  G+LQR+ + Y VVA++E +  K  P
Sbjct: 312 SFLLICIGMFIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAMLELIFAKPVP 365

Query: 196 NVLEPRHLSIFT----AYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIV 250
                   S F+     + W QW+   +   I++  T+ L VP          G     +
Sbjct: 366 ESCASER-SCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVP----------GCPTGYL 414

Query: 251 KCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
             G  G LG  P C   A GY+DR L G +H+Y  P  + L    ++             
Sbjct: 415 GPGGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEVA------------- 461

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAI 362
                ++PEG+L TI++I+   +G+  G +L+++K  +     R   W     GL+ IA+
Sbjct: 462 -----YDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCF-LGLISIAL 515

Query: 363 ILHFTNA--IPINKQLYSFSYV 382
                N   IPINK L+S SYV
Sbjct: 516 TKISENEGFIPINKNLWSISYV 537


>gi|417411831|gb|JAA52337.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
           [Desmodus rotundus]
          Length = 595

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 181/380 (47%), Gaps = 61/380 (16%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK--RVATLDAFRGLTVVLMILVDD 82
           ++++  IN E G   S  +  Q G+    ++ +  +   R+  +D FRGL ++LM+ V+ 
Sbjct: 163 RETDRLINSELG---SPSRAGQLGDDAQSEVWRPAAAPLRLRCVDTFRGLALILMVFVNY 219

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    H+ WNG T+AD V P+F+FI+G +I L++  V +  G  K     K+ +R+
Sbjct: 220 GGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMSSVLQ-RGCSKFRLLGKVAWRS 278

Query: 138 LKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
             L+  G+I+       P+     +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 279 FLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLGVTYFVVAVLELLFAKPVPER 333

Query: 197 -VLEPRHLS---IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
              E R  S   I +++  QW+   +   I++  T+ L VP          G     +  
Sbjct: 334 GAWEGRCSSLQDIMSSWP-QWLFILMLESIWLALTFFLPVP----------GCPTGYLGP 382

Query: 253 GMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
           G  G LG  P C   A GY+DR L G +H+Y  P  + L                     
Sbjct: 383 GGIGDLGKYPNCTGGAAGYIDRLLLGEDHIYQHPSSTVLYH------------------T 424

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIIL 364
           R  ++PEG+L TI++I+   +G+  G +L+++K  +     R   W     GL+ + +  
Sbjct: 425 RVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILIRFTAWCCF-LGLISVGLTK 483

Query: 365 HFTNA--IPINKQLYSFSYV 382
              N   IP+NK L+S SYV
Sbjct: 484 ISENEGFIPVNKNLWSISYV 503


>gi|345481194|ref|XP_001603332.2| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
           N-acetyltransferase-like [Nasonia vitripennis]
          Length = 570

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 182/375 (48%), Gaps = 57/375 (15%)

Query: 26  DSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG 85
           +S N I++   L+ SE   E          + + + R+  LDAFRG+ V+LMI V++ GG
Sbjct: 157 ESHNDIDR---LQESESTPEMVA-------VSKTAMRLQALDAFRGIAVLLMIFVNNGGG 206

Query: 86  AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLL 141
            Y  ++H+ WNG T+AD V+P+F + +G  I  +++   +++ +  ++I     RT+ L+
Sbjct: 207 EYVFLNHAAWNGLTVADLVLPWFAWAMGFTIVNSVRVHLRVSVSRTRLIIMQLRRTVLLI 266

Query: 142 FWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPR 201
            +G+ +   ++           +  +R+ G+LQ +A+ Y + ++IET     +      R
Sbjct: 267 LFGLFINSQHNST---------LSELRFPGVLQLLAVAYFICSVIETCLASPQRTFQFGR 317

Query: 202 HLSIFTAYQ-W-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG 259
            + +    + W QW+   +  +++   T+ L+VP          G   Y +     G   
Sbjct: 318 FVFLQDILERWTQWMVVLVIILVHTCITFFLHVPGCPRGYLGPGGYHHYGLNVNCTG--- 374

Query: 260 PACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLS 319
               A GY+DR ++G  H+Y   +          +P  GP           P +PEGL++
Sbjct: 375 ---GAAGYIDRLIFG-QHMYQKTM----------NPVYGP---------TLPHDPEGLMN 411

Query: 320 TISAILSGTIGIHYGHVLIHFKGHSARLKHWVS--MGFGLLIIAIILHFTNA---IPINK 374
           TISA+L   +G+  G + + +   ++R+  W +  +  GLL   I+ +F+     +P+NK
Sbjct: 412 TISAVLIVFMGVQAGRIFVTYYQANSRIIRWFAWFVATGLL-AGILCNFSQEKGWVPVNK 470

Query: 375 QLYSFSYVCFTAGAA 389
            + S S+V  T+  A
Sbjct: 471 NMMSLSFVLCTSSFA 485


>gi|91079154|ref|XP_966977.1| PREDICTED: similar to CG6903 CG6903-PA [Tribolium castaneum]
          Length = 533

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 100/352 (28%), Positives = 159/352 (45%), Gaps = 65/352 (18%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +  KR+ +LD FRG+++V+MI V+   G Y  +DH+ WNG  LAD V P+F++I+G  + 
Sbjct: 175 EGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFMWIMGACMP 234

Query: 118 LALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++L    K   + K I      R++KL   G+ L  G             ++ +R  G+L
Sbjct: 235 ISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNAGPY-----------LECMRIFGVL 283

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR  + Y+VV  I     KR  +  + +    FT     W G  +  +I+ +    L++ 
Sbjct: 284 QRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFVHCMFLFLL 343

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDRELWGINHLYSD 281
                  +D G          RG+LGP           C   A GY+D  + G NH Y  
Sbjct: 344 -------ADEGCP--------RGYLGPGGLHENGKHFNCTGGATGYIDAVILG-NHRYQK 387

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
           P    +   T                    F+PEG+L  +++I+   IG+  G  L+ +K
Sbjct: 388 PTSKEIYLGT------------------QAFDPEGILGCLTSIVHVFIGVQAGITLLVYK 429

Query: 342 GHSARLKHWVSMG-FGLLIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
            HSARL  W+S      ++   +  F+     IP+NK L+S S+V  T+  A
Sbjct: 430 EHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFA 481


>gi|355697915|gb|EHH28463.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           mulatta]
          Length = 596

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 177/380 (46%), Gaps = 57/380 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 162 SSRETDRLINSELG--SPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMVFVN 219

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  V +  G  K     KI +R
Sbjct: 220 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSVLQ-RGCSKFRLLGKIAWR 278

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 279 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 332

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+       +++  T+ L VP          G     + 
Sbjct: 333 EHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTGYLG 382

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 383 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEVA-------------- 428

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVS-MGFGLLIIAI 362
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W   +G   +++  
Sbjct: 429 ----YDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTK 484

Query: 363 ILHFTNAIPINKQLYSFSYV 382
           +      IP+NK L+S SYV
Sbjct: 485 VSENEGFIPVNKNLWSLSYV 504


>gi|301765942|ref|XP_002918389.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Ailuropoda melanoleuca]
          Length = 851

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 177/383 (46%), Gaps = 63/383 (16%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
           + ++++  IN E G           G+ QL+       S R+  +D FRG+ ++LM+ V+
Sbjct: 417 NSRETDRLINSELG--SPSRAGSIGGDAQLEAWHPPAASPRLRCVDTFRGIALILMVFVN 474

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    HS WNG T+AD V P+F+FI+G ++ L++  V +  G  K     KI +R
Sbjct: 475 YGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQ-RGCSKFKLLGKIAWR 533

Query: 137 TLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  G +I+   Y   P      +    +R  G+LQR+ + Y VVA++E +  K  P
Sbjct: 534 SFLLICIGVVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELIFAKPVP 587

Query: 196 NVLEPRHLS------IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYI 249
                          IF+  QW +I   +   I++  T+ L VP          G     
Sbjct: 588 ESCASERGCFSLRDIIFSWPQWLFI--LMLESIWLGLTFFLPVP----------GCPTGY 635

Query: 250 VKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAP 305
           +  G  G  G  P C   A GY+DR L G +H+Y  P  + L    ++            
Sbjct: 636 LGPGGIGDWGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEVA------------ 683

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIA 361
                 ++PEG+L TI++I+   +GI  G +L+++K  +     R   W     GL+ +A
Sbjct: 684 ------YDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCF-LGLISVA 736

Query: 362 IILHFTNA--IPINKQLYSFSYV 382
           +     N   IPINK L+S SYV
Sbjct: 737 LTKVSENEGFIPINKNLWSISYV 759


>gi|321474731|gb|EFX85695.1| hypothetical protein DAPPUDRAFT_309035 [Daphnia pulex]
          Length = 588

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 163/365 (44%), Gaps = 72/365 (19%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++KS R+ +LD FRG+T+VLMI V+D  G Y    H+ WNG  LAD + P+F++I+GV +
Sbjct: 189 KKKSSRLKSLDTFRGITIVLMIFVNDGAGQYFIFQHATWNGLQLADVIFPWFMWIMGVCM 248

Query: 117 ALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            ++L     +K  K+      I+ R+  L F GI+         ++L   VD+  +R  G
Sbjct: 249 PISLRSSLRRKESKLT-IFAGILRRSCLLFFLGIM--------NNSLGGPVDLGRLRVPG 299

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEP----RHLSIFTAYQW-QWIGGFIAFVIYIIT 226
           +LQR A+ Y+ V     L T    +   P    R L       W QWI   +    +   
Sbjct: 300 VLQRFAITYLAVGTAGLLLTPADLSAPHPSSKARKLFQDIVVLWPQWILFLLLVAAHCFI 359

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA-----------C--NAVGYVDRELW 273
           T+ L V                   C + G+LGPA           C   A GY+DR + 
Sbjct: 360 TFFLPVEE----------------GCPV-GYLGPAGLHLDNAYPGHCIGGAAGYIDRLML 402

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
            + H+++ P        T+    SG            P++PEG+L ++       +G   
Sbjct: 403 SVQHIFNKPT-------TIGVYGSG------------PYDPEGILGSMLCTFQVFLGAQA 443

Query: 334 GHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAA 389
           G  L+ F G  +RL  W++      +I  +L   +     IP+NK L+S S+V  T G A
Sbjct: 444 GMTLLIFSGWKSRLIRWLAWSVLTGLIGALLCLASQNDGWIPVNKNLWSLSFVLVTTGLA 503

Query: 390 DGCLG 394
              LG
Sbjct: 504 FFLLG 508


>gi|380789677|gb|AFE66714.1| heparan-alpha-glucosaminide N-acetyltransferase precursor [Macaca
           mulatta]
 gi|383410547|gb|AFH28487.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
 gi|384945386|gb|AFI36298.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
          Length = 635

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 177/380 (46%), Gaps = 57/380 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 201 SSRETDRLINSELG--SPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMVFVN 258

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 259 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWR 317

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 318 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 371

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+       +++  T+ L VP          G     + 
Sbjct: 372 EHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTGYLG 421

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 422 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEVA-------------- 467

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVS-MGFGLLIIAI 362
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W   +G   +++  
Sbjct: 468 ----YDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTK 523

Query: 363 ILHFTNAIPINKQLYSFSYV 382
           +      IP+NK L+S SYV
Sbjct: 524 VSENEGFIPVNKNLWSLSYV 543


>gi|355779672|gb|EHH64148.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
           fascicularis]
          Length = 596

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 177/380 (46%), Gaps = 57/380 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 162 SSRETDRLINSELG--SPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMVFVN 219

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 220 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWR 278

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 279 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 332

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+       +++  T+ L VP          G     + 
Sbjct: 333 EHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTGYLG 382

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 383 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEVA-------------- 428

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVS-MGFGLLIIAI 362
               ++PEG+L TI++I+   +G+  G +L+++K  +     R   W   +G   +++  
Sbjct: 429 ----YDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTK 484

Query: 363 ILHFTNAIPINKQLYSFSYV 382
           +      IP+NK L+S SYV
Sbjct: 485 VSENEGFIPVNKNLWSLSYV 504


>gi|340727561|ref|XP_003402110.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Bombus terrestris]
          Length = 571

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 185/375 (49%), Gaps = 59/375 (15%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           GK S N +  +  L R + ++     +++ +     S R+ ++DAFRG+T++LMI V++ 
Sbjct: 149 GKLSPNNVYDD--LNRLQEENTTHPVIRVTKF----SSRIQSVDAFRGITILLMIFVNNG 202

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLK 139
           GG Y   +HS W G ++AD ++P+F +I+G++I ++ +   ++  +  KI      R+  
Sbjct: 203 GGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITISKRAELRLTTSRVKITLCCLRRSAI 262

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--V 197
           L+  G++L    S +         +  +R+ G+LQ +++ Y V A++ET+  K      +
Sbjct: 263 LILLGLMLNSIDSKS---------LNDLRFPGVLQLLSVSYFVCAILETIFMKPHSQDIL 313

Query: 198 LEPRHLSIFTAY-----QWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
           L+    +IF        QW  + G +    + + T+ L++PN         G   Y    
Sbjct: 314 LQFGRFAIFRDILDSWPQWLIMAGIMT--THTLITFFLHIPNCPTGYFGPGGKYHY---- 367

Query: 253 GMRG-HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
             RG ++     A GY+DR ++G NH YS            +S     LR D        
Sbjct: 368 --RGKYMNCTAGAAGYIDRLIFG-NHTYSR---------ITNSIYGQILRYD-------- 407

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG-FGLLIIAIILHFTN-- 368
             PEGL++TISAI    +G+H G +L+ +   +ARL  W     F  +I  I+ +F +  
Sbjct: 408 --PEGLMNTISAIFIVYLGVHAGKILLLYYQGNARLIRWFLWAVFTGIIAGILCNFESEG 465

Query: 369 -AIPINKQLYSFSYV 382
             IP++K++ + SYV
Sbjct: 466 GVIPVSKRMMTLSYV 480


>gi|91078976|ref|XP_974454.1| PREDICTED: similar to heparan-alpha-glucosaminide
           N-acetyltransferase [Tribolium castaneum]
 gi|270003686|gb|EFA00134.1| hypothetical protein TcasGA2_TC002950 [Tribolium castaneum]
          Length = 566

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 169/351 (48%), Gaps = 60/351 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           +++   R+ ++D FRG  +++MI V+  GG Y    HS WNG T+AD V P+FL+++GV+
Sbjct: 175 IRKHPHRIKSIDVFRGFCIMIMIFVNYGGGKYWFFSHSVWNGLTVADLVFPWFLWLMGVS 234

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            A++L+   +     ++++   ++  F  I+L    +   +  + G     +R+ G+LQR
Sbjct: 235 FAVSLQAKLRRAVPRRQLVIGVMRRSFILILLGIIINSNQNLQTIG----SLRFPGVLQR 290

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVPN 234
           I + Y +V ++E + TKR    +E        A  W QW+   +  VI+   T+   VP 
Sbjct: 291 IGVCYFIVGMLEIIFTKRSE--VESVSCIYDVAVAWPQWLCVTVLVVIHTCVTFLGDVPG 348

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPA----------CNA--VGYVDRELWGINHLYSDP 282
                            CG RG+LGP           C     GY+DR+++G  H++ +P
Sbjct: 349 -----------------CG-RGYLGPGGLDDNGRFYNCTGGVAGYIDRQVFG-EHMHKNP 389

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG 342
           V  +L    +                   F+PEG+L T++++L+   G+  G  L  ++ 
Sbjct: 390 VCKKLYEIDVY------------------FDPEGILGTLTSVLTVYFGVQAGRTLNTYQN 431

Query: 343 HSARLKHWVSMG--FGLLIIAIILHFTN--AIPINKQLYSFSYVCFTAGAA 389
             A++  WV  G   GLL  A+     N   IP+NKQL+S S+    +G A
Sbjct: 432 VKAKVIRWVVWGSLAGLLGGALCEFKQNDGLIPLNKQLWSLSFALVLSGMA 482


>gi|270004236|gb|EFA00684.1| hypothetical protein TcasGA2_TC003561 [Tribolium castaneum]
          Length = 569

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/352 (28%), Positives = 159/352 (45%), Gaps = 65/352 (18%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +  KR+ +LD FRG+++V+MI V+   G Y  +DH+ WNG  LAD V P+F++I+G  + 
Sbjct: 175 EGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFMWIMGACMP 234

Query: 118 LALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++L    K   + K I      R++KL   G+ L  G             ++ +R  G+L
Sbjct: 235 ISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNAGPY-----------LECMRIFGVL 283

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR  + Y+VV  I     KR  +  + +    FT     W G  +  +I+ +    L++ 
Sbjct: 284 QRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFVHCMFLFLL 343

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDRELWGINHLYSD 281
                  +D G          RG+LGP           C   A GY+D  + G NH Y  
Sbjct: 344 -------ADEGCP--------RGYLGPGGLHENGKHFNCTGGATGYIDAVILG-NHRYQK 387

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
           P    +   T                    F+PEG+L  +++I+   IG+  G  L+ +K
Sbjct: 388 PTSKEIYLGT------------------QAFDPEGILGCLTSIVHVFIGVQAGITLLVYK 429

Query: 342 GHSARLKHWVSMG-FGLLIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
            HSARL  W+S      ++   +  F+     IP+NK L+S S+V  T+  A
Sbjct: 430 EHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSISFVLVTSCFA 481


>gi|297491309|ref|XP_002698775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
 gi|296472360|tpg|DAA14475.1| TPA: Heparan-alpha-glucosaminide N-acetyltransferase-like [Bos
           taurus]
          Length = 723

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 168/376 (44%), Gaps = 53/376 (14%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
           + ++++  IN E G   S   D Q    +          R+  +D FRG+ ++LM+ V+ 
Sbjct: 293 NSRETDRLINSELG-SPSRASDPQPEAWRRSA----APLRLRCVDTFRGMALILMVFVNY 347

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+
Sbjct: 348 GGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKFRLLGKIAWRS 406

Query: 138 LKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
             L+  GI ++   Y   P      +  +  R  G+LQR+   Y VVA++E L  K  P 
Sbjct: 407 FLLICIGIFVVNPKYCLGP------LSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPE 460

Query: 197 VLEPRH----LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
                     L   TA   QW+   I   +++  T+ L VP          G        
Sbjct: 461 TCASERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPGGIG 514

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
               +      A GYVDR L G  HLY  P  + L    ++                  +
Sbjct: 515 DGGRYRNCTGGAAGYVDRLLLGDQHLYQHPSSAVLYHTEVA------------------Y 556

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFTN 368
           +PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+     N
Sbjct: 557 DPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASEN 615

Query: 369 A--IPINKQLYSFSYV 382
              IP+NK L+S SYV
Sbjct: 616 EGFIPVNKNLWSISYV 631


>gi|281351504|gb|EFB27088.1| hypothetical protein PANDA_006846 [Ailuropoda melanoleuca]
          Length = 557

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 111/382 (29%), Positives = 177/382 (46%), Gaps = 61/382 (15%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
           + ++++  IN E G           G+ QL+       S R+  +D FRG+ ++LM+ V+
Sbjct: 123 NSRETDRLINSELG--SPSRAGSIGGDAQLEAWHPPAASPRLRCVDTFRGIALILMVFVN 180

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    HS WNG T+AD V P+F+FI+G ++ L++  V +  G  K     KI +R
Sbjct: 181 YGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQ-RGCSKFKLLGKIAWR 239

Query: 137 TLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  G +I+   Y   P      +    +R  G+LQR+ + Y VVA++E +  K  P
Sbjct: 240 SFLLICIGVVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELIFAKPVP 293

Query: 196 NVLEPRHLSIFT----AYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIV 250
                     F+     + W QW+   +   I++  T+ L VP          G     +
Sbjct: 294 ESCASER-GCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVP----------GCPTGYL 342

Query: 251 KCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
             G  G  G  P C   A GY+DR L G +H+Y  P  + L    ++             
Sbjct: 343 GPGGIGDWGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEVA------------- 389

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAI 362
                ++PEG+L TI++I+   +GI  G +L+++K  +     R   W     GL+ +A+
Sbjct: 390 -----YDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCF-LGLISVAL 443

Query: 363 ILHFTNA--IPINKQLYSFSYV 382
                N   IPINK L+S SYV
Sbjct: 444 TKVSENEGFIPINKNLWSISYV 465


>gi|194679266|ref|XP_588978.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
           taurus]
          Length = 734

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 168/376 (44%), Gaps = 53/376 (14%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
           + ++++  IN E G   S   D Q    +          R+  +D FRG+ ++LM+ V+ 
Sbjct: 304 NSRETDRLINSELG-SPSRASDPQPEAWRRSA----APLRLRCVDTFRGMALILMVFVNY 358

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+
Sbjct: 359 GGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKFRLLGKIAWRS 417

Query: 138 LKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
             L+  GI ++   Y   P      +  +  R  G+LQR+   Y VVA++E L  K  P 
Sbjct: 418 FLLICIGIFVVNPKYCLGP------LSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPE 471

Query: 197 VLEPRH----LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
                     L   TA   QW+   I   +++  T+ L VP          G        
Sbjct: 472 TCASERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPGGIG 525

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
               +      A GYVDR L G  HLY  P  + L    ++                  +
Sbjct: 526 DGGRYRNCTGGAAGYVDRLLLGDQHLYQHPSSAVLYHTEVA------------------Y 567

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFTN 368
           +PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+     N
Sbjct: 568 DPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASEN 626

Query: 369 A--IPINKQLYSFSYV 382
              IP+NK L+S SYV
Sbjct: 627 EGFIPVNKNLWSISYV 642


>gi|340371415|ref|XP_003384241.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Amphimedon queenslandica]
          Length = 743

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 111/401 (27%), Positives = 183/401 (45%), Gaps = 84/401 (20%)

Query: 18  EQEQDDGKDSENGINKEKGLERSEVQDEQKGE-----LQLQQLLQQKSKRVATLDAFRGL 72
           E  +D G   +         E + + D+QK +     L    L  +K +R+ +LD FRG+
Sbjct: 310 EISKDLGDPDKVNYGALNNGETANLLDDQKEKATTDLLNEDPLSTRKKERLRSLDTFRGM 369

Query: 73  TVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK- 131
           ++++MI V+  GG Y   +HS WNG T+AD V P+F++I+GV+I  + K   K +  ++ 
Sbjct: 370 SLIIMIFVNYGGGGYWFFNHSIWNGITVADLVFPWFVWIMGVSIVYSFKGRKKDSFKLRL 429

Query: 132 -KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLT 190
            +++ R++ LL  G+ L  GY            + H R  G+LQR A+ Y VVA+ E L 
Sbjct: 430 YQVVRRSVILLGLGLFLNNGYR-----------LSHWRIPGVLQRFAIAYFVVAMTELLA 478

Query: 191 T------KRRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDH 243
                  K + +V+  R L+    + W QW+       +++I T+SL  P          
Sbjct: 479 PMVYNKYKLKWDVISVRDLT----HNWVQWLVIVFLESLWLIITFSLKAPG--------- 525

Query: 244 GVKKYIVKCGMRGHLGPACNA------------VGYVDRELWGINHLYSDPVWSRLEACT 291
                   C  RG+LGP   A             GY+D  +   NH+Y  P    +    
Sbjct: 526 --------CP-RGYLGPGGRADGGKYSNCTGGIAGYIDSWILTDNHIYGHPTCKAI---- 572

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                          +    ++PEG+L +I++I+    G+  G +LIH K   +R+  +V
Sbjct: 573 ---------------YHTGSYDPEGILGSINSIVMCFFGVQAGRILIHHKQFGSRIVRFV 617

Query: 352 SM-----GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
                  G G ++    L+    IP+NK L+S S++   AG
Sbjct: 618 VWGLLMGGLGTILCEATLN-KGVIPLNKNLWSLSFILVIAG 657


>gi|118378164|ref|XP_001022258.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila]
 gi|89304025|gb|EAS02013.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila
           SB210]
          Length = 827

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 109/406 (26%), Positives = 171/406 (42%), Gaps = 105/406 (25%)

Query: 14  TQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKS------------- 60
            ++ +QE +  K ++  IN ++ L+      EQ+  ++ + LLQ++              
Sbjct: 395 NKMRQQESESFKQAKVSINSDQNLQNQYENLEQEQGIKQKLLLQEEQPPQQIVIQKDIQQ 454

Query: 61  ------KRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIV 112
                 +R+  LD +RGLT+V MILVD+ G +     +D + WNG + AD V P FLFI 
Sbjct: 455 PAAAPKQRLECLDIYRGLTMVGMILVDNMGNSSVIWPLDETEWNGLSTADCVFPSFLFIS 514

Query: 113 GVAIALALKKVPKINGAVKKIIFRTL----KLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           G+AI LA+K     NG  K+  FR L    KL   G+ L    ++           +  R
Sbjct: 515 GMAITLAIKH----NGNKKQQFFRILERFVKLFVIGVALNAACANYK---------QQFR 561

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRIA+ Y V +                        +  Q++   +  +IYI   Y
Sbjct: 562 IMGVLQRIAICYFVTS----------------TSYLFLQNFAVQFVLNGVFLLIYIYFMY 605

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
              VP+                 CG   ++ P CN   Y+D +++ +N++          
Sbjct: 606 FFDVPD----------------GCGA-NNVTPTCNFGRYLDMQIFTLNYM---------- 638

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                 P +PEGL +T+ A+++  IG+ YG  L  FK    RL 
Sbjct: 639 --------------------MKPSDPEGLFTTLGALVTTFIGLCYGLALQEFKSQKKRLS 678

Query: 349 H-WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
             W  M   L+ I  I  F    PINK+++S S+V F  G+  G  
Sbjct: 679 CIWFVMSLVLVFIGGICCFLT--PINKKVWSPSFV-FIVGSMSGAF 721


>gi|195447210|ref|XP_002071113.1| GK25317 [Drosophila willistoni]
 gi|194167198|gb|EDW82099.1| GK25317 [Drosophila willistoni]
          Length = 537

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 104/352 (29%), Positives = 160/352 (45%), Gaps = 65/352 (18%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL +VLMI V+  GG Y  IDH  WNG  LAD V P FL+I+GV I L
Sbjct: 186 QRKRLRSLDTFRGLAIVLMIFVNSGGGGYDSIDHVAWNGLHLADLVFPCFLWIMGVCIPL 245

Query: 119 ALKK-----VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K        KI     +II+R+ KL   G+ L             G  ++ +R  G+L
Sbjct: 246 SIKSQLGRGTSKIQ-ICGRIIWRSFKLFAIGVCLNS---------INGPKLEQLRVMGVL 295

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR  + ++VV L+ T+ ++R  + + P+     + Y      G  A ++ +I TY     
Sbjct: 296 QRFGVAFLVVGLLHTVCSRR--DHISPQQAWQRSIYDICIFSGEFAVLLALIATY----- 348

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGP------ACN------AVGYVDRELWGINHLYSD 281
                    +G+K   V    RG+LGP      A N      A GY+D+++ G  H+Y  
Sbjct: 349 -----LGLTYGLK---VPGCPRGYLGPGGKSNNAANPHCIGGAAGYIDQQVLGNAHIYQY 400

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
           P    +   T                    F+PEGL   + +++   +G   G  L+   
Sbjct: 401 PTAKYVYDAT-------------------AFDPEGLFGCLLSVVHVLLGAFAGVTLLVHP 441

Query: 342 GHSARLKHW-VSMGFGLLIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
              +R+K W +      L+   +  F+     IP+NK L+S S+V  T   A
Sbjct: 442 TWQSRMKRWTILALLLGLMGGALCGFSKENGLIPVNKNLWSLSFVFVTVTVA 493


>gi|195399031|ref|XP_002058124.1| GJ15666 [Drosophila virilis]
 gi|194150548|gb|EDW66232.1| GJ15666 [Drosophila virilis]
          Length = 572

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 100/352 (28%), Positives = 157/352 (44%), Gaps = 73/352 (20%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG Y+ I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 180 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 239

Query: 119 ALKKVPKINGAVKK------IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++K   ++   + K      I++R+ KL   G+ L           + G  ++ +R  G+
Sbjct: 240 SIKS--QLGRGISKSRICGRIVWRSCKLFAIGLCLNS---------TNGPQLEQLRLMGV 288

Query: 173 LQRIALVYVVVALIETLTTKR---RPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
           LQR  + ++VV L+ T+ ++R    P     R +     +  +          Y+  T+ 
Sbjct: 289 LQRFGIAFLVVGLLHTVCSRRDQLSPQRAWQRAIYDICLFSGELAVLLALIAAYLGLTFG 348

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGP------ACN------AVGYVDRELWGINH 277
           L VP                     RG+LGP      A N      A GY+DR++ G  H
Sbjct: 349 LPVPGCP------------------RGYLGPGGKHNNAANPNCIGGAAGYIDRQVLGNAH 390

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y  P    +   T                    F+PEG+   + +++   +G   G  L
Sbjct: 391 IYQHPTAKYVYDAT-------------------AFDPEGIFGCLLSVVQTLLGAFAGVTL 431

Query: 338 IHFKGHSARLKHW-VSMGFGLLIIAIILHFTN---AIPINKQLYSFSYVCFT 385
           +      ARLK W +      L+   +  FT     IP+NK L+S S+V  T
Sbjct: 432 LVHATWQARLKRWLLGATLLGLLGGALCGFTREQGVIPVNKNLWSLSFVFVT 483


>gi|225012704|ref|ZP_03703139.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225003237|gb|EEG41212.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 366

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 148/339 (43%), Gaps = 82/339 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++ K++R+  LD  RGLT++LMI+V+D G     YA   H+ WNG T  D+V P FLFIV
Sbjct: 1   METKNQRLLALDILRGLTIILMIVVNDPGSWSEVYAPFLHAEWNGLTPTDYVFPTFLFIV 60

Query: 113 GVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           GV+I L+L K  +         KK+++R LK+   GI L          L    + + IR
Sbjct: 61  GVSIVLSLSKQLEAGKTRSQIAKKVLWRALKIYLVGIFLW---------LWPSFNFEGIR 111

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           W G+L RIALV++   LI   TTK+                  QW  G    + Y I   
Sbjct: 112 WVGVLPRIALVFLACGLIFLYTTKKT-----------------QWYLGIGILLGYWIMMA 154

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            + VP   F + S                  P  N   Y+D      + L    +W    
Sbjct: 155 YVPVPGIGFPDLSV-----------------PEKNWAHYLD------SFLIPGRLW---- 187

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                               +  ++PEG LST+ AI+SG IG+  G+VL+  +    RL 
Sbjct: 188 --------------------KYTWDPEGFLSTLPAIVSGLIGMWAGYVLMKKEELKTRLN 227

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
               +GF LL +   + +    P NK L+S S+  F  G
Sbjct: 228 QLFFIGFILLFLGDAMQWV--FPFNKNLWSSSFTLFMGG 264


>gi|345513910|ref|ZP_08793425.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
 gi|229435722|gb|EEO45799.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
          Length = 372

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 86/358 (24%)

Query: 46  QKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLA 101
           +K EL  +   Q     KR+ +LDA RG+TV  MILV++AGG  +YA + HS WNG T  
Sbjct: 2   KKEELNTETAQQAPPIKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPC 61

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQGGYSHAPD 156
           D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G + H  +
Sbjct: 62  DLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCE 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                +   H+R  G+LQRIAL Y V++        +                       
Sbjct: 118 G--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK----------------------- 152

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P  +F     + V    + C   G+     N +  +DR+L+G  
Sbjct: 153 --------------FIPALTFILLVSYTV----ILCMGNGYACDESNILSIIDRQLFGEA 194

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                             ++P +PEG +ST+SAI    IG  YG  
Sbjct: 195 HLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSYGKW 226

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +I       ++      GF L+ I  +L   +A+P+NK+++S ++V  T GAA   L 
Sbjct: 227 IIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282


>gi|237711644|ref|ZP_04542125.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|423230937|ref|ZP_17217341.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|423244648|ref|ZP_17225723.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
 gi|229454339|gb|EEO60060.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|392630057|gb|EIY24059.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
           CL02T00C15]
 gi|392641497|gb|EIY35273.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
           CL02T12C06]
          Length = 372

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 86/358 (24%)

Query: 46  QKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLA 101
           +K EL  +   Q     KR+ +LDA RG+TV  MILV++AGG  +YA + HS WNG T  
Sbjct: 2   KKEELNTETAQQAPPIKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPC 61

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQGGYSHAPD 156
           D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G + H  +
Sbjct: 62  DLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCE 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                +   H+R  G+LQRIAL Y V++        +                       
Sbjct: 118 G--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK----------------------- 152

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P  +F     + V    + C   G+     N +  +DR+L+G  
Sbjct: 153 --------------FIPALTFILLVSYTV----ILCMGNGYACDESNILSIIDRQLFGEA 194

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                             ++P +PEG +ST+SAI    IG  YG  
Sbjct: 195 HLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSYGKW 226

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +I       ++      GF L+ I  +L   +A+P+NK+++S ++V  T GAA   L 
Sbjct: 227 IIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282


>gi|374263976|ref|ZP_09622521.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
 gi|363535543|gb|EHL28992.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
          Length = 372

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 107/340 (31%), Positives = 155/340 (45%), Gaps = 78/340 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +++ SKR+ +LD FRGLT+ LM+LV+  G    Y  +DH+ WNGCTLAD V P FLFIVG
Sbjct: 1   MKEDSKRILSLDVFRGLTMALMVLVNSQGSRSIYPILDHAAWNGCTLADLVFPAFLFIVG 60

Query: 114 VAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMK--HIRWCG 171
           V   ++L +    N A +  I+++   +    IL   +    +A  +  D+   ++R  G
Sbjct: 61  VTTVVSLNRQVTTNEAARLDIYKS---ILKRSILLFLFGLFLNAFPFHFDLSFANLRIYG 117

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y + ALI   TT +   +L      I   Y W WI               + 
Sbjct: 118 ILQRIAICYFICALIYLNTTVKTQIIL---FWGILLGY-WYWIT-------------QIP 160

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +S                   G L  A N V YVD+ ++   HL+ +          
Sbjct: 161 VPGFS------------------GGQLSLANNWVAYVDKMIFSPVHLHKN---------- 192

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH--FKGHSARLKH 349
                               F+PEGL+STISA+ +   G+  GH L+    K     L  
Sbjct: 193 --------------------FDPEGLISTISAVATTLAGLITGHFLLMQLSKKKKCLLMF 232

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            V M F +L  A   +F    PINK L++ S+V +T G +
Sbjct: 233 LVGMAFLVLGWAWGYYF----PINKNLWTSSFVLWTGGVS 268


>gi|345517324|ref|ZP_08796801.1| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
 gi|345457717|gb|EET14395.2| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
          Length = 365

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 160/365 (43%), Gaps = 90/365 (24%)

Query: 37  LERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSP 94
           +++ E+  E       QQ L  K KR+ +LDA RG+TV  MILV++AGG  +YA + HS 
Sbjct: 1   MKKKELDTETA-----QQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSA 54

Query: 95  WNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQG 149
           WNG T  D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G
Sbjct: 55  WNGLTPCDLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---G 110

Query: 150 GYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
            + H  +     +   H+R  G+LQRIAL Y V++        +                
Sbjct: 111 WFDHVCEG--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK---------------- 152

Query: 210 QWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVD 269
                                ++P  +F     + V    + C   G+     N +  +D
Sbjct: 153 ---------------------FIPTLTFILLVSYTV----ILCMGNGYTCDESNILSIID 187

Query: 270 RELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTI 329
           R+L+G  HLY                             ++P +PEG +ST+SAI    I
Sbjct: 188 RQLFGEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCI 219

Query: 330 GIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G   G  +I       ++      GF L+ I  +L   +A+P+NK+++S ++V  T GAA
Sbjct: 220 GFSCGKWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAA 277

Query: 390 DGCLG 394
              L 
Sbjct: 278 SMSLA 282


>gi|294777712|ref|ZP_06743163.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|319640295|ref|ZP_07995020.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
 gi|294448780|gb|EFG17329.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|317388070|gb|EFV68924.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
          Length = 372

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 160/365 (43%), Gaps = 90/365 (24%)

Query: 37  LERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSP 94
           +++ E+  E       QQ L  K KR+ +LDA RG+TV  MILV++AGG  +YA + HS 
Sbjct: 1   MKKKELDTETA-----QQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSA 54

Query: 95  WNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQG 149
           WNG T  D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G
Sbjct: 55  WNGLTPCDLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---G 110

Query: 150 GYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
            + H  +     +   H+R  G+LQRIAL Y V++        +                
Sbjct: 111 WFDHVCEG--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK---------------- 152

Query: 210 QWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVD 269
                                ++P  +F     + V    + C   G+     N +  +D
Sbjct: 153 ---------------------FIPTLTFILLVSYTV----ILCMGNGYTCDESNILSIID 187

Query: 270 RELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTI 329
           R+L+G  HLY                             ++P +PEG +ST+SAI    I
Sbjct: 188 RQLFGEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCI 219

Query: 330 GIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G   G  +I       ++      GF L+ I  +L   +A+P+NK+++S ++V  T GAA
Sbjct: 220 GFSCGKWIIQSHQTENKVLRLFLTGFILMSIGYLL--ADALPLNKRIWSPTFVLVTCGAA 277

Query: 390 DGCLG 394
              L 
Sbjct: 278 SMSLA 282


>gi|196012186|ref|XP_002115956.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
 gi|190581732|gb|EDV21808.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
          Length = 580

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 151/345 (43%), Gaps = 66/345 (19%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++ +   ++R+  +DAFRGL + +MI V+  GG Y     +PWNG T AD ++P+F+FIV
Sbjct: 239 EESIHPLAQRIYAVDAFRGLCITIMIFVNSGGGGYWYFRSTPWNGLTFADLILPWFIFIV 298

Query: 113 GVAIALAL--------KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM 164
           G+ IAL+          ++P  + AV K++ R++ L   G+ L             GV++
Sbjct: 299 GICIALSFYNHRYITASRLPP-SSAVLKVLSRSVILFLIGLFLND-----------GVNL 346

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
              R  G LQ++A+ Y+VV+L      K    +   R +         WIG      IY+
Sbjct: 347 STWRIPGNLQKVAISYIVVSLSVLYLAKPPDTITNLRAIREIVCIWKIWIGMIGLLSIYL 406

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
              ++L VP          G+           H      A GY+DR ++G NHL ++P  
Sbjct: 407 SLIFALPVPGCPTGYFGPGGLSD------DANHYNCTGGATGYIDRFIFG-NHLDANPSC 459

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
             L    +                  PF+ EG LST+++IL+  +G+     +    G  
Sbjct: 460 KVLYRTHM------------------PFDSEGCLSTLTSILTCFMGLQVATGVALCGGKQ 501

Query: 345 ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                  +MGF              IP+N+ L+S SY+    G A
Sbjct: 502 -------NMGF--------------IPLNRNLWSLSYITLLGGLA 525


>gi|441501363|ref|ZP_20983482.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
 gi|441434899|gb|ELR68324.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
           imtechensis AK7]
          Length = 368

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 160/340 (47%), Gaps = 81/340 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVG 113
             K+KR+ +LD FRG+T+  MI+V++ G     Y  + H+ W+GCTL D V PFFLFIVG
Sbjct: 1   MNKNKRLLSLDVFRGITIAAMIVVNNPGSWAAVYPPLLHAGWHGCTLTDLVFPFFLFIVG 60

Query: 114 VAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           VA+ L+L +  +  G  K+IIF  LK    L   G+ L        +A  Y  D+ H+R 
Sbjct: 61  VAVCLSLSRAVEDKGRHKQIIFTVLKRSVILFLIGLFL--------NAFPY-FDLYHLRI 111

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRIA+V+ + A +  L T  +  V               +IG  I  V +++    
Sbjct: 112 PGVLQRIAVVFFICAFLY-LKTGWKVQV---------------YIGSAILMVYWLLF--- 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L +P                +     G L    N   +VD +L                 
Sbjct: 153 LIIP----------------IPGAATGSLESGANLAAWVDSQL----------------- 179

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                  +G + E   +W     +PEG+LST+ AI++G IG+  G  L+  +    ++ +
Sbjct: 180 ------LTGHMWEVTKTW-----DPEGVLSTLPAIVTGIIGVLVGQWLMADRTEKEKVIY 228

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            + +   LLI+A  L +    PINK L++ SYV +TAG A
Sbjct: 229 -LFVAANLLIVA-GLFWDLFFPINKSLWTSSYVLYTAGIA 266


>gi|195133238|ref|XP_002011046.1| GI16326 [Drosophila mojavensis]
 gi|193907021|gb|EDW05888.1| GI16326 [Drosophila mojavensis]
          Length = 570

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/345 (28%), Positives = 158/345 (45%), Gaps = 59/345 (17%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + +R+ +LD FRGL++VLMI V+  GG Y+ I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 178 QRQRMRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPTFLWIMGVCIPL 237

Query: 119 ALK-------KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           ++K         P+I     +I++R  KL   G+ L           + G  ++ +R  G
Sbjct: 238 SIKAQLGRGISKPRI---CLRIVWRACKLFAIGLCLNS---------TNGPQLEQLRLMG 285

Query: 172 ILQRIALVYVVVALIETLTTKR---RPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           +LQR  + Y+V  ++ T+ ++R    P     R +     +  +          Y+  T+
Sbjct: 286 VLQRFGIAYLVAGVLHTICSRRDYLSPQRAWQRAIYDICLFSGELAVLLALIAAYLGLTF 345

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG-PAC--NAVGYVDRELWGINHLYSDPVWS 285
            L VP            + Y+   G   +   P C   A GYVDR + G  H+Y  P   
Sbjct: 346 GLRVPGCP---------RGYLGPGGKHNNAADPNCIGGAAGYVDRLILGNAHIYQHP--- 393

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                      +     DA +     F+PEG+   + +I+   +G   G  L+      A
Sbjct: 394 -----------TAKFVYDASA-----FDPEGVFGCLLSIVQAMLGCFAGVTLLVHVTWQA 437

Query: 346 RLKHWVSMGFGLLII--AIILHFTNA---IPINKQLYSFSYVCFT 385
           RL+ W+ +G  LL +    +  F+     IP+NK L+S S+V  T
Sbjct: 438 RLRRWL-LGATLLGVLGGALCGFSKEHGIIPVNKNLWSLSFVFVT 481


>gi|320168011|gb|EFW44910.1| hgsnat protein [Capsaspora owczarzaki ATCC 30864]
          Length = 800

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 159/343 (46%), Gaps = 57/343 (16%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           ++  RV +LD FRG+ + +M+ V+  GG Y   DHS WNG T+AD V P+F+F++GV+++
Sbjct: 415 EERVRVRSLDTFRGIALSIMLFVNYGGGGYWFFDHSTWNGLTVADLVFPWFIFMMGVSMS 474

Query: 118 LALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           L+ +K+ +  GA +     K+I R++ L   G+ L          +   +     R  G+
Sbjct: 475 LSFEKLRR-RGAPRGALFLKVIRRSMTLFALGLFL----------VCRQIIFATWRMPGV 523

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQR A+ Y+ VA I             P      T++  QW+   I   I+   T+   V
Sbjct: 524 LQRFAVSYLFVAAIVMFVPIF---ATLPGPFRDLTSHWLQWVVIGIFITIHTCITFLYDV 580

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACN----AVGYVDRELWGINHLYSDPVWSRLE 288
           P          G     +  G  G  G   N    A GY+D +++G  H+Y  P      
Sbjct: 581 P----------GCGTGYIGPGGIGDFGQYMNCTGGAAGYIDSQVFG-RHIYQAPTAQAY- 628

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                             +    ++PEGLL  +++++   +G   G +L+ F  HSARL+
Sbjct: 629 ------------------YLTGAYDPEGLLGCLTSVVITFLGYQAGRILVTFSTHSARLR 670

Query: 349 HWVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAG 387
            W + G GL ++ +IL         IPINK L+S S+V   AG
Sbjct: 671 RWAAWGVGLGLLGLILCKGTQNDGWIPINKNLWSLSFVLIMAG 713


>gi|328869407|gb|EGG17785.1| transmembrane protein [Dictyostelium fasciculatum]
          Length = 651

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 105/404 (25%), Positives = 164/404 (40%), Gaps = 98/404 (24%)

Query: 49  ELQLQQLL--QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           E QLQ L   Q K  R+ +LD FRGL++ +MI V+  GG Y   +HS WNG T+AD V P
Sbjct: 211 EFQLQNLESNQPKKDRLKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFP 270

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDALSYG 161
           +F+FI+G+A+ L+   + +I G  K+ IF     R++ L   G+ +  G           
Sbjct: 271 WFIFIMGIAMPLSFNAM-EIRGVPKRTIFIKLVRRSVILFSLGLFINNGN---------- 319

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVL---------------------EP 200
            ++ H R  G+LQR  + Y V   I       RPN                       EP
Sbjct: 320 -NLGHWRILGVLQRFGVSYFVTGCIMMFVPLYRPNGGGGGNSHHQYNRFDGTGNDREREP 378

Query: 201 RH---------------------LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
                                  L+ F  +  QW+   +   ++ + T+ L VP      
Sbjct: 379 SESDPLFQSSSIQEKFKAHSASMLADFIPFWLQWLFALLILAVWFLVTFLLPVP------ 432

Query: 240 HSDHGVKKYIVKCGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLEACTLSSP 295
               G     +  G  G  G   N  G     VD  ++  NH++  P       C     
Sbjct: 433 ----GCPTGYLGPGGLGDQGQHVNCTGGAAKIVDLHIFSNNHIFQTP------TC----- 477

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
                    P +    ++PEG L  ++++    +G+H G  ++ +K + +RL  W  +  
Sbjct: 478 --------QPIYNTGAYDPEGTLGYLTSVFMCFLGVHAGRTIMTYKSNRSRLIRWTILSI 529

Query: 356 GLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAADGCLGI 395
            L  IA  L   +     IPINK L++ S++   +G     L I
Sbjct: 530 LLCGIAAGLCGVSQNGGWIPINKNLWTPSFIFLLSGFGFFVLAI 573


>gi|434403337|ref|YP_007146222.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
 gi|428257592|gb|AFZ23542.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
          Length = 375

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/338 (30%), Positives = 155/338 (45%), Gaps = 78/338 (23%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+ AG A   Y  +DH+ WNGCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGMTIAAMILVNMAGVADEIYPLLDHAKWNGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    N   K +  R L+    L   G++L G ++       +  D+ +IR+ G+LQ
Sbjct: 62  SLSKYTAANKPTKAVYLRILRRAAILFALGLLLNGFWNKG----VWTFDLSNIRFMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y++ +L   L   R+                 QWI   +  V Y +T   +YVP 
Sbjct: 118 RISLTYLLASL-AVLQLPRKG----------------QWILAVVLLVGYWLTM--MYVP- 157

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   ++DR +    HLY    ++ L       
Sbjct: 158 -----VPDYGA----------GVLTREGNFGAFIDRLIIPKAHLYKGDGFNLLG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGL STI A+++   G   G  +      S  +K   S+G
Sbjct: 197 ------------------DPEGLFSTIPAVVNVLAGYFAGEWI-----RSQPVKSRTSLG 233

Query: 355 FGLLIIAIIL---HFTNAIPINKQLYSFSYVCFTAGAA 389
             L+ +A ++    +    PINK++++ SYV FT+G A
Sbjct: 234 LVLVGVACLVIGWGWGWIFPINKKIWTSSYVVFTSGWA 271


>gi|195041852|ref|XP_001991329.1| GH12115 [Drosophila grimshawi]
 gi|193901087|gb|EDV99953.1| GH12115 [Drosophila grimshawi]
          Length = 573

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 98/340 (28%), Positives = 155/340 (45%), Gaps = 49/340 (14%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL +VLMI V+  GG Y+ I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 181 QRKRLRSLDTFRGLCIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 240

Query: 119 ALK-KVPKINGAVK---KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           ++K ++ +     +   +I++R  KL   G+ L             G  ++ +R  G+LQ
Sbjct: 241 SIKAQLARGTSKTRICLRIVWRACKLFAIGLCLNS---------VNGPQLEQLRLMGVLQ 291

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY-----S 229
           R  + Y++VA++ T+ ++R  + L P+       Y      G  A ++ +I TY      
Sbjct: 292 RFGIAYLLVAILHTVCSRR--DQLSPQRAWQRAIYDICLFSGEFAVLLALIATYLGLTFG 349

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L VP          G      K     H      A GY+D  + G  H+Y  P       
Sbjct: 350 LRVPGCPVGYLGPGG------KHNNAAHPNCIGGAAGYIDLLVLGNAHIYQHP------- 396

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                  +     DA +     F+PEG+   + +++   +G   G  L+       RLK 
Sbjct: 397 -------TAKYVYDATA-----FDPEGIFGCLLSVVQTLLGAFAGVTLLVHSTWQGRLKR 444

Query: 350 WVSMG-FGLLIIAIILHFTNA---IPINKQLYSFSYVCFT 385
           W+ +     L+   +  F+     IPINK L+S S+V  T
Sbjct: 445 WLLVATVLGLLGGALCGFSKENGIIPINKNLWSLSFVFVT 484


>gi|423241434|ref|ZP_17222547.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
 gi|392641327|gb|EIY35104.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
           CL03T12C01]
          Length = 372

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 156/358 (43%), Gaps = 86/358 (24%)

Query: 46  QKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLA 101
           +K EL  +   Q     KR+ +LDA RG+TV  MILV++AGG  +YA + HS WNG T  
Sbjct: 2   KKEELNTETAQQAPPIKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPC 61

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQGGYSHAPD 156
           D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G + H  +
Sbjct: 62  DLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCE 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                +   H+R  G+LQRIAL Y V++        +                       
Sbjct: 118 G--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK----------------------- 152

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P  +F     + V    + C   G+     N +  +DR+L+G  
Sbjct: 153 --------------FIPALTFILLVSYTV----ILCMGNGYACDESNILSIIDRQLFGEA 194

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                             ++P +PEG +ST+SAI    IG   G  
Sbjct: 195 HLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCGKW 226

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +I       ++      GF L+ I  +L   +A+P+NK+++S ++V  T GAA   L 
Sbjct: 227 IIQSHQTENKVLRLFLTGFILISIGYLL--ADALPLNKRIWSPTFVLVTCGAASMSLA 282


>gi|344281343|ref|XP_003412439.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Loxodonta africana]
          Length = 782

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 173/391 (44%), Gaps = 69/391 (17%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVD 81
           + ++++  IN E G       D   G++QL+         R+  +D FRGL +++M+ V+
Sbjct: 338 NSRETDRLINSELG--SPSRADPLGGDIQLEAWRPSAPPSRLRCVDTFRGLALIIMVFVN 395

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIV----------GVAIALALKKVPKINGAVK 131
             GG Y    H+ WNG T+AD V P FL I+           + I L++  + +  G  K
Sbjct: 396 YGGGKYWYFKHASWNGLTVADLVFPCFLEILFGEDLLCTRDPLEIFLSMTSILQ-RGCSK 454

Query: 132 -----KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVAL 185
                KI +R+  L+  G +I+   Y   P      +    +R  G+LQR+ + Y VVA+
Sbjct: 455 FKLLGKIAWRSFLLICIGVVIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAV 508

Query: 186 IETLTTKRRP-NVLEPR---HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
           +E L  K  P N    R    L   TA   QW+       I++  T+ L VP        
Sbjct: 509 LELLFAKPVPENCASERSCFSLRDLTASWPQWLFILTLESIWLTLTFFLPVP-------- 560

Query: 242 DHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
             G     +  G  G  G  P C   A GY+DR L G  HLY  P  + L    ++    
Sbjct: 561 --GCPTGYLGPGGIGDWGKYPNCTGGAAGYMDRVLLGDEHLYQHPSSAVLYHTEMA---- 614

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSM 353
                         ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +
Sbjct: 615 --------------YDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDIVIRFTAWCCI 660

Query: 354 GFGLLIIAIILHFTNA--IPINKQLYSFSYV 382
             GL   A+     N   IP+NK L+S SYV
Sbjct: 661 -LGLTSAALTKVSENEGFIPVNKNLWSISYV 690


>gi|212695334|ref|ZP_03303462.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
 gi|212662113|gb|EEB22687.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
           17855]
          Length = 284

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 86/358 (24%)

Query: 46  QKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLA 101
           +K EL  +   Q     KR+ +LDA RG+TV  MILV++AGG  +YA + HS WNG T  
Sbjct: 2   KKEELNTETAQQAPPIKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPC 61

Query: 102 DFVMPFFLFIVGVAIALALKKVPKIN---GAVKKIIFRTLKLLF--WGIILQGGYSHAPD 156
           D V PFFLFI+G++  ++L K    N     V KI+ RT  +L   W I   G + H  +
Sbjct: 62  DLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCE 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                +   H+R  G+LQRIAL Y V++                   ++F  +++     
Sbjct: 118 G--DFLPFVHLRIPGVLQRIALCYCVISFT-----------------ALFMNHKFIPALT 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI  V Y                         ++ C   G+     N +  +DR+L+G  
Sbjct: 159 FILLVSYT------------------------VILCMGNGYACDESNILSIIDRQLFGEA 194

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                             ++P +PEG +ST+SAI    IG   G  
Sbjct: 195 HLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCGKW 226

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +I       ++      GF L+ I  +L   + +P+NK+++S ++V  T GAA   L 
Sbjct: 227 IIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSLA 282


>gi|327275365|ref|XP_003222444.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Anolis carolinensis]
          Length = 632

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 159/340 (46%), Gaps = 54/340 (15%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           R+ +LD FRGL +++M+ V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L 
Sbjct: 236 RLRSLDTFRGLALIIMVFVNYGGGKYWFFKHQSWNGLTVADLVFPWFVFIMGTSISLSLS 295

Query: 122 KVPKINGAVK-----KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            + +  G  K     KI++R+L L   G II+   Y   P      +  +++R  G+LQR
Sbjct: 296 SMLR-RGCSKWKLLGKILWRSLLLFLIGVIIVNPNYCLGP------LSWENLRIPGVLQR 348

Query: 176 IALVYVVVALIETLTTKRRPN----VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ++  Y VVA++E L  K  P+     +    L     Y  QW+       +++  T+ L 
Sbjct: 349 LSCTYFVVAVLELLFAKPVPDNSTLEIPCPALRDILPYWPQWLFMMALETVWLCLTFLLN 408

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRL 287
           VP          G     +  G  G  G  P C   A  Y+D  L G  H+Y  P  + L
Sbjct: 409 VP----------GCPNGYLGPGGIGDFGNYPNCTGGAAAYIDHVLLGEKHIYQHPSSNVL 458

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS--- 344
              T++                  F+PEG+L TI++++   +G+  G +L+ +K      
Sbjct: 459 YQTTVA------------------FDPEGILGTINSVIMAFLGLQAGKILLFYKDQHRQI 500

Query: 345 -ARLKHW-VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV 382
             R   W V MG    ++         IP+NK L+S SYV
Sbjct: 501 MLRFCIWSVVMGVISAVLTECSKDEGFIPVNKNLWSISYV 540


>gi|116789271|gb|ABK25182.1| unknown [Picea sitchensis]
          Length = 124

 Score =  114 bits (285), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 54/89 (60%), Positives = 65/89 (73%)

Query: 38  ERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNG 97
           E S V   Q G+    +   +   RVATLD FRGLT+ +MILVDDAGG + +I+HSPWNG
Sbjct: 16  ESSNVIVIQDGQTIPAKPTNETKTRVATLDVFRGLTIAVMILVDDAGGKWPQINHSPWNG 75

Query: 98  CTLADFVMPFFLFIVGVAIALALKKVPKI 126
           CTLADFVMPFFLFIVGVA+AL  K V ++
Sbjct: 76  CTLADFVMPFFLFIVGVAVALTFKVVQQL 104


>gi|298491757|ref|YP_003721934.1| hypothetical protein Aazo_3034 ['Nostoc azollae' 0708]
 gi|298233675|gb|ADI64811.1| Protein of unknown function DUF2261, transmembrane ['Nostoc
           azollae' 0708]
          Length = 376

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 149/331 (45%), Gaps = 63/331 (19%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+  G A   Y+ +DH+ WNGCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGITIAGMILVNMVGVADHKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           +L K    N   K +  R L+      +L    +   +   +  D+  IR+ G+LQRI+L
Sbjct: 62  SLSKYTADNKPTKAVYLRILRRAAILFLLGLLLNGFWNKGVWTFDLSSIRFMGVLQRISL 121

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y+  +LI      +                  QW+   +  + Y +T   +YVP     
Sbjct: 122 SYLFASLIVLKVPGKN-----------------QWVLAGVLLIGYWLTM--MYVP----- 157

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
              D+G           G L    N  G++DR +    HLY    ++ L           
Sbjct: 158 -VPDYGA----------GVLTREGNFGGFIDRLIIPKAHLYKGDGFNYLG---------- 196

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLL 358
                         +PEGL STI AI+S  +G ++  + I  + H         + FGL 
Sbjct: 197 --------------DPEGLYSTIPAIVSVLVG-YFAGIRIKERKHLNSQTSMDFVLFGLC 241

Query: 359 IIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            + + + +  A PINK+L++ SYV FT G A
Sbjct: 242 CLVVSIIWDVAFPINKKLWTSSYVVFTTGWA 272


>gi|260816362|ref|XP_002602940.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
 gi|229288254|gb|EEN58952.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
          Length = 512

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 178/388 (45%), Gaps = 76/388 (19%)

Query: 25  KDSENGINKEKGLER--SEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
           ++++  +N + G     S   ++Q       Q   Q  +R+ +LD FRGL++ +M+ V+ 
Sbjct: 81  RETDRLVNSDLGAPSPSSTATEDQTVANSSTQPASQGIRRLRSLDTFRGLSLAVMVFVNY 140

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRT 137
            GG Y    H+ WNG T+AD V P+F+FI+G +IAL+ +++ K  G  +     K+I RT
Sbjct: 141 GGGGYWFFKHARWNGLTVADLVFPWFVFIMGTSIALSFRRLLK-KGVSRLSLLWKVIQRT 199

Query: 138 LKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN 196
           + L   G+ I+     H             +R  G+LQR+AL Y +VAL+E+        
Sbjct: 200 VILFLLGLFIINTKKGHN--------SWSTLRIPGVLQRLALTYFIVALMESW------- 244

Query: 197 VLEPR-HLSIF------------TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSD 242
             +PR +LS++                W QW+   +   ++++  + L VPN        
Sbjct: 245 --KPRGYLSLYLLQTSRIAPIRDIVNSWGQWLFMIVVVTLHLVLMFWLQVPNCPIGYLGP 302

Query: 243 HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
            G+           H      A GY+DR ++  +H+Y  P           +P +    E
Sbjct: 303 GGLSDI-------AHYNCTGGAAGYIDRAVFTDDHIYQHP-----------TPITVYETE 344

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIH----YGHVLIHFKGHSARLKHWVSMGFGLL 358
                   PFEPEGLL T+++ L   +G+     Y ++++++   + R     ++ F L 
Sbjct: 345 -------VPFEPEGLLGTLTSALLCFLGLQVKNMYMYIIMNYCVFTNRSP---AICFCLG 394

Query: 359 IIAIIL----HFTNAIPINKQLYSFSYV 382
            I  +L         IP+NK L+S S+V
Sbjct: 395 AIGGLLCNGRQNEGWIPLNKNLWSLSFV 422


>gi|427385206|ref|ZP_18881711.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727374|gb|EKU90234.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
           12058]
          Length = 368

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/339 (28%), Positives = 149/339 (43%), Gaps = 82/339 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            Q  KR+ +LD  RG+TVV MILV+++GG  +Y  + HS WNG TL D V PFFLFI+G+
Sbjct: 11  SQPKKRLLSLDVLRGITVVGMILVNNSGGKLSYESLQHSAWNGLTLCDLVFPFFLFIMGI 70

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG--VDMKHIRWC 170
           + ++AL K         V+KI+ RTL +L  G ++     H  D +  G      H+R  
Sbjct: 71  STSIALSKFHFQASGSVVRKILKRTLIILCIGWVI-----HWFDFICDGDFSPFAHLRLT 125

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+L RIAL Y V + +      +                   +IG  I  ++        
Sbjct: 126 GVLPRIALCYCVASFVALYVNHK-------------------YIGWLIGILL-------- 158

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
               ++F            + C   G+   + N +  +DR + G +HLY           
Sbjct: 159 --AGYTF------------LLCIGNGYAPDSTNLLAIIDRNVLGADHLYH---------- 194

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                             ++P +PEGL ST SAI    IG   G +++  K    +    
Sbjct: 195 ------------------KSPIDPEGLTSTFSAIAHTLIGFCCGKLILAKKNLEQKTLKL 236

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             +GF L+     L    A+P+NK+++S ++V  T G A
Sbjct: 237 FVVGFILMACGFCL--MEALPLNKRIWSPTFVLVTCGLA 273


>gi|426359530|ref|XP_004047024.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gorilla
           gorilla gorilla]
          Length = 635

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 175/379 (46%), Gaps = 59/379 (15%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQ-LLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 203 RETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYG 260

Query: 84  GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTL 138
           GG Y    H+ WNG      ++  F+FI+G +I L++  + +  G  K     KI +R+ 
Sbjct: 261 GGKYWYFKHASWNGAEGCIEMIEMFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSF 319

Query: 139 KLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN- 196
            L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P  
Sbjct: 320 LLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEH 373

Query: 197 -VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG 253
              E   LS+   T+   QW+   +   +++  T+ L VP          G     +  G
Sbjct: 374 CASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLGPG 423

Query: 254 MRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
             G  G  P C   A GY+DR L G +HLY  P  + L    ++                
Sbjct: 424 GIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA---------------- 467

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILH 365
             ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+   
Sbjct: 468 --YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKV 524

Query: 366 FTNA--IPINKQLYSFSYV 382
             N   IP+NK L+S SYV
Sbjct: 525 SENEGFIPVNKNLWSLSYV 543


>gi|323447301|gb|EGB03229.1| hypothetical protein AURANDRAFT_68196 [Aureococcus anophagefferens]
          Length = 399

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/151 (41%), Positives = 86/151 (56%), Gaps = 8/151 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           ++  RV +LD  RG  V+LMI VDDAG AYA +DHSPW+G T+AD VMPFF+F+VGV+ A
Sbjct: 10  RRPPRVRSLDVVRGFAVLLMIFVDDAGSAYAVLDHSPWDGLTIADVVMPFFIFMVGVSAA 69

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY--GVDMKHIRWCGILQR 175
           LAL     +   +++          W + +       PD  +Y  G D+  +RWCGILQR
Sbjct: 70  LALGGKRTLAPVLRR------GATLWVVGVAVQGGGLPDPTTYAWGYDLGTVRWCGILQR 123

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIF 206
           IA  YVV + +   + + R   L      +F
Sbjct: 124 IAACYVVASALVLCSPRARRGGLATADDGVF 154


>gi|156401294|ref|XP_001639226.1| predicted protein [Nematostella vectensis]
 gi|156226353|gb|EDO47163.1| predicted protein [Nematostella vectensis]
          Length = 387

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 153/345 (44%), Gaps = 60/345 (17%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           R+ +LD FRG+++ +MI V+  GG Y    HS WNG T+AD V P+F++I+GV++ L+ +
Sbjct: 1   RLKSLDTFRGISLTVMIFVNFGGGGYYFFAHSIWNGLTVADLVFPWFMWIMGVSMVLSFR 60

Query: 122 KVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            + +   +  +II     RTL L   G+                 ++ + R  G+LQR A
Sbjct: 61  VLRRKQISTYRIIIKITKRTLLLFALGLFTSN-------------NLTNYRIPGVLQRFA 107

Query: 178 LVYVVVALIETLTTKRRPNVLEPR-----HLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
             Y VVA+I+ L      +  +PR      +    +   QW+  F   +IY++ TY+  +
Sbjct: 108 ACYFVVAVIQVLAGPSVEDS-QPRGSWWDGIRDVVSLWAQWLLMFAFLIIYVVVTYATEL 166

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLE 288
                     HG  +     G       A N  G    +VD  L G  H+Y    +  + 
Sbjct: 167 ----------HGCPRGYTGPGGISDNSSAFNCTGGMASHVDSWLLG-KHVYQRGTFKDMY 215

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
             T++                   +PEG++ T+++I    +G+  GH L  F  H  RL 
Sbjct: 216 RTTVAH------------------DPEGVMGTLTSIFIVFLGVQAGHTLFTFSHHRQRLV 257

Query: 349 HWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAGAA 389
            W      L +IAI L         IPINK L+S S+V  T   A
Sbjct: 258 RWFVWAVLLGVIAIGLSGGTQNDGVIPINKNLWSISFVLATGSMA 302


>gi|265753064|ref|ZP_06088633.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236250|gb|EEZ21745.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 372

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 155/358 (43%), Gaps = 86/358 (24%)

Query: 46  QKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLA 101
           +K EL  +   Q     KR+ +LDA RG+TV  MILV++AGG  +YA + HS WNG T  
Sbjct: 2   KKEELNTETAQQAPPIKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPC 61

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGA---VKKIIFRTLKLLF--WGIILQGGYSHAPD 156
           D V PFFLFI+G++  ++L K    N +   V KI+ RT  +L   W I   G + H  +
Sbjct: 62  DLVFPFFLFIMGISTYISLNKF-NFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCE 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                +   H+R  G+LQRIAL Y V++        +                       
Sbjct: 118 G--DFLPFVHLRIPGVLQRIALCYCVISFTALFMNHK----------------------- 152

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P  +F     + V    + C   G+     N +  +DR+L+G  
Sbjct: 153 --------------FIPALTFILLVSYTV----ILCMGNGYACDESNILSIIDRQLFGEA 194

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                             ++P +PEG +ST+SAI    IG   G  
Sbjct: 195 HLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCGKW 226

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +I       ++      GF L+ I  +L   + +P+NK+++S ++V  T GAA   L 
Sbjct: 227 IIQSHQTENKVLRLFLTGFILISIGYLL--ADVLPLNKRIWSPTFVLVTCGAASMSLA 282


>gi|392968994|ref|ZP_10334410.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
 gi|387843356|emb|CCH56464.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
           BUZ 3]
          Length = 390

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/335 (31%), Positives = 142/335 (42%), Gaps = 77/335 (22%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
             +R+ +LDAFRGLTV  MILV++AG     YA + H+ WNG T  D + PFFLFIVGV+
Sbjct: 18  NPQRLLSLDAFRGLTVAGMILVNNAGDWQYVYAPLKHAAWNGWTPTDLIFPFFLFIVGVS 77

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           I  AL    +    + KI+ R+  L   G+ L         A     D+  +R  G+LQR
Sbjct: 78  ITFALAGGQEHTNVIGKILKRSFTLFMLGLFL---------AFFPKFDITTVRIPGVLQR 128

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           IALVY+  +LI   TT R+   L    L                 V Y +    + VP  
Sbjct: 129 IALVYLACSLIYLRTTTRQQTWLLAALL-----------------VGYWLVMTVVPVPGV 171

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLY-SDPVWSRLEACTLSS 294
            ++                  +L P  N   ++DR +   +HLY S  VW          
Sbjct: 172 GYA------------------NLEPTTNLAAWLDRTVLTTDHLYRSTKVW---------- 203

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGLLSTI AI +G  G+  G  L       A    W+   
Sbjct: 204 ------------------DPEGLLSTIPAIGTGLAGVLVGTWLRRRDVADADKIAWL-FT 244

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            G L     L +    PINK L++ SYV   AG A
Sbjct: 245 VGCLTTLGGLIWDAWFPINKALWTSSYVLLAAGLA 279


>gi|423226736|ref|ZP_17213201.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627009|gb|EIY21050.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 368

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 153/361 (42%), Gaps = 96/361 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + KR+ +LD  RG+TVV MILV+++GG  +Y  + HS WNG TL D V PFFLFI+GV+ 
Sbjct: 13  QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGVST 72

Query: 117 ALALKK--VPKINGAVKKIIFRTLKLLFWG-------IILQGGYSHAPDALSYGVDMKHI 167
            +AL K         V+K++ RTL +L  G        I  G +              H+
Sbjct: 73  YIALSKFHFQASGSVVRKVLKRTLVILCIGWAIHWFHFICDGDF----------FPFAHL 122

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+L RIAL Y VV+ +      +                   WI G       +I  
Sbjct: 123 RLTGVLPRIALCYCVVSFVALYVNHKYIG----------------WIIG------CLIAG 160

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
           Y+                   ++ C   G+     N +  +DR + G +HLY        
Sbjct: 161 YA-------------------VLLCIGNGYAPDDTNLLAIIDRNVLGADHLYH------- 194

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                                ++P +PEGL ST+SAI    IG   G +++  +    + 
Sbjct: 195 ---------------------KSPIDPEGLTSTLSAIAHTLIGFCCGKIILAKEALEQKT 233

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWD 407
                 GF L+    +L  T A+P+NK+++S ++V  T G A     +  S+ I  +D  
Sbjct: 234 LKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGLA----AMLQSVLIYFIDMK 287

Query: 408 E 408
           E
Sbjct: 288 E 288


>gi|189465173|ref|ZP_03013958.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
 gi|189437447|gb|EDV06432.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
           17393]
          Length = 369

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 163/362 (45%), Gaps = 98/362 (27%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + KR+ +LD  RG+TVV MILV+++GG  +Y  + HS WNG TL D V PFFLFI+G++ 
Sbjct: 14  QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGIST 73

Query: 117 ALALKKVP-KINGAV-KKIIFRTLKLLFWG-------IILQGGYSHAPDALSYGVDMKHI 167
            +AL K   + +G V +KI+ RTL +L  G        I +G +            + H+
Sbjct: 74  YIALNKFHFQASGPVIRKILKRTLVILCIGWAIHWFHFICEGDF----------FPLAHL 123

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+L RIAL Y  V+ +           ++P+++         W+ GF+         
Sbjct: 124 RLTGVLPRIALCYCAVSFVAL--------YVKPKYIG--------WMIGFL--------- 158

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
                            +  Y V  G+  G+   + N +  +DR + G +HLY       
Sbjct: 159 -----------------IIGYAVLLGIGNGYTLDSTNILAIIDRNVLGADHLYH------ 195

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
                                 ++P +PEGL ST++AI    IG   G +++  +    +
Sbjct: 196 ----------------------KSPIDPEGLTSTLAAIAHTLIGFCCGRIILAKEALEQK 233

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
                  GF L+    +L  T A+P+NK+++S ++V  T G A     +  ++ I  +D 
Sbjct: 234 TLKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGLA----AMLQAVLIYFIDM 287

Query: 407 DE 408
            E
Sbjct: 288 KE 289


>gi|383859754|ref|XP_003705357.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Megachile rotundata]
          Length = 572

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 164/334 (49%), Gaps = 41/334 (12%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           + S R+ ++D FRG+ ++LMI V++ GG Y   +HS W G T+AD V+P+F +I+G+ I 
Sbjct: 178 RASTRIRSVDTFRGIAILLMIFVNNGGGKYVFFNHSAWYGLTVADLVLPWFAWIMGLTIT 237

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           ++ +   ++  +  KI+   ++     I+L    +   +          +R+ G+LQ + 
Sbjct: 238 ISKRAELRVTVSRVKIMLHCIRRSLVLILLGLMLNSIKNN-----SFSDLRFPGVLQLLG 292

Query: 178 LVYVVVALIETLTTK--RRPNVLEPRHLSIFTAY--QW-QWIGGFIAFVIYIITTYSLYV 232
           + Y V +++ET+  K   +  +L+    + F      W QW+   +    + + T+ L V
Sbjct: 293 VSYFVCSMLETIFMKPHSQDTLLQFGRFASFRDILDSWPQWLVMAVIMTTHTLITFLLPV 352

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRG-HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           PN         G  +Y      RG ++     A GY+DR ++G NH+Y  P         
Sbjct: 353 PNCPKGYFGPGGQYEY------RGKYMNCTAGAAGYIDRLIFG-NHMYPKPK-------- 397

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
             S     LR D          PEGL++TISAI    +G+H G +L+ +  +++R+  W+
Sbjct: 398 -ESIYGDILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQYNSRVIRWI 446

Query: 352 SMGFGLLIIA-IILHFTN---AIPINKQLYSFSY 381
                  IIA  + HF+     IP++K++ + S+
Sbjct: 447 LWAVLTGIIAGNLCHFSTQGGVIPVSKRMMTLSF 480


>gi|224537467|ref|ZP_03678006.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520905|gb|EEF90010.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 368

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 153/361 (42%), Gaps = 96/361 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + KR+ +LD  RG+TVV MILV+++GG  +Y  + HS WNG TL D V PFFLFI+G++ 
Sbjct: 13  QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGIST 72

Query: 117 ALALKK--VPKINGAVKKIIFRTLKLLFWG-------IILQGGYSHAPDALSYGVDMKHI 167
            +AL K         V+KI+ RTL +L  G        I  G +              H+
Sbjct: 73  YIALGKFHFQASGSVVRKILKRTLVILCIGWAIHWFHFICDGDF----------FPFAHL 122

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+L RIAL Y VV+ +      +                   WI G       +I  
Sbjct: 123 RLTGVLPRIALCYCVVSFVALYVNHKYIG----------------WIIG------CLIAG 160

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
           Y+                   ++ C   G+     N +  +DR + G +HLY        
Sbjct: 161 YA-------------------VLLCIGNGYAPDDTNLLAIIDRNILGADHLYH------- 194

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                                ++P +PEGL ST+SAI    IG   G +++  +    + 
Sbjct: 195 ---------------------KSPIDPEGLTSTLSAIAHTLIGFCCGKIILAKEALEQKT 233

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWD 407
                 GF L+    +L  T A+P+NK+++S ++V  T G A     +  S+ I  +D  
Sbjct: 234 LKLFVAGFILMACGFVL--TEALPLNKRIWSPTFVLVTCGLA----AMLQSVLIYFIDMK 287

Query: 408 E 408
           E
Sbjct: 288 E 288


>gi|410926267|ref|XP_003976600.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Takifugu rubripes]
          Length = 497

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 86/275 (31%), Positives = 135/275 (49%), Gaps = 46/275 (16%)

Query: 26  DSENGINKEKGLE-RSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG 84
           D+E  IN E G   R+ V   +   L         SKR+ +LD FRG+++V+M+ V+  G
Sbjct: 248 DTERLINSELGSPGRNTVPVTENILLPPPT----SSKRLQSLDTFRGISLVIMVFVNYGG 303

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKL 140
           G Y    H  WNG T+AD V P+F+FI+G +IAL++  + +        + K+ +R+L+L
Sbjct: 304 GRYWFFRHESWNGLTVADLVFPWFMFIMGTSIALSVHALLRTGSTRLSLLGKVAWRSLQL 363

Query: 141 LFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLE 199
              G+ I+   Y   P  LS+G     +R  G+LQR+AL Y+VVA ++ L  +       
Sbjct: 364 FLIGLFIINPNYCQGP--LSWGT----LRIPGVLQRLALAYLVVACLDLLVAR------- 410

Query: 200 PRHLSIFTA--------YQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
             HL I+T         Y   W+   +   +++  T+ L VP+         G+      
Sbjct: 411 -AHLEIYTTVSSTDVLLYWPAWVCVLLLESVWLFLTFLLPVPDCPTGYLGPGGI------ 463

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDP 282
               G +G  P C   A G++DR L G  H+Y +P
Sbjct: 464 ----GDMGLFPNCTGGAAGFIDRWLLGEKHIYQNP 494


>gi|414077874|ref|YP_006997192.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
 gi|413971290|gb|AFW95379.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
          Length = 376

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 101/335 (30%), Positives = 151/335 (45%), Gaps = 71/335 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+++LD FRG+T+  MIL + AG A   Y  + H+ W+GCT  D + P FLFIVGVA+  
Sbjct: 2   RLSSLDVFRGITIAAMILANMAGVADDVYRPLSHAQWHGCTPTDLIFPCFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    N   K +  R L+    L   G++L G ++       +  D+  IR  G+LQ
Sbjct: 62  SLAKYTAQNKPTKAVYLRILRRTAILFILGLVLNGFWNQG----VWTFDLSSIRLMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RIAL Y+  +LI  L   R+               QW   GG +  + Y +T   + VP 
Sbjct: 118 RIALTYLFASLI-VLKLPRKS--------------QWLVAGGLL--IAYWLTMMYIPVP- 159

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   ++DR +    HLY    ++ L       
Sbjct: 160 -------DYGA----------GVLTREGNFGAFIDRLIIPKAHLYKGDGFNFLG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGL STI AI+S   G   G  +   K  +++    + + 
Sbjct: 197 ------------------DPEGLFSTIPAIVSVLAGYFTGQWIKDKKQATSQTSMDLVL- 237

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           FGL  + I + +  + PINK++++ SYV FT G A
Sbjct: 238 FGLCCLVIAIIWDVSFPINKKIWTSSYVLFTTGWA 272


>gi|410900570|ref|XP_003963769.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Takifugu rubripes]
          Length = 581

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 158/364 (43%), Gaps = 96/364 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +Q+ KR+ +LD FRG  + +M+ V+  GG Y    H+PWNG T+AD VMP+F+FI+G ++
Sbjct: 194 KQRPKRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFIIGTSV 253

Query: 117 ALALKKVP----KINGAVKKIIFRTLKLLFWG----------------IILQGGYSHAPD 156
            LA + +     +    ++KI +RT  LL  G                ++ Q  +S A  
Sbjct: 254 VLAFRSMQRRRVRRLQLLRKITWRTGVLLMLGFCFLNYSPRDGPCSVLVLAQDSWSPAAS 313

Query: 157 ALSYGVDMKHIRWCGILQRIALV---YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
            L     +   RW   +Q + +    ++++ L+ETL                        
Sbjct: 314 GLHLLHSITPHRWWSSVQDVVVYWPQWLIIILLETL------------------------ 349

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGH-LGPACN--AVGYVDR 270
                    ++  T+ + VP+             Y+   G+  H L P C   A GY+DR
Sbjct: 350 ---------WLCVTFLMPVPD---------CPTGYLGAGGIGDHGLYPNCTGGAAGYIDR 391

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
            ++G N +Y  P    +   T                   PF+PEG+L T+++I+ G +G
Sbjct: 392 WMFGDN-MYRYPTCKEMYQTT------------------QPFDPEGVLGTVNSIVMGFLG 432

Query: 331 IHYGHVLIHFKGHS----ARLKHW-VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV--- 382
           +  G +L+ ++G       R   W V +G    I+         IP+NK L+S SYV   
Sbjct: 433 MQAGKILLFYRGADVHILCRFLLWAVILGISAAILTKCTRDEGFIPVNKNLWSLSYVTCM 492

Query: 383 -CFT 385
            CF+
Sbjct: 493 GCFS 496


>gi|270339962|ref|ZP_06203500.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333113|gb|EFA43899.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 389

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 167/361 (46%), Gaps = 77/361 (21%)

Query: 42  VQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCT 99
           +  E    + +  L+ Q+  R+ +LD  RGLTV+LMI V++  G   ++ + HS WNG T
Sbjct: 1   MHRENGVSISILTLMPQQ--RLISLDVLRGLTVMLMIFVNNGAGTQIFSPLRHSRWNGMT 58

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDA 157
           L D V PFFLF+VGV+  L+L+K      A  ++KI  RT  L   G+ +   +  A + 
Sbjct: 59  LCDLVFPFFLFMVGVSTYLSLRKSNFAWSAKTLRKIARRTALLFLIGLTIN-WFDMACNG 117

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
               +D+ H+R  G++QRIAL Y   A +  L++K       P+ L +       W+   
Sbjct: 118 --SPLDLAHLRIMGVMQRIALCYGATAFVAILSSK------VPQRLHLI-----PWL--- 161

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
              +  ++  YSL +      ++S                   A N +  VD  + G +H
Sbjct: 162 ---IAVLLIAYSLLLIIGGGYDYSS------------------ATNLLAIVDTHILGYDH 200

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           LY                             R+P +PEGLLST+ AI    IG     + 
Sbjct: 201 LYH----------------------------RSPVDPEGLLSTLPAIAHTLIGFWVARLT 232

Query: 338 IHFKG-HSAR--LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           I  +G HS +  ++ ++  G  L++  IIL  T  +P+NK+++S SYV  T G A    G
Sbjct: 233 IGKQGSHSTQNTVRMFLLAGAMLVVSGIIL--TLLLPLNKRIWSPSYVLTTCGLASFTQG 290

Query: 395 I 395
           +
Sbjct: 291 L 291


>gi|395501613|ref|XP_003755186.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Sarcophilus harrisii]
          Length = 425

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 168/368 (45%), Gaps = 54/368 (14%)

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP 124
           +L +  GL++ LM+ V+  GG Y   +H+PWNG T+AD VMP+F+FI+G ++ L    + 
Sbjct: 32  SLVSKHGLSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFILGTSVGLTFHNMQ 91

Query: 125 KIN----GAVKKIIFRTLKLLFWGII-LQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
           K        ++K+ +RT  L+  G++ L  G +  P + S+       R  G+LQR+   
Sbjct: 92  KRGVKNIQLLRKVAWRTGVLIIIGVLFLNYGPADGPLSWSWA------RLPGVLQRLGFT 145

Query: 180 YVVVALIE-----TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           Y  VAL++     T T   + N+       +   ++ +W       ++++  T+ L VP 
Sbjct: 146 YFAVALMQIAFGVTDTQIYQVNLWWAPFRDVILYWK-EWFIIISLEILWLCLTFLLPVP- 203

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLEAC 290
                    G  +  +  G  G  G   N  G    Y+D+ + G NHLY  P    L   
Sbjct: 204 ---------GCPRGYLGPGGIGDEGKYFNCTGGAAAYIDKWILGENHLYQFPSCKELYKT 254

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-ARLKH 349
           T                   PF+PEG+L TI++IL    G+  G +++ ++    + L+ 
Sbjct: 255 T------------------QPFDPEGILGTINSILMAFFGLQAGKIILMYRNKPRSILRR 296

Query: 350 WVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD 405
           ++     L II+ IL         IP+NK L+S S+V  T+  +   LG+   I      
Sbjct: 297 FLIWSILLGIISAILTMGTQNEGFIPVNKNLWSLSFVTTTSCFSFILLGLLYYITDVNGW 356

Query: 406 WDECDACF 413
           W+ C   F
Sbjct: 357 WNGCPFIF 364


>gi|330792857|ref|XP_003284503.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
 gi|325085533|gb|EGC38938.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
          Length = 373

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 150/353 (42%), Gaps = 83/353 (23%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           KR+++LD  RG+T+  MILVD+ GG      +  + WNG + AD + P FLFI G ++AL
Sbjct: 2   KRMSSLDVARGITIFGMILVDNQGGPDVIWPLKETEWNGLSTADLIFPSFLFICGFSVAL 61

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           ALK           II RT  L F    L        + +++       R  G+LQRIAL
Sbjct: 62  ALKTAKNTRSTWYNIIRRTFLLFFIQCFL--------NLMAHHFVFSSFRVMGVLQRIAL 113

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y     +  ++    P  L+   L   T             V YI   Y+L VP     
Sbjct: 114 CY----FLSCVSFLCFPVFLQRLFLLGTT-------------VTYISVMYALPVPG---- 152

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
                        CG +G L P CNA  Y+D +++G N ++               PN  
Sbjct: 153 -------------CG-KGVLTPTCNAGAYLDFKVFGPNMIH---------------PN-- 181

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF-----KGHSARLKHWVSM 353
                         +PEGLLST+SA ++  +G+ +G V   +       +   +  W+ M
Sbjct: 182 --------------DPEGLLSTLSAFITTWMGLEFGRVFTTYYRKYDYSNVDLIVRWIVM 227

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              L I+  I      +P NK ++SFS+  FT G   GCL     I I  ++W
Sbjct: 228 -IALFIVPAIGLGATVMPFNKLIWSFSFALFTVGTG-GCLITVAYILIDVIEW 278


>gi|347738959|ref|ZP_08870332.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
 gi|346917867|gb|EGY00078.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
          Length = 400

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/338 (30%), Positives = 158/338 (46%), Gaps = 72/338 (21%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + S R+ +LD  RGL V  MILV   G    AY  + H+PW+G TLAD V P FLF VG+
Sbjct: 18  RTSPRLPSLDVLRGLAVAGMILVVSPGDWSKAYTPLKHAPWDGWTLADMVFPTFLFSVGL 77

Query: 115 AIALALKKVPKINGAVK-KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           AIAL+  K+ +   A   KI  R L L+  G++L        +AL Y  D+ H+R  GIL
Sbjct: 78  AIALSFTKIAQNRRAAGVKIARRALALIVLGLVL--------NALPY-FDLAHLRLPGIL 128

Query: 174 QRIALVYVVVALIETLTTKR----RPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
           QRIAL YV+  L+  +T +      P V + R L I  A     + G+ A + ++     
Sbjct: 129 QRIALCYVLATLLCLVTARTGADGSPTVNQ-RALLIAMAVV---LLGYCAVLAWV----- 179

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
             VP                      GHL P  +   ++DR ++ + HL+          
Sbjct: 180 -PVPGIG------------------AGHLDPGGSLAAWIDRGVFTVPHLW---------- 210

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                    P   DA       ++PEGLLST+ A ++  +G+  G  L   K   +RL  
Sbjct: 211 ---------PYGTDAAGAVV--YDPEGLLSTLPATVNVLVGVLAGTAL---KASRSRLNL 256

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            V+    ++++   L     + INK++++ S+  F++G
Sbjct: 257 LVA---AVMLMMAGLALDPVLVINKRIWTSSFALFSSG 291


>gi|328790778|ref|XP_623715.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis mellifera]
          Length = 572

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/392 (26%), Positives = 181/392 (46%), Gaps = 76/392 (19%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDD 82
            GK S N I  +  L+R +   E +    L     + S R+ ++D FRG+ ++LMI V++
Sbjct: 148 KGKLSPNNIYDD--LDRLQ---EAENATNLVVRTTKFSTRIHSVDTFRGIAILLMIFVNN 202

Query: 83  AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK--- 139
            GG Y   +HS W G ++AD V+P+F +I+G+ I ++ +   ++  +  KI    L+   
Sbjct: 203 GGGKYIFFNHSAWFGLSIADLVLPWFAWIMGLMITVSKRTELRLTTSRIKITLYCLRRSA 262

Query: 140 -LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVL 198
            L+F G++L    S +         +  +R+ G+LQ + + Y V A++ET+  K  P+  
Sbjct: 263 ILIFLGLMLNSKDSES---------LHDLRFPGVLQLLGVSYFVCAILETIFMK--PHSQ 311

Query: 199 EPRH-LSIFTAY--------QWQWIGGFIAFVIYIITTYSLYV--------PNWSFSEHS 241
           +  H    F  +        QW  + G       I+TT++L          P   F    
Sbjct: 312 DILHQFGRFAMFRDILESWPQWLIMAG-------IVTTHTLITFLLPISNCPKGYFGPGG 364

Query: 242 DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
           ++  +   + C           A GY+DR ++G NH Y+          T +      LR
Sbjct: 365 EYHFRGKYINC--------TAGAAGYIDRLIFG-NHTYNH---------TENFLYGQILR 406

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW-VSMGFGLLII 360
            D          PEGL++TISAI    +G+H G +L+ +   ++R+  W +   F  +I 
Sbjct: 407 YD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQCNSRVIRWFLWTVFTGIIA 456

Query: 361 AIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
            I+ +F      IPI+K++ + SYV   +  A
Sbjct: 457 GILCNFETQGGIIPISKRMMTLSYVLICSSFA 488


>gi|299140549|ref|ZP_07033687.1| membrane protein [Prevotella oris C735]
 gi|298577515|gb|EFI49383.1| membrane protein [Prevotella oris C735]
          Length = 359

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/337 (28%), Positives = 147/337 (43%), Gaps = 79/337 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            + KR+ +LD  RG TV LMILV++  G   YA + HS WNG T  D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCGI 172
             L+L+K           KI+ RT+ L F G+ +     + + +AL    D+ H+R   +
Sbjct: 61  TYLSLEKTNFTWSRQVAFKIVKRTVLLFFIGLFINWFDMAISGNAL----DLSHLRIWAV 116

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QRIA+ Y  V++       R                        I  ++ ++  YSL +
Sbjct: 117 MQRIAICYFAVSIFALCCNHRHT----------------------IPAIVILLAAYSLLL 154

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
             W      D                    N +  +D  L+GI HLY +           
Sbjct: 155 I-WGNGYAYD-----------------SQQNILAQIDIRLFGIEHLYHN----------- 185

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                            +P +PEG  S++SAI    IG + G  +   K    ++  ++ 
Sbjct: 186 -----------------SPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLRFLI 228

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            G  L+II  I+ F   +P+NK+++S SYVC T G A
Sbjct: 229 TGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLA 263


>gi|75909960|ref|YP_324256.1| hypothetical protein Ava_3756 [Anabaena variabilis ATCC 29413]
 gi|75703685|gb|ABA23361.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 375

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/336 (32%), Positives = 154/336 (45%), Gaps = 74/336 (22%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFFLFIVGVA++ 
Sbjct: 2   RLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMSF 61

Query: 119 ALKKVPKIN---GAVKKIIFRTLKLLF-WGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K  + N    AV   IFR   +LF  G++L G ++       +  D+ +IR  G+LQ
Sbjct: 62  SLSKYTQENKPTSAVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y+  +L   L   R+                 QWI   +  V Y +T   +YVP 
Sbjct: 118 RISLSYLFASL-AVLNLPRKG----------------QWILAGVLLVGYWLTM--MYVP- 157

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   YVDR +    HLY+   +  L       
Sbjct: 158 -----VPDYGA----------GVLTREGNFGAYVDRLIIPQAHLYAGDGFKNLG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGL STI AI+S   G   G      +  S + +  V + 
Sbjct: 197 ------------------DPEGLFSTIPAIVSVLAGYFTGE---WIRKQSVQTRTSVGLA 235

Query: 355 -FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            FG+  + +   +    PINK+L++ SYV FT+G A
Sbjct: 236 LFGIGCLIVGWGWGWVFPINKKLWTSSYVVFTSGWA 271


>gi|397505551|ref|XP_003823320.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
           2 [Pan paniscus]
          Length = 622

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 112/418 (26%), Positives = 180/418 (43%), Gaps = 96/418 (22%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVD 81
             ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+
Sbjct: 151 SSRETDRLINSELG--SPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVN 208

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFR 136
             GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     KI +R
Sbjct: 209 YGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWR 267

Query: 137 TLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRP 195
           +  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P
Sbjct: 268 SFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVP 321

Query: 196 N--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                E   LS+   T+   QW+   +   +++  T+ L VP          G     + 
Sbjct: 322 EHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPTGYLG 371

Query: 252 CGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSW 307
            G  G  G  P C   A GY+DR L G +HLY  P  + L    ++              
Sbjct: 372 PGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVA-------------- 417

Query: 308 CRAPFEPEGLLSTISAILSGTIGIH----------------------------------- 332
               ++PEG+L TI++I+   +G+                                    
Sbjct: 418 ----YDPEGILGTINSIVMAFLGVQCCPDWVTKQACLTEPLSPLWRILFGPCLEVRATEP 473

Query: 333 --YGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFTNA--IPINKQLYSFSYV 382
              G +L+++K  +     R   W  +  GL+ +A+     N   IP+NK L+S SYV
Sbjct: 474 AQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYV 530


>gi|385809567|ref|YP_005845963.1| heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
 gi|383801615|gb|AFH48695.1| Heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
           album JCM 16511]
          Length = 378

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/348 (29%), Positives = 161/348 (46%), Gaps = 80/348 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + ++R+ +LD FRG+T++ MILV++ G     Y ++ H+ W+GCT  D + PFFLFIVG
Sbjct: 1   MKPTERLVSLDVFRGITIMGMILVNNPGTWSAVYPQLLHAEWHGCTFTDLIFPFFLFIVG 60

Query: 114 VAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGYSHAPDALSYG--VDMKHI 167
           VA++ +L K     G++K     II RT+ L   GIIL G     P  L +G     + +
Sbjct: 61  VAVSYSLTKRKAQGGSMKSLYLNIIRRTVILFLLGIILNG----FPFGLLFGHQFSWETL 116

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+LQRIA+VY V A +   T+               T +Q+        F   I+  
Sbjct: 117 RIPGVLQRIAIVYFVAAFLFLTTS---------------TKFQYW-------FTAAILIL 154

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           Y+               V  +I   G+   +  P  N   ++D+ + G +H++S    ++
Sbjct: 155 YA--------------AVMSFIPVPGIGYANFEPGKNLSAWIDQMILG-SHMWSG---TK 196

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
           L                        ++PEG+LSTI AI S  +GI  G+ L        +
Sbjct: 197 L------------------------WDPEGILSTIPAIGSAMLGIFTGNWLRSENDQKEK 232

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           + +       L++   I  F    P+NK L++ SYV +TAG A   LG
Sbjct: 233 VVYLFIWANVLMLAGWIWSF--WFPLNKNLWTSSYVLWTAGLALHFLG 278


>gi|321463338|gb|EFX74354.1| hypothetical protein DAPPUDRAFT_129175 [Daphnia pulex]
          Length = 409

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 162/349 (46%), Gaps = 48/349 (13%)

Query: 70  RGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA 129
           R LT+V MI+V+  GG Y   +HSPWNG T+AD + P F++I+G +  L+L    +   +
Sbjct: 29  RRLTIVFMIIVNYGGGGYWFFEHSPWNGITIADVIFPCFVWILGASCVLSLNSQLRRALS 88

Query: 130 VKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVAL 185
            ++I++    R++ +L  G++L        ++LS   ++K  R  G+LQR++ VY++VAL
Sbjct: 89  KQRILYSTVRRSVAMLVIGLVL--------NSLSNN-NIKTFRIPGVLQRMSFVYLIVAL 139

Query: 186 IETL----TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
           IE         +R     P    + +  QW  +  F++    ++ T+ L VP+       
Sbjct: 140 IELTGFDPEDNQRYAWFAPIRDIVCSWRQWIIVTVFVS--TQLLITFLLPVPDCPLGYTG 197

Query: 242 DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
             G++K  +     G       A   VD  L+G +H+Y  P    +   TL+        
Sbjct: 198 AGGLEKNGLYRNCTG------GAARLVDVSLFGNDHIYQRPTPRAIYDATLA-------- 243

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL-LII 360
                     F+PEG L  ++ +L   +G     VL+ F  +  R+  W+       L  
Sbjct: 244 ----------FDPEGALGGLTCVLCAYLGAEAAKVLLVFPANKQRIVRWMLWALVTGLSG 293

Query: 361 AIILHFT---NAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
            I+  F      IPINK L+S SYV  T+  A   L I   +FI  + W
Sbjct: 294 GILCDFKIDDGPIPINKNLWSLSYVLVTSSIAFILLTIL-YVFIDVLSW 341


>gi|440684188|ref|YP_007158983.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
 gi|428681307|gb|AFZ60073.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
          Length = 376

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 100/335 (29%), Positives = 149/335 (44%), Gaps = 71/335 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+  G A   Y+ +DH+ WNGCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGITIAGMILVNMVGVADNKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    N   K +  R L+    L   G++L G ++       +  D+  IR  G+LQ
Sbjct: 62  SLSKYTADNKPTKAVYLRILRRAAILFILGLLLNGFWNKG----VWTFDLSSIRLMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y+  +LI  L   R+                 Q I   +  + Y +T   + VP 
Sbjct: 118 RISLSYLFASLI-VLKLPRKS----------------QLILAGVLLIGYWLTMMYIPVPE 160

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
           +     +  G                  N   ++DR                     L  
Sbjct: 161 YGAGVLTREG------------------NFGAFIDR---------------------LII 181

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
           P +   + D  ++     +PEGL STI AI+S   G   G  +   K  +++    + + 
Sbjct: 182 PKAQLYKGDGFNFMG---DPEGLYSTIPAIVSVLAGYFTGEWIKDKKQANSQTSMDLVL- 237

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           FGL  + I + +  A PINK+L++ SYV FT G A
Sbjct: 238 FGLCCLVIGIIWDVAFPINKKLWTSSYVVFTTGWA 272


>gi|427728534|ref|YP_007074771.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
 gi|427364453|gb|AFY47174.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
          Length = 380

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 104/346 (30%), Positives = 152/346 (43%), Gaps = 74/346 (21%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFF 108
           LQ L +    R+ +LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFF
Sbjct: 6   LQSLDKTLPMRLTSLDVFRGITIAAMILVNMAGVADDVYLPLTHADWHGCTPTDLVFPFF 65

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDM 164
           LFIVGVA+  +L K  + N     I +R L+    L   G+ L G ++       +  D 
Sbjct: 66  LFIVGVAMTFSLSKYTQDNKPTSAIYWRILRRAAILFILGLFLNGFWNQG----VWTFDF 121

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
             IR  G+LQRI+L Y++ +LI  L   R+                 QW+   +  + Y 
Sbjct: 122 TSIRMMGVLQRISLSYLLASLI-VLKLPRKG----------------QWLLAGVLLIGYW 164

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
           +    + VP        D+G           G L    N   YVDR +    HLY    +
Sbjct: 165 LAMMYIPVP--------DYGA----------GVLTREGNFGAYVDRLIIPKAHLYKGDGF 206

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
           + +                         +PEGL STI AI+S   G   G  +   +   
Sbjct: 207 NFMG------------------------DPEGLFSTIPAIVSVLAGYFTGEWI---RKQP 239

Query: 345 ARLKHWVSMG-FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            + +  V +  FG+  + I   +    PINK+L++ SYV FT+G A
Sbjct: 240 VQTRTSVGLALFGIGCLMIGWAWGWVFPINKKLWTSSYVVFTSGWA 285


>gi|336416828|ref|ZP_08597160.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
 gi|335937266|gb|EGM99170.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 149/340 (43%), Gaps = 75/340 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +       G + K++ RT+ L   G+ L          +  GV+    HIR  
Sbjct: 61  SMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWF-----SLVCTGVEQPFSHIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  +   RRP  L              WI G       I+  YS+
Sbjct: 116 GVLQRLALAYFFGSLL--IVGVRRPANLA-------------WISGI------ILAGYSI 154

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +      E S+                    N +   DR L+G  HLY           
Sbjct: 155 LLALGHGFELSEQ-------------------NIIAVTDRTLFGEAHLY----------- 184

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                     RE  P   R  F+PEGLLST+  I    IG   G++L        RL   
Sbjct: 185 ----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQI 234

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
             +G  LL    +L +    P+NK+++S ++V  T G A 
Sbjct: 235 SILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFAS 272


>gi|160883406|ref|ZP_02064409.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
 gi|156111126|gb|EDO12871.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
          Length = 371

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 149/340 (43%), Gaps = 75/340 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +       G + K++ RT+ L   G+ L          +  GV+    HIR  
Sbjct: 61  SMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWF-----SLVCTGVEQPFSHIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  +   RRP  L              WI G       I+  YS+
Sbjct: 116 GVLQRLALAYFFGSLL--IVGVRRPANLA-------------WISGI------ILAGYSI 154

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +      E S+                    N +   DR L+G  HLY           
Sbjct: 155 LLALGHGFELSEQ-------------------NIIAVTDRTLFGEAHLY----------- 184

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                     RE  P   R  F+PEGLLST+  I    IG   G++L        RL   
Sbjct: 185 ----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQI 234

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
             +G  LL    +L +    P+NK+++S ++V  T G A 
Sbjct: 235 SILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFAS 272


>gi|423289836|ref|ZP_17268686.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
 gi|392666578|gb|EIY60091.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 149/340 (43%), Gaps = 75/340 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +       G + K++ RT+ L   G+ L          +  GV+    HIR  
Sbjct: 61  SMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWF-----SLVCTGVEQPFSHIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  +   RRP  L              WI G       I+  YS+
Sbjct: 116 GVLQRLALAYFFGSLL--IVGVRRPANLA-------------WISGI------ILAGYSI 154

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +      E S+                    N +   DR L+G  HLY           
Sbjct: 155 LLALGHGFELSEQ-------------------NIIAVTDRTLFGEAHLY----------- 184

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                     RE  P   R  F+PEGLLST+  I    IG   G++L        RL   
Sbjct: 185 ----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQI 234

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
             +G  LL    +L +    P+NK+++S ++V  T G A 
Sbjct: 235 SILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFAS 272


>gi|428319268|ref|YP_007117150.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428242948|gb|AFZ08734.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 406

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/366 (28%), Positives = 150/366 (40%), Gaps = 106/366 (28%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+ +  MILV++ G     Y  +DH+ WNGCT  D V PFFLFIVG A++ 
Sbjct: 2   RLKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWNGCTPTDLVFPFFLFIVGCAMSF 61

Query: 119 ALKK----VPKINGAVKKIIFRTLKL------------------LFWGI----------- 145
           +L K     PK      KII +  KL                  ++W I           
Sbjct: 62  SLSKYIQNYPKTGIETSKIIQKNEKLESDKNPFPSSFFLLPASNIYWRIARRAAILFILG 121

Query: 146 ILQGGYSHAPDALSYGVDMKH---IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRH 202
           +L    S A D L     +++   IR  G+LQRI L Y + A+     + R   +L    
Sbjct: 122 LLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFIGAIAILNLSPRNQKLLAAAV 181

Query: 203 LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC 262
           L          +G + A  ++ +  Y+                          G L P  
Sbjct: 182 L----------LGYWGALTVFAVGGYT-------------------------AGELTPEG 206

Query: 263 NAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTIS 322
           N  GYVDR + G  HLY                               PF+PEGLLST+ 
Sbjct: 207 NLGGYVDRLILGSQHLYKG----------------------------GPFDPEGLLSTLP 238

Query: 323 AILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-PINKQLYSFSY 381
           A+++  IG   G  L   +    + +  +++    L   +I H    + PINKQL++ SY
Sbjct: 239 AVVTVLIGYFTGEWL---RVQPIKTRTSINLAICGLSCVVIGHLWGFLFPINKQLWTSSY 295

Query: 382 VCFTAG 387
           V FTAG
Sbjct: 296 VVFTAG 301


>gi|186683151|ref|YP_001866347.1| hypothetical protein Npun_R2871 [Nostoc punctiforme PCC 73102]
 gi|186465603|gb|ACC81404.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 375

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 151/340 (44%), Gaps = 82/340 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGITIAAMILVNMAGVADNIYPPLAHADWHGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K  + N     I +R L+    L   G++L G ++       +  D+  IR  G+LQ
Sbjct: 62  SLSKYTEDNKPTSAIYWRILRRAAILFALGLLLNGFWNQG----VWTFDLSSIRIMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI++ Y++ +LI  L   R+                 QWI   +  + Y +    L VP 
Sbjct: 118 RISITYLLASLI-VLNLPRKG----------------QWILAAVILIGYWLMMMYLPVP- 159

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   Y+DR +    HLY    +  +       
Sbjct: 160 -------DYGA----------GVLTREGNLGAYIDRMIIPKAHLYKGDGFKFMG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGL STI AI+S   G   G  +      S  ++   S+G
Sbjct: 197 ------------------DPEGLFSTIPAIVSVLAGYFTGQWI-----RSQPVQSRTSIG 233

Query: 355 FGL-----LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            GL     LII     +T   PINK+L++ SYV F++G A
Sbjct: 234 LGLFGVGCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWA 271


>gi|383111974|ref|ZP_09932776.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|423296747|ref|ZP_17274817.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
 gi|313696106|gb|EFS32941.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
 gi|392669124|gb|EIY62615.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
           CL03T12C18]
          Length = 371

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 149/340 (43%), Gaps = 75/340 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +       G + K++ RT+ L   G+ L          +  GV+    HIR  
Sbjct: 61  SMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLSWF-----SLVCTGVEQPFSHIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  +   RRP  L              WI G       I+  YS+
Sbjct: 116 GVLQRLALAYFFGSLL--IVGVRRPANLA-------------WISGI------ILAGYSI 154

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +      E S+                    N +   DR L+G  HLY           
Sbjct: 155 LLALGHGFELSEQ-------------------NIIAVTDRTLFGEAHLY----------- 184

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                     RE  P   R  F+PEGLLST+  I    IG   G++L        RL   
Sbjct: 185 ----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQI 234

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
             +G  LL    +L +    P+NK+++S ++V  T G A 
Sbjct: 235 SILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFAS 272


>gi|383115204|ref|ZP_09935962.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
 gi|313695379|gb|EFS32214.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
          Length = 361

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 154/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  A D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLVVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGMIDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNERRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL +  +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|125981811|ref|XP_001354909.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
 gi|54643221|gb|EAL31965.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
          Length = 574

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 163/342 (47%), Gaps = 53/342 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 240

Query: 119 ALK-KVPKINGAVK---KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           ++K ++ + N   +   +I++R++KL   G+ L        +++S G  ++ +R  G+LQ
Sbjct: 241 SVKAQLSRGNSKARICLRILWRSIKLFAIGLCL--------NSMS-GPSLEQLRLMGVLQ 291

Query: 175 RIALVYVVVALIETLTTKR---RPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           R  + ++VV ++ TL ++R   +P     R +     +  +          Y+  T+ L 
Sbjct: 292 RFGIAFLVVGILHTLCSRREQVQPQRAWHRAIYDVCLFSGELAVLLALIAAYLGLTFGLP 351

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLG--PAC--NAVGYVDRELWGINHLYSDPVWSRL 287
           VP          G  +  +  G +  L   P C   A GYVD ++ G  H+Y  P    +
Sbjct: 352 VP----------GCPRGYLGPGGKHDLAAHPNCIGGAAGYVDLQVLGNAHIYQHPTAKYV 401

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG-HVLIHFKGHSAR 346
                              +  + F+PEG+   + +++   +G   G  +L+H    +  
Sbjct: 402 -------------------YDSSAFDPEGVFGCLLSVVQVLLGAFAGLTLLVHTTWQTRL 442

Query: 347 LKHWVSMGFGLLIIAIILHFTN---AIPINKQLYSFSYVCFT 385
            +  +      L+   +  F+     IP+NK L+S S+V  T
Sbjct: 443 RRWLLLSLLLGLVGGALCGFSREGGVIPVNKNLWSLSFVFVT 484


>gi|17229379|ref|NP_485927.1| hypothetical protein all1887 [Nostoc sp. PCC 7120]
 gi|17130977|dbj|BAB73586.1| all1887 [Nostoc sp. PCC 7120]
          Length = 375

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 108/341 (31%), Positives = 155/341 (45%), Gaps = 84/341 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFFLFIVGVA++ 
Sbjct: 2   RLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMSF 61

Query: 119 ALKKVPKINGAVKKI---IFRTLKLLF-WGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K  + N     +   IFR   +LF  G++L G ++       +  D+ +IR  G+LQ
Sbjct: 62  SLSKYTQENKPTSVVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y    L  +LT    P              + QWI   +  V Y +T   +YVP 
Sbjct: 118 RISLSY----LFASLTVLNLPR-------------KGQWILAGVLLVGYWLTM--MYVP- 157

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   Y+DR +   +HLY+   +  L       
Sbjct: 158 -----VPDYGA----------GVLTREGNFGAYIDRLIIPKSHLYAGDGFKNLG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR---LKHWV 351
                             +PEGL STI AI+S   G        +F G   R   ++   
Sbjct: 197 ------------------DPEGLFSTIPAIVSVLAG--------YFTGEWIRKQPVQTRT 230

Query: 352 SMGFGLLIIAIIL---HFTNAIPINKQLYSFSYVCFTAGAA 389
           S+G  L  I  ++    +    PINK+L++ SYV FT+G A
Sbjct: 231 SLGLALFGIGCLIVGWGWGWVFPINKKLWTSSYVVFTSGWA 271


>gi|126272886|ref|XP_001369919.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Monodelphis domestica]
          Length = 389

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 163/369 (44%), Gaps = 70/369 (18%)

Query: 72  LTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL----KKVPKIN 127
           L++ LMI V+  GG Y   +H+PWNG T+AD VMP+F+FI+G ++ LA     +K  K  
Sbjct: 3   LSLTLMIFVNYGGGGYWFFEHAPWNGLTIADLVMPWFVFILGTSVGLAFHVMQRKGVKKF 62

Query: 128 GAVKKIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
              +K+ +RT  L+  G + L  G    P + S+       R  G+LQR+   Y +VAL+
Sbjct: 63  KLFRKVAWRTGVLIAIGALFLNYGPVDGPLSWSWA------RLPGVLQRLGFTYFIVALM 116

Query: 187 ETL-----TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
           +         K +  V       I   +Q +WI       I++  T+ L VP        
Sbjct: 117 QIAFGVADMQKYQVGVWWAPFRDIVLYWQ-EWIIIIGLECIWLCLTFLLPVPG------- 168

Query: 242 DHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDRELWGINHLYSDPVWSRLEA 289
                     C  RG+LGP           C   A  Y+D+ + G NHLY  P    L  
Sbjct: 169 ----------CP-RGYLGPGGIGDEGKYFNCTGGAAAYIDKWILGENHLYRFPSCKELYK 217

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-ARLK 348
            T                   PF+PEG+L TI++I+    G+  G +++ ++    + LK
Sbjct: 218 TT------------------QPFDPEGILGTINSIIMAFFGLQAGKIILMYRSKPRSILK 259

Query: 349 HWVSMGFGLLIIAIILHFTNA----IPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
            ++     L II+ IL         IP+NK L+S S+V  T+  +   LG+   I   + 
Sbjct: 260 RFLIWSVLLGIISAILTLGTQNEGFIPVNKNLWSLSFVTTTSCFSFFLLGLLYYIIDMKG 319

Query: 405 DWDECDACF 413
            W  C   +
Sbjct: 320 WWSGCPLIY 328


>gi|160884063|ref|ZP_02065066.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|423291476|ref|ZP_17270324.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
 gi|156110405|gb|EDO12150.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
 gi|392663476|gb|EIY57026.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
           CL02T12C04]
          Length = 361

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 154/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  A D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLAVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGMIDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL +  +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|224496100|ref|NP_001139059.1| uncharacterized protein LOC565246 precursor [Danio rerio]
          Length = 582

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 159/358 (44%), Gaps = 56/358 (15%)

Query: 36  GLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW 95
           G + S   +E   E Q+ +  + KS R+ +LD FRG ++ +M+ V+  GG Y    H+PW
Sbjct: 178 GSQNSMEMEEPNTEQQIDES-KPKSSRLKSLDTFRGFSLTVMVFVNYGGGGYWFFQHAPW 236

Query: 96  NGCTLADFVMPFFLFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGY 151
           NG T+AD VMP+F+FI+G ++ LA     +K   +   ++K+ +RT+ L+  G      Y
Sbjct: 237 NGLTVADLVMPWFVFIIGTSVMLAFTSMHRKGVSLLQLLRKVTWRTVVLMLIGFCFM-NY 295

Query: 152 SHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW 211
           S     L    D +     G+            L  + T     N ++   L     Y  
Sbjct: 296 SPRDGILVLAADTRSSPASGL-----------HLFRSGTDHNWWNPIQDVIL-----YWP 339

Query: 212 QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN--AVGYVD 269
           +W+   +   +++  T+ L VPN         GV            L P C   A  ++D
Sbjct: 340 EWLFIVLLETLWLCLTFLLPVPNCPTGYLGAGGVGD--------AGLYPNCTGGAAAHID 391

Query: 270 RELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTI 329
           +  +G N ++  P    L   T                   PF+PEG+L TI++I+ G +
Sbjct: 392 KWFFGDN-MFWYPTCKVLYRTT------------------EPFDPEGVLGTINSIVMGFL 432

Query: 330 GIHYGHVLIHF----KGHSARLKHW-VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV 382
           G+  G +L+ F    KG  AR   W + +G    I++        IP+NK L+S S+V
Sbjct: 433 GMQAGKILLFFRQMNKGILARFLVWALILGISAAILSKCTRDEGFIPVNKNLWSLSFV 490


>gi|170027692|ref|XP_001841731.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167862301|gb|EDS25684.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 127/278 (45%), Gaps = 48/278 (17%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           +  R+ +LD FRG+ ++LMI V+  GG Y  I+H+ WNG  +AD V P+FLFI+GV I +
Sbjct: 288 QKTRLRSLDTFRGIAIMLMIFVNSGGGDYWWIEHATWNGLHVADLVFPWFLFIMGVCIPI 347

Query: 119 AL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           +L     + VP+    +K +  R+LKL   G+ L             G  +  +R  G+L
Sbjct: 348 SLRSQLGRNVPRYE-ILKNVAVRSLKLFLIGLCLNS---------INGPTVADLRLFGVL 397

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEP---RHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           QR  + Y VV+ I     +    +  P    H  I   ++   I G I FV Y++  + +
Sbjct: 398 QRFGVAYFVVSAIHLYCYQENDQLQHPLARSHADILRLWKHWVIVGTIVFV-YLLVIFFV 456

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNA--VGYVDRELWGINHLYSDPVWSRLE 288
            VPN   S +   G K  ++       L P C     GY+DR++ GI HLY  P      
Sbjct: 457 PVPNCP-SGYFGPGGKHLML-------LYPNCTGGITGYIDRQVLGIRHLYQHP------ 502

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILS 326
                   +     DA      PF+PEG    +  I  
Sbjct: 503 --------TARYMYDA-----MPFDPEGPFGCLPTIFQ 527


>gi|365877201|ref|ZP_09416706.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442587874|ref|ZP_21006688.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
 gi|365755061|gb|EHM96995.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
 gi|442562373|gb|ELR79594.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
          Length = 400

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 152/359 (42%), Gaps = 86/359 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS R  +LD FRG TV LMILV++ G     Y  ++H+ W+GCT  D V PFFLF VG 
Sbjct: 1   MKSARYYSLDVFRGATVALMILVNNPGTWSAIYPPLEHAKWHGCTPTDLVFPFFLFAVGN 60

Query: 115 AIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQ--GGYSHAPDALSY----GVDMKH 166
           A+   + K  + N +V  KK+I RTL +   G+ L     +    D+LS+      D   
Sbjct: 61  AMTFVIPKFQQHNSSVFWKKVIKRTLLIFGIGLFLNWCPFFQWDHDSLSFISWESSDENG 120

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R  G+LQRIA+ Y   ++I     ++   VL              WI G +  + +++T
Sbjct: 121 VRIMGVLQRIAIAYFFASVIAYYFKEKM--VL--------------WISGALLVIYWLLT 164

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            +               G   Y     + G +G        +D  + GI H Y       
Sbjct: 165 LFL-------------GGTDPY----SLEGFIGVP------IDHSILGIAHEYKGE---- 197

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL-----IHFK 341
                                   PF+PEGL STI AI     G   G+ +     I + 
Sbjct: 198 ----------------------GVPFDPEGLFSTIPAISQVLFGYLIGNYIQKKGNIQWF 235

Query: 342 GHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADGCLGI 395
           G S +     SM  GL I+ II  F + +     P NK+++S SY   T G A   LG+
Sbjct: 236 GKSLKENSIYSMLSGLFILGIIALFISYVWQLDFPYNKKIWSSSYTLLTTGLAIMVLGV 294


>gi|348507459|ref|XP_003441273.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Oreochromis niloticus]
          Length = 460

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 154/351 (43%), Gaps = 52/351 (14%)

Query: 43  QDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLAD 102
           ++E          ++ K  R+ +LD FRG  + +M+ V+  GG Y    H+PWNG T+AD
Sbjct: 59  KEESHASETAHGTVKAKPTRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVAD 118

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDAL 158
            VMP+F+F++G ++ LA   + +   +    ++KI +RT+ LL  G      YS      
Sbjct: 119 LVMPWFVFVIGTSVVLAFSSMQRRGVSRLQLLRKITWRTVVLLLLGFCFL-NYSPRDGPC 177

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
           S  V  +  R               + +  L +    N   P  +     Y  QW+   +
Sbjct: 178 SVLVLAEDPR-----------SAAASGLHLLCSVSPYNWWNP--VQDILLYWPQWLIIIL 224

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN--AVGYVDRELWGIN 276
              +++  T+ + VPN         G+       G  G L P C   A GY+DR ++G N
Sbjct: 225 LETLWLCLTFLMPVPNCPTGYLGAGGI-------GDNG-LYPNCTGGAAGYIDRWMFGDN 276

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
            +Y  P    +   T                   PF+PEG+L TI++I+ G +G+  G +
Sbjct: 277 -MYRYPTCKEMYRTT------------------QPFDPEGVLGTINSIVIGFLGMQAGKI 317

Query: 337 LIHFKGHS----ARLKHW-VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV 382
           LI +K  +     R   W V +G    I++        IP+NK L+S SYV
Sbjct: 318 LIFYKKMNIHILCRFLVWTVILGISAAILSKCTRDGGFIPVNKNLWSLSYV 368


>gi|427716050|ref|YP_007064044.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
 gi|427348486|gb|AFY31210.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
          Length = 375

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 146/334 (43%), Gaps = 70/334 (20%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+  G A   Y  +DH+ WNGCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGITIAAMILVNMVGVADDKYPLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           +L K  + N     + +R L+     + L    +   +   +  D+  IR  G+LQRI+L
Sbjct: 62  SLSKYTEGNKPNSSVYWRILRRAAILLALGLLLNGFWNKGVWTFDLSSIRLMGVLQRISL 121

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y +VA +  L   R+                 QWI   +  + Y +T   L VP     
Sbjct: 122 SY-LVASVTVLNLPRKG----------------QWILAAVLLIGYWLTMMYLPVPG---- 160

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
               HG           G L    N   Y+DR +    HLY    ++ +           
Sbjct: 161 ----HGA----------GVLTREGNLGAYIDRLIIPKAHLYKGDKFNFMG---------- 196

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM---GF 355
                         +PEGL STI AI+S   G   G   +  +    R +  + +   G 
Sbjct: 197 --------------DPEGLFSTIPAIVSVLAGYFAG---LWIRSQPVRSRTSIGLALFGI 239

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G LII     +T   PINK+L++ SYV F++G A
Sbjct: 240 GCLIIGWAWGWT--FPINKKLWTSSYVVFSSGWA 271


>gi|157112232|ref|XP_001657450.1| hypothetical protein AaeL_AAEL000933 [Aedes aegypti]
 gi|108883723|gb|EAT47948.1| AAEL000933-PA [Aedes aegypti]
          Length = 569

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 102/353 (28%), Positives = 159/353 (45%), Gaps = 62/353 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           ++ +  R+ +LD FRG+ ++LMI V+  GG Y  I+H+ WNG  +AD V P+FLFI+GV 
Sbjct: 175 VEPRKTRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVC 234

Query: 116 IALALKKVPKINGAVKKII----FRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           I ++L+     N   K I+     R+ KL   G+ L             G  + ++R  G
Sbjct: 235 IPISLRSQVSRNIPRKTILANVAVRSFKLFCIGLCLNS---------INGPQVANLRLFG 285

Query: 172 ILQRIALVYVVVALI------ETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
           +LQR  + Y VV+ I      E++  + R   L   ++ I   ++   I G I F IY++
Sbjct: 286 VLQRFGVAYFVVSAIHLYCYSESIEFQGR---LARLNVDILRLWKHWIIMGAIVF-IYLL 341

Query: 226 TTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNA--VGYVDRELWGINHLYS 280
             + +  P      F     H +  Y           P C     GY+DR + G NHLY 
Sbjct: 342 IMFLVAAPGCPSGYFGPGGKHLMAMY-----------PNCTGGITGYLDRIILGNNHLYQ 390

Query: 281 DPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 340
            P              +     DA +     F+PEG    +  IL   +G+  G +++  
Sbjct: 391 HP--------------TARYVYDAQA-----FDPEGPFGCLPTILQVFLGLQCGVLILTH 431

Query: 341 KGHSARLKHWVSMG-FGLLIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
               AR++   + G    L+  I+  F+     IP+NK L+S SYV  TA  A
Sbjct: 432 TEVMARIRRMAAWGTVLGLLGGILCGFSKNDGWIPVNKNLWSLSYVLVTASLA 484


>gi|281423205|ref|ZP_06254118.1| putative membrane protein [Prevotella oris F0302]
 gi|281402541|gb|EFB33372.1| putative membrane protein [Prevotella oris F0302]
          Length = 359

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 146/337 (43%), Gaps = 79/337 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            + KR+ +LD  RG TV LMILV++  G   YA + HS WNG T  D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCGI 172
             L+LKK           KI+ RT+ L   G+ +     + + +AL    D  H+R   +
Sbjct: 61  TYLSLKKTNFTWSRQVAFKIVKRTVLLFLIGLFINWFDMAISGNAL----DFSHLRIWAV 116

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QRIA+ Y  V+ I  L    R  +  P  + +  AY    I G                
Sbjct: 117 MQRIAICYFAVS-IFALCCNHRHTI--PAIVILLAAYNLLLIWG---------------- 157

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
             +++                         N +  +D  L+GI HLY +           
Sbjct: 158 NGYAYDSQQ---------------------NILAQIDIRLFGIEHLYHN----------- 185

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                            +P +PEG  S++SAI    IG + G  +   K    ++  ++ 
Sbjct: 186 -----------------SPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLRFLI 228

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            G  L+II  I+ F   +P+NK+++S SYVC T G A
Sbjct: 229 TGGFLVIIGYIVSF--GLPLNKRIWSPSYVCMTCGLA 263


>gi|345320430|ref|XP_001516736.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Ornithorhynchus anatinus]
          Length = 448

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 151/327 (46%), Gaps = 51/327 (15%)

Query: 20  EQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL 79
           ++ + ++++  IN E G       D    +LQ +       +R+ +LD FRG ++++M+ 
Sbjct: 159 KKMNPRETDRLINSELG--SPTRADSYNSDLQAEVWRSSSPQRLRSLDTFRGFSLIIMVF 216

Query: 80  VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KII 134
           V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L  + +  G  K     KI+
Sbjct: 217 VNYGGGKYWFFKHEGWNGLTVADLVFPWFVFIMGSSISLSLSSMLR-RGYSKWRLLWKIL 275

Query: 135 FRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           +R+  L   G+ I+   Y   P      +    +R  G+LQR+   Y+VVA +E L  K 
Sbjct: 276 WRSFLLFLIGVLIVNPNYCLGP------LSWDKLRIPGVLQRLGFTYLVVATLELLFAKA 329

Query: 194 RP--NVLEP--RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYI 249
            P  N LE     L    +Y  QWI   +    ++  T+ L VP          G     
Sbjct: 330 VPESNSLERTCSFLQEIISYWPQWIFILMLETAWLCLTFLLPVP----------GCPTGY 379

Query: 250 VKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAP 305
           +  G  G  G  P C   A GY+D  L G NH+Y  P           SPN         
Sbjct: 380 LGPGGIGDFGKYPNCTGGAAGYIDHLLLGENHIYQHP-----------SPN-------VL 421

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIH 332
              +  ++PEG+L TI++I+   +G+ 
Sbjct: 422 YHTKVAYDPEGILGTINSIVMAFLGVQ 448


>gi|270160204|ref|ZP_06188860.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|289165026|ref|YP_003455164.1| hypothetical protein LLO_1691 [Legionella longbeachae NSW150]
 gi|269988543|gb|EEZ94798.1| membrane protein, putative [Legionella longbeachae D-4968]
 gi|288858199|emb|CBJ12067.1| putative membrane protein [Legionella longbeachae NSW150]
          Length = 372

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/336 (30%), Positives = 151/336 (44%), Gaps = 79/336 (23%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +++R+ +LD FRGLT+ LM+LV+  G   +Y  + HS WNGCTLAD V P FLFIVG+  
Sbjct: 7   QNERILSLDVFRGLTMALMVLVNSLGTRISYPILLHSEWNGCTLADLVFPSFLFIVGMTT 66

Query: 117 ALALKKVPKINGAVK---KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
            ++LK+  K     +    I  RT+ L   GI L        +     VD+  IR  GIL
Sbjct: 67  VISLKRHIKEESKTEIYYSIFKRTIILFLLGIFL--------NVFPKNVDISSIRIYGIL 118

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL Y++ A I   TT R    +    L I   Y W ++  F             ++P
Sbjct: 119 QRIALCYLICAFIYLHTTIRAQIFI---FLGILLGY-WYFLACF-------------HLP 161

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
                              GM   L    N VGY+D+ L+   HL               
Sbjct: 162 -----------------VSGM-NQLTITRNWVGYIDQLLFSPKHL--------------- 188

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                             F+PEGLLSTI +I +   G+  G++L+       +    V+ 
Sbjct: 189 --------------LFRNFDPEGLLSTIPSIATTLSGLIAGNLLLAQIQKQKKCILMVAS 234

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G   L++A + ++  + PINK L++ S+V + +G +
Sbjct: 235 GLVFLLLAWLWNY--SFPINKNLWTSSFVLWCSGFS 268


>gi|297172331|gb|ADI23307.1| uncharacterized conserved protein [uncultured nuHF2 cluster
           bacterium HF0770_19K18]
          Length = 373

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/343 (27%), Positives = 150/343 (43%), Gaps = 77/343 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++ KS R+ +LDAFRGLT+  MI+V+  G     Y  + H+ W+GCT  D V PFFLFIV
Sbjct: 3   MKNKSDRLLSLDAFRGLTIAFMIIVNTPGNWSYVYGPLRHAEWHGCTPTDLVFPFFLFIV 62

Query: 113 GVAI--ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           GVA+  + A       +  +KKI +RT+ +  +G++L     +A   +    D   +R  
Sbjct: 63  GVAMRFSFAQHNYQPSSDLLKKIFWRTVTIFSFGLLL-----NAYPFIRQNWDWSSLRIM 117

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRI L Y + A+                 LS++ + +  WI   I  + Y +     
Sbjct: 118 GVLQRIGLAYGLAAI-----------------LSLYLSEKKLWISCGIILIGYWLI---- 156

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
                             ++  G     G + N    +D  + G NHL+           
Sbjct: 157 ------------------LLLFGGSDPFGLSSNIARTIDIAILGENHLWRG--------- 189

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                               PF+PEGLLSTI AI++  IG   G ++       + ++  
Sbjct: 190 -----------------TGIPFDPEGLLSTIPAIVTVLIGFSIGQLIQENSNRISLVQTI 232

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           +  G G+  +  +  F    PINKQL++ +YV +T G A   L
Sbjct: 233 LIRGAGIAAVGWLWGFI--FPINKQLWTSTYVLYTGGLASFFL 273


>gi|410631381|ref|ZP_11342056.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148827|dbj|GAC18923.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 330

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 167/370 (45%), Gaps = 74/370 (20%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           QQ   R+  LD FRG+T+  MILV++ G     Y  + H+ W+G TL D + PFF+FIVG
Sbjct: 17  QQPGNRLLALDVFRGITITAMILVNNPGSWQHIYGPMRHAQWHGWTLTDLIFPFFIFIVG 76

Query: 114 VAIALALKKV----PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMK--HI 167
           V+I L+ +++       +  +K+ + RT KL+  G  L   Y          V+ +  +I
Sbjct: 77  VSIQLSGQRMLASGTSRSSIIKQALLRTFKLVLLGWFLALFYYDFGAEHYNWVEQRLLNI 136

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV--IYII 225
           R+ G+LQRIA+VY +  L+    +KR                     G  ++FV  + + 
Sbjct: 137 RFMGVLQRIAVVYFICVLMWLFLSKR---------------------GLLVSFVGILLLY 175

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
                 +P +  + +S  G+ ++              N   ++D  L+   HLY      
Sbjct: 176 WLALAAIPYYDNAGNSYSGLLEF------------GNNLSAWLDSWLFAAPHLYYSS--- 220

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                  ++P +              F+PEG+LST+ A+ SG  G+  G++L      + 
Sbjct: 221 -------ATPFA--------------FDPEGILSTLPAVASGISGMLVGYLLTQ-SSLNT 258

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD 405
           R K  V +  G + + +   +    PINK L++ SYV  T+  A  CL + + IFI +  
Sbjct: 259 RNKTIVLLVLGSIGVLLGELWHGYFPINKALWTSSYVLLTSAYA--CLVLASLIFILDSK 316

Query: 406 WD---ECDAC 412
            +   EC  C
Sbjct: 317 KNHLLECSLC 326


>gi|387793162|ref|YP_006258227.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
 gi|379655995|gb|AFD09051.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
          Length = 393

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 161/361 (44%), Gaps = 84/361 (23%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFF 108
           +Q    +   R+ +LD FRG TV  MILV++ G     YA ++H+ WNGCT  D + PFF
Sbjct: 1   MQASASEPKPRLLSLDVFRGATVAAMILVNNPGSWSNIYAPLEHAKWNGCTPTDLIFPFF 60

Query: 109 LFIVGVAIALAL---KKVPKI-NGAVKKIIFRTLKLLFWGIIL---------QGGYSHAP 155
           LFIVG++IA AL   K  P+  + A+K I  R+LKL   G+IL         + G     
Sbjct: 61  LFIVGISIAYALSGKKSRPEEHSAAIKSITIRSLKLFGLGLILALFPIVYFDKFGEVDVW 120

Query: 156 DALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
           D +   +    +R  G+LQRI +V+ +  +I     K +P  +              W  
Sbjct: 121 DQIV--MRFSGVRIMGVLQRIGIVFFIAGII---FIKAKPKTIA-------------WTA 162

Query: 216 GFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
           G +  + Y++ T+ + VP   ++                  +L P  N   ++DR +   
Sbjct: 163 GSLLVIYYLLMTF-VPVPGVGYA------------------NLEPETNLGAWIDRLILTT 203

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
           +HL+                      + + +W     +PEGLL TI A+ +G +G   G 
Sbjct: 204 DHLW----------------------KQSKTW-----DPEGLLGTIPAVATGLLGTLCGT 236

Query: 336 VLIHFKGHSARLK-HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            +     ++ ++   +V   FG   IA  L +    PINK L++ S+V +TAG       
Sbjct: 237 WMKKPMDNALKITWLFVFAAFG---IAAGLVWDLFFPINKSLWTSSFVLYTAGLGSAFFA 293

Query: 395 I 395
           +
Sbjct: 294 V 294


>gi|299144716|ref|ZP_07037784.1| putative membrane protein [Bacteroides sp. 3_1_23]
 gi|298515207|gb|EFI39088.1| putative membrane protein [Bacteroides sp. 3_1_23]
          Length = 361

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 153/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  A D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLIVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGMIDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL    +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|373460170|ref|ZP_09551926.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
 gi|371956555|gb|EHO74341.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
          Length = 359

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 150/344 (43%), Gaps = 81/344 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            + KR+ +LD  RG+TV LMILV++  G   Y+ + HS WNG T  D V PFFLFI+G++
Sbjct: 1   MEKKRLLSLDVLRGMTVCLMILVNNGAGEHIYSTLQHSKWNGMTPCDLVFPFFLFIMGIS 60

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG--VDMKHIRWCG 171
             L+LK+           KI  RT+ L   G+ +        D L  G  +D +H+R  G
Sbjct: 61  TFLSLKQTNFAWNRQTACKIAKRTVLLFAIGLFINWF-----DLLLQGRALDFEHLRIWG 115

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ++QRIA+ Y  V++       +R     P   ++  AY    + G               
Sbjct: 116 VMQRIAICYGAVSVFALSINHKRT---LPLIATLLIAYAMFLMLG--------------- 157

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
              +++                         N +  +D  L+G  HLY            
Sbjct: 158 -NGYAYDSQQ---------------------NLIAQIDIHLFGQAHLYH----------- 184

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                            ++P +PEGL S++ AI    IG + G ++   +    ++  ++
Sbjct: 185 -----------------KSPVDPEGLASSLPAIAHTLIGFYCGRLMAMARTTEEKVLKFM 227

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            +G  L++I  +  F   +P+NK+++S SYVC T G A  CLG+
Sbjct: 228 LVGGVLVLIGYLASF--GLPLNKRIWSPSYVCITCGLAATCLGL 269


>gi|293371912|ref|ZP_06618316.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292633158|gb|EFF51735.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 361

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 153/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  A D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLIVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG VD  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGIVDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL    +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|423214205|ref|ZP_17200733.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693150|gb|EIY86385.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 361

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 154/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  + D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLAVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGMIDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I+ K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIINIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL    +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|282898832|ref|ZP_06306819.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281196359|gb|EFA71269.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 375

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 147/333 (44%), Gaps = 72/333 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRGLT+ +MI+ + AG     Y  + H+PWNGCT  D V PFFLFIVGVA++ 
Sbjct: 2   RLISLDVFRGLTIAMMIIANMAGVVPDVYPFLSHAPWNGCTPTDLVFPFFLFIVGVAMSF 61

Query: 119 ALKKV---PKINGAVKKIIFRTLKLLF-WGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K     K++  V   + R   +LF  G++L G ++    +     D++ +R  G+LQ
Sbjct: 62  SLSKYSLESKLDNLVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI L Y+  +LI     ++                  QW    I  + Y +T   + VP 
Sbjct: 118 RIGLAYLFASLIVLKLPEKT-----------------QWALAGILLIFYWLTMMYIPVP- 159

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   ++DR +    HLY+              
Sbjct: 160 -------DYGA----------GMLTREGNFGAFIDRLIIAKPHLYAG------------- 189

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
            +    R D          PEGL STI AI++   G   G  +     +S      V  G
Sbjct: 190 -DGFNFRGD----------PEGLFSTIPAIVNVLFGYFAGQWIRKSTINSHTSMDLVLWG 238

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
              L++ +I  +    PINK+L++ SYV F+ G
Sbjct: 239 LCSLVVGMI--WDGLFPINKKLWTSSYVLFSTG 269


>gi|428306334|ref|YP_007143159.1| heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
 gi|428247869|gb|AFZ13649.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
           epipsammum PCC 9333]
          Length = 371

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 147/336 (43%), Gaps = 78/336 (23%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+ +  MILV+ AG A   Y  + H+ WNG T AD V PFFLFI+GVA+A 
Sbjct: 2   RLTSLDVFRGMAIAGMILVNKAGVADQVYPALAHADWNGWTFADLVFPFFLFIIGVAMAF 61

Query: 119 ALKKVPK-INGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           +  K  +  N   K++  R L+    L   G++L G +++         D   IR  G+L
Sbjct: 62  SFAKYTEGDNKPTKQLYLRILRRSAILFILGLLLNGFWNY---------DFSTIRVMGVL 112

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRI++ Y++ +L   LT  ++                 QW    +  + Y +    + VP
Sbjct: 113 QRISVAYLLASL-AVLTLPKKG----------------QWALAAVLLIGYWLIMSFVPVP 155

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
            +     +  G                  N   Y+DR + G  HLY    ++ L      
Sbjct: 156 GYGAGVLTREG------------------NFGAYIDRLIIGAAHLYKGDNYNSLG----- 192

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                              +PEGL S++ A++S  IG   G  L      S    + +  
Sbjct: 193 -------------------DPEGLFSSLPAVVSVLIGYFTGEWLRKQPERSRTSINMLIA 233

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G   L++  + +F    PINK+L++ SYV FTAG A
Sbjct: 234 GLSCLVVGEVWNFW--FPINKKLWTSSYVLFTAGIA 267


>gi|282896863|ref|ZP_06304869.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281198272|gb|EFA73162.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 375

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 147/333 (44%), Gaps = 72/333 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRGLT+ +MI+ + AG A   Y  + H+ WNGCT  D V PFFLFIVGVA++ 
Sbjct: 2   RLISLDVFRGLTIAMMIIANMAGVAPDVYPFLSHALWNGCTPTDLVYPFFLFIVGVAMSF 61

Query: 119 ALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    +   K + F    R + L   G++L G ++    +     D++ +R  G+LQ
Sbjct: 62  SLSKYSLESKLDKFVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y+V +LI     ++                  QW    I  + Y +T   + VP 
Sbjct: 118 RISLAYLVASLIVLKFPEKT-----------------QWALAGILLIFYWLTMMYIPVP- 159

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G L    N   ++DR +    HLY+              
Sbjct: 160 -------DYGA----------GMLTREGNFGAFIDRLIIAKPHLYAG------------- 189

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
            +    R D          PEGL STI AI++   G   G  +     +S      V  G
Sbjct: 190 -DGFNFRGD----------PEGLFSTIPAIVNVLFGYFAGQWMRKSTINSHTSMDLVLWG 238

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
              L++ +I  +    PINK+L++ SYV F+ G
Sbjct: 239 LCSLVVGMI--WDGLFPINKKLWTSSYVLFSTG 269


>gi|305665862|ref|YP_003862149.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
 gi|88710633|gb|EAR02865.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
          Length = 362

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 158/360 (43%), Gaps = 84/360 (23%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            RV ++D FRGLT+VLMILV+  G     Y    H+ W+G T  D V PFFLFIVG +I+
Sbjct: 3   NRVISVDIFRGLTIVLMILVNTPGTWSSVYTPFLHAEWHGYTPTDLVFPFFLFIVGTSIS 62

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            A +K        KKI  R+LKL+  G+ L G ++    +  +  D   IR+ G+LQRI 
Sbjct: 63  FAYQKKKASTQTYKKIAVRSLKLIGLGLFL-GAFTL---SFPFIKDFADIRFPGVLQRIG 118

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL---YVPN 234
           +V++  A+                   +F  + W+ + G     I ++  Y L   YVP 
Sbjct: 119 VVFLFTAV-------------------LFVNFNWKTLLG---ICIVLLVGYWLLMGYVPV 156

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D                    N   Y+D +++G ++   D             
Sbjct: 157 EGIESTFDRA----------------PNNLANYLDVKIFGTHNYKPD------------- 187

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                            ++PEG LST+ +I S   G+  G +L   K +    K  V +G
Sbjct: 188 -----------------YDPEGFLSTLPSIASALTGVFTGLILTSKKDN----KTMVLVG 226

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFR 414
            G++++A+   +    PINK L+S S+V  T+G A+  L +    +I ++   E  + F+
Sbjct: 227 LGVVMLALGYLWHTVFPINKALWSSSFVLVTSGWANIFLALVY--YISDVRQIEFGSIFK 284


>gi|423293378|ref|ZP_17271505.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
 gi|392678321|gb|EIY71729.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
           CL03T12C18]
          Length = 361

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 88/335 (26%), Positives = 153/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  + D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLVVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K               N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SVDNIVGMIDSAILGANHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL +  +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|298480127|ref|ZP_06998326.1| membrane protein [Bacteroides sp. D22]
 gi|336404355|ref|ZP_08585053.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
 gi|295085510|emb|CBK67033.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298273936|gb|EFI15498.1| membrane protein [Bacteroides sp. D22]
 gi|335943683|gb|EGN05522.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
          Length = 361

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 153/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  + D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYDFQCRPAITKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVAIPHKR----------------------FMPLAIILLIVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K             A N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SADNIVGMIDSAILGANHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL    +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|237719042|ref|ZP_04549523.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229451820|gb|EEO57611.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 361

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/335 (26%), Positives = 153/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  + D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLVVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K               N VG +D  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SVDNIVGMIDSAILGANHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL +  +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFVGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|119583586|gb|EAW63182.1| hCG1993224, isoform CRA_a [Homo sapiens]
          Length = 382

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 153/325 (47%), Gaps = 54/325 (16%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK----- 131
           M+ V+  GG Y    H+ WNG T+AD V P+F+FI+G +I L++  + +  G  K     
Sbjct: 1   MVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLG 59

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           KI +R+  L+  GII+       P+     +    +R  G+LQR+ + Y VVA++E L  
Sbjct: 60  KIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 114

Query: 192 KRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK 247
           K  P     E   LS+   T+   QW+   +   +++  T+ L VP          G   
Sbjct: 115 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP----------GCPT 164

Query: 248 YIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
             +  G  G  G  P C   A GY+DR L G +HLY               P+S  L   
Sbjct: 165 GYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQH-------------PSSAVLYHT 211

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLI 359
                   ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ 
Sbjct: 212 -----EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLIS 265

Query: 360 IAIILHFTNA--IPINKQLYSFSYV 382
           +A+     N   IP+NK L+S SYV
Sbjct: 266 VALTKVSENEGFIPVNKNLWSLSYV 290


>gi|224537871|ref|ZP_03678410.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|423227284|ref|ZP_17213748.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|224520557|gb|EEF89662.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|392624424|gb|EIY18516.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 373

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 100/340 (29%), Positives = 146/340 (42%), Gaps = 75/340 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +   + R+  LD  RG+T+  MILV++ G     YA + H+ W+G T  D V PFF+FI+
Sbjct: 1   MSAPNNRLLALDVIRGITIAGMILVNNPGSWQSVYAPLQHARWHGLTPTDLVYPFFMFIM 60

Query: 113 GVAIALALKKVPKINGAVK-KIIFRTLKLLFWGIILQGGYSHAPDALSYGV-DMKHIRWC 170
           GVAI  +L+K  K+N  V  KII RT+ L   GI L          L YG    +H+R  
Sbjct: 61  GVAIHFSLRKFDKLNTTVSLKIIRRTVALFAVGIALD-----CFSKLCYGTFSWEHLRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G++QR+AL Y   A +  L  K+               Y     GG +   + ++  ++ 
Sbjct: 116 GVMQRLALAYAGGAFLAILIPKK---------------YYLATAGGILLLYLVLLQCFNG 160

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           YV       HS                   A N +  +D +L G  HL            
Sbjct: 161 YV-------HS-------------------ADNLIAVIDVKLLGAGHL------------ 182

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                    ++E A     A FEPEGLLSTI        G   G ++     +  R++  
Sbjct: 183 ---------IKETAEGGSFA-FEPEGLLSTIPCWAHVLFGTFVGSLITGIAENKERIRQI 232

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
              G  LL    +L + +  PINK+L++ SY   T+G A 
Sbjct: 233 ALFGTVLLFAGFLLQYLD--PINKKLWTASYTLITSGTAS 270


>gi|328870644|gb|EGG19017.1| hypothetical protein DFA_02260 [Dictyostelium fasciculatum]
          Length = 759

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 136/342 (39%), Gaps = 88/342 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + KRV +LD  RGLT+  MILVD+ GG      +  + WNG + AD + P FLFI G ++
Sbjct: 48  QRKRVLSLDTVRGLTIFGMILVDNQGGPQVIWPLLETEWNGLSTADLIFPSFLFICGFSV 107

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           ALALK           II RTL L F    L        + +++       R  G+LQRI
Sbjct: 108 ALALKSAKNDIKTWYNIIRRTLLLFFIQAFL--------NLMAHKFVFDSFRVMGVLQRI 159

Query: 177 ALVYVVVA---LIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           ++ Y       L+  L  +R           IF               IY+   Y L VP
Sbjct: 160 SICYFACCCSFLLLPLVGQR-----------IFLV---------ACAAIYLSVMYGLDVP 199

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
                             CG RG L P+CNA  Y+D  + G N ++              
Sbjct: 200 G-----------------CG-RGVLTPSCNAGSYIDNSVLGANMIH-------------- 227

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF-----KGHSARLK 348
                            P +PEGLLST SA ++  +G+  G +   F       H   L 
Sbjct: 228 -----------------PNDPEGLLSTFSAFITTWMGLELGRIFTRFYRKHDYAHLNILI 270

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
            W+ +     +  I L  T  +P+NK ++SFS+   T     
Sbjct: 271 RWIGIAVVFGVTGIALGVTK-MPVNKLIWSFSFALITVACGS 311


>gi|262406057|ref|ZP_06082607.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648122|ref|ZP_06725665.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294806856|ref|ZP_06765681.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510562|ref|ZP_08790129.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|229443274|gb|EEO49065.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
 gi|262356932|gb|EEZ06022.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636506|gb|EFF54981.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
 gi|294445885|gb|EFG14527.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 361

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 155/343 (45%), Gaps = 75/343 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  A D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      ++                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKK----------------------FMPLAIILLVVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K               N VG VD  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SVDNIVGIVDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           +G  LL    +L +  A P+NK+L+S S+V  T G A   L +
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCGIATLSLAL 272


>gi|338210835|ref|YP_004654884.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336304650|gb|AEI47752.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 363

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 100/332 (30%), Positives = 145/332 (43%), Gaps = 79/332 (23%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R+ +LD FRG+TV  MILV++ G     YA + H+ WNGCT  D + PFFLFIVGV++A
Sbjct: 3   NRLLSLDVFRGMTVAAMILVNNPGDWDHVYAPLLHAHWNGCTPTDLIFPFFLFIVGVSVA 62

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            A+ K P    ++ KII R+  +LF   +    Y           D  ++R  G+LQRIA
Sbjct: 63  FAMGKNPP---SLLKIIKRS-AILFGLGLFLNLYPK--------FDFANVRIPGVLQRIA 110

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
           LVY+V +LI   TT++                              +ITT  L +  W  
Sbjct: 111 LVYLVCSLIFIKTTRKT----------------------------QVITTVLLLIAYWLL 142

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
                     Y        +L P  N   +VDR L    HL+                  
Sbjct: 143 MTLVPVPGVGY-------ANLEPTTNLGAWVDRGLLTTAHLW------------------ 177

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
               + A  W     +PEG+ STI AI +G +G+  G  L   K  + ++      G  L
Sbjct: 178 ----KSAKVW-----DPEGMFSTIPAIGTGLLGVLTGQWLRSEKPVAEKMAWLFLSGNAL 228

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           ++  +I  +    PINK L++ S+V +  G A
Sbjct: 229 ILGGLI--WNEFFPINKSLWTSSFVLYAGGWA 258


>gi|113475212|ref|YP_721273.1| hypothetical protein Tery_1515 [Trichodesmium erythraeum IMS101]
 gi|110166260|gb|ABG50800.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 366

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/342 (29%), Positives = 149/342 (43%), Gaps = 83/342 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MI+V++ G     Y  + H+ W+GCT  D + PFFLFI+GVA+  
Sbjct: 2   RLKSLDVFRGITIASMIIVNNPGSWNHVYPPLLHAKWHGCTPTDLIFPFFLFIMGVAMTF 61

Query: 119 ALKKVPKINGAVKKI---IFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           +L K    N  +  I   IFR   ++F   +L  G+ +         ++  IR  G+LQR
Sbjct: 62  SLSKYTDKNQPIPHIYQRIFRRCLIIFLFGLLLNGFPN--------YNLATIRVMGVLQR 113

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           I+LVY++ A+     ++++   L    L          IG +IA  +  +  Y L     
Sbjct: 114 ISLVYLLAAIAILNLSRKQLYGLATTLL----------IGYWIAMQLIPVPGYGL----- 158

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
                               G+L P  N   Y+DR +    HL                 
Sbjct: 159 --------------------GNLSPEGNFAAYIDRLILTQQHL----------------- 181

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
                      W    ++PEGL ST+ AI++  IG   G  L H   +S    + V  G 
Sbjct: 182 -----------WAGKQYDPEGLFSTLPAIVTVLIGYLTGEWLKHQSTNSRTTLNMVISGL 230

Query: 356 GLLIIAII--LHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
             L++  +  L F    PINK L++ SYV FTAG A   L I
Sbjct: 231 SCLVVGYLWGLFF----PINKALWTSSYVIFTAGWALLFLAI 268


>gi|149280688|ref|ZP_01886799.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
 gi|149228553|gb|EDM33961.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
          Length = 385

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 150/343 (43%), Gaps = 67/343 (19%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           ++  R+ +LD FRG TV  MILV+   D G  YA ++H+ W+GCT  D V PFFLFIVGV
Sbjct: 9   EQPVRLLSLDFFRGATVAAMILVNNPGDWGHIYAPLEHADWHGCTPTDLVFPFFLFIVGV 68

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD-MKHIRWCGIL 173
           +IA A+        +  K I + LK       L    S  P+  S  V+  + +R  G+L
Sbjct: 69  SIAYAMGSKKTDPSSHGKTILKALKRTLILFGLGLFLSLFPNVFSNPVEAFQQVRIPGVL 128

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIA+V+ + ++I   +++R       R + I  A  W  I  FI             VP
Sbjct: 129 QRIAVVFFICSIIFLKSSERTIF----RTMVIILAAYWA-IMTFIP------------VP 171

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
              F                   +L    N   ++DR ++   HL+              
Sbjct: 172 GTGFP------------------NLEKETNLGAWIDRGVFTEAHLW-------------- 199

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                   + + +W     +PEGLLST+ AI +G  GI  G  L       A    W+  
Sbjct: 200 --------KSSKTW-----DPEGLLSTLPAIATGLFGILVGSYLKRKDIEPATKIAWL-F 245

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGIT 396
             G    A+ L +    PINKQL++ S+V +T G A   L ++
Sbjct: 246 STGAAATALGLLWDLQFPINKQLWTSSFVLYTGGLATTILSLS 288


>gi|260788586|ref|XP_002589330.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
 gi|229274507|gb|EEN45341.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
          Length = 382

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 96/338 (28%), Positives = 155/338 (45%), Gaps = 50/338 (14%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
           +R+ +LD FRG+ + +M  V+  GG Y  +DHS WNG T+AD V P+F++I+G + AL+ 
Sbjct: 1   RRLKSLDTFRGMCLCIMAFVNYGGGGYWFLDHSVWNGITVADLVFPWFMWIMGTSTALSF 60

Query: 121 -----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
                K  PK+     KI+ RT+ L   G+ +     +APD      D   IR  G+LQR
Sbjct: 61  RGLQRKATPKLT-IFGKIVRRTITLFLLGLFI----VNAPD------DWATIRIPGVLQR 109

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
            A+ Y  V+ +  L        +E         Y  QW+       ++   T+ + VP  
Sbjct: 110 FAVSYFAVSTMMLLH-------METEWYRDLVPYWKQWLFVLCLLAVHTCLTFLMPVPGC 162

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
                   G+           H      A GY+D  L   +H+Y D            +P
Sbjct: 163 PTGYLGAGGLSDL-------DHTNCTGGAAGYIDNWLLTQDHIYGD-----------ETP 204

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
               L +         ++PEG+L ++++I    +G+  G +L+ ++ H +RL  W+  G 
Sbjct: 205 KVRILYQ-----ILVNYDPEGVLGSLTSIFMTFLGLQAGKILLSYEDHGSRLVRWLLWGI 259

Query: 356 GLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAGAA 389
           GL ++AI+L         +PINK L+S ++V   A  A
Sbjct: 260 GLGLLAILLCEGRQNGGWVPINKNLWSVTFVLSLASMA 297


>gi|398341237|ref|ZP_10525940.1| hypothetical protein LkirsB1_18883 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 383

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 166/366 (45%), Gaps = 89/366 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   RV +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG+
Sbjct: 7   QNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGI 66

Query: 115 AIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIR 168
           +I L++    KI+ +  KI F    R++ L+  G+ L   G +S +            +R
Sbjct: 67  SIQLSVYSKNKIHKS--KIWFGICIRSITLILIGLFLNFFGEWSFS-----------ELR 113

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI  VY +VA +  +  KR                           +I I    
Sbjct: 114 IPGVLQRIGFVYWIVASLHLILPKR---------------------------MILISWIP 146

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+         
Sbjct: 147 ILLVHTWVLIQIPAPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW--------- 189

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        + + +W     +PEGL S IS+I +  +G+  G +L   K +  + +
Sbjct: 190 -------------KFSKTW-----DPEGLFSGISSIATSLLGVFCGSILSS-KTNEIKKQ 230

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA---DGCLGITNSIFIPEMD 405
                GFG+L++ + L +   +P+NK L++ SYV +TAG A    GC  + N   +    
Sbjct: 231 ILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLN-FLLRIKK 289

Query: 406 WDECDA 411
           WD+  +
Sbjct: 290 WDQLQS 295


>gi|427710153|ref|YP_007052530.1| heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
 gi|427362658|gb|AFY45380.1| Heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
           7107]
          Length = 387

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/341 (29%), Positives = 144/341 (42%), Gaps = 74/341 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
              S R+ +LD FRG+ +  MILV++ G     Y  +DH+ W+GCT  D V PFFLFIVG
Sbjct: 13  STPSTRLVSLDVFRGIAIASMILVNNPGSWDYIYPPLDHAEWHGCTPTDLVFPFFLFIVG 72

Query: 114 VAIALALKKVPKINGAVKKIIFRTLK---LLFWGIILQGGYSHAPDALSYGV--DMKHIR 168
           VA+  +  K    N     +  R L+   +LF   +    ++   D L  G+  +   +R
Sbjct: 73  VAMPFSFAKYTPENRPTATVYQRILRRGLILFALGLFLALFTLTLDWLIKGITPNFSTLR 132

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI+L YV+ AL   L   RR                  WI   +  + Y +   
Sbjct: 133 IMGVLQRISLAYVIAAL-AVLNLSRRG----------------LWILAAVILIGYWLAMQ 175

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            + VP +                    G+L P  N  GY+DR + G  H+Y         
Sbjct: 176 FIPVPGFG------------------AGNLTPEGNLGGYIDRIILG-KHIYRS------- 209

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                  F+PEGL ST+ A+++  +G   G  L      S    
Sbjct: 210 ---------------------GSFDPEGLFSTLPAVVTVFLGYFTGDWLRTQPIKSRTSI 248

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           + V  G   LII  +  F    PINKQL++ SYV FTAG A
Sbjct: 249 NLVIFGLITLIIGQLWGFI--FPINKQLWTSSYVLFTAGWA 287


>gi|336412607|ref|ZP_08592960.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942653|gb|EGN04495.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
           3_8_47FAA]
          Length = 361

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 75/335 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVG 113
              +KR+ +LD  RG+TV  MILV++ G     +A   H+ W+G + AD V P F+F++G
Sbjct: 1   MNPNKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMG 60

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           ++  ++L K   +   A+ KII R+L L+F G++++  +  + D+ +Y  D+  +R  G+
Sbjct: 61  ISTYISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGV 118

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+ + Y + AL+      +R                      F+   I ++  Y ++ 
Sbjct: 119 MQRLGICYGITALLAVTIPHKR----------------------FMPLAIILLIVYFIF- 155

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                 +   +G +K               N VG VD  + G NH+Y             
Sbjct: 156 ------QLFGNGFEK------------SVDNIVGIVDSAILGSNHMYLQG---------- 187

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           R   +PEG+LSTI A+    IG   G ++I  K +  R+ +   
Sbjct: 188 ----------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFL 231

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G  LL    +L +  A P+NK+L+S S+V  T G
Sbjct: 232 IGTTLLFAGYLLSY--ACPLNKRLWSPSFVLLTCG 264


>gi|423215264|ref|ZP_17201791.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392691832|gb|EIY85072.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 371

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 151/357 (42%), Gaps = 85/357 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +         + K++ RT+ L   G+ L          +  GV+     IR  
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLSWF-----SLVCAGVEQPFSQIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  + + RRP  L              WI   I            
Sbjct: 116 GVLQRLALAYFFGSLL--IMSVRRPANLA-------------WISAII------------ 148

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                         +  YIV   +  G      N +   DR L+G  HLY          
Sbjct: 149 --------------LIGYIVLLALGNGFELSEQNIIAVTDRTLFGETHLY---------- 184

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      RE  P   R  F+PEGLLST+  I    IG   G++L        RL  
Sbjct: 185 -----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQ 233

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              +G  LL    +L +    P+NK+++S ++V  T G A        S+F+  + W
Sbjct: 234 ISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFA--------SLFLVFLTW 280


>gi|421109691|ref|ZP_15570204.1| putative membrane protein [Leptospira kirschneri str. H2]
 gi|410005185|gb|EKO58983.1| putative membrane protein [Leptospira kirschneri str. H2]
          Length = 383

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 168/370 (45%), Gaps = 89/370 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   RV +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG+
Sbjct: 7   QNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGI 66

Query: 115 AIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIR 168
           +I L++    KI  +  KI F    R++ L+  G+ L   G +S +            +R
Sbjct: 67  SIQLSVYSKNKIYKS--KIWFGICIRSITLILIGLFLNFFGEWSFS-----------ELR 113

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI  VY +VA +  +  KR                           +I I    
Sbjct: 114 IPGVLQRIGFVYWIVASLHLILPKR---------------------------MILISWIP 146

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+         
Sbjct: 147 ILLVHTWVLIQIPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW--------- 189

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        + + +W     +PEGL S IS+I +  +G+  G +L   K +  + +
Sbjct: 190 -------------KFSKTW-----DPEGLFSGISSIATSLLGVFCGSILSS-KTNEIKKQ 230

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA---DGCLGITNSIF-IPEM 404
                GFG+L++ + L +   +P+NK L++ SYV +TAG A    GC  + N +  I + 
Sbjct: 231 ILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLNFLLRIKKW 290

Query: 405 DWDECDACFR 414
           D  + +  F+
Sbjct: 291 DRLQSETIFQ 300


>gi|418694540|ref|ZP_13255577.1| putative membrane protein [Leptospira kirschneri str. H1]
 gi|409957715|gb|EKO16619.1| putative membrane protein [Leptospira kirschneri str. H1]
          Length = 383

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 168/370 (45%), Gaps = 89/370 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   RV +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG+
Sbjct: 7   QNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGI 66

Query: 115 AIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIR 168
           +I L++    KI  +  KI F    R++ L+  G+ L   G +S +            +R
Sbjct: 67  SIQLSVYSKNKIYKS--KIWFGICIRSITLILIGLFLNFFGEWSFS-----------ELR 113

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI  VY +VA +  +  KR                           +I I    
Sbjct: 114 IPGVLQRIGFVYWIVASLHLILPKR---------------------------MILISWIP 146

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+         
Sbjct: 147 ILLVHTWVLIQIPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW--------- 189

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        + + +W     +PEGL S IS+I +  +G+  G +L   K +  + +
Sbjct: 190 -------------KFSKTW-----DPEGLFSGISSIATSLLGVFCGSILSS-KTNEIKKQ 230

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA---DGCLGITNSIF-IPEM 404
                GFG+L++ + L +   +P+NK L++ SYV +TAG A    GC  + N +  I + 
Sbjct: 231 ILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLNFLLRIKKW 290

Query: 405 DWDECDACFR 414
           D  + +  F+
Sbjct: 291 DRLQSETIFQ 300


>gi|405978397|gb|EKC42791.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Crassostrea gigas]
          Length = 549

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 76/270 (28%), Positives = 123/270 (45%), Gaps = 31/270 (11%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG 84
           + ++NG +     E S    ++    Q Q     K +R+ +LD FRGL++++M+ V+  G
Sbjct: 178 RPTQNGFSDIS--EDSGTAHDRNNSPQ-QYSTHNKRERLKSLDTFRGLSLMIMVFVNYGG 234

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----L 140
           G Y   DH PWNG T+AD V P+F+FI+G A+  + + + K      +++++ L+    L
Sbjct: 235 GGYWFFDHPPWNGITVADLVFPWFIFIMGTAMNYSFRGMMKRGTPRYRMLYKVLRRAILL 294

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK-RRPNVLE 199
            F GI+L   +          V++K IR  G+LQR +L Y+V+ L E   ++   P   +
Sbjct: 295 FFIGIVLNTNWG--------PVNLKTIRIPGVLQRFSLTYLVLGLFEVCFSRYDTPEKYQ 346

Query: 200 PR---HLSIFTAYQWQWIGGFIAFVIYIITTYSLYV---PNWSFSEHSDHGVKKYIVKCG 253
            R    L     +  QW         Y+  T+ L +   P         H   KY   C 
Sbjct: 347 GRCWSSLRDILLFLPQWFLALGILAAYVCLTFLLPIGPCPTGYIGPGGLHDSSKY-YNC- 404

Query: 254 MRGHLGPACNAVGYVDRELWGINHLYSDPV 283
                     A  Y+D  + G NH+Y  P 
Sbjct: 405 -------TAGAAAYIDIMVLGKNHIYGKPT 427


>gi|418709516|ref|ZP_13270303.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|421125723|ref|ZP_15585968.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421135286|ref|ZP_15595410.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410020544|gb|EKO87345.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410436829|gb|EKP85940.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410770179|gb|EKR45405.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|456966468|gb|EMG08068.1| putative membrane protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 381

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 153/345 (44%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 4   KLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           ++I  ++    KI  +     I  R++ L+  G+ L   G +S +            +R 
Sbjct: 64  ISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFS-----------ELRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +  KR              A    WI   I           
Sbjct: 113 PGVLQRIGFVYWVVASLYLILPKR--------------AILISWIPILI----------- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
             V  W   +    G  + IV      +L P  +   ++DR ++G NHL+          
Sbjct: 148 --VHTWILIQLPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW---------- 187

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG  S IS+I +  +G+  G +L   K +  + + 
Sbjct: 188 ------------KFSKTW-----DPEGFFSGISSITTSLLGVFCGSILSS-KTNETKKQI 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               GFG+L + + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 230 LSIFGFGILFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|189502592|ref|YP_001958309.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
 gi|189498033|gb|ACE06580.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
           5a2]
          Length = 380

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/350 (29%), Positives = 160/350 (45%), Gaps = 79/350 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +Q   +R+ +LD FRGLTV  MIL ++ G     YA + H+ W+GCT  D + PFFLFIV
Sbjct: 1   MQHLKQRLVSLDFFRGLTVAGMILANNPGSWGHIYAPLKHAEWHGCTPTDLIFPFFLFIV 60

Query: 113 GVAIALAL---KKVPKINGA-VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD-MKHI 167
           GV+IA A+   K++P+ +   + K + R L L   GI L    +  P   +  ++  K +
Sbjct: 61  GVSIAFAIGSKKELPETHSQLILKSVRRMLTLFCLGIFL----ALYPKIFTSPIEAFKTV 116

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+LQR A+VY +  +I     K  P  +    L +  +Y             +I+ T
Sbjct: 117 RIPGVLQRTAIVYFISTII---FLKFTPRTILKIMLGLLVSY-------------WILMT 160

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
           + + VP   ++                  +L    N   ++DR L    HL+        
Sbjct: 161 F-VPVPGIGYA------------------NLEQETNLAAWIDRNLLTEPHLW-------- 193

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS--A 345
                         +   +W     +PEG+L TISAI +G  GI  G +L+  K  S   
Sbjct: 194 --------------KAVRTW-----DPEGILGTISAIATGLSGILAG-ILLQRKDQSDTE 233

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           ++    S G   +I  +I +F    PINK L++ S+V +T+G A   L +
Sbjct: 234 KIARLFSSGTLAVITGLIWNFI--FPINKSLWTSSFVLYTSGLAYIILAL 281


>gi|119512372|ref|ZP_01631456.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
 gi|119462961|gb|EAW43914.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
          Length = 369

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/338 (29%), Positives = 152/338 (44%), Gaps = 84/338 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+++LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFFLFIVGVA++ 
Sbjct: 2   RLSSLDVFRGITIAAMILVNMAGVAGEVYPPLAHADWHGCTPTDLVFPFFLFIVGVAMSF 61

Query: 119 ALKKVPKINGAVKKIIFRTLKLLF-WGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           +L K  +  G  +  IFR   +LF  G++L G ++       +  D+  IR  G+LQRI+
Sbjct: 62  SLSKYTE-KGYSR--IFRRAAILFALGLLLNGFWNQG----IWTFDLSKIRIMGVLQRIS 114

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
           L Y++ +L   L   R+                 QWI   +  + Y +T   + VP +  
Sbjct: 115 LAYLLASL-AVLNLPRKG----------------QWILAGVLLIGYWLTMMYVPVPEYGA 157

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
              +  G                  N   Y+DR +    HLY+   +  L          
Sbjct: 158 GVLTREG------------------NFGAYIDRLIIPQVHLYAGDGYQNLG--------- 190

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR---LKHWVSMG 354
                          +PEGL STI A+++   G        +F G   R   +K   S+G
Sbjct: 191 ---------------DPEGLFSTIPAVVNVLAG--------YFTGQWIRNQPVKTRTSIG 227

Query: 355 FGLLIIAIIL---HFTNAIPINKQLYSFSYVCFTAGAA 389
            GL  I  ++    +    PINK+L++ SYV F+ G A
Sbjct: 228 LGLFGIGCLIIGWAWGWIFPINKKLWTSSYVVFSNGWA 265


>gi|428308802|ref|YP_007119779.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
 gi|428250414|gb|AFZ16373.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
          Length = 381

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/341 (29%), Positives = 143/341 (41%), Gaps = 79/341 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA-----YARIDHSPWNGCTLADFVMPFFLFIV 112
           + SKR+ +LD FRG+T+  MILV+  G A     Y  + H+ WNG T  D V PFFLFIV
Sbjct: 7   KPSKRLTSLDVFRGITIAGMILVNMIGVAGDKNVYPPLLHADWNGFTPTDLVFPFFLFIV 66

Query: 113 GVAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           G A+A +  K    N        +II R+L L   GI+L G + +         +   IR
Sbjct: 67  GAAMAFSFSKYKHGNKPTPTVYWRIIRRSLILFALGILLNGFWEY---------NWSSIR 117

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI+L Y++ +LI  L   R+                 QW       + Y     
Sbjct: 118 IMGVLQRISLTYLIASLI-VLNVPRKG----------------QWAIAAFLLIGYWFAMS 160

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            + VP        D+G+          G+L    N   Y DR +    HLY    ++ + 
Sbjct: 161 LIPVP--------DYGM----------GNLTREGNFGAYFDRLIIPTAHLYKGDDFNGMG 202

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                   +PEGL ST+ A++S   G   G  L      S    
Sbjct: 203 ------------------------DPEGLFSTLPAVVSVLFGYLTGDWLRQQPIKSTTSM 238

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             V +G   L+I  +  F    PINK+L++ SYV FT G A
Sbjct: 239 DLVLLGLSCLVIGQVWDF--WFPINKKLWTSSYVLFTTGWA 277


>gi|298479647|ref|ZP_06997847.1| membrane protein [Bacteroides sp. D22]
 gi|336403243|ref|ZP_08583960.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
 gi|295084924|emb|CBK66447.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
 gi|298274037|gb|EFI15598.1| membrane protein [Bacteroides sp. D22]
 gi|335946636|gb|EGN08437.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
          Length = 371

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 151/357 (42%), Gaps = 85/357 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +         + K++ RT+ L   G+ L          +  GV+     IR  
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLSWF-----SLVCAGVEQPFSQIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  + + RRP  L              WI   I            
Sbjct: 116 GVLQRLALAYFFGSLL--IMSVRRPANLA-------------WISAII------------ 148

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                         +  YIV   +  G      N +   DR L+G  HLY          
Sbjct: 149 --------------LIGYIVLLALGNGFELSEQNIIAVTDRTLFGETHLY---------- 184

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      RE  P   R  F+PEGLLST+  I    IG   G++L        RL  
Sbjct: 185 -----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQ 233

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              +G  LL    +L +    P+NK+++S ++V  T G A        S+F+  + W
Sbjct: 234 ISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFA--------SLFLVLLTW 280


>gi|330805524|ref|XP_003290731.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
 gi|325079117|gb|EGC32733.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
          Length = 644

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 165/389 (42%), Gaps = 90/389 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +K  R+ +LD FRG ++ +MI V+  GG Y   +HS WNG T+AD V P+F+FI+G+A+
Sbjct: 202 NKKKDRLKSLDVFRGFSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFVFIMGIAM 261

Query: 117 ALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            L+   + +        V+K++ R+  L   G+ +             GV+++H R  G+
Sbjct: 262 PLSFNAMERRGTTKLVIVQKLVRRSAILFALGLFINN-----------GVNLQHWRILGV 310

Query: 173 LQRIALVYVVVALIE---------------TLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
           LQR A+ Y++V LI                 L   ++ +V+ P   +I  A Q Q I   
Sbjct: 311 LQRFAISYLIVGLIMLFVPLWRFRPSPSDINLNIDQQQHVIAPLDFNIDPAQQ-QSINNN 369

Query: 218 IAFVIYIITTYSLYV-PNWSFSEHSDHGVKKYIVK----------------------CGM 254
                   ++  +Y   N SF  +    +  Y ++                      C  
Sbjct: 370 NNNNNNNNSSDKIYSGKNNSFINNYLADLAPYWIQWLVALLLLAGWFLLMFLVPVPGCP- 428

Query: 255 RGHLGPA----------CN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
           +G+LGP           C   A   +D +++  NH++  P    +               
Sbjct: 429 KGYLGPGGIGDNSLYPNCTGGAARLIDMKIFTNNHIFQSPTCQDI--------------- 473

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI 362
               +  + ++PEG +  +++I    IG+  G +++ +K + +RL  W+        IA 
Sbjct: 474 ----YKTSSYDPEGTVGYLTSIFICFIGVQAGRIILIYKSNRSRLIRWMVWSAVCCGIAA 529

Query: 363 IL----HFTNAIPINKQLYSFSYVCFTAG 387
            L         IPINK L+S S+V   AG
Sbjct: 530 GLCGLSQNDGVIPINKNLWSPSFVFLMAG 558


>gi|393785792|ref|ZP_10373938.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
 gi|392661411|gb|EIY54997.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
           CL02T12C05]
          Length = 361

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 90/332 (27%), Positives = 146/332 (43%), Gaps = 75/332 (22%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ +LD  RG TV  MILV++AG     YA + H+ W+G T AD V P F+F++G++ 
Sbjct: 4   NNRLLSLDVLRGFTVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGIST 63

Query: 117 ALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            ++L+K   +   A+ KII R L L+  GI ++   + +   +    D +H+R  G++QR
Sbjct: 64  YISLRKYDFQWRLAIGKIIKRALLLILIGIAMKWIINSSETGIW--TDWEHMRLLGVMQR 121

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           + + Y   A++      +R     P  L +   Y          F++ +I          
Sbjct: 122 LGICYGATAIMALFIPHKR---FFPVALLLLAGY----------FILQLIGN-------- 160

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
            F +  D                    N +  VD  + G NH+Y                
Sbjct: 161 GFEKSPD--------------------NIIAIVDSTVLGTNHMYLQG------------- 187

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
                        R   EPEG+LSTI AI    IG   G ++I+ K +  R++    +G 
Sbjct: 188 -------------RQFVEPEGILSTIPAIAQVMIGFVCGRMIINQKDNKERMQKLFFLGT 234

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            +L    +  F+ A P+NK+L+S S+V  T G
Sbjct: 235 LMLFAGFL--FSYACPLNKRLWSPSFVLLTCG 264


>gi|417770421|ref|ZP_12418329.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|418680131|ref|ZP_13241383.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|421117858|ref|ZP_15578212.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|400328139|gb|EJO80376.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|409947562|gb|EKN97558.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|410010535|gb|EKO68672.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|455668600|gb|EMF33807.1| putative membrane protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
          Length = 381

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 4   KLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           ++I  ++    KI  +     I  R++ L+  G+ L   G +S +            +R 
Sbjct: 64  ISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFS-----------ELRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +  KR              A    WI   I           
Sbjct: 113 PGVLQRIGFVYWVVASLYLILPKR--------------AILISWIPILI----------- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
             V  W   +    G  + IV      +L P  +   ++DR ++G NHL+          
Sbjct: 148 --VHTWILIQLPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW---------- 187

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG  S IS+I +  +G+  G +L   K +  + + 
Sbjct: 188 ------------KFSKTW-----DPEGFFSGISSITTSLLGVFCGSILSS-KTNETKKQI 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               GFG L + + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 230 LSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|262407085|ref|ZP_06083634.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294648023|ref|ZP_06725570.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294809833|ref|ZP_06768513.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345512215|ref|ZP_08791750.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|229445856|gb|EEO51647.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
 gi|262355788|gb|EEZ04879.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636642|gb|EFF55113.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
 gi|294442971|gb|EFG11758.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 371

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 151/357 (42%), Gaps = 85/357 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKHIRWC 170
           +++ AL +         + K++ RT+ L   G+ L          +  GV+     IR  
Sbjct: 61  SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLSWF-----SLVCAGVEQPFSQIRIL 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR+AL Y   +L+  + + RRP  L              WI   I            
Sbjct: 116 GVLQRLALAYFFGSLL--IMSVRRPANLA-------------WISAII------------ 148

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                         +  YIV   +  G      N +   DR L+G  HLY          
Sbjct: 149 --------------LIGYIVLLALGNGFELSEQNIIAVTDRTLFGETHLY---------- 184

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      RE  P   R  F+PEGLLST+  I    IG   G++L        RL  
Sbjct: 185 -----------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQ 233

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              +G  LL    +L +    P+NK+++S ++V  T G A        S+F+  + W
Sbjct: 234 ISILGIVLLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFA--------SLFLVFLTW 280


>gi|393783262|ref|ZP_10371437.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669541|gb|EIY63029.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
           CL02T12C01]
          Length = 365

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 150/345 (43%), Gaps = 79/345 (22%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ +LD  RG+TV  MILV++AG     YA + H+ W+G T AD V P F+F++G++  
Sbjct: 9   QRLLSLDVLRGITVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGISTY 68

Query: 118 LALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV--DMKHIRWCGILQ 174
           ++L+K   +    + KII R   L+  GI ++        +   G+  D +H+R  G++Q
Sbjct: 69  ISLRKYNFQWQLTIGKIIKRAFLLILIGIAMK----WLIHSFETGIWNDWEHMRILGVMQ 124

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           R+ + Y + A++      +R                      F+   + ++  Y      
Sbjct: 125 RLGICYGITAVMALFIPHKR----------------------FLPIALLLLIGY------ 156

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                        +I++    G      N +  VD  + G +H+Y               
Sbjct: 157 -------------FILQLAGNGFEKSPDNIMAIVDSTVLGTSHMYLQG------------ 191

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                         R   EPEG+LSTI A+    IG   GH+LI+ K +  R++    MG
Sbjct: 192 --------------RQFVEPEGILSTIPAVAQVMIGFVCGHMLINRKDNQERMQQLFFMG 237

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
             LL    +L +  A P+NK+L+S S+V  T G A   L +   I
Sbjct: 238 TLLLFAGFLLSY--ACPLNKRLWSPSFVLVTCGIAALALAVLIEI 280


>gi|296222155|ref|XP_002757067.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
           [Callithrix jacchus]
          Length = 516

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 136/285 (47%), Gaps = 49/285 (17%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           R+ ++D FRG+ ++LM+ V+  GG Y    H+ WNG T+AD V P+F+FI+G ++ L++ 
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSVFLSMT 326

Query: 122 KVPKINGAVK-----KIIFRTLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQR 175
            + +  G  K     KI +R+  L+  GII+    Y   P      +    +R  G+LQR
Sbjct: 327 SILQ-RGCSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQR 379

Query: 176 IALVYVVVALIETLTTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLY 231
           + + Y VVA++E L  K  P     E   LS+   T+   QW+       +++  T+ L 
Sbjct: 380 LGVTYFVVAVLELLFAKPVPEHCTSERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLP 439

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRL 287
           VP          G     +  G  G  G  P C   A GY+DR L G +HLY  P  + L
Sbjct: 440 VP----------GCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVL 489

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
               ++                  ++PEG+L TI++IL   +G+ 
Sbjct: 490 YHTEVA------------------YDPEGILGTINSILMAFLGVQ 516


>gi|284041413|ref|YP_003391343.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820706|gb|ADB42544.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 364

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 94/346 (27%), Positives = 150/346 (43%), Gaps = 85/346 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            ++  R+ +LDA RG T+  MI+ +  G     Y  + HS WNG +  D + PFFLFIVG
Sbjct: 1   MRQPHRLISLDAMRGFTIAAMIVANFPGSEEFVYFTLRHSRWNGLSFTDLIAPFFLFIVG 60

Query: 114 VAIALALKKVPKINGA-----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           V+I LA  +  + +G+     ++KI+ R+LK+   G+ L    +  PD      D   +R
Sbjct: 61  VSIVLAYAR-KRADGSPKGPLIQKIVLRSLKIFAVGMFL----NLLPD-----FDFATLR 110

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           W G L RIA+V++V AL+   T+ R                Q  WI        ++  T 
Sbjct: 111 WTGTLHRIAIVFLVCALLFLTTSWR----------------QQAWIATLTLLAYWLAMT- 153

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
              +P        D G          R  L P  N   ++DR                  
Sbjct: 154 --QIPT------PDVG----------RVVLEPGQNLAAWLDRRYL--------------- 180

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                     P R    +W     +PEG+LST  +I++G +G+  G +++     +A++ 
Sbjct: 181 ----------PGRMWQGTW-----DPEGILSTFPSIVTGILGMLAGRLMVSPASQTAKVS 225

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +   M  G+   A+   +    P+N+ L++ S+V  TAG A   LG
Sbjct: 226 Y--LMTAGVFTAALGYFWGLTFPVNENLWTSSFVLVTAGFAALVLG 269


>gi|281209662|gb|EFA83830.1| hypothetical protein PPL_02898 [Polysphondylium pallidum PN500]
          Length = 409

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 96/334 (28%), Positives = 139/334 (41%), Gaps = 82/334 (24%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           KR+ +LD  RGLT+  MILVD+ GG      +  + WNG + AD + P FLFI G +I+L
Sbjct: 37  KRMLSLDTARGLTIFGMILVDNQGGPEVIWPLKETDWNGISTADLIFPSFLFICGFSISL 96

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           ALK           II RT+ LLF       G     + +++       R  G+LQRI+L
Sbjct: 97  ALKNAKNDRPTWINIIRRTI-LLF-------GIQLFLNLMAHKFVFSTFRVMGVLQRISL 148

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVPNWSF 237
            Y        L  K                  W Q +   I+  IY+   Y+  VP    
Sbjct: 149 CYCFSCCSFMLLPK------------------WAQRVALVISATIYLCLMYAYPVPG--- 187

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
                         CG RG++  +CNA GY+D  +   N ++                  
Sbjct: 188 --------------CG-RGNITRSCNAAGYIDNLILRKNMIH------------------ 214

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL-IHFK---GHSARLKHWVSM 353
                        P +PEG +ST SA ++  +G+  G +L  H +   G    L  W+S+
Sbjct: 215 -------------PTDPEGFISTFSAFITTWMGVELGRILTTHARSADGWKDILIRWLSI 261

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           G    +I + L  TN I  NK ++SFS+   T  
Sbjct: 262 GMVCAMIGLFLDATNVIQFNKIIWSFSFAMLTVA 295


>gi|24213473|ref|NP_710954.1| hypothetical protein LA_0773 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45658672|ref|YP_002758.1| hypothetical protein LIC12842 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386073105|ref|YP_005987422.1| hypothetical protein LIF_A0631 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417764272|ref|ZP_12412242.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|417786789|ref|ZP_12434477.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|418669621|ref|ZP_13231000.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418701883|ref|ZP_13262801.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418702896|ref|ZP_13263788.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|418717763|ref|ZP_13277304.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|418729566|ref|ZP_13288113.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|421083731|ref|ZP_15544602.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|421102101|ref|ZP_15562711.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421121310|ref|ZP_15581607.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|24194245|gb|AAN47972.1|AE011263_12 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45601916|gb|AAS71395.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353456894|gb|AER01439.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400353508|gb|EJP05677.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|409950064|gb|EKO04595.1| putative membrane protein [Leptospira interrogans str. C10069]
 gi|410345744|gb|EKO96814.1| putative membrane protein [Leptospira interrogans str. Brem 329]
 gi|410368246|gb|EKP23624.1| putative membrane protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433648|gb|EKP77988.1| putative membrane protein [Leptospira santarosai str. HAI1594]
 gi|410754552|gb|EKR16202.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410759015|gb|EKR25234.1| putative membrane protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410767440|gb|EKR38115.1| putative membrane protein [Leptospira interrogans serovar
           Hebdomadis str. R499]
 gi|410775744|gb|EKR55735.1| putative membrane protein [Leptospira interrogans str. UI 12758]
 gi|410786933|gb|EKR80669.1| putative membrane protein [Leptospira interrogans str. UI 08452]
 gi|456824782|gb|EMF73208.1| putative membrane protein [Leptospira interrogans serovar Canicola
           str. LT1962]
          Length = 381

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 4   KLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           ++I  ++    KI  +     I  R++ L+  G+ L   G +S +            +R 
Sbjct: 64  ISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFS-----------ELRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +  KR              A    WI               
Sbjct: 113 PGVLQRIGFVYWVVASLYLILPKR--------------AILISWIP-------------I 145

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+          
Sbjct: 146 LIVHTWILIQLPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW---------- 187

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG  S IS+I +  +G+  G +L   K +  + + 
Sbjct: 188 ------------KFSKTW-----DPEGFFSGISSITTSLLGVFCGSILSS-KTNETKKQI 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               GFG L + + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 230 LSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|418690664|ref|ZP_13251772.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|418722429|ref|ZP_13281595.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|400360164|gb|EJP16144.1| putative membrane protein [Leptospira interrogans str. FPW2026]
 gi|409963797|gb|EKO27519.1| putative membrane protein [Leptospira interrogans str. UI 12621]
 gi|455790461|gb|EMF42326.1| putative membrane protein [Leptospira interrogans serovar Lora str.
           TE 1992]
          Length = 381

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 4   KLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           ++I  ++    KI  +     I  R++ L+  G+ L   G +S +            +R 
Sbjct: 64  ISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFS-----------ELRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +  KR              A    WI               
Sbjct: 113 PGVLQRIGFVYWVVASLYLILPKR--------------AILISWIP-------------I 145

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+          
Sbjct: 146 LIVHTWILIQLPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW---------- 187

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG  S IS+I +  +G+  G +L   K +  + + 
Sbjct: 188 ------------KFSKTW-----DPEGFFSGISSITTSLLGVFCGSILSS-KTNETKKQI 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               GFG L + + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 230 LSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|417760159|ref|ZP_12408187.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|417775681|ref|ZP_12423532.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|418673844|ref|ZP_13235155.1| putative membrane protein [Leptospira interrogans str. 2002000623]
 gi|409944118|gb|EKN89707.1| putative membrane protein [Leptospira interrogans str. 2002000624]
 gi|410574555|gb|EKQ37586.1| putative membrane protein [Leptospira interrogans str. 2002000621]
 gi|410579122|gb|EKQ46972.1| putative membrane protein [Leptospira interrogans str. 2002000623]
          Length = 381

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 4   KLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           ++I  ++    KI  +     I  R++ L+  G+ L   G +S +            +R 
Sbjct: 64  ISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFS-----------ELRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +  KR              A    WI               
Sbjct: 113 PGVLQRIGFVYWVVASLYLILPKR--------------AILISWIP-------------I 145

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+          
Sbjct: 146 LIVHTWILIQLPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW---------- 187

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG  S IS+I +  +G+  G +L   K +  + + 
Sbjct: 188 ------------KFSKTW-----DPEGFFSGISSITTSLLGVFCGSILSS-KTNETKKQI 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               GFG L + + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 230 LSIFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|223935576|ref|ZP_03627492.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223895584|gb|EEF62029.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 154/347 (44%), Gaps = 85/347 (24%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           S R+ +LD FRG T+  M+LV++ G     Y ++DH+ WNG T  D + PFFL+IVGVAI
Sbjct: 22  SARLMSLDVFRGATIASMMLVNNPGSWDSIYRQLDHAEWNGWTFTDLIFPFFLWIVGVAI 81

Query: 117 ALALKKVPKINGAVKK------------IIFRT-LKLLFWGIILQGGYSHAPDALSYGVD 163
            L+ +K  +++G   +            IIF   L L F+  ++ G Y        +  +
Sbjct: 82  PLSTQK--RLDGGASRTNLWLHVVRRAAIIFGLGLFLAFFSFLINGSYGRLGGFGPWFNE 139

Query: 164 M-KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           +   IR  G+LQRIA+ Y++ + I  LTTK R  +               W+ G +A   
Sbjct: 140 ICGTIRIPGVLQRIAVCYLIASTIY-LTTKLRGQI--------------AWLIGLLA--A 182

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
           Y +    + VP         HG           G L P  N   YVD  + G +      
Sbjct: 183 YWVLMKCVPVPG--------HGA----------GVLTPEGNFSAYVDGNVLGRH------ 218

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG 342
                                  +W  AP++PEG++STI AI +   GI  G +L+  K 
Sbjct: 219 -----------------------TWHGAPWDPEGVISTIPAIATCLFGILTGQLLL-IKR 254

Query: 343 HSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              +   WV +  G+L+I         +PINK L++ SY  F AG A
Sbjct: 255 SVEQKTTWVFVS-GILLILAGAVMNIWLPINKNLWTSSYSVFMAGMA 300


>gi|334121382|ref|ZP_08495452.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
 gi|333455096|gb|EGK83757.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
          Length = 406

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 147/368 (39%), Gaps = 106/368 (28%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD FRG+ +  MILV++ G     Y  +DH+ W+GCT  D + PFFLFIVG A++ 
Sbjct: 2   RFKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWHGCTPTDLIFPFFLFIVGCAMSF 61

Query: 119 ALKK----VPKINGAVKKI-----------------------------IFRTLKLLFWGI 145
           +L K     P+      KI                             I R   +LF   
Sbjct: 62  SLSKYTQNYPQTGIETSKITQTKEKSESAKNPLPSSLFLLPYSNIYWRIARRAAILFILG 121

Query: 146 ILQGGYSHAPDALSYGVDMKH---IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRH 202
           +L    S A D L     +++   IR  G+LQRI L Y + A+     + R   +L    
Sbjct: 122 LLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFISAIAIINLSPRNQKLLAVAV 181

Query: 203 LSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPAC 262
           L          +G + A  ++ +  Y+                          G L P  
Sbjct: 182 L----------LGYWAALTVFAVGGYT-------------------------AGELTPEG 206

Query: 263 NAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTIS 322
           N  GYVDR + G  HLY                               PF+PEGLLST+ 
Sbjct: 207 NLGGYVDRLILGSQHLYKG----------------------------GPFDPEGLLSTLP 238

Query: 323 AILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF-GLLIIAIILHFTNAIPINKQLYSFSY 381
           A+++  IG   G  L   +    + +  +++   GL  + I   +    PINKQL++ SY
Sbjct: 239 AVVTVLIGYFTGEWL---RVQPIKTRTSINLAICGLSCVVIGRLWGFLFPINKQLWTSSY 295

Query: 382 VCFTAGAA 389
           V FTAG A
Sbjct: 296 VVFTAGWA 303


>gi|449664780|ref|XP_002169793.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Hydra magnipapillata]
          Length = 369

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 98/346 (28%), Positives = 152/346 (43%), Gaps = 60/346 (17%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL----KKVPKINGAVKK 132
           MI V+  GG Y    HS WNG T+AD + P+F+FI+G +I +++    KK+ K    V K
Sbjct: 1   MIFVNYGGGGYYFFSHSSWNGLTVADLLFPWFIFIMGSSIYISMHSLRKKLSKRKMTV-K 59

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI----ET 188
           II+R+ KL                 L+ G D+ + R  G+LQR A+ Y VVAL+    ++
Sbjct: 60  IIYRSFKL-----------LLLGLFLNNGFDLANWRLPGVLQRFAISYFVVALVFLWFDS 108

Query: 189 LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
              +   N  +     ++  +Q            +I+    L +        +  G  K 
Sbjct: 109 PNEESETNSWKNMFRDVWFPFQ------------HIVMLLLLTIYLLIIYLLNVPGCPKG 156

Query: 249 IVKCGMRGHLGP--ACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
               G  G  G    C   A GYVDR ++G+NH+Y +P    L  C              
Sbjct: 157 YFGPGGDGDHGAYEKCTGGASGYVDRTVFGLNHIYKNPTCKSLYNC-------------- 202

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV--SMGFGLLIIAI 362
                  ++PEGLL TI +IL   +G+     L+ +K  +  +  W   S+  G L + +
Sbjct: 203 -----FTYDPEGLLGTIPSILLTYLGLQAARTLLFYKSKNGHIIRWFIWSVLLGALAVGL 257

Query: 363 ILHFTN--AIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
                N  AIPINK L+S SY+ F  G+    L +   + +  + W
Sbjct: 258 CGGTLNDGAIPINKNLWSLSYI-FAMGSTAYLLLLICYVLVDVLKW 302


>gi|332709783|ref|ZP_08429740.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
 gi|332351381|gb|EGJ30964.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
          Length = 366

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 100/333 (30%), Positives = 141/333 (42%), Gaps = 81/333 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+ +  MILV++ G     Y  + H+ W+G T  D V P FLFI GVA+A 
Sbjct: 2   RLTSLDVFRGIAMASMILVNNPGSWSYVYPPLLHAKWHGFTPTDLVFPAFLFIAGVAMAF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    N +V +  +R  +    L   G++L G  ++  D          IR  G+LQ
Sbjct: 62  SLVKYTNNNQSVSQGYWRIGRRCAILFALGLLLNGFPTYNWDT---------IRIMGVLQ 112

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y + A +  L  +RR                  W+   I  + Y      + VP 
Sbjct: 113 RISLAYFLSA-VAVLNLRRRG----------------LWVLTGIVLLGYWAAMSLVPVP- 154

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                  D+G           G+L P  N   Y+DR + G NHLY               
Sbjct: 155 -------DYGA----------GNLTPEGNFAAYIDRMVLGTNHLYK-------------- 183

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                         +A F+PEGL ST  A+++   G   G  L H    S      V  G
Sbjct: 184 --------------QAQFDPEGLFSTFPAVVTVLAGYFVGDWLRHQPIPSRTSLGLVLFG 229

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            G L +  +  F    PINKQL++ SYV F+AG
Sbjct: 230 VGCLGLGWVWDFW--FPINKQLWTSSYVVFSAG 260


>gi|333031144|ref|ZP_08459205.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741741|gb|EGJ72223.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 363

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 154/356 (43%), Gaps = 83/356 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + SKR+ +LD  RG+TV  MILV++ G     Y  + H+ W+G   AD V P F+F++G+
Sbjct: 6   EDSKRLLSLDVLRGITVAGMILVNNTGSCGYNYTALRHASWDGLNFADLVFPMFMFMMGI 65

Query: 115 AIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           +  ++L+K       A  KI  RT  L+  G+ ++         +  G+++  +R  G++
Sbjct: 66  STYISLRKYENNKKTAFYKIFKRTSLLIIIGLFMEC----IITWIEVGLNLSTLRLMGVM 121

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR+ L Y + AL+                                          SLYVP
Sbjct: 122 QRLGLCYGITALL------------------------------------------SLYVP 139

Query: 234 NWSFSEHSDHGVKKY-IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           +    + +   +  Y I++    G    A N +G VDR + G+NH+Y             
Sbjct: 140 HKYLLKIALSVLLGYFIIQIVGSGFDKSAENVIGVVDRSVLGVNHIYLQG---------- 189

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                           +   +PEG+LST+ AI    IG   G  ++  + H  ++     
Sbjct: 190 ----------------KQFVDPEGVLSTLPAIAQVMIGFFCGRKILEKREHKQQMLILYR 233

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDE 408
           +G   L +  +  F+   PINK+L+S ++V  T+G A  C+ +  S+ I  +D  +
Sbjct: 234 LGSLFLFVGFV--FSYVCPINKRLWSPTFVLVTSGVA--CMAL--SLLIDTLDIKQ 283


>gi|410941669|ref|ZP_11373463.1| putative membrane protein [Leptospira noguchii str. 2006001870]
 gi|410783218|gb|EKR72215.1| putative membrane protein [Leptospira noguchii str. 2006001870]
          Length = 381

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 155/347 (44%), Gaps = 85/347 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+TV  MILV++ G     Y  + H+ WNGCT  D V PFFLF+VG
Sbjct: 4   KSNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYTPLKHAKWNGCTPTDLVFPFFLFVVG 63

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHI 167
            +I  +L    KI   + KI F    R++ L+  G+ L   G +S +            +
Sbjct: 64  TSIPFSLYSKNKI--YISKIWFGICIRSITLILIGLFLNFFGEWSFS-----------KL 110

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  GILQRI  VY VVA +  +  KR                           +I I   
Sbjct: 111 RIPGILQRIGFVYWVVASLYLMLPKR---------------------------IILISWI 143

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
             L V  W   +    G  + IV      +L P  +   ++DR ++G NHL+        
Sbjct: 144 PILIVHTWVLIQIPPPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW-------- 187

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                         + + +W     +PEG  S IS+I +  +G+  G +L   K +  + 
Sbjct: 188 --------------KFSKTW-----DPEGFFSGISSIATTLLGVFCGSILSS-KTNETKK 227

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +     GFG+L++ + L +   +P+NK L++ SYV +TAG A   +G
Sbjct: 228 QILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIG 274


>gi|196002389|ref|XP_002111062.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
 gi|190587013|gb|EDV27066.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
          Length = 431

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 150/330 (45%), Gaps = 65/330 (19%)

Query: 47  KGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           K  L  Q  +    +R+ +LD +RGL  ++M   D  GG Y    HS WNG T+ D V P
Sbjct: 147 KRNLYQQPHIDNLDRRIRSLDLYRGLCAIVMAFGDSGGGQYRFFKHSIWNGLTIVDVVFP 206

Query: 107 FFLFIVGVAIALALKK------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
            F+FI G +++++L K       PK+   V  I  R+  L F G+++ G     P  +S 
Sbjct: 207 GFIFISGFSLSISLVKRLYKMQTPKLILIVTTIR-RSFYLFFLGLLING-----PCQIS- 259

Query: 161 GVDMKHIRWCGILQRIALVYVVVA-----LIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
                + R  G+LQRI++ ++VV+     L  T+ +  +  VL+ + L       W  + 
Sbjct: 260 -----NWRLLGVLQRISVTFLVVSCLAVWLYPTIKSFTKDQVLQEKVLR----KMWPIMV 310

Query: 216 GFIAFVIYIITTYSLYVPNWSFSE-----HSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
             +    Y+  T +  VP+           SD G K Y    G+           G++DR
Sbjct: 311 LIVGLHTYVTLTAA--VPDCPVGYSGPGGKSDDG-KYYNCTGGI----------AGFIDR 357

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
            ++G NHLYS P       C L   +S             PF+PEG+L T+++I    +G
Sbjct: 358 FVFGSNHLYSRP------TCKLLYQSS------------QPFDPEGVLGTLTSIFLCFLG 399

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLII 360
           +  G +   F  +   ++ W+   FGLL++
Sbjct: 400 LQMGILHNIFSNNLRIMRTWIL--FGLLLV 427


>gi|256425421|ref|YP_003126074.1| hypothetical protein Cpin_6469 [Chitinophaga pinensis DSM 2588]
 gi|256040329|gb|ACU63873.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 358

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 155/343 (45%), Gaps = 79/343 (23%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ +LD FRGLTV  MILV++ G     Y  ++HS WNGCT  D V PFFLF+VGV++ 
Sbjct: 3   QRLLSLDFFRGLTVAAMILVNNPGSWSYVYPPLEHSKWNGCTPTDLVFPFFLFMVGVSVT 62

Query: 118 LAL-KKVPKING--AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
            AL  +   ++G  ++   I R   +LF    +   +   P       D  ++R  G+LQ
Sbjct: 63  FALSSRKADVSGHTSLIIHIIRRAAILF---AIGLAFRLIPS-----FDFHNLRILGVLQ 114

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAF-VIYIITTYSLYVP 233
           RI++V++V++L+  L T  +P +               W+   I+F VIY +    + VP
Sbjct: 115 RISIVFLVISLLY-LKTGTKPRI---------------WLC--ISFLVIYWLLMTVVPVP 156

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
            +                     +L    N   ++DR + G  HL+              
Sbjct: 157 GYG------------------PANLEAETNLAAWIDRTVLGEQHLW-------------- 184

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                   + A +W     +PEGLLST+ AI +G +GI  G  L       A    W+  
Sbjct: 185 --------KQARTW-----DPEGLLSTLPAISTGLLGIMTGDWLRRKDVADADKVSWL-F 230

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGIT 396
             G L +   L +    PINK L++ S+V +T G A   L ++
Sbjct: 231 AAGFLSVIAGLIWDGFFPINKSLWTSSFVLYTGGLAAMGLALS 273


>gi|427709244|ref|YP_007051621.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
 gi|427361749|gb|AFY44471.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
          Length = 375

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/335 (29%), Positives = 141/335 (42%), Gaps = 72/335 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+T+  MILV+ AG A   Y  + H+ W+GCT  D V PFFLFIVGVA+  
Sbjct: 2   RLTSLDVFRGITIAAMILVNMAGVADDVYPLLAHADWHGCTPTDLVFPFFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVKKI---IFRTLKLLF-WGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K    N     +   IFR   +LF  G++L   ++          D   IR  G+LQ
Sbjct: 62  SLSKYTADNKPTSTVYLRIFRRAAILFALGLLLNVFWNKGVGTF----DFSSIRIMGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RI+L Y++ +L   L   R+                 QWI   +  + Y +T   + VP 
Sbjct: 118 RISLSYLLASL-AVLNLPRKG----------------QWILAAVLLIGYWLTMMYVPVPE 160

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
           +     +  G                  N   Y DR +    HLY+   +  +       
Sbjct: 161 YGAGVLTREG------------------NFGAYFDRLIIPQTHLYAGDGFKSMG------ 196

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                             +PEGL STI A++S   G   G  +      +      +  G
Sbjct: 197 ------------------DPEGLFSTIPAVVSVLAGYFTGQWIRKQPVQTRTSVGLILFG 238

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            G LII     +    PINK+L++ SYV FT+G A
Sbjct: 239 MGCLIIG--WAWGWVFPINKKLWTSSYVIFTSGWA 271


>gi|386819709|ref|ZP_10106925.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
 gi|386424815|gb|EIJ38645.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
          Length = 366

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 153/354 (43%), Gaps = 80/354 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++Q  +R  +LD FRGLT+ LMI+V+  G     Y  + H+PWNG TL D V P FLF+V
Sbjct: 1   MKQLKERYLSLDVFRGLTLFLMIIVNTPGSWSFIYKPLHHAPWNGFTLTDLVFPTFLFVV 60

Query: 113 GVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           G A++ +LKK  +I     +KK+  RT  +   G +L         AL     +   R  
Sbjct: 61  GNAMSFSLKKFEEIGNTAFLKKVFKRTFLIFLIGFLLYWFPFFKDGALK---PISETRIF 117

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRIAL Y   ALI                      + W+  G  I  VI ++     
Sbjct: 118 GVLQRIALCYCFAALI---------------------LHYWKPKGALIFSVIALV----- 151

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
                            Y +   + G L    NA    D  L G +H+Y           
Sbjct: 152 ----------------GYHIILLLFGDLTMQGNAAIKADLWLIGSSHMYKG--------- 186

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                      E  P      F+PEG+LST+ AI++   G ++  V +  KG +      
Sbjct: 187 -----------EGFP------FDPEGVLSTLPAIVNVIAG-YFAGVFLQQKGKTYEAIAK 228

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
           ++M  G+LI  + L +   +PINK+L++ SYV  T G     L I   +F+ E+
Sbjct: 229 LTMVGGVLIF-LALGWDLLLPINKKLWTSSYVLLTVGIDVMVLSIL--VFVIEI 279


>gi|237720190|ref|ZP_04550671.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293371122|ref|ZP_06617659.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|229450742|gb|EEO56533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292633780|gb|EFF52332.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 371

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 145/344 (42%), Gaps = 83/344 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+V MILV++ G     YA + H+ WNG T  D V PFF+FI+GV
Sbjct: 1   MKSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60

Query: 115 AIALALKKVPK--INGAVKKIIFRTLKLLF------WGIILQGGYSHAPDALSYGVDMKH 166
           +++ AL +       G + K++ RTL L        W  ++  G               H
Sbjct: 61  SMSFALSRFDHHFSRGFIIKLVRRTLILFLLGLFLSWFSLVCTGVEQ---------PFSH 111

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           IR  G+LQR+AL Y   +L+  +   RRP  L              WI G       I+ 
Sbjct: 112 IRILGVLQRLALAYFFGSLL--IVGVRRPANLA-------------WISGI------ILA 150

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            YS  +      E S+                    N +   DR L+G  HLY       
Sbjct: 151 GYSTLLALGHGFELSEQ-------------------NIIAVTDRTLFGEAHLY------- 184

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
                         RE  P   R  F+PEGLLST+  I    IG   G++L        R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
           L     +G  LL    +L +    P+NK+++S ++V  T G A 
Sbjct: 231 LLQISILGIALLFAGWLLSY--GCPLNKKVWSPTFVLVTCGFAS 272


>gi|226225918|ref|YP_002760024.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
 gi|226089109|dbj|BAH37554.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
          Length = 401

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 157/346 (45%), Gaps = 78/346 (22%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           K++R+ +LD FRG+TV  M+LV++ G     Y  + H+PW+G T  D + PFFLFIVG+ 
Sbjct: 9   KAERLLSLDVFRGMTVAGMLLVNNPGTWSAIYPPLQHAPWHGWTPTDLIFPFFLFIVGIT 68

Query: 116 IALALK----KVPKINGAVKKIIFRTLKLLFWGIILQG-GYSHAPDAL---SYG---VD- 163
             L+L+    +       +++I+ R   +  +G++L G  +   P  L   S+G   +D 
Sbjct: 69  TELSLRARRARGDDEQAILRQILKRGALIFLFGLLLAGFPFFTWPPGLPGASFGERVIDR 128

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
            +H R  G+LQRI + Y+  AL+   TT R+  V+                   +A +++
Sbjct: 129 FEHWRIMGVLQRIGVAYLCGALLTWRTTVRQQGVI-------------------LAALLF 169

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
                   VP        D GV    V         P      ++DR + G+NHL+S   
Sbjct: 170 GYWALMTLVPV------PDTGVAGRFVLD------KPDQLLSAWLDRTVLGVNHLWS--- 214

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFK 341
                               A +W     +PEGLLSTI AI +   G   G  +      
Sbjct: 215 -------------------GAKTW-----DPEGLLSTIPAIGTMICGTFAGRWIARQELT 250

Query: 342 GHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            H  RL    ++G   +++ ++ H+    PINK +++ SYV FTAG
Sbjct: 251 LHE-RLVALFAVGALAMMVGLMWHWV--FPINKSIWTSSYVVFTAG 293


>gi|390344818|ref|XP_795043.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 680

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 98/373 (26%), Positives = 156/373 (41%), Gaps = 49/373 (13%)

Query: 31  INKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARI 90
           IN    +  +  QD+Q           + S R+ ++D FRGL +  ++L     G +   
Sbjct: 259 INHNGSILSNGSQDDQTPLTFPASDKPKSSLRLRSVDTFRGLAITHLVLGASGDGHFWYS 318

Query: 91  DHSPWNGCTLADFVMPFFLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGII 146
           +H+ W G T+ADF+ P+F+FI+G +I L+    L K        KKI+FR++ L   G+ 
Sbjct: 319 NHARWYGITVADFMFPWFVFIMGTSIHLSFNILLSKGLSYCAIFKKIVFRSISLFIMGVC 378

Query: 147 LQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVA---LIETLTTKRRPNVLEPRHL 203
           +Q   SH         D++++R  G+LQR  + Y +VA   L+      RR       ++
Sbjct: 379 IQ---SHN--------DLRNLRIPGVLQRFGITYFIVASSYLLSRRLQARRAEKTGKCYM 427

Query: 204 SI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA 261
                T Y    +      V+++  T+ L VP          G        G  G L   
Sbjct: 428 MFRDITDYLELPLAA-CCLVVHLCLTFLLPVPGCPLGYQGPGG-----PLVGENGELTNC 481

Query: 262 C-NAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLST 320
              A GY+DR  +   HL        +   T        +R D          PEG+L T
Sbjct: 482 TGGASGYIDRTFFTEAHL--------ILVNTCDDVYRTIVRSD----------PEGILGT 523

Query: 321 ISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA----IPINKQL 376
            ++I     G+  G +L  F     RL   +  G  L+  + +L   +     IP+NK L
Sbjct: 524 FTSIALCVFGLQSGKILHLFTTVRGRLVRLLLWGLALISCSAVLCKCSMADGWIPLNKNL 583

Query: 377 YSFSYVCFTAGAA 389
           +S S++  T G A
Sbjct: 584 WSVSFIALTGGTA 596


>gi|410664067|ref|YP_006916438.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409026424|gb|AFU98708.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 390

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 98/341 (28%), Positives = 159/341 (46%), Gaps = 62/341 (18%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           L  + + R+ ++D  RGL +  M+LV++ G     YA + H+ W+G T  D + P FL++
Sbjct: 8   LAARPAGRLMSVDVLRGLAIAAMVLVNNPGSWSHVYAPLAHAEWHGWTPTDVIFPLFLYV 67

Query: 112 VGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGY-SHAPDALSYGVD-MKHIRW 169
           VG++I LA +K        +    R  KL   G+ L   Y   A    S+  D +  +R 
Sbjct: 68  VGLSIVLA-QKGETFALPGRSTWLRAAKLFGLGLFLALFYFPFAKPEFSWWRDQLLDVRI 126

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRIALVY+    +  L  KR                  QW+  ++A ++++   Y+
Sbjct: 127 LGVLQRIALVYLACCYLAWLCQKR------------------QWL-LWLATLVFMWLAYA 167

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L +       ++D   + Y      RG L    +   ++D+ L G  HLY          
Sbjct: 168 LMLS----IPYADDTGEIY------RGQLVFGNHFSAWLDQLLLGREHLYYQT------- 210

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
              + P +              F+PEGLL+T+ AI SG +G+  G + +   GHS+RL+ 
Sbjct: 211 ---AQPFA--------------FDPEGLLTTLPAISSGLLGVLAG-LQLKAAGHSSRLEI 252

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
           W + G  +L+   +LH     PINK L++ S+V  TAG + 
Sbjct: 253 WFAGGVLMLVAGQLLH--PVCPINKALWTPSFVLVTAGVSQ 291


>gi|116621919|ref|YP_824075.1| hypothetical protein Acid_2804 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116225081|gb|ABJ83790.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 367

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 97/340 (28%), Positives = 154/340 (45%), Gaps = 83/340 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +++   R+ +LDAFRG T+ LM+LV++AG    +Y +++HSPW+G T+ D V P FL+IV
Sbjct: 6   VEKLPLRLVSLDAFRGATIALMVLVNNAGSGLDSYRQLEHSPWHGWTITDTVFPSFLWIV 65

Query: 113 GVAIALALKK-----VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167
           GVAI L+L K     VP+ +  + +I+ R   L  +G+ +   + H         D+   
Sbjct: 66  GVAITLSLGKRVAEGVPR-SHLLPQILRRAAILFVFGLFVY-AFPH--------FDLGTQ 115

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R  G+LQRIA+ Y+  ++I   +  R                Q  WI G +A   Y +  
Sbjct: 116 RILGVLQRIAICYLAASVIFLYSGVRG---------------QILWILGLLA--AYWMMM 158

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
             + VP +                    G L    N   Y+D    G ++ +S   W   
Sbjct: 159 TLIPVPGYG------------------PGRLDVEGNFAHYIDHLALGRHNYHSTRTW--- 197

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                                    +PEGL+ST+ AI +   G+  GH+L   +  + R 
Sbjct: 198 -------------------------DPEGLVSTLPAIATALFGVLAGHILRCRRTLAER- 231

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             W+    G L++A  L  T  +PINK+L++ S+  F AG
Sbjct: 232 TSWMFTA-GSLLLAAGLICTAWLPINKKLWTDSFCLFMAG 270


>gi|410613391|ref|ZP_11324450.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
 gi|410167053|dbj|GAC38339.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           psychrophila 170]
          Length = 400

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 163/345 (47%), Gaps = 67/345 (19%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+  LD FRG+T+  MILV++ G     Y+ + H+ W+G TL D + PFF+FIVGV+I
Sbjct: 18  ANRLLALDVFRGMTITAMILVNNPGSWQYIYSPLAHAKWHGWTLTDLIFPFFIFIVGVSI 77

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYS------HAPDALSYGVDMKHIRWC 170
           +L+ ++  +       II   L  +F  ++L    +       A D   +   +  +R+ 
Sbjct: 78  SLSGQRQKEQGLGHGHIIHHALLRMFKLLLLGCFLALFYYNFSAADYDWFTQRLMQMRFM 137

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRIALVY+   L+    ++ +  +     L+I  AY   W+   +AF+ Y       
Sbjct: 138 GVLQRIALVYMACVLLWLFLSRLQLVICM---LAILVAY---WLA--MAFIPY------- 182

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
                    H D G  +Y+      G L  A N   ++D  L+   HLY           
Sbjct: 183 ---------HDDLG-NQYV------GLLEYANNLSAWLDNYLFAKTHLYYS--------- 217

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK-H 349
                ++ P            F+PEG+LST+ AI SG  G+  G  L  F  HS R K  
Sbjct: 218 -----SAQPF----------AFDPEGVLSTLPAIASGLSGVLAGQWL-SFSHHSMRHKAK 261

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           W+++  G++ + +   + + +PINK L++ SYV  T+G A   L 
Sbjct: 262 WLAIC-GVVALLLGQMWAHWLPINKALWTSSYVLLTSGYAALILA 305


>gi|255534024|ref|YP_003094396.1| hypothetical protein Phep_4143 [Pedobacter heparinus DSM 2366]
 gi|255347008|gb|ACU06334.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366]
          Length = 384

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 102/349 (29%), Positives = 156/349 (44%), Gaps = 73/349 (20%)

Query: 54  QLLQQKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           Q    K  R+ +LD FRG TV  MILV+   D G  YA ++H+ WNGCT  D + PFFLF
Sbjct: 4   QTALDKPVRLLSLDFFRGATVAAMILVNNPGDWGHIYAPLEHAEWNGCTPTDLIFPFFLF 63

Query: 111 IVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD-MKH 166
           IVGV+IA A+   K  P  +G   K I + LK       L    S  P   +  ++  + 
Sbjct: 64  IVGVSIAYAMGGKKADPSSHG---KTIVKALKRASILFGLGLFLSLFPKVFTAPLEAFQQ 120

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R  G+LQRIA+V+++ A+I    T++  N+ +   L+I   Y             + + 
Sbjct: 121 VRIPGVLQRIAVVFLISAIIFLKNTEK--NIFK-ILLAILAVY-------------WALM 164

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
           T+ + VP   ++                  +L    N   ++DR +    HL+       
Sbjct: 165 TF-IPVPGVGYA------------------NLEKETNLGAWLDRSILTEAHLW------- 198

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
                          + A +W     +PEG+LST+ AI +G  GI  G  L      +A 
Sbjct: 199 ---------------KSAKTW-----DPEGILSTLPAIATGLFGILVGVYLKRKDVDAAT 238

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
              W+    G   +A+ L +    PINK L++ S+V +T G A   L +
Sbjct: 239 KISWL-FCTGCAAVALGLLWDLQFPINKSLWTSSFVLYTGGLATMILSL 286


>gi|156401292|ref|XP_001639225.1| predicted protein [Nematostella vectensis]
 gi|156226352|gb|EDO47162.1| predicted protein [Nematostella vectensis]
          Length = 376

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 91/346 (26%), Positives = 160/346 (46%), Gaps = 55/346 (15%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR 136
           MI V+  GG Y    H+ WNG  +AD V P+F++I+GV+I L+ K + +      KI  +
Sbjct: 1   MIFVNFGGGGYYFFGHAAWNGLLVADLVFPWFIWIMGVSITLSFKSLKRRKVKKWKICLK 60

Query: 137 TLK--LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL--TTK 192
            ++  L+ +G+   G ++   +      D++  R  G+LQR A  Y+V+AL++     ++
Sbjct: 61  VIRRSLILFGL---GLFTSNFN------DLETYRIPGVLQRFAACYIVIALMQLFLGPSE 111

Query: 193 RRPNVLEPRHL----SIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
            +  VL P+       + + ++ QW+   +   IY+  TY++ +         D   + Y
Sbjct: 112 EQTQVLYPKWWDPIRDVVSIWK-QWLAMLLLLAIYVTVTYAVKL---------DGCPRGY 161

Query: 249 IVKCGMRGHLGPACNAVG----YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
               G+      A N  G    Y+DR+ +G  H+Y  P   +L    L            
Sbjct: 162 TGPGGIGRGYPEAFNCTGGVANYIDRKFFG-KHIYQWPTVKQLYKTKL------------ 208

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
                 P EPEG L T+++I    +G+  G +L  ++  + R+  W++ G  L +I + L
Sbjct: 209 ------PHEPEGFLGTLTSIFLVFLGVQAGRILHTYRKSTERITRWLAWGVFLGLIGVGL 262

Query: 365 ----HFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
                    +PINK L+S S++  T G++   L     IF   + W
Sbjct: 263 CKASENEGVVPINKNLWSVSFILVT-GSSSFFLLTFCYIFTDSLGW 307


>gi|354568330|ref|ZP_08987495.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
 gi|353540693|gb|EHC10166.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
          Length = 384

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/344 (30%), Positives = 152/344 (44%), Gaps = 81/344 (23%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA----YARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           R+ +LD FRG+T+  MILV+ A  A    Y  + H+ W+GCT  D V PFFLFIVGVA++
Sbjct: 2   RLTSLDVFRGITIAGMILVNTASIAEPNVYPPLLHAEWHGCTPTDLVFPFFLFIVGVAMS 61

Query: 118 LALKKV--PKINGAVKK---------IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
            +  K    K++G  +K         II R   L   G++L G ++       +  D   
Sbjct: 62  FSFSKYTDSKLHGEKEKVFVSLPYWRIIRRAAILFVLGLLLNGFWNQG----VWTFDFNS 117

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           IR  G+LQRI+L Y++ +L+         N+  PR        + QWI   +  + Y + 
Sbjct: 118 IRVMGVLQRISLTYLLASLVVF-------NI--PR--------KGQWILAGVLLIGYWLA 160

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
              +YVP       S +G           G L    N   Y+DR +    HLY    ++ 
Sbjct: 161 M--MYVP------VSGYGA----------GVLTRDGNLGAYIDRLIIPKAHLYKGDNYNF 202

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
           +                         +PEGL STI AI+S   G   G  +   K  +++
Sbjct: 203 MG------------------------DPEGLFSTIPAIVSVLAGYFAGQWIRSQKQINSK 238

Query: 347 LK-HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                V  G   L+I  I     A PINK+L++ SYV FT G A
Sbjct: 239 TSMDLVLFGLSCLVIGGIWDL--AFPINKKLWTSSYVVFTTGWA 280


>gi|333378336|ref|ZP_08470067.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
 gi|332883312|gb|EGK03595.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
           22836]
          Length = 387

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/403 (25%), Positives = 166/403 (41%), Gaps = 136/403 (33%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + S R+ +LD  RG+T+  MILV++ G     YA + H+ WNG T  D + PFF+FI+G+
Sbjct: 4   KASSRLLSLDVLRGITIAGMILVNNPGSWGHIYAPLRHAEWNGLTPTDLIFPFFMFIMGI 63

Query: 115 AIALALKK------VPKINGAVKKIIFRTLKLLFWGIIL-----QGGYSHAPDALSYG-- 161
           +  ++L+K      VP +    +KI+ RT  +   G+ L       G  HA  A + G  
Sbjct: 64  STFISLRKFNFEFSVPTL----RKILKRTFVIFLIGLGLSWLGVSFGTYHALAADNLGFL 119

Query: 162 -------VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
                   + +H+R  G++QR+AL Y + +LI                 +IF  ++    
Sbjct: 120 ERLGRSVTNFEHLRILGVMQRLALTYGITSLI-----------------AIFIKHK---- 158

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHL----------GPACNA 264
                           Y+P              YI+  G+ G+               N 
Sbjct: 159 ----------------YIP--------------YIIVVGLVGYFLLLLFGNGFATEGYNI 188

Query: 265 VGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAI 324
           +   D+ + G+NH+Y++                               +PEG+LSTI A+
Sbjct: 189 LAVTDQSILGLNHMYTE----------------------------FGLDPEGILSTIPAV 220

Query: 325 LSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSF 379
               IG + G +L+  K +  R+ H       L II  IL F+  +     PINK+++S 
Sbjct: 221 CHVLIGFYCGKILMETKDNQQRMLH-------LFIIGAILTFSGFLLSYGCPINKKIWSP 273

Query: 380 SYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
           ++   T G     LG T   F+  + W      ++ WS  + S
Sbjct: 274 TFELTTCG-----LGAT---FLSLLIWIIDVKGYKKWSVFFES 308


>gi|404485250|ref|ZP_11020448.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
 gi|404338685|gb|EJZ65130.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
           YIT 11860]
          Length = 390

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 163/356 (45%), Gaps = 90/356 (25%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +KR+ +LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G++ 
Sbjct: 6   NKRLLSLDILRGITIAGMIMVNNPGSWGYIYAPLGHAEWIGLTPTDLVFPFFMFIMGIST 65

Query: 117 ALALKKVP-KINGAVK-KIIFRT-------LKLLFWGIILQGGYSHAPDALSY----GVD 163
            ++L+K   +++GAV  KII RT       L + ++G+ ++  +    ++LS+    G  
Sbjct: 66  YMSLRKFDFRLSGAVAWKIIRRTIVIFAIGLAIAWFGLTMRTYHQLGEESLSFFERLGRS 125

Query: 164 M---KHIRWCGILQRIALVYVVVALIETLTT-KRRPNVLEPRHLSIFTAYQWQWIGGFIA 219
           M    HIR  G++ R+A+ Y V A I  +   K  P+++                   IA
Sbjct: 126 MWNFDHIRILGVMPRLAICYGVAAFIALIVKHKYIPHIVSVT---------------LIA 170

Query: 220 FVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLY 279
           + + +IT        + FSE                       N +  VDR + G NH+Y
Sbjct: 171 YFVILITG-----KGFEFSED----------------------NIISVVDRAILGSNHMY 203

Query: 280 SDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH 339
            D              N   L            +PEGLLSTI +I    +GI  G +++ 
Sbjct: 204 HD--------------NGLAL------------DPEGLLSTIPSICHVLVGIFCGGLIMR 237

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            K ++ R+++    G  L    ++L +    PI+K+++S ++V  T G A   L +
Sbjct: 238 TKDNAVRMQNLFIAGTILTFAGLLLEY--GCPISKKIWSPTFVLTTCGLAASSLAL 291


>gi|395803976|ref|ZP_10483217.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
 gi|395433620|gb|EJF99572.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
          Length = 423

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 159/362 (43%), Gaps = 71/362 (19%)

Query: 61  KRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ +LD FRGLT++LM +V+   D G  Y  + H+ WNGCT  D V PFF+FI+GVA+ 
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPGDWGNVYPPLLHAHWNGCTPTDLVFPFFIFIMGVAVP 63

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           LA+ +         KI+ R+L++   GI     ++       +G+D   +    +L R+ 
Sbjct: 64  LAMPEKKYDETTFNKILIRSLRMFCLGIF----FNFFGKIQLFGLDGIPL----LLVRLI 115

Query: 178 LVYVV-VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           + + V  AL+   + K +         +IF          F    IYII  Y        
Sbjct: 116 ITFAVGYALMGNFSNKLK---------NIF---------AFSILAIYIILAYG------G 151

Query: 237 FSEHSD---HGVKKYI-----------VKCGMRGHLGPACNAV-GYVDRELWGINHLYSD 281
           F  ++D    GV + I           +K   +  L      + GY     W I  L   
Sbjct: 152 FENYADVRLPGVLQRIAIVYFVVSLLYLKTSRKTQLFTGIVLLFGY-----WAIMTLVPV 206

Query: 282 P--------VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
           P          + L A   S    G +  +  +W     +PEG+LSTI +I++G IG+  
Sbjct: 207 PGIGEANLERGTNLAAWVDSVLLKGHMYHETNTW-----DPEGILSTIPSIVNGIIGLFI 261

Query: 334 GHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           G +L+       + +    +G  L+   +I       PINK +++ SYV +T G A  CL
Sbjct: 262 GQILLLNITKIQKAQRMGMIGTSLIFFGLIWDL--VFPINKSIWTSSYVLYTTGLATVCL 319

Query: 394 GI 395
            +
Sbjct: 320 TV 321


>gi|384417772|ref|YP_005627132.1| membrane protein [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460685|gb|AEQ94964.1| membrane protein, putative [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 388

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 149/346 (43%), Gaps = 80/346 (23%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 116 IALAL-KKVPKIN--GAVKK---IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           ++ AL    P +   G V K   +I     L++W       +   PD       +  +R 
Sbjct: 78  MSFALATNTPHLQFLGRVSKRAALIVLCGVLMYWFPF----FHLQPDGGWAFTTVDQLRL 133

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI L Y+  AL   L     P  + P  +++   Y W                  
Sbjct: 134 TGVLQRIGLCYLAAAL---LVRYLPPRSIAPACVALLLGY-WA----------------L 173

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           LYV     +E S  G                  NA   +D  L+G  HLY          
Sbjct: 174 LYVFGQPGAELSKTG------------------NAGTCLDLWLYGREHLY---------- 205

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      R+D        F+PEGLL T+SA ++   G   G  L      +A  + 
Sbjct: 206 -----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTTASTRS 248

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            +  G G++++A++  +  A P++K+L+S S+V  T G     LG+
Sbjct: 249 LLLAGVGMVLLALL--WAPAWPLSKKLWSGSFVACTVGLDLLALGV 292


>gi|255013110|ref|ZP_05285236.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|256838332|ref|ZP_05543842.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|410102572|ref|ZP_11297498.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
 gi|423333958|ref|ZP_17311739.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|256739251|gb|EEU52575.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409226793|gb|EKN19699.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
           CL03T12C09]
 gi|409238644|gb|EKN31435.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
          Length = 368

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 160/356 (44%), Gaps = 79/356 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           + Q+S R+ +LD  RG+T+  MILV++ G     Y  ++H+ WNG T  D V PFF+FI+
Sbjct: 1   MAQQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIM 60

Query: 113 GVAIALALKKVP-KING-AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           GV++  +L+K   K++  +V K++ RT+ +   G+ L   + H     +   D +++R  
Sbjct: 61  GVSMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHV--CYNGFTDFQNLRIL 117

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G++QR+AL Y   +LI                L+I   Y  Q   G + F   ++     
Sbjct: 118 GVMQRLALAYGFGSLIG---------------LAINHKYILQVAAGILIFYWALLG---- 158

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           +  +   SE S                       +  VDR L+G +H+Y D         
Sbjct: 159 FTHSMEMSEDS----------------------IIAIVDRTLFGTSHMYHD--------- 187

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                       D     R  F+PEGLLS I +I    +G + G V+   K ++  +   
Sbjct: 188 ------------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRN 235

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
           + + FG +I+      +   PINK+++S ++V  T G A        S+F+  + W
Sbjct: 236 IFI-FGTIILFAGFLLSYGCPINKKIWSSTFVLVTCGFA--------SLFLALLIW 282


>gi|150009610|ref|YP_001304353.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|262383102|ref|ZP_06076239.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298374002|ref|ZP_06983960.1| membrane protein [Bacteroides sp. 3_1_19]
 gi|149938034|gb|ABR44731.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|262295980|gb|EEY83911.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|298268370|gb|EFI10025.1| membrane protein [Bacteroides sp. 3_1_19]
          Length = 368

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/356 (26%), Positives = 160/356 (44%), Gaps = 79/356 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           + Q+S R+ +LD  RG+T+  MILV++ G     Y  ++H+ WNG T  D V PFF+FI+
Sbjct: 1   MAQQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIM 60

Query: 113 GVAIALALKKVP-KING-AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           GV++  +L+K   K++  +V K++ RT+ +   G+ L   + H     +   D +++R  
Sbjct: 61  GVSMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHV--CYNGFTDFQNLRIL 117

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G++QR+AL Y   +LI                L+I   Y  Q   G + F   ++     
Sbjct: 118 GVMQRLALAYGFGSLIG---------------LAINHKYILQVAAGILIFYWALLG---- 158

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           +  +   SE S                       +  VDR L+G +H+Y D         
Sbjct: 159 FTHSMEMSEDS----------------------IIAIVDRTLFGTSHMYHD--------- 187

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                       D     R  F+PEGLLS I +I    +G + G V+   K ++  +   
Sbjct: 188 ------------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRN 235

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
           + + FG +I+      +   PINK+++S ++V  T G A        S+F+  + W
Sbjct: 236 IFI-FGTIILFAGFLLSYGCPINKKIWSSTFVLVTCGFA--------SLFLALLIW 282


>gi|294667090|ref|ZP_06732315.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292603100|gb|EFF46526.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 388

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 146/336 (43%), Gaps = 78/336 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALAL-KKVPKIN--GAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           A++ AL    P +   G V K   R   +L  G+++     +   PD       +  +R 
Sbjct: 77  AMSFALATNTPHLQFLGRVSK---RAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRL 133

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI L Y+  AL   L     P  + P  L++   Y W  +             Y+
Sbjct: 134 TGVLQRIGLCYLAAAL---LVRYLPPRGIAPVCLALLLGY-WALL-------------YA 176

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
              P    S+  + G +                     +D  L+G +HLY          
Sbjct: 177 FGQPGAELSKTGNAGTR---------------------LDLWLYGRDHLY---------- 205

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      R+D        F+PEGLL T+SA ++   G   G  L       A  + 
Sbjct: 206 -----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTVASTRS 248

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
            +  G GL+++A++  +  A P++K+L+S S+V  T
Sbjct: 249 LLLAGAGLVVLALL--WAPAWPLSKKLWSGSFVACT 282


>gi|294674520|ref|YP_003575136.1| hypothetical protein PRU_1851 [Prevotella ruminicola 23]
 gi|294472648|gb|ADE82037.1| putative membrane protein [Prevotella ruminicola 23]
          Length = 357

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/341 (26%), Positives = 155/341 (45%), Gaps = 86/341 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDA-GGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +SKR+ +LD  RG+TV  MILV++  G ++  + HS WNG T  D V PFFLFI+G++ 
Sbjct: 1   MESKRLLSLDILRGITVAGMILVNNGWGESFEMLRHSKWNGMTPCDLVFPFFLFIMGISC 60

Query: 117 ALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHA--PDALSYGVDMKHIRW 169
            L+L     K  P++   +++I+ RT+ L   G+ +   + HA   D L +G    H+R 
Sbjct: 61  YLSLVKSEFKPTPQV---IRRIVKRTVLLFAIGLFIN-WFDHAIEGDLLCFG----HLRI 112

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
             ++QRIAL Y +V+L       +               Y    IGG +A     I T  
Sbjct: 113 WAVMQRIALCYGIVSLFALFCNHK---------------YTLSVIGGLLA-----IYTAI 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L + N  ++E ++                    N +   D +L+G +H+Y          
Sbjct: 153 LILGN-GYAEDAN-------------------VNVLAQADLKLFGYDHIYH--------- 183

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                              ++P +PEGL+ TIS++    +G + G ++   +    ++  
Sbjct: 184 -------------------KSPVDPEGLMGTISSVAHVLLGFYCGMLIRKRETVEQKVIA 224

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
              +G   +I   +L +   +P+NK+++S SYV  T G A 
Sbjct: 225 LFVVGAVCVIGGYLLSY--GLPLNKRIWSPSYVLMTCGLAS 263


>gi|317478517|ref|ZP_07937676.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
 gi|316905331|gb|EFV27126.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 100/364 (27%), Positives = 153/364 (42%), Gaps = 96/364 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 4   TKVSKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 63

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        APD LS+G  +
Sbjct: 64  ISTYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKL 122

Query: 165 -------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    IR  G++QR+AL Y   ++I  T+  K  P ++      + T Y       
Sbjct: 123 WASVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIPYLIA----GLLTGY------- 171

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI                              ++ CG  G      N +  VDR +    
Sbjct: 172 FI------------------------------LLMCG-NGFAYNETNILSVVDRAILTPA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI AI    +G   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           ++       R     S    LL++ +IL F+  +     PINK+++S +YV  T G A  
Sbjct: 233 MLDGNKSEDRASFLNSQLITLLLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 392 CLGI 395
            L +
Sbjct: 293 FLAL 296


>gi|260061394|ref|YP_003194474.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
 gi|88785526|gb|EAR16695.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
           HTCC2501]
          Length = 382

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 80/147 (54%), Gaps = 7/147 (4%)

Query: 43  QDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCT 99
            ++Q+     ++  +   +R+ ++D FRGLT+ LMILV+  G     YA   H+ W+G T
Sbjct: 5   NNDQRERTNPEKQTKAMKERIVSVDIFRGLTIALMILVNTPGTWEAVYAPFRHAEWHGYT 64

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
             D V PFFLFIVG +I  A +         +KII RTLKL+  GI L G ++  P    
Sbjct: 65  PTDLVFPFFLFIVGTSIVFAYRNKQPDAATHRKIIVRTLKLILLGIFL-GAFTVEPPFFE 123

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALI 186
                  IR+ G+LQRI +V+   AL+
Sbjct: 124 ---PFSEIRFPGVLQRIGVVFFAAALL 147



 Score = 41.2 bits (95), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 48/84 (57%), Gaps = 4/84 (4%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           ++PEGLLST+ AI S  +GI  G VL+  + +  +   W+ +    L+ A  + +    P
Sbjct: 208 YDPEGLLSTLPAIASALLGIFTGRVLVSDRANKTQ---WMLLAGAALLAAGSI-WGLVFP 263

Query: 372 INKQLYSFSYVCFTAGAADGCLGI 395
           +NK L++ S+V  TAG A+  L +
Sbjct: 264 VNKALWTSSFVLVTAGWANLLLAL 287


>gi|294627662|ref|ZP_06706244.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598014|gb|EFF42169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 388

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 146/336 (43%), Gaps = 78/336 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALAL-KKVPKIN--GAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
           A++ AL    P +   G V K   R   +L  G+++     +   PD       +  +R 
Sbjct: 77  AMSFALATNTPHLQFLGRVSK---RAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRL 133

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI L Y+  AL   L     P  + P  L++   Y W  +             Y+
Sbjct: 134 TGVLQRIGLCYLAAAL---LVRYLPPRGIAPVCLALLLGY-WALL-------------YA 176

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
              P    S+  + G +                     +D  L+G +HLY          
Sbjct: 177 FGQPGAELSKTGNAGTR---------------------LDLWLYGRDHLY---------- 205

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                      R+D        F+PEGLL T+SA ++   G   G  L       A  + 
Sbjct: 206 -----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRQGKTVASTRS 248

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
            +  G GL+++A++  +  A P++K+L+S S+V  T
Sbjct: 249 LLLAGAGLVVLALL--WAPAWPLSKKLWSGSFVACT 282


>gi|428300562|ref|YP_007138868.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
 gi|428237106|gb|AFZ02896.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
          Length = 402

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 158/361 (43%), Gaps = 97/361 (26%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGA----YARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           R+ +LD FRG T+  MILV+    A    Y  + H+ W+GCTLAD V PFFLFIVGVA++
Sbjct: 2   RLTSLDVFRGATIAGMILVNMVSLAEPNVYPALLHADWHGCTLADLVFPFFLFIVGVAMS 61

Query: 118 LALKK----VPKIN------GAVK-----------------KIIFRTLKLLF-WGIILQG 149
            +  K    +PK+       GA++                 K IFR   +LF  G+ L  
Sbjct: 62  FSFAKYTDVIPKVEKEKDAIGALQQFLAKESSAAGGAKPPYKKIFRRGAILFALGLFLNL 121

Query: 150 GYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
            ++     L Y  D   +R  G+LQRIAL Y+  +LI     K+                
Sbjct: 122 FWNS--KNLPY-FDFSTLRIMGVLQRIALTYLFASLIVLKLPKKA--------------- 163

Query: 210 QWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVD 269
             QWI   +  V Y +    +YVP        ++G  +   + G         N   Y+D
Sbjct: 164 --QWIVAGVLLVGYWLLM--MYVP------IPEYGAGEIGTRTG---------NFAAYID 204

Query: 270 RELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTI 329
           R +    HLY    ++          N G              +PEGL STI AI+S  +
Sbjct: 205 RFIIPKAHLYKGDGFN----------NFG--------------DPEGLFSTIPAIVS-VL 239

Query: 330 GIHYGHVLIHFKGHSARLKHWVSM-GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGA 388
           G ++    I  +    R    +++ G G LIIA +  F    PINK+L++ SYV FT G 
Sbjct: 240 GGYFSGQWIRSQPVKTRTSLGLALAGIGCLIIAWL--FALDFPINKKLWTSSYVIFTTGW 297

Query: 389 A 389
           A
Sbjct: 298 A 298


>gi|421090259|ref|ZP_15551055.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|421129053|ref|ZP_15589263.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
 gi|410000994|gb|EKO51618.1| putative membrane protein [Leptospira kirschneri str. 200802841]
 gi|410359757|gb|EKP06816.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
          Length = 369

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 162/358 (45%), Gaps = 89/358 (24%)

Query: 66  LDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           +D FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG++I L++  
Sbjct: 1   MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60

Query: 123 VPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRI 176
             KI+ +  KI F    R++ L+  G+ L   G +S +            +R  G+LQRI
Sbjct: 61  KNKIHKS--KIWFGICIRSITLILIGLFLNFFGEWSFS-----------ELRIPGVLQRI 107

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
             VY +VA +  +  KR                           +I I     L V  W 
Sbjct: 108 GFVYWIVASLHLILPKR---------------------------MILISWIPILLVHTWV 140

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
             +    G  + IV      +L P  +   ++DR ++G NHL+                 
Sbjct: 141 LIQIPAPG--ESIV------YLEPGKDIGAWIDRNVFGENHLW----------------- 175

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
                + + +W     +PEGL S IS+I +  +G+  G +L   K +  + +     GFG
Sbjct: 176 -----KFSKTW-----DPEGLFSGISSIATSLLGVFCGSILSS-KTNEIKKQILSIFGFG 224

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA---DGCLGITNSIFIPEMDWDECDA 411
           +L++ + L +   +P+NK L++ SYV +TAG A    GC  + N   +    WD+  +
Sbjct: 225 ILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLN-FLLRIKKWDQLQS 281


>gi|300865789|ref|ZP_07110543.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300336202|emb|CBN55698.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 376

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 147/344 (42%), Gaps = 76/344 (22%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+ +  MILV++ G     Y  +DH+ WNGCT  D + PFFLF VG A++ 
Sbjct: 2   RLTSLDVFRGIAIASMILVNNPGSWDYVYPPLDHAEWNGCTPTDLIFPFFLFAVGAAMSF 61

Query: 119 ALKKVPKINGAVKKIIFRTLK---LLFWGIILQGGYSHAPDALSYGVDMKH---IRWCGI 172
           +L K  + N  +  + +R L+   LLF   +L   +S   D L  G  +++   IR  G+
Sbjct: 62  SLSKYTEENPPISTVYWRILRRATLLFLLGLLLNSFSIFLDVLLNGSPIENFGKIRILGV 121

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI+L Y + A I  L        L  R+L I  A                    +L +
Sbjct: 122 LQRISLAYFLAA-IAILN-------LSSRNLRILAA--------------------TLLL 153

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGH-LGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
             W        G    I   G   + L P  N   Y+DR + G  HLY            
Sbjct: 154 GYW--------GALTLIPVPGYGANLLTPEGNLGAYIDRLILGTQHLYR----------- 194

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                            +  ++PE LL T+ AI +   G ++    +  +    R   W 
Sbjct: 195 -----------------QGQYDPESLLGTLPAIATVLAG-YFTTQWLRVQPIKTR-TTWN 235

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            + FGL  + I   +    PINKQL++ SYV  TAG A   L I
Sbjct: 236 LVIFGLASLTIGQLWGFWFPINKQLWTSSYVLLTAGWAILLLAI 279


>gi|374580713|ref|ZP_09653807.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
 gi|374416795|gb|EHQ89230.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
           DSM 17734]
          Length = 375

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 154/340 (45%), Gaps = 80/340 (23%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAY---ARIDHSPWNGCTLADFVMPFFLFIV 112
           +++   R   +D FRGLT+ LM++  + G      A++ H+ WNG T+ DFV PFF+F +
Sbjct: 1   MEKGKLRFDCIDIFRGLTISLMLICSNPGNITNIPAQLRHADWNGATIGDFVFPFFIFSM 60

Query: 113 GVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           G+ + +A+ +  +   +  +II     R++ +   G+IL G  +          D+  IR
Sbjct: 61  GIVVPIAINRRLEKGISQMRIIINVLNRSIVMFLLGLILNGFPTF---------DLAIIR 111

Query: 169 WCGILQRIALVYVVVALIETL-TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
             G+LQRIA+VY   ALI  L  +  + ++++              IG     ++ +I  
Sbjct: 112 VPGVLQRIAIVYFCSALIYLLFKSIVKKDLVQ--------------IG-----ILTLIAV 152

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
             L +  W        G++      G++G L      V Y+D +     HLY+       
Sbjct: 153 LLLAIYYWLLKGLQVPGIE------GLKGGL------VSYIDLKYLK-GHLYT------- 192

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                            P+     F+PEG+LSTI A+ SG IG+  G + +       ++
Sbjct: 193 -----------------PT-----FDPEGILSTIPALSSGIIGVVVGMIFLRRDSRFVKM 230

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             +V  G  L+I A    F    P NKQL+S S+V  T+G
Sbjct: 231 TIFVCSGILLIIFA--EWFNAYFPYNKQLWSSSFVLLTSG 268


>gi|418737426|ref|ZP_13293823.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|410746620|gb|EKQ99526.1| putative membrane protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
          Length = 383

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 153/363 (42%), Gaps = 83/363 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            Q   R+ +LD FRG+TVV MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 6   TQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVG 65

Query: 114 VAIALALKKVPKIN--GAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
            +I ++L     IN       I  R + L+  G+ L   G ++ +            +R 
Sbjct: 66  TSIPISLYSKNGINRIRVWIGICIRGISLILLGLFLNFFGEWTFS-----------ELRI 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +   ++                       + F+I I     
Sbjct: 115 PGVLQRIGFVYWVVATLFLIFPGKKV----------------------LVFLIPI----- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W  +  +  G     ++ G         +   ++DR ++G  HL+          
Sbjct: 148 LLVHTWILTHIAPPGESMVSLEQGK--------DIGAWIDRTIFGEKHLW---------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG LS I++I +   G+  G +L   +G     + 
Sbjct: 190 ------------KFSKTW-----DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RV 231

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI--FIPEMDWD 407
               G G L   + L +  ++P+NK L++ SY  +T G +  C+G    +   I    W+
Sbjct: 232 LSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIGFFEYLDSLILLKKWN 291

Query: 408 ECD 410
           + D
Sbjct: 292 DLD 294


>gi|431796483|ref|YP_007223387.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
 gi|430787248|gb|AGA77377.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
          Length = 369

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 149/337 (44%), Gaps = 83/337 (24%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+  LD FRG+T+  MILV++ G     YA + H+ W+GCT  D + PFFLFIVGVAI 
Sbjct: 4   QRILALDVFRGITIFAMILVNNPGSWSHVYAPLLHAKWHGCTPTDLIFPFFLFIVGVAIE 63

Query: 118 LAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           L+L     K  PK    ++K + R LK          G      A+ Y  D+ H+R+ G+
Sbjct: 64  LSLGGQLKKGTPK-GFLLRKSLIRALK--------LIGLGLLLTAIPY-FDLAHLRFPGV 113

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI L    V  I T+          P+ L   +          I  + Y +    + V
Sbjct: 114 LQRIGL----VYFISTVMYLYW----SPKALVFSSG---------ILLIGYWLCMTFIPV 156

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           P          G+           +L P  N   ++D+++       +  +WS+ +    
Sbjct: 157 P----------GIGP--------ANLEPGTNLAAWIDQQV------LTGHMWSQTKT--- 189

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                              ++PEGL ST+ AI++  +G+  G +L     H ARL  W  
Sbjct: 190 -------------------WDPEGLFSTLPAIVTCLLGVACGKILTGNSSHKARLTKWGI 230

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            G  L+     L ++   P+NK L++ S+V +TAG A
Sbjct: 231 AGVTLVFGG--LAWSLFFPLNKALWTSSFVLYTAGWA 265


>gi|71278983|ref|YP_267171.1| hypothetical protein CPS_0413 [Colwellia psychrerythraea 34H]
 gi|71144723|gb|AAZ25196.1| putative membrane protein [Colwellia psychrerythraea 34H]
          Length = 358

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 100/342 (29%), Positives = 147/342 (42%), Gaps = 93/342 (27%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R   LDAFRG+T+ LMILV+  G     YA + H+ W+G T  D V PFFLFI+G A+  
Sbjct: 3   RYLALDAFRGITIALMILVNTPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMFF 62

Query: 119 ALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
           + KK          +KII R   + F G +L        + + + V+ +  R  GILQRI
Sbjct: 63  SFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQRI 114

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
            + Y V A +  LT  R                     G FIA  + ++  ++L      
Sbjct: 115 GIAYTVAACL-VLTLNRT--------------------GVFIASAVILLAYWAL------ 147

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
                       ++  G  G L    N +  +D  ++G NH+Y+                
Sbjct: 148 ------------LLSMG-EGALTIEGNIIRQLDLAVFGANHMYT---------------- 178

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR-----LKHWV 351
              +R  A       FEPEGLLSTI AI++  +G      L   +   +      L   +
Sbjct: 179 ---MRGVA-------FEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLTLIGGL 228

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           ++GFG L       +   +PINK L++ SYV ++ G A  CL
Sbjct: 229 AVGFGAL-------WGLVLPINKSLWTPSYVIYSTGFA--CL 261


>gi|284040246|ref|YP_003390176.1| hypothetical protein Slin_5410 [Spirosoma linguale DSM 74]
 gi|283819539|gb|ADB41377.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
           linguale DSM 74]
          Length = 385

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 147/333 (44%), Gaps = 76/333 (22%)

Query: 60  SKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ +LD FRGLTV  MILV+   D G  YA ++H+PW+G T  D + PFFLFIVGV+I
Sbjct: 20  TSRLLSLDFFRGLTVAAMILVNNPGDWGHIYAPLEHAPWHGWTPTDLIFPFFLFIVGVSI 79

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
             AL+      G V KI+ R++ L     +L    +  P       D+  +R  G+LQRI
Sbjct: 80  TFALEGGKSKKGVVGKIVKRSVTL----FLLGLFLNFFPK-----FDITLVRIPGVLQRI 130

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A+VY+V +LI   T  R       + L I            I  + Y +    + VP   
Sbjct: 131 AVVYLVCSLIFLKTNSR-------QQLYILV----------IVLIGYWLLMTVVPVPGVG 173

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
           ++                  +L PA N   + D  +    H+Y                 
Sbjct: 174 YA------------------NLEPATNLAAWFDYTILTPAHVY----------------- 198

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
                + A +W     +PEG+LST+ A+ +G IG+  G  L   +  + ++  W+    G
Sbjct: 199 -----KPAKTW-----DPEGVLSTLPAVGTGLIGMLVGTWLRSSRPVADKVA-WL-FATG 246

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
            L     + +    PINK L++ SYV    G A
Sbjct: 247 CLATLGGVMWDGFFPINKALWTSSYVLLAGGLA 279


>gi|375149929|ref|YP_005012370.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063975|gb|AEW02967.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 392

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 97/365 (26%), Positives = 146/365 (40%), Gaps = 93/365 (25%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR  +LD FRG TV  MILV++ G     YA ++H+PW+GCT  D V PFFLF VG A+A
Sbjct: 3   KRFYSLDVFRGATVAFMILVNNPGSWSNLYAPLEHAPWHGCTPTDLVFPFFLFAVGNALA 62

Query: 118 LALKKVPK--INGAVKKIIFRTLKLLFWGIILQGG--YSHAPDALSYGV--------DMK 165
             + ++ +      +KK+I R+  +   G  L          D L++          ++ 
Sbjct: 63  FVMPRLQEAGTTAFLKKVITRSFLIFLIGFFLNWSPFIRWDNDHLTFKAWEYAGANGNLI 122

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            IR  G+LQRIAL Y   +LI      R                     G F       +
Sbjct: 123 GIRILGVLQRIALCYFFASLIIYFFKIR---------------------GAF-------V 154

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
           + + L +  W       +    Y     + G+ G        VD+ + G +H+Y      
Sbjct: 155 SAFVLLLGYWVLCMFFGNAADPY----SLNGYFGLG------VDKAILGESHMYHGE--- 201

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                                     F+PEG+ ST++AI+    G   G  +     +  
Sbjct: 202 -----------------------GVAFDPEGITSTLTAIVQVIFGYFVGFYIQQKGKNFE 238

Query: 346 RLKHWVSMGFGLLIIAIILHFTN-----AIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
            L H       L +   IL FT        PINK++++ SYV +T G A   L ++  IF
Sbjct: 239 MLSH-------LFVAGCILIFTGYAWDMMFPINKKIWTSSYVLYTTGLA--ILVLSLCIF 289

Query: 401 IPEMD 405
           + E  
Sbjct: 290 LIEFK 294


>gi|301307595|ref|ZP_07213552.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423337400|ref|ZP_17315144.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834269|gb|EFK64882.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409237229|gb|EKN30029.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
           CL09T03C24]
          Length = 368

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 94/356 (26%), Positives = 160/356 (44%), Gaps = 79/356 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           + Q+S R+ +LD  RG+T+  MILV++ G     Y  ++H+ WNG T  D V PFF+FI+
Sbjct: 1   MAQQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIM 60

Query: 113 GVAIALALKKVP-KING-AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           GV++  +L+K   K++  +V K++ RT+ +   G+ L   + H     +   D +++R  
Sbjct: 61  GVSMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHV--CYNGFTDFQNLRIL 117

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G++QR+AL Y   +LI                L+I   Y  Q   G + F   ++     
Sbjct: 118 GVMQRLALAYGFGSLIG---------------LAINHKYILQVAAGILIFYWALLG---- 158

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           +  +   SE S                       +  VD+ L+G +H+Y D         
Sbjct: 159 FTHSMEMSEDS----------------------IIAIVDKALFGTSHMYHD--------- 187

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                       D     R  F+PEGLLS I +I    +G + G V+   K ++  +   
Sbjct: 188 ------------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRN 235

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
           + + FG +I+      +   PINK+++S ++V  T G A        S+F+  + W
Sbjct: 236 IFI-FGTIILFAGFLLSYGCPINKKIWSSTFVLVTCGFA--------SLFLALLIW 282


>gi|405957484|gb|EKC23691.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crassostrea gigas]
          Length = 1901

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 137/302 (45%), Gaps = 55/302 (18%)

Query: 108  FLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            F++I+G A+A +    L++V   +    KI  R+  L   G+++  G          G D
Sbjct: 1552 FIWIMGTAMAYSFTGMLRRVTPKHKIFWKIFKRSCILFVLGLLVNTG----------GCD 1601

Query: 164  ---MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW--------- 211
               + H+R  G+LQR A  Y+VVA I     K          +S++T + +         
Sbjct: 1602 PTRLTHLRIPGVLQRFAGTYLVVASIHMFFAKTV-------DVSMYTYWGFIRDIVDFWL 1654

Query: 212  QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRE 271
            +WI   +   ++II T++L VP          G+ + +       +      A GY+DR+
Sbjct: 1655 EWILHIVFVTVHIIITFTLDVPGCGKGYIGPGGLHEAVNSTEASVYQNCTGGAAGYIDRQ 1714

Query: 272  LWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
            ++G +H+Y  P    +   T+                  P++PEGLL T++++    +G+
Sbjct: 1715 VFGDDHIYQSPTCKPIYKTTV------------------PYDPEGLLGTLNSVFMCYLGL 1756

Query: 332  HYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL----HFTNAIPINKQLYSFSYVCFTAG 387
              G +L+ FK  SAR+K ++  G  L +IA  L         IP+NK L+S S+V   AG
Sbjct: 1757 QAGKILMTFKEPSARVKRFLIWGLFLCLIAGALCGFKKDGGTIPLNKNLWSLSFVLCMAG 1816

Query: 388  AA 389
             A
Sbjct: 1817 FA 1818


>gi|404404862|ref|ZP_10996446.1| transmembrane protein [Alistipes sp. JC136]
          Length = 382

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 155/359 (43%), Gaps = 87/359 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDA---GGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            KS+R+ +LD  RG+T+  MILV++    G AYA + H+ W+G T  D + PFF+FI+GV
Sbjct: 1   MKSERLLSLDVMRGMTIAAMILVNNPAVWGKAYAPLQHAFWHGMTPTDLIYPFFVFIMGV 60

Query: 115 AIALALKKVPKING--AVKKIIFRTLKLLFWGIILQG----GYSHA--------PDALSY 160
           +   +L K  +  G  A  +I+ R+  +   G++LQ     GY  A         DA  +
Sbjct: 61  SAFFSLSKRYEGAGREAFSRILRRSAVIFGVGLLLQEISYFGYGTANFLSGQTSADATWF 120

Query: 161 GV--DMKHIRWCGILQRIALVYVV--VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                 +  R  G+LQ +ALVY+    AL+           L  RHL +         GG
Sbjct: 121 ETVFPFRTFRIMGVLQGLALVYLFGSAALL----------CLRFRHLIVAA-------GG 163

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
            +   + ++ T +                          G+   A N +  VDR + G +
Sbjct: 164 LLILYLVLLQTGN--------------------------GYSLSADNIIAVVDRAVLGES 197

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                     RE  P   R  FEPEGLLST+  I    +G   G +
Sbjct: 198 HLY---------------------REWLPDGSRLAFEPEGLLSTLPRIAQFLLGCAAGRI 236

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  +    R     + G  L    ++L + +  P+NK+++S S+   T+G A   LG+
Sbjct: 237 LLANEDAPMRFGRLFAFGTALFFTGLLLQYGD--PLNKKIWSSSFALATSGFASLLLGL 293


>gi|160887858|ref|ZP_02068861.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
 gi|156862688|gb|EDO56119.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
          Length = 394

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 99/364 (27%), Positives = 152/364 (41%), Gaps = 96/364 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 4   TKVSKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 63

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        APD LS+G  +
Sbjct: 64  ISTYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKL 122

Query: 165 -------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    IR  G++QR+AL Y   ++I  T+  K  P ++      + T Y       
Sbjct: 123 WASVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIPYLIA----GLLTGY------- 171

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI                              ++ CG  G      N +  VDR +    
Sbjct: 172 FI------------------------------LLMCG-NGFAYNETNILSVVDRAILTPA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI AI    +G   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           ++       R     S    L ++ +IL F+  +     PINK+++S +YV  T G A  
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 392 CLGI 395
            L +
Sbjct: 293 FLAL 296


>gi|270295536|ref|ZP_06201737.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274783|gb|EFA20644.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 99/364 (27%), Positives = 152/364 (41%), Gaps = 96/364 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 4   TKVSKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 63

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        APD LS+G  +
Sbjct: 64  ISTYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKL 122

Query: 165 -------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    IR  G++QR+AL Y   ++I  T+  K  P ++      + T Y       
Sbjct: 123 WASVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIPYLIA----GLLTGY------- 171

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI                              ++ CG  G      N +  VDR +    
Sbjct: 172 FI------------------------------LLMCG-NGFAYNETNILSVVDRAILTPA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI AI    +G   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           ++       R     S    L ++ +IL F+  +     PINK+++S +YV  T G A  
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 392 CLGI 395
            L +
Sbjct: 293 FLAL 296


>gi|325299496|ref|YP_004259413.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
 gi|324319049|gb|ADY36940.1| hypothetical protein Bacsa_2393 [Bacteroides salanitronis DSM
           18170]
          Length = 356

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 95/336 (28%), Positives = 143/336 (42%), Gaps = 82/336 (24%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            +R+ +LD  RG+TV  MI+V++AGG Y+   + HS WNG T  D V PFFLFI+G++  
Sbjct: 2   KQRLLSLDVLRGITVFGMIVVNNAGGEYSYDSLRHSVWNGLTPCDLVFPFFLFIMGISTY 61

Query: 118 LALKK--VPKINGAVKKIIFRTLKLLF--WGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           +AL+K         ++KI+ RTL +    WG I    ++   D L +     HIR  G+L
Sbjct: 62  IALRKFQFQPSPAVLRKIVRRTLLIFLIGWG-IYWFEFACEGDFLPFA----HIRILGVL 116

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
            RIAL Y +V+L   L    RP  L              WI G       I+      + 
Sbjct: 117 PRIALCYGIVSL---LALYVRPKGLA-------------WIAG-------ILLLGYALLL 153

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
            W                    G+   + N +   D ++ G  HLY              
Sbjct: 154 QWG------------------NGYAMDSTNILAIWDTKVLGYEHLYH------------- 182

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                          ++P +PEGL ST++AI    IG   G +++  +    ++      
Sbjct: 183 ---------------KSPVDPEGLTSTLAAIAHTIIGFLCGRLIMETQDLGQKIVKLFVA 227

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           GF L     +L     +P+NK+++S SY   T G A
Sbjct: 228 GFLLAAAGFLL--VEWLPLNKRIWSPSYALATCGMA 261


>gi|158337501|ref|YP_001518676.1| hypothetical protein AM1_4380 [Acaryochloris marina MBIC11017]
 gi|158307742|gb|ABW29359.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 383

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 151/337 (44%), Gaps = 83/337 (24%)

Query: 66  LDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           LD FRG+ +  M+LV+ +G    AY ++ H+ W+G TLAD V PFFLF++G ++A ++ +
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKDAYPQLQHADWHGWTLADLVFPFFLFVLGASMAFSMAR 72

Query: 123 ------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
                  PK      KI+ R++ L   G+ L G +S+         ++  +R  GILQRI
Sbjct: 73  HTASLTQPK-RRVYLKILRRSVVLFGLGLFLNGFWSY---------NLSTLRVMGILQRI 122

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           +L Y+V AL+  L   R+               QW   G        ++  Y L +   S
Sbjct: 123 SLTYLVSALV-ILKLPRKS--------------QWGMTG-------LLLVGYWLAL---S 157

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
           F    + G           G+L    N   YVDR + G +HLY    ++ +         
Sbjct: 158 FIPVPEFGA----------GNLTRTGNFGAYVDRLIIGSSHLYVGDQFNSMG-------- 199

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK----HWVS 352
                           +PEGL ST+ AI +  +G   G   I  +G   ++K        
Sbjct: 200 ----------------DPEGLFSTLPAIATVLLGYFAGD-WIRKRGSGLKIKTSRQSLAL 242

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             +GL+   + L ++   PINK+L++ SYV FT G A
Sbjct: 243 ASYGLISTGLGLLWSIWFPINKKLWTSSYVLFTVGIA 279


>gi|421093382|ref|ZP_15554106.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
 gi|410363365|gb|EKP14394.1| putative membrane protein [Leptospira borgpetersenii str.
           200801926]
          Length = 383

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 147/345 (42%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            Q   R+ +LD FRG+TVV MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 6   TQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVG 65

Query: 114 VAIALALKKVPKIN--GAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
            +I ++L     IN       I  R + L+  G+ L   G ++ +            +R 
Sbjct: 66  ASIPISLYSKNGINRIRVWIGICIRGISLILLGLFLNFFGEWTFS-----------ELRI 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +   ++                       + F+I I     
Sbjct: 115 PGVLQRIGFVYWVVATLFLIFPGKKV----------------------LVFLIPI----- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W  +  +  G     ++ G         +   ++DR ++G  HL+          
Sbjct: 148 LLVHTWILTHIAPPGESMVSLEQGK--------DIGAWIDRRIFGEKHLW---------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG LS I++I +   G+  G +L   +G     + 
Sbjct: 190 ------------KFSKTW-----DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RV 231

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               G G L   + L +  ++P+NK L++ SY  +T G +  C+G
Sbjct: 232 LSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIG 276


>gi|160885454|ref|ZP_02066457.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|423290374|ref|ZP_17269223.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
 gi|156109076|gb|EDO10821.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
 gi|392665761|gb|EIY59284.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
           CL02T12C04]
          Length = 371

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 92/336 (27%), Positives = 139/336 (41%), Gaps = 86/336 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            +KR+  LD  RG+T+  MILV++ G    AYA + H+ WNG T  D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGVDMKHIRW 169
             ++LKK        A  KII RT+ +   GI L       Y+H P      +  + IR 
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G++QR+AL Y   ALI  L   +                       +I ++I ++    
Sbjct: 119 LGVMQRLALCYGASALIALLLKHK-----------------------YIPYLIVVLL--- 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                          V  +I+     G      N +  VDR + G  H+Y D        
Sbjct: 153 ---------------VGYFIILITGNGFAYNETNILSIVDRSILGDAHMYQD-------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                  +PEGLLSTI +I    IG   G +L+  K    +L+ 
Sbjct: 190 --------------------NHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLER 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
              +G  L     +  F+   P NK+++S ++V  T
Sbjct: 230 LFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVT 263


>gi|418719584|ref|ZP_13278783.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
 gi|410743627|gb|EKQ92369.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
          Length = 383

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 147/345 (42%), Gaps = 81/345 (23%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            Q   R+ +LD FRG+TVV MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 6   TQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVG 65

Query: 114 VAIALALKKVPKIN--GAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
            +I ++L     IN       I  R + L+  G+ L   G ++ +            +R 
Sbjct: 66  ASIPISLYSKNGINRIRVWIGICIRGISLILLGLFLNFFGEWTFS-----------ELRI 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +   ++                       + F+I I     
Sbjct: 115 PGVLQRIGFVYWVVATLFLIFPGKKV----------------------LVFLIPI----- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L V  W  +  +  G     ++ G         +   ++DR ++G  HL+          
Sbjct: 148 LLVHTWILTHIAPPGESMVSLEQGK--------DIGAWIDRTIFGEKHLW---------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                       + + +W     +PEG LS I++I +   G+  G +L   +G     + 
Sbjct: 190 ------------KFSKTW-----DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RV 231

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
               G G L   + L +  ++P+NK L++ SY  +T G +  C+G
Sbjct: 232 LSIFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLSFLCIG 276


>gi|345856403|ref|ZP_08808889.1| putative membrane protein [Desulfosporosinus sp. OT]
 gi|344330527|gb|EGW41819.1| putative membrane protein [Desulfosporosinus sp. OT]
          Length = 381

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 151/360 (41%), Gaps = 95/360 (26%)

Query: 39  RSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPW 95
            + +  E+KG          K  R+  +D FRG+ V +M++V + G     Y ++ H+ W
Sbjct: 2   ENSIAPEEKG----------KFGRLNCIDVFRGIAVAIMLIVTNPGNPLRNYPQLRHAAW 51

Query: 96  NGCTLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGY 151
           NG T+AD   PFF+ I+G+ I  A+ K  K   +       I+ R++ L   GI+L G  
Sbjct: 52  NGYTVADLAFPFFMLIMGMVIPYAVDKRIKEGKSNLSIFNHILIRSIGLFCIGILLNGFP 111

Query: 152 SHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW 211
            +         D+  IR  G+LQRIA+ Y+   +IE +        ++  +L I      
Sbjct: 112 VY---------DLSIIRIPGVLQRIAIAYLCTGIIELIVKA----TVKKSYLQIIVES-- 156

Query: 212 QWIGGFIAFVIYIITTYSLYVPNWSFSEHSD--HGVKKYIVKCGMRGHLGPACNAVGYVD 269
                  +  + II+ YS+ +  +SF ++ +    +  Y +K                  
Sbjct: 157 -------SLALSIISVYSVLLIKYSFPDYKNLVQTIDLYFLK------------------ 191

Query: 270 RELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTI 329
                  HLY+                        P W     +PEG+L+T S+I +   
Sbjct: 192 ------GHLYT------------------------PDW-----DPEGILTTFSSIATAIF 216

Query: 330 GIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G   G++L + + + AR K      +G++ I +        P NK L+S SYV  TAG A
Sbjct: 217 GSIAGNILFN-RDNKARKKFITIFIYGVVTIIVASIIQRWFPYNKNLWSSSYVLITAGIA 275


>gi|336415339|ref|ZP_08595679.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940935|gb|EGN02797.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
           3_8_47FAA]
          Length = 371

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 92/336 (27%), Positives = 139/336 (41%), Gaps = 86/336 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            +KR+  LD  RG+T+  MILV++ G    AYA + H+ WNG T  D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVIRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGVDMKHIRW 169
             ++LKK        A  KII RT+ +   GI L       Y+H P      +  + IR 
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G++QR+AL Y   ALI  L   +                       +I ++I ++    
Sbjct: 119 LGVMQRLALCYGASALIALLLKHK-----------------------YIPYLIVVLL--- 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                          V  +I+     G      N +  VDR + G  H+Y D        
Sbjct: 153 ---------------VGYFIILITGNGFAYNETNILSIVDRSILGDAHMYQD-------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                  +PEGLLSTI +I    IG   G +L+  K    +L+ 
Sbjct: 190 --------------------NHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLER 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
              +G  L     +  F+   P NK+++S ++V  T
Sbjct: 230 LFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLVT 263


>gi|452822119|gb|EME29141.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 1
           [Galdieria sulphuraria]
          Length = 356

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 88/330 (26%), Positives = 140/330 (42%), Gaps = 81/330 (24%)

Query: 77  MILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPK-------I 126
           MIL  D G     Y+   H  W    +AD + PFFLF+VG +I  A +KVP+        
Sbjct: 1   MILGKDQGNFDKMYSVFKHESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEK 60

Query: 127 NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
           + A++ +  RT+KL   G++L    S        G   + +RW GILQRIA+ Y  VA +
Sbjct: 61  DKALRSVTSRTIKLFLVGVLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFL 112

Query: 187 ETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVK 246
                 R   V++   +S+              F+++    Y L VPN            
Sbjct: 113 FLFVNSR---VIQYALVSVL-------------FLLHTSLLYGLIVPN------------ 144

Query: 247 KYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
                C +   L  AC+A  Y+D  + G  HLY       LE                  
Sbjct: 145 -----CLISERLTRACSAQSYLDTMILGGKHLY-----FHLE------------------ 176

Query: 307 WCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL-- 364
                ++PEG+LST+ A ++   G+    +    +  + R+     +G   + I I+L  
Sbjct: 177 -----YDPEGILSTLMATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVD 231

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            F +++PI+K L++ S++  T G +  CL 
Sbjct: 232 CFPDSVPISKPLWTASFLFLTVGCSFWCLS 261


>gi|373953861|ref|ZP_09613821.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890461|gb|EHQ26358.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 379

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 151/344 (43%), Gaps = 81/344 (23%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++R+ +LD FRGLTV  MILV++ G     Y+ ++HS WNGCT  D + PFFLFIVGV+I
Sbjct: 13  NQRLLSLDVFRGLTVACMILVNNPGDWAHIYSPLEHSAWNGCTPTDLIFPFFLFIVGVSI 72

Query: 117 ALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
             ++   K  P  +G +   I +   +LF   +    Y           +   +R  G+L
Sbjct: 73  VYSMGTKKTDPAQHGKLVLTILKRSLILFCLALFLSLYPK--------FNFHTLRIPGVL 124

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIA+V+ +  +I  L T+R+                 Q I  ++  ++Y +    + VP
Sbjct: 125 QRIAVVFGICGIIF-LKTERKT----------------QLILFWLFLIVYYLLMTLVPVP 167

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
              ++                  +L P  N   ++DR + G  HL+ + V          
Sbjct: 168 GVGYA------------------NLQPETNLGAWIDRTVIGNVHLWKESV---------- 199

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV-S 352
                       +W     +PEG+L T+ A  +G  GI  G  L       +    W+  
Sbjct: 200 ------------TW-----DPEGILGTMPATSTGLFGILVGTWLKRKDVDESTKVAWLFC 242

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGIT 396
            G G +I+ ++       PINK L++ S+V +  G A   LG+T
Sbjct: 243 TGIGAVILGLLWDL--FFPINKALWTSSFVLYAGGLAT--LGLT 282


>gi|383110831|ref|ZP_09931649.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
 gi|313694406|gb|EFS31241.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
          Length = 371

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 92/336 (27%), Positives = 139/336 (41%), Gaps = 86/336 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            +KR+  LD  RG+T+  MILV++ G    AYA + H+ WNG T  D V PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGVDMKHIRW 169
             ++LKK        A  KII RT+ +   GI L       Y+H P      +  + IR 
Sbjct: 65  TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G++QR+AL Y   ALI  L   +                       +I ++I ++    
Sbjct: 119 LGVMQRLALCYGASALIALLLKHK-----------------------YIPYLIVVLL--- 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                          V  +I+     G      N +  VDR + G  H+Y D        
Sbjct: 153 ---------------VGYFIILITGNGFAYNETNILSIVDRSILGDAHMYQD-------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                  +PEGLLSTI +I    IG   G +L+  K    +L+ 
Sbjct: 190 --------------------NHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLER 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
              +G  L     +  F+   P NK+++S ++V  T
Sbjct: 230 LFLIGTILTFAGFL--FSYGCPFNKKIWSPTFVLAT 263


>gi|418676277|ref|ZP_13237561.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418684894|ref|ZP_13246077.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418743212|ref|ZP_13299580.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|400323423|gb|EJO71273.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|410740642|gb|EKQ85357.1| putative membrane protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410749503|gb|EKR06488.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 369

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 161/358 (44%), Gaps = 89/358 (24%)

Query: 66  LDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           +D FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG++I L++  
Sbjct: 1   MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60

Query: 123 VPKINGAVKKIIF----RTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRI 176
             KI+ +  KI F    R++ L+  G+ L   G +S +            +R  G+LQRI
Sbjct: 61  KNKIHKS--KIWFGICIRSITLILIGLFLNFFGEWSFS-----------ELRIPGVLQRI 107

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
             VY +VA +  +  KR                           +I I     L V  W 
Sbjct: 108 GFVYWIVASLHLILPKR---------------------------MILISWIPILLVHTWV 140

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
             +    G  + IV      +L P  +   ++DR ++G N L+                 
Sbjct: 141 LIQIPAPG--ESIV------YLEPGKDIGAWIDRNVFGENRLW----------------- 175

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
                + + +W     +PEGL S IS+I +  +G+  G +L   K +  + +     GFG
Sbjct: 176 -----KFSKTW-----DPEGLFSGISSIATSLLGVFCGSILSS-KTNEIKKQILSIFGFG 224

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA---DGCLGITNSIFIPEMDWDECDA 411
           +L++ + L +   +P+NK L++ SYV +TAG A    GC  + N   +    WD+  +
Sbjct: 225 ILLVLVGLLWDQNLPMNKSLWTGSYVIYTAGLAFLSIGCFELLN-FLLRIKKWDQLQS 281


>gi|333030942|ref|ZP_08459003.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332741539|gb|EGJ72021.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 385

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 151/348 (43%), Gaps = 79/348 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
             SKR+ +LD  RG T++ MILV++ G     Y+ + H+ W+G T  D + PFF+F++G+
Sbjct: 1   MSSKRLLSLDILRGGTIIGMILVNNPGSWEYIYSPLRHAEWHGLTPTDLIFPFFIFVMGI 60

Query: 115 AIALALKKVP--KINGAV--KKIIFRTLKLLFWGIILQGGYSHAP---DALSYG------ 161
           +++L+  K    + N  +  +K+I R+ KL   G+ L           + L Y       
Sbjct: 61  SMSLSFSKFKNEEYNKTLFWEKVIKRSAKLFLLGLFLSWFSLLLEGINNRLEYESISEIL 120

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
                IR  G++QR+AL Y+V ++   L  K        +HL I +             V
Sbjct: 121 FPFGQIRILGVMQRLALSYLVGSVFVMLIPK-------AKHLVITS-------------V 160

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
           I +I  + L      FS  SD                    N +  VD  L+G NH+Y  
Sbjct: 161 ILLIAYFILLSLGNGFSFSSD--------------------NIIAIVDNSLFGENHVY-- 198

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
                               E  P   R  F+PEGLLSTI  I+   +G   G V+   K
Sbjct: 199 -------------------LEWLPDGERLRFDPEGLLSTIPCIVQVIMGYLCGEVIRKKK 239

Query: 342 GHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
               ++     +G  LL I ++L +    P+NK+++S ++   T+G A
Sbjct: 240 DLLNKMMDLAIIGIVLLFIGLLLSY--GCPLNKKIWSPTFELVTSGFA 285


>gi|375253854|ref|YP_005013021.1| hypothetical protein BFO_0041 [Tannerella forsythia ATCC 43037]
 gi|363406758|gb|AEW20444.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 390

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 90/363 (24%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFF 108
           + +L    S+R+  LD  RG+T+  MI+V++ G     YA + H+ W+G T  D V PFF
Sbjct: 1   MSRLPDTSSRRLLALDILRGITIAGMIMVNNPGSWSFVYAPLGHAAWHGLTPTDLVFPFF 60

Query: 109 LFIVGVAIALALKKVPKI--NGAVKKIIFRT-------LKLLFWGIILQGGYSHAPDALS 159
           +FI+G++  ++LKK        A++KII RT       L L + G+  +  +  A   LS
Sbjct: 61  MFIMGISTYISLKKYDFTFSYSAMRKIIRRTAVIFAIGLGLAWLGLTCRTWHGLADGGLS 120

Query: 160 YGV-------DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQ 212
           +G        +  H+R  G++QR+AL Y   ALI       R   L    L         
Sbjct: 121 FGARLWQSVSNFGHLRILGVMQRLALSYGATALIALAIRHHRIPYLIVALL--------- 171

Query: 213 WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDREL 272
             GG+        T   L     +++E                       N +  VDR +
Sbjct: 172 --GGY--------TVLLLAGNGLAYNE----------------------TNILSIVDRAV 199

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
            G+NH Y D                               EPEGLLST+ AI    IG  
Sbjct: 200 LGVNHTYKD----------------------------MGIEPEGLLSTLPAIAHVLIGFC 231

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGC 392
            G  ++       ++     +G  +     +L +    PINK+++S ++V  T G A G 
Sbjct: 232 CGRAMLGVTEVRDKMLRLFLIGTAMAFAGWLLSY--GCPINKKIWSPTFVLITCGMASGL 289

Query: 393 LGI 395
           L +
Sbjct: 290 LAL 292


>gi|297300348|ref|XP_001115683.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Macaca mulatta]
          Length = 547

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 152/336 (45%), Gaps = 61/336 (18%)

Query: 25  KDSENGINKEKGLERSEVQDEQKGELQLQQL-LQQKSKRVATLDAFRGLTVVLMILVDDA 83
           ++++  IN E G       D   G++Q     L     R+ ++D FRG+ ++LM+ V+  
Sbjct: 231 RETDRLINSELG--SPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMVFVNYG 288

Query: 84  GGAYARIDHSPWNG-----CTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KI 133
           GG Y    H+ WNG     C +  F M  F+FI+G +I L++  + +  G  K     KI
Sbjct: 289 GGKYWYFKHASWNGTLPMQCGICIFAM-MFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKI 346

Query: 134 IFRTLKLLFWGIIL-QGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
            +R+  L+  GII+    Y   P      +    +R  G+LQR+ + Y VVA++E L  K
Sbjct: 347 AWRSFLLICIGIIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAK 400

Query: 193 RRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
             P    LE   LS+   T+   QW+       +++  T+ L VP          G    
Sbjct: 401 PVPEHCALERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTG 450

Query: 249 IVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
            +  G  G  G  P C   A GY+DR L G +HLY  P  + L    ++           
Sbjct: 451 YLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEVA----------- 499

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF 340
                  ++PEG+L TI++I+   +G+    V +HF
Sbjct: 500 -------YDPEGILGTINSIVMAFLGV---QVFVHF 525


>gi|333378010|ref|ZP_08469743.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
 gi|332884030|gb|EGK04310.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
           22836]
          Length = 389

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/361 (26%), Positives = 154/361 (42%), Gaps = 94/361 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q+ S R+ +LD  RG+T+  MI+V++ G     YA + H+ W+G T  D V PFF+FI+G
Sbjct: 3   QKPSSRLLSLDILRGITIAGMIMVNNPGSWSYVYAPLGHAAWHGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKKVP-KING-AVKKIIFRTLKLLFWGIILQ-----------------GGYSHA 154
           ++  ++L+K   + N   + KI+ RT+ +   G+ L                  G +   
Sbjct: 63  ISTYISLRKFNFEFNKPTLFKILKRTVVIFLIGLGLGWLSLSFRTFNSLSGEDIGFFERF 122

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
             A++   + +HIR  G++QR+AL Y   ALI              +H  I        I
Sbjct: 123 ITAIT---NFEHIRILGVMQRLALTYGATALIAIFV----------KHKYIPYIIVVTLI 169

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
           G F+           L+   + FSE                       N +  +DR + G
Sbjct: 170 GYFLLL---------LFGNGFDFSED----------------------NIISVLDRAILG 198

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
            +H+Y D               SG              +PEGLLSTI AI    IG   G
Sbjct: 199 ADHMYKD---------------SG-----------LAIDPEGLLSTIPAICHVLIGFCCG 232

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            +L+  K ++ R++    +G  +  +  +L +    PINK+++S ++V  T G A   L 
Sbjct: 233 EILLTTKDNNERIQRLFIIGAIMTFLGFLLSY--GCPINKKIWSPTFVLATCGLASTMLA 290

Query: 395 I 395
           +
Sbjct: 291 L 291


>gi|237710367|ref|ZP_04540848.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|229455829|gb|EEO61550.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 366

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/371 (24%), Positives = 164/371 (44%), Gaps = 80/371 (21%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFF 108
           +  +     KR+  LD  RG+T+  MILV++ G     YA ++H+ +NG T  D V PFF
Sbjct: 1   MTHMTANTPKRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHASFNGLTPTDLVFPFF 60

Query: 109 LFIVGVAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           +FI+G++  ++L+K      +  ++KI+ RT+ +   G++L      A    ++ ++ + 
Sbjct: 61  MFIMGISTYISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEE 117

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
            R+ G++QR+A+ Y V +L+  +T K +                      F A ++  + 
Sbjct: 118 WRYLGVMQRLAIGYGVTSLVA-ITVKHKY---------------------FPAIILVTLA 155

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            Y L +        S+                    N V   D    G +H+Y +     
Sbjct: 156 AYFLLLATGDGFNQSE-------------------TNVVARFDAWALGTSHMYHE----- 191

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
                     SG             F+PEGLLST+ A+    +G + G +L+  K ++ +
Sbjct: 192 ----------SG-----------MAFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEK 230

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD- 405
           ++    +G  L     +L +    PINK+++S ++V  T G A   L +   I+I +M  
Sbjct: 231 IQRLFLIGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALL--IWIIDMKG 286

Query: 406 WDECDACFRAW 416
           +    A FR++
Sbjct: 287 YQNWCAFFRSF 297


>gi|54297581|ref|YP_123950.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
 gi|53751366|emb|CAH12784.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
          Length = 372

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/342 (26%), Positives = 145/342 (42%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI+V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLYEK---------- 191

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG +ST ++I +   G+  G +LI+    FK      
Sbjct: 192 -------------------TYDPEGFVSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G G+L + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGVGMLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|398343267|ref|ZP_10527970.1| hypothetical protein LinasL1_09415 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 399

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 151/360 (41%), Gaps = 91/360 (25%)

Query: 40  SEVQDEQKGELQLQQLLQQK--SKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSP 94
           S V+   +     Q+LL     SKR+ ++DA RG TV  MILV++ G     Y+ + H+ 
Sbjct: 10  SLVKIRNEIHFVKQELLNDSFASKRLLSIDALRGFTVAGMILVNNPGSWSAIYSPLRHAK 69

Query: 95  WNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYS 152
           W GCT  D V PFFLF VGV+I  +        G   KI+ R   L+F G+ L   G +S
Sbjct: 70  WFGCTPTDLVFPFFLFSVGVSIPFS---TIGNGGTFFKILKRASILIFIGLFLHWFGEWS 126

Query: 153 HAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQ 212
                      M  +R  G+LQRI LVY + A+        R      R           
Sbjct: 127 -----------MDRLRIPGVLQRIGLVYFISAI------AYRSASFRTR----------- 158

Query: 213 WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDREL 272
                I   I I+  Y + +    F+     G             L P  +   ++DR +
Sbjct: 159 -----IMICISILFGYWILL---EFAPPPGAGSPS----------LSPGKDWGAWLDRIV 200

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
           +G NHL+                      + + +W     +PEGLL ++SA+ +  +GI 
Sbjct: 201 FGENHLW----------------------KSSKTW-----DPEGLLGSLSAVATTFLGIF 233

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAG 387
           +G VL   K  S    +     F  ++ AI+L     I     P+NK L++ S+V +T G
Sbjct: 234 FGEVL---KKDSDTKGNIQKTAFTFILAAIVLMLVGWIWHQFFPMNKSLWTSSFVLWTGG 290


>gi|433652541|ref|YP_007296395.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
 gi|433303074|gb|AGB28889.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
          Length = 394

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/135 (42%), Positives = 81/135 (60%), Gaps = 7/135 (5%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCTLADFVMPFFLFIVGV 114
           +  S+R+ +LD  RGLTV+LMI V++  G   +A++ HS WNG TL D V PFFLFI+GV
Sbjct: 8   RMSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGV 67

Query: 115 AIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  L+L+K   +  A   +KI  RTL L   G+ +   +  A     +  D+ H+R  G+
Sbjct: 68  STYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGV 124

Query: 173 LQRIALVYVVVALIE 187
           +QRIAL Y   ALI 
Sbjct: 125 MQRIALCYGATALIA 139



 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLI----HFKGHSARLKHWVSMGFGLLIIAIIL 364
           ++P +PEGL+ST++A+     G +  H  +       G +AR         G ++  + L
Sbjct: 198 KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPAARRSMMRFAAAGTVLAVVGL 257

Query: 365 HFTNAIPINKQLYSFSYVCFTAG 387
             +  +P+NK+++S SYVC + G
Sbjct: 258 ALSPLLPLNKRVWSPSYVCLSCG 280


>gi|225010297|ref|ZP_03700769.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
 gi|225005776|gb|EEG43726.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
          Length = 363

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/160 (38%), Positives = 96/160 (60%), Gaps = 13/160 (8%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +KRV ++D FRG T++LMILV+  G     YA   H+ W+G TL D V PFF+FIVGV+I
Sbjct: 4   NKRVPSVDIFRGATLLLMILVNTPGTWSAVYAPFLHASWHGYTLTDLVFPFFIFIVGVSI 63

Query: 117 ALALKKVPKINGAV-KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           +L+ K   K+NG V  K+  R+LKL+  G+ L G ++    +  +   +++IR+ G+LQR
Sbjct: 64  SLSYKD-KKLNGPVFFKLTKRSLKLIGLGLFL-GAFT---ISFPFIKAVENIRFPGVLQR 118

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
           I LV+   ++I    +KR   ++    + I     W W+G
Sbjct: 119 IGLVFFFASIIYLWGSKRSTALI----IGIILLAYWLWMG 154


>gi|404487027|ref|ZP_11022214.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335523|gb|EJZ61992.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
           YIT 11860]
          Length = 364

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/337 (28%), Positives = 153/337 (45%), Gaps = 77/337 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           L + SKR+ +LD  RG+TV  MILV++AG    AYA + H+ W+G T AD V P F+FI+
Sbjct: 3   LTRTSKRLVSLDVLRGITVCGMILVNNAGACGYAYAPLKHAKWDGFTPADLVFPAFMFIM 62

Query: 113 GVAIALALKKVP-KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           GV+I L+L K       ++ +I+ RT+ +   G+ L+  +  A  A      ++++R  G
Sbjct: 63  GVSIYLSLNKSNFDWRISIARILRRTVLIFMSGVALK--WILAFIATGEYNTLENLRIMG 120

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +LQR+ + Y +VAL+  +T + R   L P  +++                          
Sbjct: 121 VLQRLGICYGIVALLA-VTVRHR---LFPTIIAVLL------------------------ 152

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
                        V  Y+++    G    A N V  VD  + G +H+Y            
Sbjct: 153 -------------VGYYLLQLFGNGFEKCAGNIVSMVDYAVLGKSHMYLGG--------- 190

Query: 292 LSSPNSGPLREDAPSWCRAPF-EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                             A F +PEG+LSTI AI    IG   G V++  K   +++   
Sbjct: 191 ------------------AQFVDPEGVLSTIPAIAQVMIGFLCGKVIVGEKEIRSQIVKL 232

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
              G  + +I  +  ++ A P+NK+L+S S+V  T G
Sbjct: 233 AVWGTSMFVIGYL--WSYAAPLNKRLWSPSFVLVTCG 267


>gi|410096828|ref|ZP_11291813.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225445|gb|EKN18364.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 367

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 163/371 (43%), Gaps = 79/371 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
             +S R+ +LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 1   MAQSGRLLSLDVMRGITIAGMIMVNNPGSWGYVYAPLRHASWNGLTPTDLVFPFFMFIMG 60

Query: 114 VAIALALKKVP-KIN-GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           V++  +L+K   K++  +V K++ RT+ +   G  L        +  S   + +++R  G
Sbjct: 61  VSMFFSLRKYDFKLSRESVTKVLKRTVLIFLVGFALNLFGHLCYNGFS---NFENLRILG 117

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ++QR+AL Y + +LI                LS+   Y  Q   G + F   ++      
Sbjct: 118 VMQRLALAYGIGSLIG---------------LSVKHKYILQTAAGILLFYWILLAA---- 158

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
             + + SE+                      N +  VDR L+G  H+Y D +        
Sbjct: 159 TGSQTLSEN----------------------NIIAIVDRALFGNTHMYHDYL-------- 188

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                            R  F+PEGLLS + +I    +G + G +++  K ++  +   +
Sbjct: 189 -------------ADGTRIAFDPEGLLSCLGSIGHVLLGFYVGKMILDCKKNNELIIRNL 235

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDA 411
            + FG +I+ +    +   PINK+L+S ++V  T G          S+F+  + W     
Sbjct: 236 FI-FGTIILFLGFLLSYGCPINKKLWSSTFVLTTCGFG--------SLFLALLIWIIDIN 286

Query: 412 CFRAWSTRYPS 422
             + WS  + S
Sbjct: 287 GKKKWSLFFES 297


>gi|423304873|ref|ZP_17282872.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|423310012|ref|ZP_17287996.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
 gi|392682836|gb|EIY76175.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
           CL03T00C23]
 gi|392683302|gb|EIY76639.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
           CL03T12C37]
          Length = 394

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/364 (26%), Positives = 151/364 (41%), Gaps = 96/364 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 4   TKVSKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 63

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        APD LS+G  +
Sbjct: 64  ISTYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKL 122

Query: 165 -------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    IR  G++QR+AL Y   ++I  T+  K  P ++      + T Y       
Sbjct: 123 WASVWTFDRIRILGVMQRLALCYGAASIIALTMKHKHIPYLIA----GLLTGY------- 171

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI                              ++ CG  G      N +  VD  +    
Sbjct: 172 FI------------------------------LLMCG-NGFAYNETNILSVVDHAILTPA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI AI    +G   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           ++       R     S    L ++ +IL F+  +     PINK+++S +YV  T G A  
Sbjct: 233 MLDGNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASS 292

Query: 392 CLGI 395
            L +
Sbjct: 293 FLAL 296


>gi|340347656|ref|ZP_08670761.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
 gi|339608850|gb|EGQ13733.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
           dentalis DSM 3688]
          Length = 386

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/135 (42%), Positives = 80/135 (59%), Gaps = 7/135 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA--YARIDHSPWNGCTLADFVMPFFLFIVGVA 115
             S+R+ +LD  RGLTV+LMI V++  G   +A++ HS WNG TL D V PFFLFI+GV+
Sbjct: 1   MSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGVS 60

Query: 116 IALALKKVPKINGAV--KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
             L+L+K   +  A   +KI  RTL L   G+ +   +  A     +  D+ H+R  G++
Sbjct: 61  TYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGVM 117

Query: 174 QRIALVYVVVALIET 188
           QRIAL Y   ALI  
Sbjct: 118 QRIALCYGATALIAV 132



 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLI----HFKGHSARLKHWVSMGFGLLIIAIIL 364
           ++P +PEGL+ST++A+     G +  H  +       G +AR         G ++  + L
Sbjct: 190 KSPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPAARRSMMRFAAAGTVLAVVGL 249

Query: 365 HFTNAIPINKQLYSFSYVCFTAG 387
             +  +P+NK+++S SYVC + G
Sbjct: 250 ALSPLLPLNKRVWSPSYVCLSCG 272


>gi|444731031|gb|ELW71398.1| Heparan-alpha-glucosaminide N-acetyltransferase [Tupaia chinensis]
          Length = 732

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 102/385 (26%), Positives = 161/385 (41%), Gaps = 57/385 (14%)

Query: 23  DGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVD 81
           + ++++  IN E G       D   GE + +      S  R+  +D FRG+ ++LM+ V+
Sbjct: 228 NSRETDRLINSELG--SPSRADPLGGETRPEPWRPAASPHRLRCVDTFRGIALILMVFVN 285

Query: 82  DAGGAYARIDHSPWNGCTLADFVMPFFL-------FIVGVAIALALKKVPKINGAVK--- 131
             GG Y    H+ WN  +     +P  L       F+V V   L  K V    G      
Sbjct: 286 YGGGKYWYFKHASWN-VSWDKVRIPGVLQRLGVTFFVVAVLELLFAKPVCIYYGVFNFSV 344

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
             I+    +    I  +      P  L   +    +R  G+LQR+ + + VVA++E L  
Sbjct: 345 NDIYAAAGMFKIQIARENCVEEFPVNLYRDLSWDKVRIPGVLQRLGVTFFVVAVLELLFA 404

Query: 192 KRRP-NVLEPR---HLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKK 247
           K  P N    R    L   T+   QW+   +   I++  T+ L VP          G  K
Sbjct: 405 KPVPENCASERSCLSLRDVTSSWPQWLVILMLESIWLGLTFFLPVP----------GCPK 454

Query: 248 YIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRED 303
             +  G  G  G  P C   A GY+D  L G +HLY  P  + L    ++          
Sbjct: 455 GYLGPGGIGDFGKYPNCTGGAAGYIDHLLLGADHLYKHPSSTVLYHTEVA---------- 504

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLI 359
                   ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ 
Sbjct: 505 --------YDPEGILGTINSIVMAFLGVQAGKILLYYKDRTKDILIRFTAWCCI-LGLIS 555

Query: 360 IAIILHFTNA--IPINKQLYSFSYV 382
           I +     N   IP+NK L+S SYV
Sbjct: 556 IGLTKISENEGFIPVNKNLWSISYV 580


>gi|389793498|ref|ZP_10196662.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
 gi|388434056|gb|EIL91012.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter fulvus Jip2]
          Length = 354

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 18/143 (12%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           SKR+ +LDA RG TV  M+LV+D G     YA ++H+PW+GCT  D V PFFLF+VGV+ 
Sbjct: 2   SKRLPSLDALRGCTVAAMLLVNDPGDWGHIYAPLEHAPWHGCTPTDLVFPFFLFVVGVSS 61

Query: 117 ALALKKVPKI------NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           ALAL  +P++         VK  ++R L++L  G+ +           ++ +   H+R+ 
Sbjct: 62  ALAL--LPRLEQGVAPGALVKAALWRALRILALGVAIN-------LLAAWLLPHAHLRFP 112

Query: 171 GILQRIALVYVVVALIETLTTKR 193
           G+LQRI + +  VAL    T  R
Sbjct: 113 GVLQRIGICFAAVALFAVHTRPR 135


>gi|294895713|ref|XP_002775269.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239881343|gb|EER07085.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 323

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 127/330 (38%), Gaps = 96/330 (29%)

Query: 69  FRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING 128
            RG+ + +M++VD  G A   I H+PWNG  LAD VMP F+FI                 
Sbjct: 1   MRGVVMSIMLIVDVCGKAVPSIGHAPWNGLHLADIVMPGFIFI----------------- 43

Query: 129 AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIET 188
                                      D L+ G+D+   R  GILQRIA+ Y    L+  
Sbjct: 44  ---------------------------DTLTVGLDLYTFRAPGILQRIAVCYAAAVLLAK 76

Query: 189 LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
           L +   PN      L   +      + G +  VI           NW+            
Sbjct: 77  LVSDLSPNDTVKGALKNNSRVL---VVGLLCIVI-----------NWAIMLLGPQP---- 118

Query: 249 IVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
             K   RG L P CN    +DR ++G  H+Y +P+W                        
Sbjct: 119 --KGCPRGSLTPQCNVASNIDRMVFGPEHMY-NPLW------------------------ 151

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
               +PEGLLST+ ++ +  +G+  G  +     H+  L+    +G GLL+    +    
Sbjct: 152 ----DPEGLLSTLPSLATVALGLACGKFIQSRPSHTELLRL---VGCGLLLDLCGMGLGI 204

Query: 369 AIPINKQLYSFSYVCFTAGAADGCLGITNS 398
            IP++K L++ SY   T G     LGI +S
Sbjct: 205 VIPVSKVLWTPSYCLLTGGICVAFLGIVSS 234


>gi|300770061|ref|ZP_07079940.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762537|gb|EFK59354.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 404

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 156/362 (43%), Gaps = 93/362 (25%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +R  +LD FRG TV LMI+V++ G     +A + H+ W+GCT  D V PFFLF VG A+
Sbjct: 2   KQRYYSLDVFRGATVALMIMVNNPGSWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 117 ALALKKVPKINGAV--KKIIFRTLKLLFWGIILQ--GGYSHAPDAL-----SYGVD-MKH 166
           +  + ++ +   AV  +K++ RT+ +   G+ +        A D L     SY  D M+ 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDSMRG 121

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R  G+LQRIAL Y   ++I      R   ++              WI  FI  V + + 
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYF--REKAII--------------WISTFILVVYWAVC 165

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            + L  P   +S               ++G  G A       D ++ G+ H+Y       
Sbjct: 166 AF-LGTPGDPYS---------------LQGWFGTA------YDIQILGVAHVYKGE---- 199

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAI-------LSGTIGIHYGHVLIH 339
                                   PF+PEGL+ST+ AI       L+GT     G V   
Sbjct: 200 ----------------------GVPFDPEGLMSTLPAIVQVVLGYLAGTYIKKQGQVDWL 237

Query: 340 FKG-------HSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGC 392
           +K        H   L      GF L+++A +  ++   PINK++++ SYV +T G     
Sbjct: 238 WKKVPASQEPHFKLLSGLFVTGFILVVLAWV--WSLGFPINKKIWTSSYVLYTTGLGVMT 295

Query: 393 LG 394
           +G
Sbjct: 296 IG 297


>gi|227538516|ref|ZP_03968565.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227241435|gb|EEI91450.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 404

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 156/362 (43%), Gaps = 93/362 (25%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +R  +LD FRG TV LMI+V++ G     +A + H+ W+GCT  D V PFFLF VG A+
Sbjct: 2   KQRYYSLDVFRGATVALMIMVNNPGSWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61

Query: 117 ALALKKVPKINGAV--KKIIFRTLKLLFWGIILQ--GGYSHAPDAL-----SYGVD-MKH 166
           +  + ++ +   AV  +K++ RT+ +   G+ +        A D L     SY  D M+ 
Sbjct: 62  SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDPMRG 121

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R  G+LQRIAL Y   ++I              R  +I       WI  FI  V + + 
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYF----------REKAII------WISTFILVVYWAVC 165

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            + L  P   +S               ++G  G A       D ++ G+ H+Y       
Sbjct: 166 AF-LGTPGDPYS---------------LQGWFGTA------YDIQILGVAHVYKGE---- 199

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAI-------LSGTIGIHYGHVLIH 339
                                   PF+PEGL+ST+ AI       L+GT     G V   
Sbjct: 200 ----------------------GVPFDPEGLMSTLPAIVQVVLGYLAGTYIKKQGEVDWL 237

Query: 340 FKG-------HSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGC 392
           +K        H   L      GF L+++A +  ++   PINK++++ SYV +T G     
Sbjct: 238 WKKVPASQEPHFKLLSGLFVTGFILVVLAWV--WSLGFPINKKIWTSSYVLYTTGLGIMT 295

Query: 393 LG 394
           +G
Sbjct: 296 IG 297


>gi|452822118|gb|EME29140.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 2
           [Galdieria sulphuraria]
          Length = 351

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 135/317 (42%), Gaps = 78/317 (24%)

Query: 87  YARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPK-------INGAVKKIIFRTLK 139
           Y+   H  W    +AD + PFFLF+VG +I  A +KVP+        + A++ +  RT+K
Sbjct: 9   YSVFKHESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIK 68

Query: 140 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLE 199
           L   G++L    S        G   + +RW GILQRIA+ Y  VA +      R   V++
Sbjct: 69  LFLVGVLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFVNSR---VIQ 117

Query: 200 PRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG 259
              +S+              F+++    Y L VPN                 C +   L 
Sbjct: 118 YALVSVL-------------FLLHTSLLYGLIVPN-----------------CLISERLT 147

Query: 260 PACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLS 319
            AC+A  Y+D  + G  HLY       LE                       ++PEG+LS
Sbjct: 148 RACSAQSYLDTMILGGKHLY-----FHLE-----------------------YDPEGILS 179

Query: 320 TISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL--HFTNAIPINKQLY 377
           T+ A ++   G+    +    +  + R+     +G   + I I+L   F +++PI+K L+
Sbjct: 180 TLMATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPDSVPISKPLW 239

Query: 378 SFSYVCFTAGAADGCLG 394
           + S++  T G +  CL 
Sbjct: 240 TASFLFLTVGCSFWCLS 256


>gi|332983392|ref|YP_004464833.1| heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
 gi|332701070|gb|AEE98011.1| Heparan-alpha-glucosaminide N-acetyltransferase [Mahella
           australiensis 50-1 BON]
          Length = 368

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 95/341 (27%), Positives = 148/341 (43%), Gaps = 93/341 (27%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGA-YAR--IDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR+ ++DA RG+ +  MI +++ G + Y    + H+PWNG TLAD   P F+F++G+ I 
Sbjct: 4   KRIQSIDALRGICITAMIFMNNPGNSKYTSPLLLHAPWNGITLADLFFPCFIFVMGMVIP 63

Query: 118 LALKKVPKINGAVKKIIFRTLK---LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           ++  K         ++I   LK   +LF    L G + +A        DM+H+R  G+LQ
Sbjct: 64  VSFGKRMAKGQTKGQLIAHLLKRSAMLF----LIGLFLNAFPCF----DMQHVRILGVLQ 115

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RIALVY    LI   ++           +S+F       IG       Y +    + VP 
Sbjct: 116 RIALVYFFSGLIFLFSST----------MSMFIISAAILIG-------YYLLLRFVPVPG 158

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
           +        G                  N + Y+D +L    HLY+              
Sbjct: 159 YGAGVFERTG------------------NLIQYIDLKLLK-GHLYT-------------- 185

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                     P W     +PEGLLST+ AI S  +GI  G +L+  K ++ +L       
Sbjct: 186 ----------PDW-----DPEGLLSTLPAIASSLLGILTGCLLVSDKKNTNKLY------ 224

Query: 355 FGLLIIAIILHFTNAI------PINKQLYSFSYVCFTAGAA 389
             ++++   L F ++I      P+NK L+S S+V FT G A
Sbjct: 225 --IMLVCSALAFISSIITQKWFPLNKNLWSSSFVLFTTGFA 263


>gi|66808259|ref|XP_637852.1| transmembrane protein [Dictyostelium discoideum AX4]
 gi|60466271|gb|EAL64333.1| transmembrane protein [Dictyostelium discoideum AX4]
          Length = 675

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 79/130 (60%), Gaps = 7/130 (5%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +K  R+ +LD FRG ++ +MI V+  GG Y   +HS WNG T+AD V P+F+FI+G+A+
Sbjct: 201 NRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMGIAM 260

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
            L+   + K  G  K+IIF+  KLL   IIL          ++ GVD++  R  G+LQR 
Sbjct: 261 PLSFHAMEK-RGTPKRIIFQ--KLLRRSIILFA----LGLFINNGVDLQQWRILGVLQRF 313

Query: 177 ALVYVVVALI 186
           ++ Y+VV  I
Sbjct: 314 SISYLVVGSI 323



 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI----ILHFT 367
           ++PEG +  +++I    IG+  G +++ +K + +RL  W+     L  IA     +    
Sbjct: 510 YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAAGLCGLTQNQ 569

Query: 368 NAIPINKQLYSFSYVCFTAG 387
             +P+NK L+S S++   AG
Sbjct: 570 GWLPVNKNLWSPSFILLMAG 589


>gi|290999917|ref|XP_002682526.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
 gi|284096153|gb|EFC49782.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
          Length = 425

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 153/365 (41%), Gaps = 74/365 (20%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG-AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           + +R+  LD  RG+T++LMI+V++    A+  +DH+ W G T  D V PFFLF++G A A
Sbjct: 62  RKERMVALDIMRGMTIMLMIIVNNQPARAFIPLDHAEWFGFTPTDCVFPFFLFVMGYAAA 121

Query: 118 LALKK---------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPD-ALSYGVDMKH- 166
           +   +          P       +  FR L      ++ +            Y + M+  
Sbjct: 122 IVYSREWPSDVYLYPPSHVKLSIQSYFRELCGKKQDLMDENEKKEEESIKFMYLIPMRKS 181

Query: 167 -----IRWCGILQRIALVYV------VVALIETLTTKRRPNVLEPRHLSIF------TAY 209
                 +W  + +R  L+++      V+A +   T  R   V +   +  F         
Sbjct: 182 LYEFVSKWVKLFRRPILMFLIGFSFSVLAHLFNFTHVRVMGVFQRIAICYFIVSLILVMV 241

Query: 210 QWQWIGGFIAFV---IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHL-GPACNAV 265
            W ++   I  +   IYI  T+ LYVP     E    G       CG RG L  P C A 
Sbjct: 242 PWTFVQILIVVLFQAIYITVTFGLYVP----MEGEGDG-------CGTRGELYEPRCTAE 290

Query: 266 GYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAIL 325
           GY+DR +   +H+Y                             +  ++PEG LS++SA+ 
Sbjct: 291 GYIDRLILSRDHIY----------------------------LQDSYDPEGFLSSLSAVT 322

Query: 326 SGTIGI-HYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCF 384
           +  +GI  +       K    RL +W  MG  +++ A+ + +   +PI K+L++ S+   
Sbjct: 323 NAFVGILAFKVARAAGKDAHKRLNYWFIMGSLMILAALAIDYA-GLPIGKKLWTTSFALI 381

Query: 385 TAGAA 389
           T+G A
Sbjct: 382 TSGIA 386


>gi|242062186|ref|XP_002452382.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
 gi|241932213|gb|EES05358.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
          Length = 108

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 39/86 (45%), Positives = 63/86 (73%), Gaps = 1/86 (1%)

Query: 304 APSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAII 363
           APSWC+APF+PEGLLS++ AI++  IG+ +GH++IHF+ H  R+ +W+ + F +L +A +
Sbjct: 13  APSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLILSFSMLALAFL 72

Query: 364 LHFTNAIPINKQLYSFSYVCFTAGAA 389
           + F+  + ++K LY+ SY   TAG A
Sbjct: 73  MDFSG-MRMSKPLYTMSYTLATAGTA 97


>gi|389809458|ref|ZP_10205319.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
 gi|388441722|gb|EIL97972.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter thiooxydans LCS2]
          Length = 353

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/140 (38%), Positives = 80/140 (57%), Gaps = 14/140 (10%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR+A+LDA RG TV  M+LV+D G     Y  + H+ WNGCT  D V PFFLF+VGV++A
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPGDWSHVYWPLAHAAWNGCTPTDLVFPFFLFVVGVSVA 61

Query: 118 LA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           LA    L++    +   +  ++R L++L  G+ +           ++ +   H+R+ G+L
Sbjct: 62  LAILPRLEQGASPSALTRAAMWRALRILALGVAIN-------LLAAWWLPQAHLRFPGVL 114

Query: 174 QRIALVYVVVALIETLTTKR 193
           QRIAL +  VAL    T  R
Sbjct: 115 QRIALCFAGVALFAVYTKPR 134


>gi|418746616|ref|ZP_13302939.1| PF07786 family protein [Leptospira santarosai str. CBC379]
 gi|410792596|gb|EKR90528.1| PF07786 family protein [Leptospira santarosai str. CBC379]
          Length = 375

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 147/339 (43%), Gaps = 86/339 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG 
Sbjct: 7   QNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGT 66

Query: 115 AIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWC 170
           +I ++L     IN +     I  R+  L+  G+ L   G +S A            +R  
Sbjct: 67  SIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA-----------ELRIP 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRI  VY VVA +  +   ++                           I + +   L
Sbjct: 116 GVLQRIGFVYWVVASLCLVFPGKK---------------------------ILVFSVAIL 148

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +  W  ++ +  G  + +V       +G       ++DR ++G  HL+           
Sbjct: 149 LIHTWILTQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHLW----------- 189

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                        + +W     +PEG LS ++++++   G+  G +L        R +  
Sbjct: 190 -----------RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF------LRERRN 227

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             +G G+L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 228 KILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS 266


>gi|375146803|ref|YP_005009244.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361060849|gb|AEV99840.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 376

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 147/370 (39%), Gaps = 102/370 (27%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVG 113
            Q+ KR   LD FRG+T+  MI+V+   D    YA + H+ W+G T  D V P FLF VG
Sbjct: 1   MQQQKRFLALDVFRGMTICFMIIVNTSPDGSHTYAPLLHAQWHGFTPTDLVFPSFLFAVG 60

Query: 114 VAIALALKKVPKING--AVKKIIFRTLKLLFWGIIL-----------QGGYSHAPDALSY 160
            A++  + +    +    + KI+ RTL +   G ++            G Y+  P     
Sbjct: 61  NAMSFVMPRWENASTGFVLGKILKRTLLIFILGYLMYWFPFVRMDKVTGVYAFYP----- 115

Query: 161 GVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAF 220
               +  R  G+LQRIAL Y   +L+                              F+ F
Sbjct: 116 ---FEKTRVFGVLQRIALAYCFASLMLY----------------------------FLKF 144

Query: 221 VIYIITTYSLYVPNWS----FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
              II T ++ +  W     F + +D               L  A NAV  +D  L G +
Sbjct: 145 RATIIITAAILLLYWPVLYFFGDSAD--------------PLSLAGNAVLKLDLWLLGPD 190

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           HLY                               PF+PEG LST  AI +   G   G  
Sbjct: 191 HLYHGE--------------------------GVPFDPEGFLSTFPAIANVVGGYWVGRF 224

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTN-AIPINKQLYSFSYVCFTAGAADGCLGI 395
           L    G    L   +  GF LL++A   HF N + PINK+L++ S+V  T G    CL I
Sbjct: 225 LQQKGGTYEALTKLMLAGFALLVLA---HFWNYSFPINKKLWTSSFVLHTVGI--DCLII 279

Query: 396 TNSIFIPEMD 405
              ++I ++ 
Sbjct: 280 AAIVYIADIQ 289


>gi|383934719|ref|ZP_09988159.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
 gi|383704254|dbj|GAB58250.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
           nanhaiensis E407-8]
          Length = 397

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 161/354 (45%), Gaps = 68/354 (19%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           L +  + R+  LD  RGLT+  MILV++ G     Y+ + H+ W+G T+ D + PFF+ I
Sbjct: 13  LAKLNTNRMLALDVLRGLTITAMILVNNPGSWNYVYSPLLHAQWHGWTITDLIFPFFIVI 72

Query: 112 VGVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDA-LSYGVD-MKHI 167
           VG+++ L+L++    N    +++ + R+ KL   G++L   Y +  D   +Y  D +  +
Sbjct: 73  VGMSLQLSLRQHSLNNKGPLIRQALLRSGKLFGLGLLLALFYYNFRDPEFNYVEDRLLTV 132

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           RW G+LQRI LVY+   LI     +R   +     ++++ A  W                
Sbjct: 133 RWLGVLQRIGLVYLATVLIVLYFGQRGRLLWLLGLMAVYLAGLW---------------- 176

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
              ++P   + +   H           RG L    + V ++D+ + G NH+Y        
Sbjct: 177 ---WLP---YQDAQGH---------EFRGLLLFGNSFVAWLDQLVLGANHVYY------- 214

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                        R   P      F+PEGL ST+ AI S   G+     L   +  + ++
Sbjct: 215 -------------RSATPF----AFDPEGLWSTLPAIASCLTGVLMAQWLQSERSLAQKI 257

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG----AADGCLGITN 397
           +  +  G   + +A + HF+  +P+NK L++ S+V  ++G    A   CL + +
Sbjct: 258 RGLLLCGVVAVWLAELWHFS--LPVNKSLWTPSFVLLSSGYCAIALAACLWLCD 309


>gi|421110364|ref|ZP_15570862.1| PF07786 family protein [Leptospira santarosai str. JET]
 gi|410804289|gb|EKS10409.1| PF07786 family protein [Leptospira santarosai str. JET]
          Length = 375

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 147/339 (43%), Gaps = 86/339 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG 
Sbjct: 7   QNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGT 66

Query: 115 AIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWC 170
           +I ++L     IN +     I  R+  L+  G+ L   G +S A            +R  
Sbjct: 67  SIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA-----------ELRIP 115

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRI  VY VVA +  +   ++                           I + +   L
Sbjct: 116 GVLQRIGFVYWVVASLCLVFPGKK---------------------------ILVFSVPIL 148

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +  W  ++ +  G  + +V       +G       ++DR ++G  HL+           
Sbjct: 149 LIHTWILTQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHLW----------- 189

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                        + +W     +PEG LS ++++++   G+  G +L        R +  
Sbjct: 190 -----------RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF------LRERRN 227

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             +G G+L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 228 KILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS 266


>gi|90022681|ref|YP_528508.1| hypothetical protein Sde_3039 [Saccharophagus degradans 2-40]
 gi|89952281|gb|ABD82296.1| conserved hypothetical protein [Saccharophagus degradans 2-40]
          Length = 363

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/337 (27%), Positives = 143/337 (42%), Gaps = 84/337 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
             ++R   LD  RG T+ +MILV+   D G  YA + H+ W+G T+ DFV PFFLFI+G 
Sbjct: 1   MATQRYLALDVMRGATLAMMILVNTPGDWGFVYAPLLHADWHGVTITDFVFPFFLFIIGS 60

Query: 115 AIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           A+    +   ++  A+  KKII RT  L   G++L         A  +   +  +R  G+
Sbjct: 61  ALFFTSRSSGQLAPAIKAKKIIKRTALLFTIGLLLH--------AFPFTTALSELRILGV 112

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRIAL Y + A I  L T +R        L I  AY          ++++I+T  S ++
Sbjct: 113 LQRIALAYGIAAFIVWLPTTQRLMA----ALGILVAY----------WLVFILTDSSYHL 158

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                                       A N V ++D  + G  HL+             
Sbjct: 159 ----------------------------ADNIVRHIDITILGAEHLWQGK---------- 180

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG---HSARLKH 349
                              F+PEGLLST+ A ++   G     +L+       ++A  + 
Sbjct: 181 ----------------GLAFDPEGLLSTLPAAVNILAGFEATRLLVSQPAGEPNNATSRQ 224

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
           +    + +  I I L +   +PINK L++ S+V  T+
Sbjct: 225 FKLALYAMCSITIALIWHRWMPINKSLWTSSFVLLTS 261


>gi|301625227|ref|XP_002941812.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Xenopus (Silurana) tropicalis]
          Length = 370

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/152 (35%), Positives = 85/152 (55%), Gaps = 13/152 (8%)

Query: 41  EVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTL 100
           E  ++  GE    Q    +S+R+ +LD FRG ++ +M+ V+  GG Y   +H+PWNG T+
Sbjct: 188 ETSEDNCGE----QSKVPESRRLYSLDTFRGFSLTIMVFVNYGGGGYWFFEHAPWNGLTV 243

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPD 156
           AD VMP+F+FI+G ++ALA   + K   +    + K+ +RT  L   G+     Y  A  
Sbjct: 244 ADLVMPWFVFIIGTSVALAFNAMLKRGLSRCQLLYKLTWRTCILFAIGVFFL-NYGPADG 302

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIET 188
            LS+    +  R  G+LQR+   Y V+AL+ T
Sbjct: 303 PLSW----RWARIPGVLQRLGFTYFVIALLHT 330


>gi|374385780|ref|ZP_09643283.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
 gi|373225482|gb|EHP47816.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 149/355 (41%), Gaps = 87/355 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVG 113
            +   R+  LD FRG+T+  MILV+D G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 1   MKTENRLLALDVFRGITIAGMILVNDPGSWSAVYAPLCHASWNGLTPTDLVFPFFMFIMG 60

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIIL--------------QGGYSHAPDA 157
           +++  +L++   +   GAV KI  R + +   G+ +               G ++     
Sbjct: 61  ISMYFSLRRYNSLFSRGAVAKIFRRAVLIFLIGLGINWFALWFGTFMSMGNGEFTFWERF 120

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                 +  IR  G+LQR+AL Y+  A++           + PR+  +FTA         
Sbjct: 121 TQNIFPVADIRILGVLQRLALAYLGGAILCL--------GIRPRY-QLFTA--------- 162

Query: 218 IAFVIYIITTYSLYVPNWSF--SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
              V+ ++  + + V    F  SEH                      N +  VDR + G+
Sbjct: 163 ---VMILVGYFVILVVGEGFIRSEH----------------------NILSVVDRAVLGV 197

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
            HLY                        A S     F+PEGLLST+        G+  G 
Sbjct: 198 RHLYG---------------------GGASSGAGMAFDPEGLLSTLPCFAHVLFGVCMGR 236

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
           +L   K +  R++     G  LL    +  ++ A P+NK+++S +YV  + GAA 
Sbjct: 237 MLGEVKENEIRIRQLFIFGTILLFAGYL--WSYACPVNKRIWSPTYVLISCGAAS 289


>gi|195565141|ref|XP_002106164.1| GD16714 [Drosophila simulans]
 gi|194203536|gb|EDX17112.1| GD16714 [Drosophila simulans]
          Length = 318

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/143 (37%), Positives = 83/143 (58%), Gaps = 15/143 (10%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 241

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++K      G+ K     +I+ R++KL   G+ L        +++S G +++ +R  G+L
Sbjct: 242 SVKSQLS-RGSSKARICLRILVRSIKLFVIGLCL--------NSMS-GPNLEQLRVMGVL 291

Query: 174 QRIALVYVVVALIETLTTKRRPN 196
           QR  + Y+       +   RRP 
Sbjct: 292 QRFGVAYLGAHQSAKILAARRPR 314


>gi|409990365|ref|ZP_11273749.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
 gi|291567406|dbj|BAI89678.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938771|gb|EKN80051.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
           Paraca]
          Length = 378

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 145/343 (42%), Gaps = 84/343 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           L     R+ +LD FRG+ +  MILV++ G     Y  + H+ W+GCT  D V P FL IV
Sbjct: 3   LINTKIRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAQWHGCTPTDVVFPSFLLIV 62

Query: 113 GVAIALALKKVP---KING-----AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM 164
           GVAIA +L K     ++ G     +V   I R   LLF   ++  G+ +         D+
Sbjct: 63  GVAIAFSLSKFSPEHRLGGDGVPPSVYSRIGRRCLLLFLLGLILNGFPN--------YDL 114

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
            +IR  G+LQRIA+ Y + A I  L   RR   L    +SIFT      IG ++A  +  
Sbjct: 115 ANIRIMGVLQRIAIAYGLSA-IAILNLSRRQLWL----ISIFTL-----IGYWLAMTMIP 164

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
           +  YS                          G+L P  N   ++D+ + G +HL      
Sbjct: 165 VPGYS-------------------------PGNLSPEGNLGAFIDQTILGSHHL------ 193

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
                                 W   P++PEGL ST  A ++  IG   G  L     +S
Sbjct: 194 ----------------------WRGGPYDPEGLFSTAPATVTVIIGYLTGEWLKSQPRNS 231

Query: 345 ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             + + V      L++  +       PINK L++ SYV  TAG
Sbjct: 232 LTVINLVMFALSSLVVGYLWGIW--FPINKALWTSSYVLVTAG 272


>gi|374312698|ref|YP_005059128.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358754708|gb|AEU38098.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 383

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 88/346 (25%), Positives = 150/346 (43%), Gaps = 79/346 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + S R+ ++D  RGLTV  MILV++ G    AY  ++HS WNG T  D V P FLFI+G+
Sbjct: 13  RPSSRLLSIDVLRGLTVAFMILVNNNGNNDLAYRALNHSLWNGFTPTDLVFPTFLFIMGI 72

Query: 115 AIALALK----KVPKINGAVKKIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRW 169
           ++ L+      K       +  I  R   L+F+G+++ G  Y H          +  +R 
Sbjct: 73  SMVLSFSAHRAKATSNTVMLTSIGRRFALLIFFGLVVNGFPYFH----------LDTLRI 122

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRIA+ Y++ AL++ +T +     + PR +  F A    W+      ++  +    
Sbjct: 123 YGVLQRIAVCYLLAALLQLVTDR-----IAPRVVLFFAAVIGYWV------LLRFVPVPG 171

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
             +P   F    DH +                 N V ++DR      H++   ++ +   
Sbjct: 172 HGIPGRDFPL-LDHDI-----------------NLVAWLDR------HIFPHRLFEKTR- 206

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                  +PEGLLS I A  S  +G+  G  +   +    +L  
Sbjct: 207 -----------------------DPEGLLSDIPAFASTILGLLAGAWIKQARPAGQKLMG 243

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
               G  L +  ++  ++ + PINK+L++ SYV +  G +   L +
Sbjct: 244 LFGAGIALAVAGLL--WSQSFPINKKLWTSSYVLYAGGLSILLLAV 287


>gi|359728547|ref|ZP_09267243.1| hypothetical protein Lwei2_17159 [Leptospira weilii str.
           2006001855]
          Length = 383

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 151/358 (42%), Gaps = 90/358 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG 
Sbjct: 7   QNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFFLFAVGG 66

Query: 115 AIALALKKVPKIN------GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           +I ++L     I+      G  K+ +   L  L      +  ++              +R
Sbjct: 67  SIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFA-------------ELR 113

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI  VY VVA +  +   ++                          +I++I   
Sbjct: 114 IPGVLQRIGFVYWVVASLYLIFPGKK-------------------------VLIFLIPV- 147

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L V  W  +  +  G     ++ G         +   ++DR + G  HL+         
Sbjct: 148 -LLVHTWILTHIAPPGEAMVSLEQGK--------DIGAWIDRVIIGEKHLW--------- 189

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        + + +W     +PEGLLS +++I +   G+  G +L   +G   R +
Sbjct: 190 -------------KFSKTW-----DPEGLLSGVASIATSLFGVLCGFILFLREG-GGRSR 230

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              + G G L   + L +  ++P+NK L++ SY  +TAG A  C+G     F   ++W
Sbjct: 231 VLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLAFFCIG-----FFEYLNW 283


>gi|352080530|ref|ZP_08951469.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
 gi|351683811|gb|EHA66887.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
          Length = 353

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 143/337 (42%), Gaps = 87/337 (25%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR+A+LDA RG TV  M+LV+D G     Y  ++H+ W+GCT  D V PFFLF+VGV++A
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPGDWGHVYWPLEHAAWHGCTPTDLVFPFFLFVVGVSVA 61

Query: 118 LA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           LA    L++    +   +   +R L++L  G+ +           ++ +   H+R+ G+L
Sbjct: 62  LAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGVL 114

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL +  VAL    T  R                QW  I              +L + 
Sbjct: 115 QRIALCFAGVALFAIHTKPRT---------------QWWAIA-------------ALLIG 146

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
            W             +++ G  G L P  N    VD  ++G      DP   R       
Sbjct: 147 YWG------------LLRLG--GSLEPWTNLASRVDSAVFGRFVYLIDPASGRGH----- 187

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                              +PEGLLST+ ++    +G+  G  L        + +  +  
Sbjct: 188 -------------------DPEGLLSTLPSLAGTLLGLRMGCWL-----RREQFRTLLLA 223

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
           G   L++  +  ++  +P NK L++ S+V +T G A 
Sbjct: 224 GIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWAT 258


>gi|343082900|ref|YP_004772195.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351434|gb|AEL23964.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 365

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 77/134 (57%), Gaps = 18/134 (13%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++    R+ +LDA RG T+  MILV+  G     Y  + H+ W+G T+ DF+ PFF+F+V
Sbjct: 1   MKPTITRLISLDALRGFTIAAMILVNYPGSWSHVYPPLLHAEWHGMTMTDFIFPFFIFMV 60

Query: 113 GVAIALALKK-----VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167
           GV++A A  K     VPK  G  KKII R +KL   GI L    +  PD      D  H+
Sbjct: 61  GVSVAFAYSKRLDEGVPK-AGMYKKIIIRAIKLFVLGIFL----NLIPD-----FDFSHM 110

Query: 168 RWCGILQRIALVYV 181
           R  G+LQRI++V++
Sbjct: 111 RIAGVLQRISIVFL 124


>gi|345516841|ref|ZP_08796327.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
 gi|229437727|gb|EEO47804.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
          Length = 363

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 161/368 (43%), Gaps = 80/368 (21%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              KR+  LD  RG+T+  MILV++ G     YA ++H+ +NG T  D V PFF+FI+G+
Sbjct: 4   NTPKRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  ++L+K      +  ++KI+ RT+ +   G++L      A    ++ ++ +  R+ G+
Sbjct: 64  STYISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGV 120

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+A+ Y V +L+  +T K +                      F A ++  +  Y L +
Sbjct: 121 MQRLAIGYGVTSLVA-ITVKHKY---------------------FPAIILVTLAAYFLLL 158

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                   S+                    N V   D    G +H+Y +           
Sbjct: 159 ATGDGFNQSE-------------------TNVVARFDAWALGTSHMYHE----------- 188

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                G +  D          PEGLLST+ A+    +G + G +L+  K ++ +++    
Sbjct: 189 -----GGMAFD----------PEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLFL 233

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD-WDECDA 411
           +G  L     +L +    PINK+++S ++V  T G A   L +   I+I +M  +    A
Sbjct: 234 IGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALL--IWIIDMKGYQNWCA 289

Query: 412 CFRAWSTR 419
            FR++   
Sbjct: 290 FFRSFGVN 297


>gi|212693969|ref|ZP_03302097.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|265751179|ref|ZP_06087242.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|423228550|ref|ZP_17214956.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|423243815|ref|ZP_17224891.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
 gi|212663501|gb|EEB24075.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
 gi|263238075|gb|EEZ23525.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|392635957|gb|EIY29845.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
           CL02T00C15]
 gi|392644181|gb|EIY37924.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
           CL02T12C06]
          Length = 363

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 161/365 (44%), Gaps = 80/365 (21%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR+  LD  RG+T+  MILV++ G     YA ++H+ +NG T  D V PFF+FI+G++  
Sbjct: 7   KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66

Query: 118 LALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           ++L+K      +  ++KI+ RT+ +   G++L      A    ++ ++ +  R+ G++QR
Sbjct: 67  ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           +A+ Y V +L+  +T K +                      F A ++  +  Y L +   
Sbjct: 124 LAIGYGVTSLVA-ITVKHKY---------------------FPAIILVTLAAYFLLLATG 161

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
                S+                    N V   D    G +H+Y +              
Sbjct: 162 DGFNQSE-------------------TNVVARFDAWALGTSHMYHE-------------- 188

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
             G +  D          PEGLLST+ A+    +G + G +L+  K ++ +++    +G 
Sbjct: 189 --GGMAFD----------PEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLFLIGT 236

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD-WDECDACFR 414
            L     +L +    PINK+++S ++V  T G A   L +   I+I +M  +    A FR
Sbjct: 237 ILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALL--IWIIDMKGYQNWCAFFR 292

Query: 415 AWSTR 419
           ++   
Sbjct: 293 SFGVN 297


>gi|422005552|ref|ZP_16352731.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417255773|gb|EKT85231.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 375

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 149/340 (43%), Gaps = 86/340 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG
Sbjct: 6   TQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVG 65

Query: 114 VAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRW 169
            +I ++L     IN +     I  R+  L+  G+ L   G +S A            +R 
Sbjct: 66  TSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA-----------ELRI 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRI  VY VVA +  +   ++                       + F++ I     
Sbjct: 115 PGVLQRIGFVYWVVASLCLVFPGKK----------------------ILVFLVPI----- 147

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L +  W  ++ +  G  + +V       +G       ++DR ++G  HL+          
Sbjct: 148 LLIHTWILTQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHLW---------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                         + +W     +PEG LS ++++++   G+  G +L        R + 
Sbjct: 190 ------------RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF------LRERR 226

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              +G G+L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 227 NKILGLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS 266


>gi|319641808|ref|ZP_07996487.1| transmembrane protein [Bacteroides sp. 3_1_40A]
 gi|345518546|ref|ZP_08797993.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|254835931|gb|EET16240.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
 gi|317386564|gb|EFV67464.1| transmembrane protein [Bacteroides sp. 3_1_40A]
          Length = 363

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 148/344 (43%), Gaps = 79/344 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              KR+  LD  RG+T+  MILV++ G     Y  ++H+ +NG T  D V PFF+FI+G+
Sbjct: 4   NTPKRLLALDILRGITIAGMILVNNPGSWGHVYTPLEHAAFNGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  ++L+K      +  ++KI+ RT+ +   G+ L      A    ++ ++ + +R+ G+
Sbjct: 64  STYISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGV 120

Query: 173 LQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +QR+A+ Y V +L+  T+  K  P +                          I+ T ++Y
Sbjct: 121 MQRLAIGYGVTSLVAITVKHKYFPAI--------------------------ILVTLAVY 154

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
                           +++     G    A N V   D    G +H+Y D          
Sbjct: 155 ----------------FLLLAMGDGFNLSATNIVARFDVWALGTSHMYHDG--------- 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                               F+PEGLLST+ A+    +G + G +L   K +  +++   
Sbjct: 190 -----------------GMAFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            +G  L     +L +    PINK+++S ++V  T G A   L +
Sbjct: 233 LVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLAL 274


>gi|402878146|ref|XP_003902762.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Papio
           anubis]
          Length = 708

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 140/305 (45%), Gaps = 54/305 (17%)

Query: 97  GCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTLKLLFWGIIL-QGG 150
           G T+AD V P+F+FI+G +I L++  + +  G  K     KI +R+  L+  GII+    
Sbjct: 347 GLTVADLVFPWFVFIMGSSIFLSMTSILQ-RGCSKFRLLGKIAWRSFLLICIGIIIVNPN 405

Query: 151 YSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--VLEPRHLSI--F 206
           Y   P      +    +R  G+LQR+ + Y VVA++E L  K  P     E   LS+   
Sbjct: 406 YCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDI 459

Query: 207 TAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN- 263
           T+   QW+       +++  T+ L VP          G     +  G  G  G  P C  
Sbjct: 460 TSSWPQWLLILALEGLWLGLTFLLPVP----------GCPTGYLGPGGIGDFGKYPNCTG 509

Query: 264 -AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTIS 322
            A GY+DR L G +HLY  P  + L    ++                  ++PEG+L TI+
Sbjct: 510 GAAGYIDRLLLGDDHLYQHPSSTVLYHTEVA------------------YDPEGILGTIN 551

Query: 323 AILSGTIGIHYGHVLIHFKGHSA----RLKHWVS-MGFGLLIIAIILHFTNAIPINKQLY 377
           +I+   +G+  G +L+++K  +     R   W   +G   +++  +      IP+NK L+
Sbjct: 552 SIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTKVSENEGFIPVNKNLW 611

Query: 378 SFSYV 382
           S SYV
Sbjct: 612 SLSYV 616


>gi|325286182|ref|YP_004261972.1| heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
 gi|324321636|gb|ADY29101.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           lytica DSM 7489]
          Length = 361

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 85/158 (53%), Gaps = 11/158 (6%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            RV ++D FRG+T+VLMILV++ G     YA   H+ W+G T  D V PFFLFIVG++I 
Sbjct: 3   NRVVSVDIFRGITIVLMILVNNPGTWSSVYAPFLHADWHGYTPTDLVFPFFLFIVGISIV 62

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            A           +KI+ R+LKL+  G+ L G ++    +  +  D   IR+ G+LQRI 
Sbjct: 63  YAYHTKEVTGKTYRKIVIRSLKLIGLGLFL-GAFTL---SFPFFKDFNDIRFPGVLQRIG 118

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
           LV+   A+   L  K     L     +I   Y W W+G
Sbjct: 119 LVFFFTAI---LFIKLNWKALVAVCAAILIMY-WLWMG 152


>gi|417780880|ref|ZP_12428636.1| PF07786 family protein [Leptospira weilii str. 2006001853]
 gi|410778851|gb|EKR63473.1| PF07786 family protein [Leptospira weilii str. 2006001853]
          Length = 383

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 151/358 (42%), Gaps = 90/358 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG 
Sbjct: 7   QNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFFLFAVGG 66

Query: 115 AIALALKKVPKIN------GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           +I ++L     I+      G  K+ +   L  L      +  ++              +R
Sbjct: 67  SIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFA-------------ELR 113

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI  VY VVA +  +   ++                          +I++I   
Sbjct: 114 IPGVLQRIGFVYWVVASLYLIFPGKK-------------------------VLIFLIPV- 147

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L V  W  +  +  G     ++ G         +   ++DR + G  HL+         
Sbjct: 148 -LLVHTWILTHIAPPGEAMVSLEQGK--------DIGAWIDRVIIGEKHLW--------- 189

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        + + +W     +PEGLLS +++I +   G+  G +L   +G   R +
Sbjct: 190 -------------KFSKTW-----DPEGLLSGVASIATSLFGVLCGFILFLREG-GGRSR 230

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDW 406
              + G G L   + L +  ++P+NK L++ SY  +TAG +  C+G     F   ++W
Sbjct: 231 VLSTFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLSFFCIG-----FFEYLNW 283


>gi|150004749|ref|YP_001299493.1| transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294775179|ref|ZP_06740705.1| putative membrane protein [Bacteroides vulgatus PC510]
 gi|149933173|gb|ABR39871.1| putative transmembrane protein [Bacteroides vulgatus ATCC 8482]
 gi|294450991|gb|EFG19465.1| putative membrane protein [Bacteroides vulgatus PC510]
          Length = 363

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 148/344 (43%), Gaps = 79/344 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              KR+  LD  RG+T+  MILV++ G     Y  ++H+ +NG T  D V PFF+FI+G+
Sbjct: 4   NTPKRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  ++L+K      +  ++KI+ RT+ +   G+ L      A    ++ ++ + +R+ G+
Sbjct: 64  STYISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGV 120

Query: 173 LQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +QR+A+ Y V +L+  T+  K  P +                          I+ T ++Y
Sbjct: 121 MQRLAIGYGVTSLVAITVKHKYFPAI--------------------------ILVTLAVY 154

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
                           +++     G    A N V   D    G +H+Y D          
Sbjct: 155 ----------------FLLLAMGDGFNLSATNIVARFDVWALGTSHMYHDG--------- 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                               F+PEGLLST+ A+    +G + G +L   K +  +++   
Sbjct: 190 -----------------GMAFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            +G  L     +L +    PINK+++S ++V  T G A   L +
Sbjct: 233 LVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLAL 274


>gi|311747386|ref|ZP_07721171.1| membrane protein [Algoriphagus sp. PR1]
 gi|126579104|gb|EAZ83268.1| membrane protein [Algoriphagus sp. PR1]
          Length = 381

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 141/334 (42%), Gaps = 73/334 (21%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R   LD  RGLT+  MI+V+ AG     YA + H+ W+G T  D V P FLF+VG A++
Sbjct: 12  NRYLALDVLRGLTIAFMIVVNSAGDWSNLYAPLAHAKWHGFTPTDLVFPTFLFVVGNAMS 71

Query: 118 LALKKVPKI--NGAVKKIIFRTLKLLFWGIILQG--GYSHAPDALSYGVDMKHIRWCGIL 173
            ++KK+ ++  +   KK+  RTL +   G +L     Y  +       +++  +R  G+L
Sbjct: 72  FSMKKLQEMPTSAFFKKVGKRTLLIFLIGWLLNAFPFYDISETGNFSLINITEVRLFGVL 131

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL Y   A+I  +   R                   WI   IA + Y    Y     
Sbjct: 132 QRIALCYFFAAIILYIGGVRL-----------------GWIFSGIALLTYWGIMY----- 169

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
              F + SD          G+ G      NA   +D  L G++ +Y              
Sbjct: 170 --VFGDSSD--------PYGLTG------NAAIKLDLSLIGVDRMYGGE----------- 202

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                            PF+PEGLLST+ +I++   G   G ++  +      +K  +  
Sbjct: 203 ---------------GIPFDPEGLLSTLPSIVNVIAGYIIGKMVQKYGNTLESIKKLLIG 247

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
              L+++A I       PINK++++ SYV  T G
Sbjct: 248 AVVLIVLAYIWDI--VFPINKKIWTSSYVLLTVG 279


>gi|320106288|ref|YP_004181878.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
 gi|319924809|gb|ADV81884.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
          Length = 394

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 92/333 (27%), Positives = 147/333 (44%), Gaps = 75/333 (22%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++R+ ++D  RGLTV  MILV+D G    AYA +DH+PWNG T  D V P FLF+VG +I
Sbjct: 9   TQRILSVDVLRGLTVAFMILVNDPGDGHVAYAPLDHAPWNGWTPTDMVFPTFLFLVGCSI 68

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
             ++    K   +  +I  + ++   + + +       P    YG     +R  G+L RI
Sbjct: 69  VFSITSRLKRGDSKSRIALQVIRRTIYLLAINYAIRLIPQ-FHYG----RMRLFGVLPRI 123

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL--YVPN 234
           A+ Y++ AL+     + R                  WI   +A V  ++  ++L  +VP 
Sbjct: 124 AICYLIAALLFLWLQRAR------------------WIA--VAVVTLLVGYWALMRFVP- 162

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPAC----NAVGYVDRELWGINHLYSDPVWSRLEAC 290
                         I   GM  H  P      N   Y+DR   G N              
Sbjct: 163 --------------IPGAGMPVHDFPLMDQFNNLPSYIDR---GFNDF------------ 193

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK-H 349
           T    ++G L E          +PEG+LST+ A+ +  +G+  G +L   + ++  +K  
Sbjct: 194 TQRFLHTGSLYEKTR-------DPEGILSTLPAVGTALLGVLNGKLL---RSNTPPVKVS 243

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYV 382
            + +G G+L IA+   +    P NK L++ SYV
Sbjct: 244 AILLGGGVLSIALGYLWNPWFPFNKNLWTSSYV 276


>gi|393786264|ref|ZP_10374400.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
 gi|392659893|gb|EIY53510.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
           CL02T12C05]
          Length = 387

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 97/385 (25%), Positives = 157/385 (40%), Gaps = 98/385 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 3   KTTSKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAKWNGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGI-------------ILQGGYSHAPDAL 158
           ++  ++L+K      + A  KI+ RT+ +   G+              L G        L
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSGEEISFLSRL 122

Query: 159 SYGV-DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
              V    HIR  G++QR+AL Y   A+I  LT K +           +  Y        
Sbjct: 123 GQSVWTFDHIRILGVMQRLALCYGATAIIA-LTMKHK-----------YIPY-------- 162

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
              ++ ++  Y + +   +  E++D                    N +  VDR + G  H
Sbjct: 163 --LIVTLLAGYFILLITGNGFEYND-------------------TNILSVVDRAVLGEAH 201

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI AI    IG   G +L
Sbjct: 202 MYKD----------------------------NGIDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITN 397
           +  K  + +L+    +G  L  +  +L +    PINK+++S ++   T G          
Sbjct: 234 MEVKDINEKLERLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLG-------- 283

Query: 398 SIFIPEMDWDECDACFRAWSTRYPS 422
           S F+  + W      +++WS  + S
Sbjct: 284 SSFLALLIWIIDVKGYKSWSRFFES 308


>gi|319952891|ref|YP_004164158.1| heparan-alpha-glucosaminide n-acetyltransferase [Cellulophaga
           algicola DSM 14237]
 gi|319421551|gb|ADV48660.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
           algicola DSM 14237]
          Length = 363

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 95/341 (27%), Positives = 149/341 (43%), Gaps = 82/341 (24%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+  +D FRG+T+VLMILV++ G     YA   H+ W+G T  D V PFFLFIVG +I 
Sbjct: 3   ERIVAVDIFRGMTIVLMILVNNPGTWAAVYAPFLHADWHGYTPTDLVFPFFLFIVGTSIV 62

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            A           KKI+ R+LKL+  G++L       P    +  D   IR+ G+LQRI 
Sbjct: 63  FAYSTKKPTADTYKKIVSRSLKLIGLGLLLGAFTLVFP----FVKDFSEIRFPGVLQRIG 118

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL---YVPN 234
           +V+ + ++                   +F  + W+ +   I   ++I+  Y L   ++P 
Sbjct: 119 VVFFITSI-------------------LFLNFNWKQL---IGVTVFILIGYWLAMGFIPV 156

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
              +   D                    N   YVD  + G +H++ D             
Sbjct: 157 NGIASTFDRA----------------PNNLANYVDLNILG-SHMWKD------------- 186

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
                            ++PEGLLSTI AI S  +G+  G +L+  + +   +   + + 
Sbjct: 187 ----------------DYDPEGLLSTIPAIASCLLGVFTGKILLSKQQNKTTIVLGLGLI 230

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
             ++  A   HF    PINK L+S S+V  TAG A+  L +
Sbjct: 231 LLIVGHA--WHFI--FPINKALWSSSFVLVTAGWANLILAL 267


>gi|333382729|ref|ZP_08474395.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828330|gb|EGK01039.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 389

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 152/357 (42%), Gaps = 98/357 (27%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFI 111
           + Q+ S R+ +LD  RG+T+  MI+V+++G     YA + H  W+G T  D V PFF+FI
Sbjct: 1   MTQKPSGRLLSLDVLRGITIAGMIMVNNSGSGEYTYAPLKHVAWDGLTPTDLVFPFFMFI 60

Query: 112 VGVAIALALKK------VPKINGAVKK---IIFRTLKLLFWGIILQGGYSHAPDALSY-- 160
           +G++  ++L+K       P +   +K+   I    L L + G+  +  +   PD L +  
Sbjct: 61  MGISTYISLRKFNFEFNTPTLLKILKRTIVIFLIGLGLSWLGLSFRTYHMLEPDNLGFWE 120

Query: 161 -----GVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
                  D  H+R  G++QR+AL Y   ++I  +T K +                     
Sbjct: 121 RFFRAITDFGHLRTLGVMQRLALTYGAASII-AITVKHK--------------------- 158

Query: 216 GFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
               ++ YII T  L            +G +K               + V  VD+ + GI
Sbjct: 159 ----YIPYIIGTTLLAY---FLLLAFGNGFEK------------SEDSIVSIVDQAILGI 199

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
           NH+Y D               SG              +PEGLLSTI AI    IG   G 
Sbjct: 200 NHMYKD---------------SG-----------LAIDPEGLLSTIPAIAHVLIGFCCGA 233

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAG 387
           ++++ K +  R+         L I+  IL F   +     PINK++++ ++V  T G
Sbjct: 234 LIMNTKDNDKRISQ-------LFIVGTILTFAGFLLSYGCPINKKIWTPTFVLATCG 283


>gi|409199197|ref|ZP_11227860.1| hypothetical protein MsalJ2_19286 [Marinilabilia salmonicolor JCM
           21150]
          Length = 369

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 96/337 (28%), Positives = 147/337 (43%), Gaps = 86/337 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            ++S+R   LD  RG+T+ LMI V+  G     YA + H+ W+GCT  D V PFFLF+ G
Sbjct: 1   MKQSQRYLALDVLRGMTIALMITVNTPGSWQYIYAPLRHASWHGCTPTDLVFPFFLFVAG 60

Query: 114 VAIALALKKV-PKING-AVKKIIFRTLKLLFWGIILQG--GYSHAPDALSYGVDMKHIRW 169
           V++  +  K    +N  ++K++  RTL +   G+ L     +SH         D   +R 
Sbjct: 61  VSMFFSFGKYGGALNSESLKRLGRRTLLIFVIGLFLNSFPQWSH---------DFSTLRI 111

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G+LQRIAL Y + +LI  L+  R+                  +IGG I  + + I  + 
Sbjct: 112 MGVLQRIALAYGIGSLI-VLSAPRK---------------YIPFIGGGILLIYWGILAWF 155

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                         G + Y ++           NAV   D+ + G  HLY+         
Sbjct: 156 -------------GGAEPYSLE----------GNAVIPFDKAILGEQHLYTG-------- 184

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                PF+PEGLLST+ AI++  +G   G ++     ++ R K 
Sbjct: 185 ------------------FGIPFDPEGLLSTVPAIVTVLLGYLTGVII----KNTERAKV 222

Query: 350 WVSMGFGLLIIAIILHFTNAI-PINKQLYSFSYVCFT 385
              +    LI  +I  F   I PINK L++ SYV +T
Sbjct: 223 PGRLALYGLITTVIGRFWGVIFPINKPLWTSSYVLYT 259


>gi|423239671|ref|ZP_17220787.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
 gi|392645711|gb|EIY39434.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
           CL03T12C01]
          Length = 363

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 160/368 (43%), Gaps = 80/368 (21%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              KR+  LD  RG+T+  MILV++ G     YA ++H  +NG T  D V PFF+FI+G+
Sbjct: 4   NTPKRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHVAFNGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  ++L+K      +  ++KI+ RT+ +   G++L      A    ++ ++ +  R+ G+
Sbjct: 64  STYISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGV 120

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           +QR+A+ Y V +L+  +T K +                      F A ++  +  Y L +
Sbjct: 121 MQRLAIGYGVTSLVA-ITVKHKY---------------------FPAIILVTLAAYFLLL 158

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                   S+                    N V   D    G +H+Y +           
Sbjct: 159 ATGDGFNQSE-------------------TNVVARFDAWALGTSHMYHE----------- 188

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                G +  D          PEGLLST+ A+    +G + G +L+  K ++ +++    
Sbjct: 189 -----GGMAFD----------PEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLFL 233

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD-WDECDA 411
           +G  L     +L +    PINK+++S ++V  T G A   L +   I+I +M  +    A
Sbjct: 234 IGTILTFAGFLLSY--GCPINKKVWSPTFVIITCGLASSFLALL--IWIIDMKGYQNWCA 289

Query: 412 CFRAWSTR 419
            FR++   
Sbjct: 290 FFRSFGVN 297


>gi|255656770|ref|ZP_05402179.1| hypothetical protein CdifQCD-2_14006 [Clostridium difficile
           QCD-23m63]
          Length = 481

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 142/348 (40%), Gaps = 91/348 (26%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + RV ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 113 NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 172

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   PD       +  +R  G+
Sbjct: 173 PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNPD-------LNTVRILGV 223

Query: 173 LQRIALVY----VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           LQR+ LVY    +V  L++ L       +L                  F+   ++II  Y
Sbjct: 224 LQRMGLVYFVTSLVYLLLKKLNVGSTATILT-----------------FLCISVFIIVGY 266

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            +    + F                + G L         VD   +   HLY         
Sbjct: 267 YILAKPYGFE---------------LEGSLAQL------VDLHFFK-GHLY--------- 295

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARL 347
                               +  FEP+G L++I AI SG +G   G VL+    G   + 
Sbjct: 296 --------------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKF 335

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
              + M   LLI A I  F    P NK+L+S S+V   AG+    L I
Sbjct: 336 FKILVMSIILLIGAFI--FNQYFPFNKRLWSSSFVLLMAGSYGVLLSI 381


>gi|393782159|ref|ZP_10370348.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
 gi|392674193|gb|EIY67642.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
           CL02T12C01]
          Length = 387

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 101/385 (26%), Positives = 159/385 (41%), Gaps = 98/385 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  SKR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 3   KTTSKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAQWNGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRT-------LKLLFWGIILQGGYSHAPDALSYGVDM 164
           ++  ++L+K      + A  KI+ RT       L L ++ +  +   S + + +S+   +
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGLAWFSMFCRTWNSLSAEEISFFSRL 122

Query: 165 -------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                   HIR  G++QR+AL Y   A++  LT K        +H+    A     IG F
Sbjct: 123 GQSIWTFDHIRILGVMQRLALCYGATAIVA-LTMKH-------KHIPYLIAT--LLIGYF 172

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I           L V    F  +S                     N +  VDR + G  H
Sbjct: 173 I-----------LLVTGNGFEYNST--------------------NILSVVDRAVLGEAH 201

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI AI    IG   G +L
Sbjct: 202 MYKD----------------------------NGIDPEGLLSTIPAIAHVLIGFCVGKLL 233

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITN 397
           +  K  + +L     +G  L  +  +L +    PINK+++S ++   T G          
Sbjct: 234 MEVKDINEKLGRLFLIGTILTFLGFLLSY--GCPINKKIWSPTFAIVTCGLG-------- 283

Query: 398 SIFIPEMDWDECDACFRAWSTRYPS 422
           S F+  + W      +++WS  + S
Sbjct: 284 SSFLALLIWIIDVKGYKSWSRFFES 308


>gi|427384458|ref|ZP_18880963.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727719|gb|EKU90578.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
           12058]
          Length = 395

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 101/390 (25%), Positives = 159/390 (40%), Gaps = 102/390 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 5   MKTNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 64

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYS-------HAPDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S       +  + + +G  +
Sbjct: 65  ISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSKFCYYWTNPSEGIGFGAQL 123

Query: 165 -------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                    IR  G++QR+AL Y   A+I  LT K        RH+    A     IG F
Sbjct: 124 WESVWTFDRIRILGVMQRLALCYGATAII-ALTMKH-------RHIPYLIAT--LLIGYF 173

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I  +                              CG  G      N +  VDR +    H
Sbjct: 174 ILLI------------------------------CG-NGFAYNETNVLSIVDRAILTPAH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI +I    +G   G ++
Sbjct: 203 MYKD----------------------------NGIDPEGLLSTIPSIAHVLLGFCVGRMM 234

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADGC 392
           +      +R +   S    L +I  IL F   +     PINK+++S ++V  T G A   
Sbjct: 235 LDGNKAQSREELLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFVLTTCGLA--- 291

Query: 393 LGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                S F+  + W      ++ WST + +
Sbjct: 292 -----SSFLALLIWIIDVKGYKKWSTFFEA 316


>gi|397171248|ref|ZP_10494657.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
 gi|396087147|gb|EJI84748.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
          Length = 394

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 91/344 (26%), Positives = 149/344 (43%), Gaps = 68/344 (19%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFL 109
           Q L QQ + R+  LDA RGL ++ MILV++ G     Y  + H+ W+G T  D + P FL
Sbjct: 8   QILTQQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAFL 67

Query: 110 FIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY-GVDMK 165
            +VG+AI  +L   + +PK     +  I      L    ++   Y+      SY    + 
Sbjct: 68  VMVGMAIPYSLAGRQLLPKAELIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKLL 127

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            +RW G+LQRI +VY    L+  L +  R  +L              W+ G +  + +++
Sbjct: 128 TVRWSGVLQRIGIVY-FCTLVIVLYSGTRGRIL--------------WLSG-LCLLYFLL 171

Query: 226 TTYSLYVPNW--SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
             +  Y  N+  +F    +HG                  N   ++D +L G NH+Y    
Sbjct: 172 MQFVPYRDNYGHTFVGLWEHG-----------------NNLAAWLDHQLLGPNHVY---- 210

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                            R   P      F+PEG+LST+ AI S   G+    +L      
Sbjct: 211 ----------------FRSATPF----AFDPEGILSTLPAIASCLSGVLMAQLLQSQAEL 250

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           + +L+  +  G   + IA + H   A+PINK L++ ++V  ++G
Sbjct: 251 AFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSG 292


>gi|398348299|ref|ZP_10533002.1| hypothetical protein Lbro5_13954 [Leptospira broomii str. 5399]
          Length = 399

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 98/347 (28%), Positives = 149/347 (42%), Gaps = 91/347 (26%)

Query: 53  QQLLQQKS--KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPF 107
           Q+L+   S  KR+ ++DA RG TV  MILV++ G     Y  + H+ W GCT  D V PF
Sbjct: 23  QELINDSSVRKRLLSIDALRGFTVAGMILVNNPGSWSAIYWPLKHAKWFGCTPTDLVFPF 82

Query: 108 FLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK 165
           FLF VGV+I  +        G   KI+ R   L+  G+ L   G +S           + 
Sbjct: 83  FLFSVGVSIPFS---SIGNGGTFFKILKRASILILIGLFLHWFGEWS-----------ID 128

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            +R  G+LQRI LVY + A+        R +    R L   +          I F  +I+
Sbjct: 129 QLRIPGVLQRIGLVYFISAI------AYRSSNFHARILICLS----------ILFGYWIL 172

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
                +VP       SD               L P  +   ++DR ++G NHL+      
Sbjct: 173 LE---FVP----PPGSD------------SVSLSPGKDWGAWLDRIVFGENHLW------ 207

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                           + + +W     +PEGLLS++SA+ +  +G  +G VL   K  S 
Sbjct: 208 ----------------KSSKTW-----DPEGLLSSLSAVATTFLGFFFGEVL---KKDSD 243

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAG 387
             K+     F  ++ AI++     I     P+NK L++ S+V +T G
Sbjct: 244 TKKNIQKTAFNFILAAIVIMVAGWIWHQFFPMNKSLWTSSFVLWTGG 290


>gi|404256028|ref|ZP_10959996.1| hypothetical protein SPAM266_22806 [Sphingomonas sp. PAMC 26621]
          Length = 392

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 98/337 (29%), Positives = 151/337 (44%), Gaps = 66/337 (19%)

Query: 65  TLDAFRGLTVVLMILV---DDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
            LD  RGL V  MILV    D G AY ++ H+ WNG TLAD V P FLF VG+A+ L+  
Sbjct: 2   ALDVLRGLAVAGMILVVSPGDWGQAYVQLQHANWNGATLADMVFPTFLFSVGIALGLSFP 61

Query: 122 KVPKINGAVK----KIIFRTLKLLFWGIILQGGYS---HAPDALSYGVDMKHIRWCGILQ 174
           +  +  G  +    +++ RT  L+  G++++  Y     A      G  + HIR  GILQ
Sbjct: 62  RRLETAGDRRLFWTRLLRRTALLILLGLLVEATYVWTIAAGAPYPGGPGLAHIRIPGILQ 121

Query: 175 RIALVYVVVALIETLTTKRRPN-VLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL--- 230
           RI L Y +  ++   T +R P+ ++    L+I            +  ++ I+  Y L   
Sbjct: 122 RIGLCYGLAGILLLATNRRDPDGMIRINPLAI------------VGCIVVILIGYWLLLI 169

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
           +VP   F                  G L PA N  G+VDR L+   HL+       L + 
Sbjct: 170 FVPVPGFGA----------------GVLTPAGNLPGFVDRTLFTEPHLWP------LGSA 207

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
           T + P              A ++PEGLLST+ A  +   GI     L   + +  R   +
Sbjct: 208 TAARP--------------ATYDPEGLLSTLPATANVLFGILSAWAL---RRYPDRALGY 250

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +++   LL  A  L     + INK++++ S+   + G
Sbjct: 251 IAVVAALLFSA-GLALDPLLVINKRIWTSSFAVLSGG 286


>gi|375110537|ref|ZP_09756759.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
 gi|374569481|gb|EHR40642.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
          Length = 394

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 91/344 (26%), Positives = 149/344 (43%), Gaps = 68/344 (19%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFL 109
           Q L +Q + R+  LDA RGL ++ MILV++ G     Y  + H+ W+G T  D + P FL
Sbjct: 8   QILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAFL 67

Query: 110 FIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY-GVDMK 165
            +VG+AI  +L   + +PK     +  I      L    ++   Y+      SY    + 
Sbjct: 68  VMVGMAIPYSLAGRQLLPKAEQIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKLL 127

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            +RW G+LQRI +VY    LI  L +  R  +L              W+ G +  + +++
Sbjct: 128 TVRWSGVLQRIGIVYFCTLLI-VLYSGTRGRIL--------------WLSG-LCLLYFLL 171

Query: 226 TTYSLYVPNW--SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
             +  Y  N+  +F    +HG                  N   ++D +L G NH+Y    
Sbjct: 172 MQFVPYRDNYGHTFVGLWEHG-----------------NNLAAWLDHQLLGPNHVY---- 210

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                            R   P      F+PEG+LST+ AI S   G+    +L      
Sbjct: 211 ----------------FRSATPF----AFDPEGILSTLPAIASCLSGVLMAQLLQSKAEL 250

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           + +L+  +  G   + IA + H   A+PINK L++ ++V  ++G
Sbjct: 251 AFKLRVLLLSGLAAVWIAELAH--PALPINKMLWTPTFVLLSSG 292


>gi|434387287|ref|YP_007097898.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
 gi|428018277|gb|AFY94371.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
          Length = 377

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 101/337 (29%), Positives = 150/337 (44%), Gaps = 74/337 (21%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG-----GAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           R+ +LD FRGLT+  MILV+ A        YA +DH+PW+G T+AD V PFFL+I+GV++
Sbjct: 2   RLTSLDVFRGLTMATMILVNMASLPNDDRKYAWLDHAPWHGYTIADLVFPFFLYIIGVSM 61

Query: 117 ALALKK-----VPKINGAVKKIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWC 170
           A +L K     VP       +I+ R+  L   G+IL    +++      +  ++  +R  
Sbjct: 62  AFSLAKYTSGDVPLSKQVYWQILRRSAILFGLGLILNNLVWNYNLTEPKFFANLDKLRIM 121

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRI + +   A I  L        L  R L I T       GG       I+  Y L
Sbjct: 122 GVLQRIGIAF-FFASIAVLA-------LAQRCLWILT-------GG-------ILVGYWL 159

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            +      ++SD    ++       G+ G       YVDR +    HL+           
Sbjct: 160 ILAFIPALDNSDGVFSQF-------GNFG------AYVDRLIITPAHLHK---------- 196

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
              S N                +PEGL ST+ AI S   G   G  L   +  + R    
Sbjct: 197 --GSKNLS--------------DPEGLFSTLPAISSILFGYLTGTWLKR-QPVATRTSAS 239

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           + M +GL  + I L + +  PINK+L++ S+V FT G
Sbjct: 240 LLM-YGLAAVVIGLVWNSFFPINKKLWTSSFVLFTTG 275


>gi|209523049|ref|ZP_03271606.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376001698|ref|ZP_09779557.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|423062475|ref|ZP_17051265.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
 gi|209496636|gb|EDZ96934.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375329927|emb|CCE15310.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|406716383|gb|EKD11534.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
          Length = 378

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 96/342 (28%), Positives = 143/342 (41%), Gaps = 84/342 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +    R+ +LD FRG+ +  MILV++ G     Y  + H+ W+GCT  D V P FL I+G
Sbjct: 4   KNTKMRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAEWDGCTPTDVVFPSFLLIMG 63

Query: 114 VAIALALKKVP--------KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMK 165
           VAIA +L K          K+  +V   I R   LLF   +   G+ H         D+ 
Sbjct: 64  VAIAFSLSKFAREHRLPGEKVPPSVYSRIGRRCLLLFLLGLFLNGFPH--------YDLA 115

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
           +IR  G+LQRIA+ Y + A I  L   RR   L    +SI T      IG ++A  I  +
Sbjct: 116 NIRIMGVLQRIAIAYGLTA-IAILNLSRRQLWL----ISILTL-----IGYWVAMTIIPV 165

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
            +Y                           G+L P  N   ++D+ + G +HL       
Sbjct: 166 PSYG-------------------------PGNLSPEGNLGAFIDQTILGSHHL------- 193

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                                W   P++PEGL ST  A ++  +G   G  L     +S 
Sbjct: 194 ---------------------WRGGPYDPEGLFSTAPATVTVILGYLTGEWLKSQPRNSF 232

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            + + V      L++  +  +    PINK L++ S+V  TAG
Sbjct: 233 TVINLVMFALSSLVVGYL--WGVWFPINKALWTSSFVLVTAG 272


>gi|148359197|ref|YP_001250404.1| hypothetical protein LPC_1091 [Legionella pneumophila str. Corby]
 gi|296107241|ref|YP_003618941.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
 gi|148280970|gb|ABQ55058.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
 gi|295649142|gb|ADG24989.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
           Alcoy]
          Length = 372

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 145/342 (42%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI+V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI+    FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|116327439|ref|YP_797159.1| hypothetical protein LBL_0655 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116120183|gb|ABJ78226.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
          Length = 369

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 143/336 (42%), Gaps = 81/336 (24%)

Query: 66  LDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           +D FRG+TVV MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG +I ++L  
Sbjct: 1   MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYS 60

Query: 123 VPKINGAV--KKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
              IN       I  R + L+  G+ L   G ++ +            +R  G+LQRI  
Sbjct: 61  KNGINRIRIWIGICIRGISLILLGLFLNFFGEWTFS-----------ELRIPGVLQRIGF 109

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
           VY VVA +  +   ++                       + F+I I     L V  W  +
Sbjct: 110 VYWVVATLFLVFPGKKV----------------------LVFLIPI-----LLVHTWILT 142

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
             +  G     ++ G         +   ++DR ++G  HL+                   
Sbjct: 143 HIAPPGESMVSLEQGK--------DIGAWIDRTIFGEKHLW------------------- 175

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLL 358
              + + +W     +PEG LS I++I +   G+  G +L   +G     +     G G L
Sbjct: 176 ---KFSKTW-----DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RVLSIFGLGFL 226

Query: 359 IIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
              + L +  ++P+NK L++ SY  +T G A  C+G
Sbjct: 227 FTFVGLLWDRSLPMNKSLWTGSYAVYTTGFAFLCIG 262


>gi|397664114|ref|YP_006505652.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395127525|emb|CCD05722.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 144/342 (42%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI+    FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGGGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|153808903|ref|ZP_01961571.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|423220258|ref|ZP_17206753.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
 gi|149128236|gb|EDM19455.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
 gi|392623335|gb|EIY17438.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
           CL03T12C61]
          Length = 371

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 145/373 (38%), Gaps = 94/373 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            +KR+  LD  RG+T+  MILV++ G     Y  + H+ WNG T  D + PFF+FI+G++
Sbjct: 5   SNKRLLALDVMRGITIAGMILVNNPGSWGYVYFPLKHAQWNGLTPTDLIFPFFMFIMGIS 64

Query: 116 IALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGVDMKHIRW 169
             ++L+K        A  KII RT+ +   GI +       Y H P      +    IR 
Sbjct: 65  TYISLRKYNFTFSTPAALKIIKRTIVIFLIGIAINWFALLCYYHDP------LPFAQIRV 118

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            G++QR+AL Y   ALI  L   +                          ++ Y+I    
Sbjct: 119 LGVMQRLALCYGASALIALLIKHK--------------------------YIPYLIVALL 152

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
                          V  +I+     G      N +  VDR + G  H+Y D        
Sbjct: 153 ---------------VGYFILLITGNGFAYNETNILSIVDRSILGDAHMYQD-------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKH 349
                                  +PEGLLSTI +I    IG   G +L+  K    +L+ 
Sbjct: 190 --------------------NHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIREKLER 229

Query: 350 WVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDEC 409
              +G  L     +L +    P NK+++S ++V  T G          S F+  + W   
Sbjct: 230 LFLIGTILTFAGFLLSY--GCPFNKKIWSPTFVLVTCGLG--------SSFLALLVWIID 279

Query: 410 DACFRAWSTRYPS 422
              ++ WS  + S
Sbjct: 280 IKGYKKWSRFFES 292


>gi|397667386|ref|YP_006508923.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
 gi|395130797|emb|CCD09044.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
           [Legionella pneumophila subsp. pneumophila]
          Length = 372

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 144/342 (42%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI+    FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|52841889|ref|YP_095688.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|378777523|ref|YP_005185961.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|52629000|gb|AAU27741.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|364508338|gb|AEW51862.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 372

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 143/342 (41%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI----HFKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI     FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLISPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGIGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|393725858|ref|ZP_10345785.1| hypothetical protein SPAM2_19574 [Sphingomonas sp. PAMC 26605]
          Length = 400

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/348 (28%), Positives = 162/348 (46%), Gaps = 70/348 (20%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           + Q+  R+  LD  RGL V  MILV   G    AYA++ H+ W+G TLAD V P FLF V
Sbjct: 1   MTQRLPRLEALDVLRGLAVAGMILVVSPGDWSMAYAQLQHAAWHGATLADMVFPTFLFSV 60

Query: 113 GVAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGY-----SHAPDALSYGVD 163
           G+A+ L+  ++       +    ++I R++ L+  G++++  Y     + AP     G  
Sbjct: 61  GMALGLSFPRLMADTAQRRLFWMRLIRRSITLVVLGLVVEATYVWTISAGAPYPGHGG-- 118

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
           + ++R  GILQRI L Y++   +  +T++    + + R   I  A Q          V++
Sbjct: 119 LSYVRIPGILQRIGLCYLLGGALIVVTSR---TIADGR---IAIAPQ---------RVLF 163

Query: 224 IITTYSLYVPNWS---FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYS 280
            I   ++ +  W+   F      GV          G L P  +   +VDR L+ + HL+ 
Sbjct: 164 CIA--AILIGYWALLRFVPVPGFGV----------GLLTPDGSLPAFVDRTLFTVPHLWP 211

Query: 281 DPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI-H 339
                 L + T   P              A ++PEGLLST+ A    T  + +G +    
Sbjct: 212 ------LGSATGQGP--------------ATYDPEGLLSTLPA----TANLLFGALAAWA 247

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           ++ +S R    V++   +LIIA  L       INK+L++ S+  F++G
Sbjct: 248 WRQNSDRATLHVAIAGTMLIIA-GLALDPVFEINKRLWTSSFALFSSG 294


>gi|423223641|ref|ZP_17210110.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638266|gb|EIY32113.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 395

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 151/363 (41%), Gaps = 94/363 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 5   TKTNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 64

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        + + LS+G  +
Sbjct: 65  ISTYISLKKYNFKFSHAAALKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEGLSFGEQL 123

Query: 165 -------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                    IR  G++QR+AL Y   A+I  LT K        RH+    A     +G F
Sbjct: 124 WASVWTFDRIRILGVMQRLALCYGATAIIA-LTMKH-------RHIPYLIAT--LLVGYF 173

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I                              ++ CG  G      N +  VDR +    H
Sbjct: 174 I------------------------------LLMCG-NGFAYNETNILSIVDRAILTPAH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI +I    +G   G ++
Sbjct: 203 MYKD----------------------------NGIDPEGLLSTIPSIAHVLLGFCVGRMM 234

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADGC 392
           +      +R     S    L ++  IL FT  +     PINK+++S ++V  T G A   
Sbjct: 235 LDSNKAESREALLNSHLIKLFLVGAILTFTGFLLSYGCPINKKIWSPTFVLTTCGLASSF 294

Query: 393 LGI 395
           L +
Sbjct: 295 LAL 297


>gi|189464405|ref|ZP_03013190.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
 gi|189438195|gb|EDV07180.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
           17393]
          Length = 395

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 94/363 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G
Sbjct: 5   TKTNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMG 64

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM 164
           ++  ++LKK      + A  KI+ RT+ +   G+ + G +S        + + LS+G  +
Sbjct: 65  ISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEGLSFGEQL 123

Query: 165 -------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                    IR  G++QR+AL Y   A+I  LT K        RH+    A     +G F
Sbjct: 124 WASVWTFDRIRILGVMQRLALCYGATAIIA-LTMKH-------RHIPYLIAT--LLVGYF 173

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I                              ++ CG  G +    N +  VDR +    H
Sbjct: 174 I------------------------------LLMCG-NGFVYNETNILSIVDRAILTPAH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI +I    +G   G ++
Sbjct: 203 MYKD----------------------------NGIDPEGLLSTIPSIAHVLLGFCVGRMM 234

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADGC 392
           +       R     S    L +I  IL F   +     PINK+++S ++V  T G A   
Sbjct: 235 LDSNKTENRETLLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFVLTTCGLASSF 294

Query: 393 LGI 395
           L +
Sbjct: 295 LAL 297


>gi|431798742|ref|YP_007225646.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
 gi|430789507|gb|AGA79636.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
          Length = 363

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 77/136 (56%), Gaps = 14/136 (10%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +  K+KR+ +LD  RG+T+  MILV+  G     +  + H+ WNG T  DF+ PFFLFIV
Sbjct: 1   MANKNKRLISLDVLRGMTIAAMILVNFPGSWEHVFPPLHHAQWNGITPTDFIFPFFLFIV 60

Query: 113 GVAIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           GV+I +A     +++  +  KK+ FR  K+   G++L       P+      D   IR  
Sbjct: 61  GVSIVMAYAGKMEMDKTIVYKKLFFRGAKIFALGVLL----GMIPE-----FDFSAIRVA 111

Query: 171 GILQRIALVYVVVALI 186
           G+LQRIALV+V   L+
Sbjct: 112 GVLQRIALVFVACTLM 127



 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 12/87 (13%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA--- 369
           +PEG+ ST+ AI +G +G+  G +L       ++LK  V     L++I ++L        
Sbjct: 189 DPEGVFSTLPAIATGILGMLAGQLL------KSQLKE-VEKANNLMVIGLVLTLWGLFWA 241

Query: 370 --IPINKQLYSFSYVCFTAGAADGCLG 394
              PINK L++ S+V  T G A   LG
Sbjct: 242 WFFPINKNLWTSSFVLVTGGTAFSFLG 268


>gi|332664355|ref|YP_004447143.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332333169|gb|AEE50270.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 438

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 158/358 (44%), Gaps = 40/358 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVG 113
              S R+ +LD  RGLT+  MILV+   D G  Y  + H+ W+GCT  D+V PFFLF+VG
Sbjct: 1   MTTSNRLLSLDVMRGLTIAGMILVNNPGDWGNVYGPLLHADWHGCTPTDWVFPFFLFMVG 60

Query: 114 VAIALALKKVP----KINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           VAI LAL K       +    +KII R+L ++  G+ L    +       Y  D     +
Sbjct: 61  VAIPLALGKRKDEGEDLRKIYRKIISRSLIIIGLGLFLTAHPTF------YFTDKTSPWY 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY--IITT 227
              L  +A   V V   E L  K+            F    WQ    +++++ +  I   
Sbjct: 115 VVHLIIMATAMVAVFTREVLNQKQ------------FQTETWQQRRKWVSYLAWSAIACM 162

Query: 228 YSLYVPNWSFSEHSDHGVKKYI----VKCG---MRGHLGPACNAVGYVDRELWGINHLYS 280
             L +  + FS     GV + I    + CG   ++            +    WG+  L  
Sbjct: 163 VVLGIFYYDFSHMRFPGVLQRIGLVYLACGFLFLKASPRMQLLTGVGLLLLYWGLMTLVP 222

Query: 281 DP--VWSRLEACT-LSSPNSGPLREDAPSWCRA-PFEPEGLLSTISAILSGTIGIHYGHV 336
            P  +   LEA T L +     +      W     ++PEGLLSTI AI +G  G+  G  
Sbjct: 223 VPGGIAPNLEAETNLGAWLDRAIFSTNHLWAAVKTWDPEGLLSTIPAIGTGIAGMLAGEW 282

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
           +   K    ++   +++  G L+ A+ L +    P+NK++++ SYV + AG +   LG
Sbjct: 283 VRSEKSDYEKVSGLLAV--GALLFALGLIWNEFFPLNKKIWTSSYVVYMAGISLLFLG 338


>gi|410028220|ref|ZP_11278056.1| hypothetical protein MaAK2_03415 [Marinilabilia sp. AK2]
          Length = 382

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 142/340 (41%), Gaps = 73/340 (21%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFL 109
           Q L     +R   LD  RGLT+ LM++V+  G     YA   H+ W+G T+ D + P FL
Sbjct: 5   QNLGVPLKERYLALDVLRGLTIALMVVVNTPGSWSHMYAPFMHADWHGFTITDLIFPTFL 64

Query: 110 FIVGVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQG--GYSHAPDALSYGVDMK 165
           F+VG A++ ++K++  +  +  +KK+  RTL +   G +L     +++  +     ++  
Sbjct: 65  FVVGNAMSFSMKRMESMGQSLFLKKVFKRTLLIFLIGWLLNAFPFFNYNAETGYSMINWS 124

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            +R  G+LQRIAL Y++ ALI     K+                   +I  FIA + Y  
Sbjct: 125 EVRLLGVLQRIALCYMLAALILYYFGKKG-----------------AFIYSFIALLGY-- 165

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
                    W+       G   Y +            NA   +D  L G  HLY      
Sbjct: 166 ---------WAIMYFFGDGEDPYSL----------IGNAALKLDLWLIGAKHLYMGE--- 203

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
                                    PF+PEG+LST  ++++   G   G  +     ++ 
Sbjct: 204 -----------------------GIPFDPEGVLSTFPSVVNVIAGFLVGKFIQESGNNTG 240

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFT 385
            +K  V  G  LL+  ++  +    PINK++++  YV  T
Sbjct: 241 TVKKLVIWGIILLVACLV--WDMVFPINKKIWTSPYVLLT 278


>gi|307610361|emb|CBW99930.1| hypothetical protein LPW_16871 [Legionella pneumophila 130b]
          Length = 372

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 144/342 (42%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKEKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +  +HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSASHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI+    FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|423312333|ref|ZP_17290270.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688817|gb|EIY82101.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
           CL09T03C04]
          Length = 363

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 148/344 (43%), Gaps = 79/344 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              KR+  LD  RG+T+  MILV++ G     Y  ++H+ +NG T  D V PFF+FI+G+
Sbjct: 4   NTPKRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +  ++L+K      +  ++KI+ RT+ +   G+ L      A    ++ ++ + +R+ G+
Sbjct: 64  STYISLRKYNFTYSHAILRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGV 120

Query: 173 LQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +QR+A+ Y V +L+  T+  K  P +                          I+ T ++Y
Sbjct: 121 MQRLAIGYGVTSLVAITVKHKYFPAI--------------------------ILVTLAVY 154

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
               +  +  +  V                 N V   D    G +H+Y D          
Sbjct: 155 FLLLAMGDGFNLSVT----------------NIVARFDVWALGTSHMYHD---------- 188

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                 G +  D          PEGLLST+ A+    +G + G +L   K +  +++   
Sbjct: 189 ------GGMAFD----------PEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            +G  L     +L +    PINK+++S ++V  T G A   L +
Sbjct: 233 LVGTILTFAGFLLSY--GCPINKKVWSPTFVITTCGLASSFLAL 274


>gi|340618131|ref|YP_004736584.1| hypothetical protein zobellia_2146 [Zobellia galactanivorans]
 gi|339732928|emb|CAZ96303.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 367

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/335 (27%), Positives = 136/335 (40%), Gaps = 79/335 (23%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR   LD FRGLT+ LMI+V+  G     ++ + H+ W+G T  D V P FLF VG A A
Sbjct: 2   KRFKALDVFRGLTICLMIIVNTPGDWDMTFSPLLHAKWHGFTPTDLVFPSFLFAVGNAFA 61

Query: 118 LALKKVPK--INGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGIL 173
               K     ++   KKI  RTL +   G  +     +S         V     R  G+L
Sbjct: 62  FVKTKWADKPLSDIFKKIAKRTLIIFLLGYTMYWIPFFSWTETGDLAAVPFSETRILGVL 121

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL Y + A++    T R+                             II +  + + 
Sbjct: 122 QRIALCYFIGAIMIYFLTNRQ----------------------------LIIASAVILLG 153

Query: 234 NWS-FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
            W   S   D+ ++                N V  +DR L G +HLY             
Sbjct: 154 YWGLLSAFGDYTLEG---------------NFVRTIDRMLLGDSHLYM------------ 186

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
               +G            PF+PEGLLST+ +I +   G   G  +I       +L   + 
Sbjct: 187 ---GNG-----------IPFDPEGLLSTLPSICNVLGGYLVGKYIIDKGIDYEKLAKMLL 232

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +G GLL++A +   T   P+NK+L++ S+V  T G
Sbjct: 233 VGAGLLVVAYLWDLT--FPVNKKLWTSSFVVLTVG 265


>gi|336316712|ref|ZP_08571601.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
 gi|335878877|gb|EGM76787.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
          Length = 363

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/139 (38%), Positives = 77/139 (55%), Gaps = 12/139 (8%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
             S R   LDA RGL + LMILV+  G     Y  + H+PW+G T AD V P FLF+VG 
Sbjct: 1   MSSPRFYALDALRGLAIALMILVNTPGSWQHVYTPLLHAPWDGFTFADIVFPTFLFVVGA 60

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           A+  +LK       ++ ++  R LKL+  G++L        + + + VD+  +R  G+LQ
Sbjct: 61  AMFYSLKTAVLSRQSLWRVSSRALKLIGIGVLL--------NYVPFTVDLAELRLPGVLQ 112

Query: 175 RIALVYVVVALIETLTTKR 193
           RI L Y + AL+  LT KR
Sbjct: 113 RIGLAYWLAALL-ILTVKR 130


>gi|389799428|ref|ZP_10202419.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
 gi|388442725|gb|EIL98904.1| protein involved in N-acetyl-D-glucosamine utilization
           [Rhodanobacter sp. 116-2]
          Length = 353

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/337 (26%), Positives = 142/337 (42%), Gaps = 87/337 (25%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           KR+A+LDA RG TV  M+LV+D G     Y  ++H+ W+GCT  D V  FFLF+VGV++A
Sbjct: 2   KRLASLDALRGCTVAAMLLVNDPGDWGHVYWPLEHAAWHGCTPTDLVFSFFLFVVGVSVA 61

Query: 118 LA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           LA    L++    +   +   +R L++L  G+ +           ++ +   H+R+ G+L
Sbjct: 62  LAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGVL 114

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL +  VAL    T  R                QW  I              +L + 
Sbjct: 115 QRIALCFAGVALFAIHTKPRT---------------QWWAIA-------------ALLIG 146

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
            W             +++ G  G L P  N    VD  ++G      DP   R       
Sbjct: 147 YWG------------LLRLG--GSLEPWTNLASRVDSAVFGRFVYLIDPASGRGH----- 187

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
                              +PEGLLST+ ++    +G+  G  L        + +  +  
Sbjct: 188 -------------------DPEGLLSTLPSLAGTLLGLRMGCWL-----RREQFRTLLLA 223

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
           G   L++  +  ++  +P NK L++ S+V +T G A 
Sbjct: 224 GIACLLLGAL--WSPWLPFNKNLWTPSFVLWTTGWAT 258


>gi|333029673|ref|ZP_08457734.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
 gi|332740270|gb|EGJ70752.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
          Length = 387

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 159/361 (44%), Gaps = 92/361 (25%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           +++++++R+  LD  RG+T+  MILV++ G     YA + H+ WNG T  D V PFF+FI
Sbjct: 1   MIKKENQRLLALDILRGITIAGMILVNNPGSWGSIYAPLGHAEWNGLTPTDLVFPFFMFI 60

Query: 112 VGVAIALALK--KVPKINGAVKKIIFRT-------LKLLFWGIILQGGYSHAP------D 156
           +G++   +L+  K      A  KI+ RT       L + ++ + L+   S A       +
Sbjct: 61  MGISTYFSLRKYKFEFSKEAALKILKRTIIIFAIGLGIAWFSLFLRTWNSLASADISIFE 120

Query: 157 ALSYGVDM-KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
            LS  + + +++R  G++ R+AL Y   A+I  LT K +              Y    I 
Sbjct: 121 RLSQSIFVFENLRILGVMPRLALTYCATAII-ALTIKHK--------------YIPTLIV 165

Query: 216 GFIAFVIYIITTYSLYVPN-WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
           G     I I+ T+ L++ N + ++E                       N +  VD+ + G
Sbjct: 166 G-----ILIVYTFILFLGNGFEYNE----------------------TNILSIVDKAILG 198

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
            NH+Y D                               +PEGL+STI AI    +G   G
Sbjct: 199 ENHMYKD----------------------------NGIDPEGLVSTIPAIAHVLLGFFVG 230

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            +    K   ++++    MG  L  + ++L +    PINK+++S ++V  T G A   L 
Sbjct: 231 KIFTEKKDIHSKVEFLFIMGSILTFVGLLLSY--GCPINKKIWSPTFVLTTCGLASTLLA 288

Query: 395 I 395
           +
Sbjct: 289 L 289


>gi|393763917|ref|ZP_10352530.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
 gi|392605231|gb|EIW88129.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
          Length = 394

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/352 (25%), Positives = 150/352 (42%), Gaps = 68/352 (19%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFL 109
           Q L +Q + R+  LDA RGL ++ MILV++ G     Y  + H+ W+G T  D + P FL
Sbjct: 8   QILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAFL 67

Query: 110 FIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY-GVDMK 165
            +VG+AI  +L   + +PK     +  I      L    ++   Y+      SY    + 
Sbjct: 68  VMVGMAIPYSLAGRQMLPKAELLRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKLL 127

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
            +RW G+LQRI +VY    LI  L +  R  VL              W+ G +  + +++
Sbjct: 128 TVRWSGVLQRIGIVYFCTLLI-VLYSGTRGRVL--------------WLSG-LCLLYFLL 171

Query: 226 TTYSLYVPNW--SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
             +  Y  N+  +F    +HG                  N   ++D  + G NH++    
Sbjct: 172 MQFVPYRDNYGHTFVGLWEHG-----------------NNLAAWLDHHVLGPNHVF---- 210

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                            R   P      F+PEG+LST+ AI S   G+    +L      
Sbjct: 211 ----------------FRSATPF----AFDPEGILSTLPAIASCLSGVLMAQLLQSKAEL 250

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           + +L+     G   + +A + H   A+PINK L++ ++V  ++G     LG+
Sbjct: 251 AFKLRVLFLAGLAAIWVAELAH--PALPINKMLWTPTFVLLSSGFTALILGL 300


>gi|436833933|ref|YP_007319149.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065346|emb|CCG98556.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 361

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 134/331 (40%), Gaps = 84/331 (25%)

Query: 77  MILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK-----VPKING 128
           MILV++AG    +YA + H+PWNG T  D + PFFLFIVGV+I  AL K     +     
Sbjct: 1   MILVNNAGDWAHSYAPLKHAPWNGWTPTDLIFPFFLFIVGVSITFALSKRQTSLLEDEKT 60

Query: 129 AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIET 188
              KII R + L   G  L          L    D  ++R  G+LQRI +VY V AL+  
Sbjct: 61  QRLKIIRRGVTLFALGFFLN---------LFPRFDFANVRIMGVLQRIGIVYTVCALVFL 111

Query: 189 LTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKY 248
            T+ R+               Q   I   +  + Y +    + VP   ++          
Sbjct: 112 RTSPRQ---------------QVNLI--LLILIGYFLLMTMVPVPGIGYA---------- 144

Query: 249 IVKCGMRGHLGPACNAVGYVDRELWGINHLY-SDPVWSRLEACTLSSPNSGPLREDAPSW 307
                   +L P  N   ++DR +    H Y S  VW                       
Sbjct: 145 --------NLEPETNLAAWIDRTILTPAHCYRSSKVW----------------------- 173

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSAR--LKHWVSMGFGLLIIAIIL 364
                +PEGLLST+ AI +G +G+  G  L   + G + R   K       GLL+  +  
Sbjct: 174 -----DPEGLLSTVPAIATGLLGLLAGRWLRSTRYGTTVRESQKALFLFLAGLLMAFVGT 228

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            F    PINK L++ SYV    G A   L I
Sbjct: 229 LFDTVFPINKALWTSSYVLLAGGLAMCGLAI 259


>gi|329851309|ref|ZP_08266066.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
 gi|328840155|gb|EGF89727.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
          Length = 384

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 96/346 (27%), Positives = 146/346 (42%), Gaps = 70/346 (20%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
             +  R   LD  RGL+++ M+L  + G     Y  + H+ W G T  D V P FLF +G
Sbjct: 5   MTQGNRWLALDILRGLSIIFMLLNLNPGSWSEQYGWVLHAKWEGATFIDMVAPVFLFCIG 64

Query: 114 VAIALALKK---VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           VAI L+L++     + NG + K I     +L    +L G + +A  A     D  H+R  
Sbjct: 65  VAIPLSLRRRIEAGESNGQLAKHILNRAGIL----VLLGLFLNAYPAF----DWAHMRIP 116

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIA--FVIYIITTY 228
           G+LQRI + Y  VAL    T +R               ++     G+IA  FV+   T  
Sbjct: 117 GVLQRIGVCYGAVALFVLFTARRE------------GGFRLNAKAGWIAWTFVLLSWTAL 164

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            ++VP   F      G  ++           P  +   YVDR +   +H++  P W    
Sbjct: 165 LMFVPVPGF------GAPRF----------DPVGSWPAYVDRLVLTTDHMF--PWW---- 202

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                 P  G          +  F+P+GLLST     +   G   GH      G +A + 
Sbjct: 203 ------PVDG----------KVVFDPDGLLSTWPVCANVLFGALVGHA--RLTGITAPIL 244

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
             +  G  L+  A+ LH T  IPI K +++ ++  FT G +   LG
Sbjct: 245 KMLVAGGLLMAAAVGLHTT--IPIIKHIWTATFALFTIGFSLVSLG 288


>gi|344203119|ref|YP_004788262.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343955041|gb|AEM70840.1| hypothetical protein Murru_1800 [Muricauda ruestringensis DSM
           13258]
          Length = 375

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 137/343 (39%), Gaps = 79/343 (23%)

Query: 51  QLQQLLQQKSK-RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMP 106
           QL Q    K K R  +LD FRGL V LMI+V+  G     ++ + H+ WNG TL D V P
Sbjct: 4   QLNQPNMSKLKNRYLSLDVFRGLDVALMIIVNSPGNGSTTFSPLLHADWNGFTLTDLVFP 63

Query: 107 FFLFIVGVAIALALKKVPKIN--GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM 164
            FLF+VG +++ ++KK   +      KK++ RT  +   G ++   Y    D        
Sbjct: 64  TFLFVVGNSMSFSMKKYESMGKPAFFKKVLKRTAIIFLLGFLMY-WYPFFDDGQLK--PF 120

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
              R  G+LQRIAL Y+  ++I      +    L    L                     
Sbjct: 121 SETRVFGVLQRIALCYMFASIILHFVKTKTAIWLSALFL--------------------- 159

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
                               V  +++  G  G L    NAV  +D  L G NH+Y     
Sbjct: 160 --------------------VGYHLILIGF-GDLTLTGNAVLKLDEWLIGANHMYHGE-- 196

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
                                      F+PEGLLST+ AI++  IG   G  + +   + 
Sbjct: 197 ------------------------GIAFDPEGLLSTLPAIVNVIIGYLAGRFIQNNGQNF 232

Query: 345 ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             +   +  GF L+     L +   +PINK+L++ S+V  T G
Sbjct: 233 ETVAKLMMFGFALVFAG--LAWDLVLPINKKLWTSSFVLLTCG 273


>gi|54294550|ref|YP_126965.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
 gi|53754382|emb|CAH15866.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
          Length = 372

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 92/342 (26%), Positives = 143/342 (41%), Gaps = 91/342 (26%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K +R+ +LD FRG+T+VLMI V+       Y   +H  WNGCTLAD V PFFLFIVG+  
Sbjct: 7   KPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLTS 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-----YGVDMKHIRWCG 171
            ++LK   +          R  K   +  I++         L      + ++   IR  G
Sbjct: 67  VISLKNQME----------RKEKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSIRIYG 116

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           ILQRIA+ Y++ A I   T+ +                  Q+    +  + Y I    + 
Sbjct: 117 ILQRIAVCYLISAFIYLNTSIKT-----------------QFFIFLVLLLGYWIIMTQVP 159

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
           VP +  ++ +  G                  + V Y D+  +   HLY            
Sbjct: 160 VPGYGANQLTKDG------------------SWVSYFDQLFFSAPHLY------------ 189

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH----FKGHSARL 347
                               ++PEG LST ++I +   G+  G +LI+    FK      
Sbjct: 190 -----------------EKTYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFK------ 226

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           K ++  G GLL + +   +  + PINK L++ SYV +T+G A
Sbjct: 227 KFYLLAGVGLLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLA 268


>gi|372268395|ref|ZP_09504443.1| hypothetical protein AlS89_10850 [Alteromonas sp. S89]
          Length = 395

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/355 (26%), Positives = 156/355 (43%), Gaps = 88/355 (24%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           L Q    R+ ++D  RG+ +  M+LV++ G     YA + H+ W+G T  D + P FLF+
Sbjct: 9   LAQVPHGRLMSVDLLRGIAIAAMVLVNNPGSWSFVYAPMAHAQWHGWTPTDVIFPLFLFV 68

Query: 112 VGVAIALALKK---VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD--MKH 166
           VGV++ L+  K    P +  A      R LKL   G+ L   + +  DA    ++  ++ 
Sbjct: 69  VGVSMVLSTGKRGDFPPVGWAQWS---RALKLFALGLFLAIFFYNFRDASYNWIEDRLEG 125

Query: 167 IRWCGILQRIALVYVVVA-LIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYII 225
           IRW G+LQRIALVY++   L+  L  K                      G  +A ++  +
Sbjct: 126 IRWMGVLQRIALVYILCCYLVRWLPAK----------------------GLLVAAILCSV 163

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCG--MRGHLGPACNAVGYVDRELWGINHLYSDPV 283
             ++L +            V  Y    G   +G L    +   ++D+ L G  H+Y    
Sbjct: 164 VPWTLML------------VVPYQSASGEVFQGQLAFGNHFAAWLDQWLLGSAHVYY--- 208

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                            R+  P      F+PEG+L+T SA  +  +G     VL      
Sbjct: 209 -----------------RDAQPF----AFDPEGVLTTFSAASTCLLG-----VLAALAWK 242

Query: 344 SA--------RL-KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           SA        RL ++W+  G  ++++  ++H  + +PINK L+S S+V  TAG +
Sbjct: 243 SADSNGEAQLRLCRNWLVAGTLMVLVGQLMH--SIVPINKALWSPSFVLVTAGVS 295


>gi|218131911|ref|ZP_03460715.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
 gi|217986214|gb|EEC52553.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
          Length = 396

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 162/391 (41%), Gaps = 105/391 (26%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G+
Sbjct: 6   KTNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGI 65

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM- 164
           +  ++LKK      + A  KI+ RT+ +   G+ + G +S          + +S+G  + 
Sbjct: 66  STYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLW 124

Query: 165 ------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                   IR  G++QR+AL Y   A+I  T+  K  P ++     ++ T Y       F
Sbjct: 125 ESVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIPYLIA----TLLTGY-------F 173

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I                              ++ CG  G      N +  VDR +    H
Sbjct: 174 I------------------------------LLLCG-NGFAYNDTNILSIVDRTILTPAH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI AI    +G   G ++
Sbjct: 203 MYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRMM 234

Query: 338 IH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           +   K +  R     S    LL++  IL F+  +     PINK+++S ++V  T G A  
Sbjct: 235 LEGGKANEDRESMLNSHLIKLLLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLA-- 292

Query: 392 CLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                 S F+  + W      ++ WS  + S
Sbjct: 293 ------SSFLALLIWIIDVKGYKKWSLFFES 317


>gi|29349027|ref|NP_812530.1| hypothetical protein BT_3619 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298386734|ref|ZP_06996289.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383124379|ref|ZP_09945043.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|29340934|gb|AAO78724.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251839125|gb|EES67209.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
 gi|298260408|gb|EFI03277.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 372

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 153/381 (40%), Gaps = 104/381 (27%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +   +KR+  LD  RG+T+  MILV+  G    AYA + H+ W G T  D V PFF+FI+
Sbjct: 3   VTTSNKRLLALDVMRGITIAGMILVNTPGSWQHAYAPLKHAEWIGLTPTDLVFPFFMFIM 62

Query: 113 GVAIALALKK------VPKINGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGV 162
           G++  ++L+K      VP    A  KI+ RT+ +   GI +       + H P       
Sbjct: 63  GISTYISLRKYNFTFSVP----AGLKILKRTVIIFLIGIGISWLSILCFQHDP------F 112

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTT-KRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            +  IR  G++QR+AL Y V A++  L   K  P ++    +S F               
Sbjct: 113 PIDQIRILGVMQRLALGYGVTAIVALLMKHKYIPYLIAVLLISYFA-------------- 158

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                                      I+  G  G++    N +  VDR + G  H+Y  
Sbjct: 159 ---------------------------ILALG-NGYVYDETNILSIVDRAVLGQAHIYGG 190

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
            +                             +PEGLLSTISAI    IG   G +L+  K
Sbjct: 191 QI----------------------------LDPEGLLSTISAIAHVLIGFCAGKLLMEVK 222

Query: 342 GHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFI 401
               +L+    +G  L     +L + +  PI K+++S S+V  T G          S F+
Sbjct: 223 DIHEKLERLFLIGTILTFAGFLLSYGS--PICKKVWSPSFVLVTCG--------LGSSFL 272

Query: 402 PEMDWDECDACFRAWSTRYPS 422
             + W      ++ WS  + S
Sbjct: 273 ALLVWIIDIKGYKNWSRFFES 293


>gi|223940501|ref|ZP_03632350.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223890825|gb|EEF57337.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 410

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 150/363 (41%), Gaps = 92/363 (25%)

Query: 58  QKSKRVATLDAFRGLTVVLMI----LVDDAGGAY---------ARIDHSPWNGCTLADFV 104
             +KR+ +LDA RG  +  ++    LV     A+         +++DH  W G    D +
Sbjct: 19  STTKRLLSLDALRGFDMFWIVGGEELVHALYNAWPNGPLGIINSQMDHKVWQGVAFYDLI 78

Query: 105 MPFFLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
            P F+FIVGV++  +L K  ++NG   A+K++ FR+L L  +G+++ GG S   D     
Sbjct: 79  FPLFVFIVGVSLVFSLTKAIEVNGKAAALKRVFFRSLLLYVFGLLIYGGISKGIDG---- 134

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
                IRW G+LQRIA+ Y   +L+      R                     G  +A  
Sbjct: 135 -----IRWMGVLQRIAICYFSTSLVFCFFKLR---------------------GMIVAAA 168

Query: 222 IYIITTYSL--YVP-------NWSFSEHSDHGVKKYIVKCG------MRGHLGPACNAVG 266
             ++T ++L  +VP       + S  E + H     + +        + G   P  N   
Sbjct: 169 ALLLTYWALMTFVPFPDVRPASASPQEITKHNGFTNVAQLNLSSTTMLHGQFIPGVNLAN 228

Query: 267 YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILS 326
           YVD++                                   W    ++PEGLLST+ AI++
Sbjct: 229 YVDQK-----------------------------YLPGYKW-DGTYDPEGLLSTLPAIVT 258

Query: 327 GTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
             +G+  G +L+       + K  +  G G+  +A+   +    P+ K+L++ SYV    
Sbjct: 259 CLLGVFAG-LLLRNPNVPDQKKVLLLAGAGIAGVALGFLWGLEFPVIKKLWTSSYVLVAG 317

Query: 387 GAA 389
           G A
Sbjct: 318 GYA 320


>gi|372223192|ref|ZP_09501613.1| hypothetical protein MzeaS_12804 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 364

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/138 (38%), Positives = 78/138 (56%), Gaps = 8/138 (5%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +KR+  +D FRG+T+ LM+LV+  G     Y+   H+ W+G T  D V PFFLFIVG +I
Sbjct: 4   NKRIVAVDIFRGMTISLMVLVNTPGTWSSVYSPFLHAQWHGYTPTDLVFPFFLFIVGTSI 63

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
             A K         KKI  R LKL+  G+ L G ++    +  +  D ++IR+ G+LQRI
Sbjct: 64  VFAYKNKKPSLKTYKKIGVRALKLIILGLFL-GAFTL---SFPFFKDFENIRFPGVLQRI 119

Query: 177 ALVYVVV-ALIETLTTKR 193
            +V+ +  AL   L  K+
Sbjct: 120 GVVFFITSALFLNLNWKK 137



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
           +  ++PEG LST+ AI S  +GI  G +L+       + K    +      + II H  +
Sbjct: 187 KPDYDPEGFLSTLPAIASALLGIFTGEILV-----GKQNKKTPLLFGLGGCLLIIGHLWD 241

Query: 369 AI-PINKQLYSFSYVCFTAGAADGCLGI 395
            + PINK L++ S+V  TAG A+  L I
Sbjct: 242 IVFPINKALWTSSFVLVTAGWANVVLAI 269


>gi|374373358|ref|ZP_09631018.1| membrane protein [Niabella soli DSM 19437]
 gi|373234331|gb|EHP54124.1| membrane protein [Niabella soli DSM 19437]
          Length = 375

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 149/361 (41%), Gaps = 90/361 (24%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG-GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           K  R   LD FRG+T+  MI+V+  G   +  + H+ WNG TL D V P FLF VG AIA
Sbjct: 8   KPGRFLALDIFRGMTICFMIIVNTGGPNPFPELRHAQWNGFTLTDLVFPSFLFAVGNAIA 67

Query: 118 LALKKVPKING--AVKKIIFRTLKLLFWGIILQGGYSHAPDALS--YGVDMKHIRWCGIL 173
            +  K  + +    + KII RT  L   G ++        DA +      +   R  G+L
Sbjct: 68  FSKSKWDQQSNKEVLTKIIKRTCLLFLIGYLMYWLPFVKIDAQNNIRPFPIGETRIFGVL 127

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL Y + ALI    + R                              IIT+  L + 
Sbjct: 128 QRIALCYGIGALIIRFASART----------------------------IIITSICLLLG 159

Query: 234 NW----SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLY-SDPVWSRLE 288
            W    +F +++ +G                   A   +D  L+  +HLY  DP      
Sbjct: 160 YWFIMMAFGDYTVNGA------------------AETKLDILLFTRDHLYIKDP------ 195

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                               RA F+PEG LST  AI++  IG   G   IH + +    +
Sbjct: 196 -------------------ARA-FDPEGFLSTFPAIVNVLIGYLAG---IHIRKNPKSYE 232

Query: 349 HWVSMGF-GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWD 407
               +   G L++A+   +  A P+NK+L++ S+VC T G    CL I   ++   +D+ 
Sbjct: 233 MLARLAVAGFLLVALGYLWNLAFPVNKKLWTSSFVCLTTGL--DCLIIATILYF--IDFK 288

Query: 408 E 408
           E
Sbjct: 289 E 289


>gi|325922207|ref|ZP_08183994.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
 gi|325547326|gb|EGD18393.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
          Length = 390

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 152/364 (41%), Gaps = 84/364 (23%)

Query: 43  QDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCT 99
           Q      +     L  K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G T
Sbjct: 4   QTPAAAAITASPTLTPKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFT 63

Query: 100 LADFVMPFFLFIVGVAIALALKK----VPKIN--GAVKKIIFRTLKLLFWGII--LQGGY 151
           LAD V P FLF VG A++ AL       P +   G    +IF    L++W     LQ G 
Sbjct: 64  LADLVFPSFLFAVGSAMSFALAADTPHRPFLGRVGKRAALIFLCGVLMYWFPFFHLQPGG 123

Query: 152 SHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW 211
             A  A      +  +R  G+LQRI L Y++ AL   L     P  + P  +++   Y W
Sbjct: 124 GWAFTA------IDQLRLTGVLQRIGLCYLLAAL---LVRYLPPRGIAPACVALLLGY-W 173

Query: 212 QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRE 271
                    V+Y+        P    S+  + G +                     +D  
Sbjct: 174 A--------VLYVFGQ-----PGAELSKTGNAGTR---------------------LDLW 199

Query: 272 LWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
           L+G  HLY                     R+D        F+PEGLL T+ A ++   G 
Sbjct: 200 LYGRAHLY---------------------RKD------NGFDPEGLLGTLPATVNVLAGY 232

Query: 332 HYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADG 391
             G  L   +G +A       +  G+ ++ + L +  A P++K+L+S S+V  T G    
Sbjct: 233 LTGRFL-QRRGKTAAATR-TLLLAGVGLVLLALLWNPAWPLSKKLWSGSFVACTVGLDLL 290

Query: 392 CLGI 395
            LG+
Sbjct: 291 ALGV 294


>gi|374309722|ref|YP_005056152.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751732|gb|AEU35122.1| hypothetical protein AciX8_0773 [Granulicella mallensis MP5ACTX8]
          Length = 377

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 143/321 (44%), Gaps = 62/321 (19%)

Query: 77  MILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKI 133
           MILV D G     Y ++ H+ WNG T  D + P FL I+GVA+  +     +     ++I
Sbjct: 6   MILVTDPGTYSAVYPQLMHAQWNGATATDMIFPSFLVIIGVAMTFSFASRIERGADRRQI 65

Query: 134 IF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
           ++    R++ L+F G+++ G   +         ++  IR  GILQRIAL Y   +L+   
Sbjct: 66  LWHVLTRSVLLIFLGLLVNGFPEY---------NLHTIRIPGILQRIALCYFAGSLLYLA 116

Query: 190 TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYI 249
            + ++    E + L   T      IG  +A          L V  W         +K Y 
Sbjct: 117 VSGKKDANTESQRLRRGTV-----IGAVLA---------GLLVLYWVL-------LKGYP 155

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
           V     G L    N   Y DR+++G+ HL++  +          +P  G           
Sbjct: 156 VPGFGSGRLDSLGNVAAYFDRKIFGVQHLWAYGL----------TPGYG----------- 194

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF-GLLIIAIILHFTN 368
             F+PEGLLST+ A+ +   G+  G  L   + + AR +  + +   G+ ++ + L  + 
Sbjct: 195 VTFDPEGLLSTLPALATLLFGVLAGEWL---RTNQARGRKALVLAVAGVALVLVGLALSP 251

Query: 369 AIPINKQLYSFSYVCFTAGAA 389
            +P+NK++ + ++  F+ G A
Sbjct: 252 LLPLNKKILTSTFAIFSGGVA 272


>gi|163786877|ref|ZP_02181325.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
 gi|159878737|gb|EDP72793.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
           ALC-1]
          Length = 361

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 74/130 (56%), Gaps = 7/130 (5%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           S R+ ++D  RGLT++ MILV+  G     Y  + H+ W+G T  D + PFFLFIVG++I
Sbjct: 2   SARIESVDILRGLTILAMILVNTPGTWGHVYTPLRHAEWHGLTPTDLIFPFFLFIVGISI 61

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
             A K  P      KKII R+LKL+  G+ L     H P    +  D +  R  G+LQRI
Sbjct: 62  YFAYKNKPNTKLTYKKIIIRSLKLIGLGLFLNLFLPHFP----FFNDFETHRIPGVLQRI 117

Query: 177 ALVYVVVALI 186
            LV++  +++
Sbjct: 118 GLVFLFSSIL 127


>gi|319900285|ref|YP_004160013.1| hypothetical protein Bache_0400 [Bacteroides helcogenes P 36-108]
 gi|319415316|gb|ADV42427.1| putative transmembrane protein [Bacteroides helcogenes P 36-108]
          Length = 396

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 101/390 (25%), Positives = 159/390 (40%), Gaps = 102/390 (26%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + SKR+  LD  RG+T+  MI+V++ G     YA ++H+ W G T  D V PFF+FI+G+
Sbjct: 5   KTSKRILALDILRGITIAGMIMVNNPGNWGHIYAPLEHAEWIGLTPTDLVFPFFMFIMGI 64

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGII------LQGGYSHAPDALSYGVDM-- 164
           +  ++LKK        A  KI+ RT  +   G+       L   ++ AP  LS+G ++  
Sbjct: 65  STYISLKKYDFEFSRSAALKILKRTAIIFLIGLAIGWFARLCYYWAAAPGELSFGENLWA 124

Query: 165 -----KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIA 219
                  +R  G++QR+AL Y   ++I  LT K        RH+    A      G  I+
Sbjct: 125 SVWTFDRMRILGVMQRLALCYGATSII-ALTMKH-------RHIPYLIA------GLLIS 170

Query: 220 FVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLY 279
           + I                          ++ CG  G      N +  VDR +    H+Y
Sbjct: 171 YFI--------------------------LLMCG-NGFAYNETNILSVVDRAVLTPAHMY 203

Query: 280 SDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI- 338
            D                               +PEGLLSTI +I    +G   G +++ 
Sbjct: 204 KD----------------------------NGIDPEGLLSTIPSIAHVLLGFCVGRMMLG 235

Query: 339 -HFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADGC 392
                 + R+    S    LL+   IL F   +     PI+K+++S ++V  T G A   
Sbjct: 236 DDRNAKADRMAVLDSHLIKLLLTGAILTFAGFLLSYGCPISKKIWSPTFVLVTCGMA--- 292

Query: 393 LGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                S F+  + W      +R WS  + S
Sbjct: 293 -----SSFLALLIWIVDVKGYRKWSVFFES 317


>gi|16124796|ref|NP_419360.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|221233512|ref|YP_002515948.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
 gi|13421730|gb|AAK22528.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
 gi|220962684|gb|ACL94040.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
          Length = 372

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 135/336 (40%), Gaps = 71/336 (21%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIV 112
           + Q + R  +LD FRGLTV LMI+V+ AG    AY+++ H+PW G T AD V P FLF V
Sbjct: 1   MSQPAARFLSLDVFRGLTVFLMIVVNTAGPGAKAYSQLVHAPWFGFTAADAVFPSFLFAV 60

Query: 113 GVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCG 171
           G ++A A  K   +N    K++ R   +   G ++    +    D     +     R  G
Sbjct: 61  GCSMAFAFSKPIPLNDFTVKVLRRAALIFLLGFLMYWFPFVRKVDGDWALIPFSDTRVMG 120

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +LQRIAL Y++ A             L PR +    A         +  + Y     +  
Sbjct: 121 VLQRIALCYLLAAFAV--------RWLSPRLIVALCA---------VLLLGYWAILMAFG 163

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
            P    S+  + G +                     +D  L G NHLY            
Sbjct: 164 DPAAPLSKLGNAGTR---------------------LDLLLIGQNHLY------------ 190

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                    R+D        F+PEGLL T+ + ++   G      L    G S  +    
Sbjct: 191 ---------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSSQAMGRMA 235

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             G  L++  ++  ++   PI K+L++ S+V  T G
Sbjct: 236 IAGLVLILAGLV--WSPLFPIAKKLWTSSFVLLTVG 269


>gi|399028182|ref|ZP_10729485.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
 gi|398074259|gb|EJL65410.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
          Length = 423

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 160/386 (41%), Gaps = 70/386 (18%)

Query: 60  SKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
            +R+ +LD FRGLT++LM +V+   D G  +  + H+ WNGCT  D V PFF+FI+GVA+
Sbjct: 3   KERLISLDVFRGLTILLMTIVNNPGDWGHVFPPLLHAKWNGCTPTDLVFPFFIFIMGVAV 62

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQG-------GYSHAPDAL----------- 158
            LA+      +    KI+ R+L++L  GI           G    P  +           
Sbjct: 63  PLAMPDKIYDDTTFNKILVRSLRMLCLGIFFNFFEKIQLFGLEGIPLLIGRLIITIAVGY 122

Query: 159 ----SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
               ++   +K+I    IL  I  +++  + IE     R P VL+     I   Y     
Sbjct: 123 VLMGNFSSKLKNIFAFSIL--IIYLFLAYSEIEAYQDVRLPGVLQ----RIAVVY----- 171

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGM-RGHLGPACNAVGYVDRELW 273
             F+  ++Y+ T+    +    F       V   I   G+   +L    N   ++D  L 
Sbjct: 172 --FVVSLLYLKTSQKTQIITGVFLLLGYWAVMNLIPVPGIGEANLEKGTNLASWLDSILL 229

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
                                   G +  +  +W     +PEG+LSTI +I++G IG+  
Sbjct: 230 -----------------------KGHMYHETKTW-----DPEGILSTIPSIVNGIIGLLI 261

Query: 334 GHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           G +L          K    +G  L+I  ++ +     PINK +++ SYV +T G A   L
Sbjct: 262 GQLLFLKIAKIEIAKKIALIGIALIITGLLWNI--VFPINKSIWTSSYVLYTTGLAATTL 319

Query: 394 GITNSIFIPEMDWDECDACFRAWSTR 419
            +   I I   ++ +    F  W   
Sbjct: 320 SVLYFI-IDIAEYKKGFKLFLIWGVN 344


>gi|338212268|ref|YP_004656323.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306089|gb|AEI49191.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 365

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 86/347 (24%), Positives = 144/347 (41%), Gaps = 85/347 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +  ++ R+ +LDA RG T+  M++V+  G     +  + H+ WNG +  D V P FLF+V
Sbjct: 1   MSPQTNRLVSLDALRGFTIAAMLMVNFPGSEEYVFFTLRHTKWNGLSFTDLVAPIFLFVV 60

Query: 113 GVAIALALKKVPKING-----AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167
           GV+I  A  K  K +G       +KII R+LK+   G+ L          L    D   I
Sbjct: 61  GVSIVFAYSK-RKWDGRPTGELYRKIIIRSLKIFAVGMFLN---------LMPTFDFSDI 110

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           RW G L RIA V++  A++   T  +                Q  W+G  I    ++  T
Sbjct: 111 RWTGTLHRIAFVFLGCAVLYLNTNWK----------------QQAWVGAVILVAYWLALT 154

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
               +P          G+ K +++        P  N V + D +                
Sbjct: 155 ---LIPT--------PGIGKVMLE--------PGVNLVAWFDTQFL-------------- 181

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                      P +    +W     +PE +LST  +I+SG  G+  G +L      + ++
Sbjct: 182 -----------PGKMWQGTW-----DPESILSTFPSIVSGITGMLAGQLLQSTFTPNEKV 225

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            + ++   G+   A+   +    P+N+ L++ S+V  T+G A   LG
Sbjct: 226 NYLMTA--GVFSAALGYFWGLGFPVNENLWTSSFVLVTSGFACLLLG 270


>gi|381169858|ref|ZP_09879020.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689628|emb|CCG35507.1| conserved hypothetical protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 388

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 119/285 (41%), Gaps = 70/285 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGI 172
           A++ AL         + ++  R   +L  G+++     +   PD       +  +R  G+
Sbjct: 77  AMSFALATNAPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGV 136

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI L Y+  AL   L     P  + P  L++   Y W +                LYV
Sbjct: 137 LQRIGLCYLAAAL---LVRYLPPRGIAPVCLALLLGY-WAF----------------LYV 176

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                +E S  G                  NA   +D  L+G +HLY             
Sbjct: 177 FGQPGAELSKTG------------------NAGTRLDLWLYGRDHLY------------- 205

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                   R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 206 --------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|359460787|ref|ZP_09249350.1| hypothetical protein ACCM5_18818 [Acaryochloris sp. CCMEE 5410]
          Length = 383

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 93/343 (27%), Positives = 145/343 (42%), Gaps = 83/343 (24%)

Query: 66  LDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           LD FRG+ +  M+LV+ +G    AY ++ H+ W+G TLAD V PFFL ++G ++A ++ +
Sbjct: 13  LDVFRGIAIAGMLLVNKSGLVKEAYPQLLHADWHGWTLADLVFPFFLVVLGASMAFSMAR 72

Query: 123 ------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
                  PK      KI+ R+  L   G+ L G +S          +   +R  GILQRI
Sbjct: 73  HTASLTQPK-RAVYLKILRRSAVLFGLGLFLNGFWSF---------NFSTLRVMGILQRI 122

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           +L Y+  A +  L   R+               QW   G  +  V Y +    + VP + 
Sbjct: 123 SLTYLASAFV-ILKLPRKS--------------QWGLTG--LLLVGYWLALSFIPVPEFG 165

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
                              G+L    N   Y+DR + G +HLY    ++ +         
Sbjct: 166 ------------------PGNLTRTGNFGAYIDRLIIGSSHLYVGDQFNSMG-------- 199

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK----HWVS 352
                           +PEGL ST+ AI +  +G   G   I  +G   ++K        
Sbjct: 200 ----------------DPEGLFSTLPAIATVLLGYFAGD-WIRKRGSGLKIKTSRQSLAL 242

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
             +GL    + L ++   PINK+L++ SYV FT G A   L +
Sbjct: 243 ASYGLFSTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAV 285


>gi|119491291|ref|ZP_01623345.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
 gi|119453455|gb|EAW34617.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
          Length = 371

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 143/338 (42%), Gaps = 82/338 (24%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LD FRG+ +  MILV++ G     Y  + H+ W+G T  D V P FLFIVGVA+  
Sbjct: 2   RLTSLDVFRGIAIASMILVNNPGSWNHVYPLLKHAEWHGYTPTDLVFPSFLFIVGVAMTF 61

Query: 119 ALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
           ++ K    N  ++     KI  R L+      +L    +  P+      D+ +IR  G+L
Sbjct: 62  SMSKYLPENRNLEENISPKIYLRILRRCLILFLLGLLLNGYPN-----YDLANIRIMGVL 116

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI-YIITTYSLYV 232
           QRI+L Y + A I  L   R+                 Q  G  I  +I Y +    + V
Sbjct: 117 QRISLAYGLSA-ITILHLSRK-----------------QIWGLSIGLLIGYAVVMQLIPV 158

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           PN         GV           +L P  N   Y+DR + G +HL              
Sbjct: 159 PN--------SGVV----------NLTPEGNFAAYLDRLILGEHHLLGG----------- 189

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                              ++PEGLLST+ A+++  IG   G+ L     +S    + V 
Sbjct: 190 -----------------GKYDPEGLLSTLPAVVTVLIGYLTGNWLKKQPINSQTSLNLVI 232

Query: 353 MGFGLLIIAIILHFTNAI-PINKQLYSFSYVCFTAGAA 389
           +G   +I+    H    I PINK L++ SYV  TAG A
Sbjct: 233 IGLCNIIVG---HLWGLIFPINKSLWTSSYVLVTAGWA 267


>gi|418515336|ref|ZP_13081517.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|418520970|ref|ZP_13087016.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702946|gb|EKQ61443.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410708055|gb|EKQ66504.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 388

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 118/285 (41%), Gaps = 70/285 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGI 172
           A++ AL         + ++  R   +L  G+++     +   PD       +  +R  G+
Sbjct: 77  AMSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGV 136

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI L Y+  AL   L     P  + P  L++   Y W  +             Y+   
Sbjct: 137 LQRIGLCYLAAAL---LVRYLPPRGIAPVCLALLLGY-WALL-------------YAFGQ 179

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           P    S+  + G +                     +D  L+G +HLY             
Sbjct: 180 PGAELSKTGNAGTR---------------------LDLWLYGRDHLY------------- 205

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                   R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 206 --------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|317474486|ref|ZP_07933760.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
 gi|316909167|gb|EFV30847.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
           1_2_48FAA]
          Length = 396

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 161/391 (41%), Gaps = 105/391 (26%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G+
Sbjct: 6   KTNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGI 65

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM- 164
           +  ++LKK      + A  KI+ RT+ +   G+ + G +S          + +S+G  + 
Sbjct: 66  STYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLW 124

Query: 165 ------KHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                   IR  G++QR+AL Y   A+I  T+  K  P ++     ++ T Y       F
Sbjct: 125 ESVWTFDRIRILGVMQRLALCYGATAIIALTMKHKNIPYLIA----TLLTGY-------F 173

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I                              ++ CG  G      N +  VDR +    H
Sbjct: 174 I------------------------------LLLCG-NGFAYNDTNILSIVDRTILTPAH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI AI    +G   G ++
Sbjct: 203 MYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRMM 234

Query: 338 IH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAADG 391
           +   K +  R     S    L ++  IL F+  +     PINK+++S ++V  T G A  
Sbjct: 235 LEGGKANEDRESMLNSHLIKLFLVGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLA-- 292

Query: 392 CLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                 S F+  + W      ++ WS  + S
Sbjct: 293 ------SSFLALLIWIIDVKGYKKWSLFFES 317


>gi|418752318|ref|ZP_13308585.1| putative membrane protein [Leptospira santarosai str. MOR084]
 gi|409967313|gb|EKO35143.1| putative membrane protein [Leptospira santarosai str. MOR084]
          Length = 363

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 145/332 (43%), Gaps = 86/332 (25%)

Query: 65  TLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG +I ++L 
Sbjct: 2   SLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISLY 61

Query: 122 KVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRIA 177
               IN +     I  R+  L+  G+ L   G +S A            +R  G+LQRI 
Sbjct: 62  SKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA-----------ELRIPGVLQRIG 110

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
            VY VVA +  + + ++                           I + +   L +  W  
Sbjct: 111 FVYWVVASLCLVFSGKK---------------------------ILVFSVPILLIHTWIL 143

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
           ++ +  G  + +V       +G       ++DR ++G  HL+                  
Sbjct: 144 TQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHLW------------------ 177

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
                 + +W     +PEG LS ++++++   G+  G +L        R +    +G G+
Sbjct: 178 ----RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF------LRERRNKILGLGI 222

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 223 LFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLS 254


>gi|380693406|ref|ZP_09858265.1| hypothetical protein BfaeM_05407 [Bacteroides faecis MAJ27]
          Length = 371

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 151/378 (39%), Gaps = 102/378 (26%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
             +KR+  LD  RG+T+  MILV+  G     YA + H+ W G T  D V PFF+FI+G+
Sbjct: 4   TSNKRLLALDVMRGITIAGMILVNTPGSWQHTYAPLKHAEWIGLTPTDLVFPFFMFIMGI 63

Query: 115 AIALALKK------VPKINGAVKKIIFRTLKLLFWGIILQ----GGYSHAPDALSYGVDM 164
           +  ++L+K      +P    A  KI+ RT+ +   GI +       + H P        +
Sbjct: 64  STYISLRKYDFTFSIP----AGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPI 113

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
             IR  G++QR+AL Y + AL   L   +                          ++ Y+
Sbjct: 114 DQIRILGVMQRLALGYGITALAALLIKHK--------------------------YIPYL 147

Query: 225 ITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
           IT                  +  +++     G++    N +  VDR + G  H+Y     
Sbjct: 148 ITVLL---------------IGYFMILAVGNGYVYDETNVLSIVDRAVLGQAHIYGG--- 189

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
                                    A  +PEGLLSTISA+    IG   G +L+  K   
Sbjct: 190 -------------------------AILDPEGLLSTISAVAHVMIGFCAGKLLMEVKDIH 224

Query: 345 ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
            +L+    +G  L     +L + +  PI K+++S S+V  T G          S F+  +
Sbjct: 225 EKLERLFLIGTILTFAGFLLSYGS--PICKKIWSPSFVLITCG--------MGSSFLALL 274

Query: 405 DWDECDACFRAWSTRYPS 422
            W      ++ WS  + S
Sbjct: 275 VWIIDIKGYKGWSRFFES 292


>gi|388456506|ref|ZP_10138801.1| hypothetical protein FdumT_08017 [Fluoribacter dumoffii Tex-KL]
          Length = 372

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 145/339 (42%), Gaps = 73/339 (21%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++KR+ +LD FRGLT+ LM+LV+  G    Y  + H  WNGC+LAD V P FLFIVG+  
Sbjct: 7   ETKRILSLDVFRGLTMALMVLVNSLGSRENYKILMHVEWNGCSLADLVFPAFLFIVGITT 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
            ++L++   +N   K  ++R+   +    +L   +    +     +D+  IR  GILQRI
Sbjct: 67  VISLQR--HLNDESKAQLYRS---ILTRTLLLMFFGLFLNIFPKQIDLSTIRIYGILQRI 121

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A  Y++ ++                 L + T+++ Q I      + Y      + VP   
Sbjct: 122 AWCYLICSI-----------------LYLHTSFRTQIIILVGILIGYWFFLTQIPVPGPG 164

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
           F +                  L  A N V Y++  L+   HL                 N
Sbjct: 165 FDQ------------------LSMAKNWVSYIETHLFSPRHLLFK--------------N 192

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
             P               EG LSTI AI +   G+  G  L+     S   K +  +  G
Sbjct: 193 FDP---------------EGFLSTIPAIATTLSGLLVGQYLL--TPGSKLKKSFTLITIG 235

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           +L + +   + +  PINK L++ ++V +++G +    G+
Sbjct: 236 ILCLFVAGFWNHYFPINKNLWTSTFVLWSSGFSLIVFGL 274


>gi|224025514|ref|ZP_03643880.1| hypothetical protein BACCOPRO_02254, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018750|gb|EEF76748.1| hypothetical protein BACCOPRO_02254 [Bacteroides coprophilus DSM
           18228]
          Length = 298

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 149/341 (43%), Gaps = 89/341 (26%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVD-DAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           +  L Q+  KR+ ++D FRG+T+  MILV+  AGG++  + H P  G  +AD V P F+F
Sbjct: 8   INTLNQKNMKRLLSIDIFRGITIFFMILVNTQAGGSFDFLIHIPGYGWRIADLVYPSFIF 67

Query: 111 IVGVAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           I+G ++ L+++K  +       K I  RT+ +   GII         + + +  ++  +R
Sbjct: 68  IMGASMYLSMRKYVEAPPTDLYKHIFRRTVLIFLMGIIF--------NWIPFDQNLLDVR 119

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRIA+VY++ +L+  +  +  P +L               I G I    Y++T  
Sbjct: 120 ILGVLQRIAIVYLICSLL-VIKVRSIPTLLS--------------ISGLILGFYYLLT-- 162

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
                                    M+G+       V  VD  + G  H+Y+        
Sbjct: 163 -------------------------MKGY-----EIVDAVDLAIIGTKHMYT-------- 184

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                           P+      +PEGLLS+I ++++  IG     +L        RL 
Sbjct: 185 ----------------PT-----HDPEGLLSSIPSVVNAIIGYVSARILTE-NELKERLI 222

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              ++   ++ +A ILH+T  +PI K  +S S+   T+G +
Sbjct: 223 KMSTIAISMIALAYILHWT-ILPIYKTYWSSSFGLLTSGIS 262


>gi|390989491|ref|ZP_10259788.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
 gi|372555760|emb|CCF66763.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
          Length = 388

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 119/285 (41%), Gaps = 70/285 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGI 172
           A++ AL         + ++  R   +L  G+++     +   PD       +  +R  G+
Sbjct: 77  AMSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGV 136

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI L Y+  AL+     +R    + P  L++   Y W  +             Y+   
Sbjct: 137 LQRIGLCYLAAALLVRYLPQRG---IAPVCLALLLGY-WALL-------------YAFGQ 179

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           P    S+  + G +                     +D  L+G +HLY             
Sbjct: 180 PGAELSKTGNAGTR---------------------LDLWLYGRDHLY------------- 205

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                   R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 206 --------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|198277541|ref|ZP_03210072.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
 gi|198270039|gb|EDY94309.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
          Length = 338

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/319 (26%), Positives = 142/319 (44%), Gaps = 86/319 (26%)

Query: 77  MILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVP--KINGAVKK 132
           MILV++AGG  +YA + HS WNG T  D V PFFLF+VG++  ++L+K      +  ++K
Sbjct: 1   MILVNNAGGPVSYAPLRHSVWNGLTPCDLVFPFFLFMVGISTYISLRKFNFGPTSEVIRK 60

Query: 133 IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
           I+ RT  ++  G+ +   + +A +   + +D   +R  G+LQRI L Y +V+L       
Sbjct: 61  IVRRTFLIILIGLAID-WFGYACNGNFFPID--TLRIPGVLQRIGLCYGIVSL------- 110

Query: 193 RRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                     + I+  +++  WI G                           G+      
Sbjct: 111 ----------MVIYINHKYLPWICG---------------------------GLLLAYSI 133

Query: 252 CGMRGHLGPAC---NAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWC 308
             + G+ G AC   N +  +DR ++G  HLY                             
Sbjct: 134 LLLTGN-GYACDDTNLLAVIDRGIFGEAHLYK---------------------------- 164

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
           ++P +PEGL  T+SA+    IG   G +L+     + R+   ++    +LII  +L    
Sbjct: 165 KSPIDPEGLAGTLSAVAHTLIGFMCGRLLLEKISVNKRIVKLITAAVMMLIIGYVLSI-- 222

Query: 369 AIPINKQLYSFSYVCFTAG 387
            +PINK+++S ++V  T G
Sbjct: 223 WMPINKRVWSTTFVLVTCG 241


>gi|340786861|ref|YP_004752326.1| protein involved in N-acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
 gi|340552128|gb|AEK61503.1| protein involved in N-Acetyl-D-glucosamine utilization [Collimonas
           fungivorans Ter331]
          Length = 373

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 20/155 (12%)

Query: 49  ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVM 105
            L++  + +  ++R+A++DA RG TV  M+LV+D G     YA ++HS W+GCT  D V 
Sbjct: 11  ALRITTVCRSMTRRLASVDALRGCTVAAMLLVNDPGDWSHVYAPLEHSAWHGCTPTDLVF 70

Query: 106 PFFLFIVGVAIALALKKVPKI-NGA-----VKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
           PFFLF+VGV+ AL ++  P++  GA      +  + R L+++  G+++        + L+
Sbjct: 71  PFFLFVVGVSTALGIE--PRLAQGANPSTLARAALIRALRIVALGLLI--------NLLA 120

Query: 160 YGV-DMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           + +    H+R  G+LQRI L +   AL    T  R
Sbjct: 121 WFIMPGVHLRLPGVLQRIGLCFAATALCSIYTRPR 155


>gi|410448043|ref|ZP_11302130.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|410018124|gb|EKO80169.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
 gi|456875246|gb|EMF90470.1| putative membrane protein [Leptospira santarosai str. ST188]
          Length = 363

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 146/332 (43%), Gaps = 86/332 (25%)

Query: 65  TLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG +I ++L 
Sbjct: 2   SLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISLY 61

Query: 122 KVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRIA 177
               IN +     I  R+  L+  G+ L   G +S A            +R  G+LQRI 
Sbjct: 62  SKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA-----------ELRIPGVLQRIG 110

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
            VY VVA +  +   ++                       + F++ I     L +  W  
Sbjct: 111 FVYWVVASLCLVFPGKK----------------------ILVFLVPI-----LLIHTWIL 143

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
           ++ +  G  + +V       +G       ++DR ++G  HL+                  
Sbjct: 144 TQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHLW------------------ 177

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
                 + +W     +PEG LS ++++++   G+  G +L        R +    +G G+
Sbjct: 178 ----RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF------LRERRNKILGLGI 222

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 223 LFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS 254


>gi|440747989|ref|ZP_20927244.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483731|gb|ELP39771.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 382

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 80/140 (57%), Gaps = 7/140 (5%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R   LD  RGLT+ LMI+V+  G     ++ + H+ W+G T+ D V P FLF+VG A++
Sbjct: 13  ERYLALDVLRGLTIALMIVVNTPGDWSNVFSPLLHADWHGFTITDLVFPTFLFVVGNAMS 72

Query: 118 LALKKVPKIN-GAVKKIIFRTLKLLF---WGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
            ++KK+ K++ GA  K +F+   L+F   WG+     +      +   +D   +R  G+L
Sbjct: 73  FSMKKMEKMSQGAFLKKVFKRAALIFLIGWGLNAFPFFETNETGVVSMIDWSAVRLLGVL 132

Query: 174 QRIALVYVVVALIETLTTKR 193
           QRIAL Y++ +L+     KR
Sbjct: 133 QRIALCYLIASLVLYYIGKR 152



 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEG+LST  ++++   G   G   I   G++ R    + +  GL+ +   + +  A 
Sbjct: 206 PFDPEGVLSTWPSVVNVIAGFLAGK-FIQQIGNNKRTVKALLLA-GLIAVGFSVIWELAF 263

Query: 371 PINKQLYSFSYVCFTAGAADGCLG 394
           PINK++++ SYV  T G     LG
Sbjct: 264 PINKKIWTSSYVLLTVGLDLIVLG 287


>gi|255038072|ref|YP_003088693.1| hypothetical protein Dfer_4326 [Dyadobacter fermentans DSM 18053]
 gi|254950828|gb|ACT95528.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 368

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 76/135 (56%), Gaps = 18/135 (13%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           S R+ +LDA RG T+  MI+V+  G     +  + HS WNG T  D + P FLFIVGV+I
Sbjct: 8   SSRLLSLDAMRGFTIAAMIMVNFPGHEDYVFPTLRHSKWNGLTFTDLIAPTFLFIVGVSI 67

Query: 117 ALA-----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            LA     L   PK +G  +KI+ R+LK+   G+ L    +  PD      +   +R+ G
Sbjct: 68  TLAYSKKRLSNAPK-SGLYRKIVIRSLKIFAVGMFL----NMLPD-----FNFSDLRYTG 117

Query: 172 ILQRIALVYVVVALI 186
            L RIA+V++V A++
Sbjct: 118 TLHRIAIVFLVCAIL 132



 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-P 371
           +PEG+LST  AI +   GI  G +++     + +    ++ GF     A + +F N I P
Sbjct: 194 DPEGILSTFPAIATTITGILAGRLMLLPFSPNEKSNFLLTAGFA---TAALGYFWNLIFP 250

Query: 372 INKQLYSFSYVCFTAGAADGCLG 394
           +N+ L++ S+V  T+G A    G
Sbjct: 251 VNENLWTSSFVLVTSGFASMLFG 273


>gi|424670170|ref|ZP_18107195.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401070628|gb|EJP79142.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 355

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 84/158 (53%), Gaps = 21/158 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALKKVPKINGAVKK------IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            ++   P+   A  +      ++ R L++L  G +L          + + +D  H R  G
Sbjct: 67  FSVA--PRAQDAAARPALARGVLERALRILMAGALLH-------LLIWWALDTHHFRIWG 117

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           +LQRIA   V  AL+  L    RP V     +++   Y
Sbjct: 118 VLQRIA---VCAALVGVLAVYARPRVQVGALIALLVGY 152


>gi|410638830|ref|ZP_11349383.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
 gi|410141358|dbj|GAC16588.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
           lipolytica E3]
          Length = 365

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 142/343 (41%), Gaps = 85/343 (24%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R  +LD FRG+T+  M+LV++ G     Y  + H+ W+G T  D + PFFLFIVG A+ 
Sbjct: 2   NRQISLDIFRGITLAAMLLVNNPGSWSFVYPPLLHAKWHGLTPTDLIFPFFLFIVGAAMF 61

Query: 118 LAL-KKVPKINGAV----KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            ++ + +PK N A+    +KI  RT+ L   G +L        +   +  D ++ R  G+
Sbjct: 62  HSMGRYLPKANQALQVPWQKIAKRTIVLFAIGFLL--------NIFPFTGDPQNWRIMGV 113

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRIA+ Y + A++  +  +++                       IA  I ++  Y    
Sbjct: 114 LQRIAICYGIAAILICVLHQKQ----------------------LIAACITLLIGY---- 147

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                          +++   +    G   N V  +D E+ G  HLY             
Sbjct: 148 ---------------WLMLNLVENPYGLETNLVRLIDIEVLGSAHLYQG----------- 181

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                              F+PEGLLS I A+++   G     +L + K    R+K  + 
Sbjct: 182 ---------------FGVAFDPEGLLSCIPAVVTVLAGFFTSKMLANAKTEQQRMKTLLL 226

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
                 ++ I   F+   P+NK L++ +YV  T G A   L +
Sbjct: 227 WSLVTFVVCIAWQFS--FPVNKSLWTSTYVLATNGFAWLALAV 267


>gi|329956032|ref|ZP_08296803.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
 gi|328524791|gb|EGF51845.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
           12056]
          Length = 396

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 163/392 (41%), Gaps = 107/392 (27%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G+
Sbjct: 6   KTNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGI 65

Query: 115 AIALALKKVP-KINGAV-KKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM- 164
           +  ++LKK   + + AV  KI+ RT+ +   G+ + G +S          + +S+G  + 
Sbjct: 66  STYISLKKYNFEFSRAVGMKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGAQLW 124

Query: 165 ------KHIRWCGILQRIALVYVVVALIETLTTKRR--PNVLEPRHLSIFTAYQWQWIGG 216
                   IR  G++QR+AL Y   A+I  LT K R  P ++     ++ T Y       
Sbjct: 125 ESVWTFDRIRILGVMQRLALCYGATAIIA-LTVKHRNIPYLIA----TLLTGY------- 172

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI  V                              CG  G      N +  +DR +    
Sbjct: 173 FILLV------------------------------CG-NGFAYNDTNILSVIDRTILTPA 201

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    +G   G +
Sbjct: 202 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLLGFCVGRM 233

Query: 337 LI-HFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAAD 390
           ++ + K +  R     S    L +   IL F+  +     PINK+++S ++V  T G A 
Sbjct: 234 MLENGKANEDRESMLNSHLIKLFLAGTILTFSGFLLSYGCPINKKIWSPTFVLTTCGLA- 292

Query: 391 GCLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                  S F+  + W      ++ WS  + S
Sbjct: 293 -------SSFLALLIWIIDVKGYKKWSLFFES 317


>gi|456734835|gb|EMF59605.1| N-acetylglucosamine transporter NagX [Stenotrophomonas maltophilia
           EPM1]
          Length = 355

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 84/158 (53%), Gaps = 21/158 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALKKVPKINGAVKK------IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            ++   P+   A  +      ++ R L++L  G +L          + + +D  H R  G
Sbjct: 67  FSVA--PRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWG 117

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           +LQRIA   V  AL+  L    RP V     +++   Y
Sbjct: 118 VLQRIA---VCAALVGVLAVYARPRVQVGALIALLVGY 152


>gi|406662851|ref|ZP_11070935.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
 gi|405553158|gb|EKB48438.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
          Length = 382

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 81/148 (54%), Gaps = 7/148 (4%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFL 109
           Q L     +R   LD  RGLT+ LM++V+  G     YA   H+ W+G T+ D + P FL
Sbjct: 5   QNLGVPFKERYLALDVLRGLTIALMVVVNTPGSWSHMYAPFMHADWHGFTITDLIFPTFL 64

Query: 110 FIVGVAIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQG--GYSHAPDALSYGVDMK 165
           F+VG A++ ++KK+  +   V  KK+  RTL +   G +L      ++ P++    ++  
Sbjct: 65  FVVGNAMSFSMKKLESMGQQVFLKKVFKRTLLIFLIGWLLNAFPFVNYNPESGYSMINWS 124

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKR 193
            +R  G+LQRIAL Y++ ALI     KR
Sbjct: 125 EVRLLGVLQRIALCYMLAALILYYLGKR 152



 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEG+LST  ++++   G   G  +     ++  +K  V  G  LL+  +I  +  A 
Sbjct: 206 PFDPEGILSTFPSVVNVIAGFLVGKFIQDSGNNTGTVKKMVIWGIILLVACLI--WDMAF 263

Query: 371 PINKQLYSFSYVCFT 385
           PINK++++  YV  T
Sbjct: 264 PINKKIWTSPYVLLT 278


>gi|322436289|ref|YP_004218501.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
 gi|321164016|gb|ADW69721.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
          Length = 389

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 141/341 (41%), Gaps = 84/341 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q+  R+ ++D  RGLT+  MILV+DAG    AYA + H+ WNG T  D V P FLF+VG+
Sbjct: 10  QRPSRLLSIDLLRGLTIGFMILVNDAGSERDAYAPLQHAWWNGFTPTDLVFPTFLFLVGI 69

Query: 115 AIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI-- 167
              L+L     + VP++              LFW ++ +    +    L+      H+  
Sbjct: 70  TTVLSLGSRMDRNVPRMT-------------LFWSVLRRAVLIYVVGILASTFPFTHLAG 116

Query: 168 -RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
            R+ G+L RIAL Y++V  +  ++                      W    +     +I 
Sbjct: 117 MRFVGVLPRIALCYLIVGSLLLISK--------------------SWKDKVVILAACLIG 156

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
            ++L      F     +GV  + V       L    N   ++DR ++   HLY     +R
Sbjct: 157 YWALL----RFVPVPGYGVPTHDVPL-----LDRDGNLAAWLDRWMFAPQHLYER---TR 204

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR 346
                                     +PEGLLSTI A+ +  +G+     L     H+ R
Sbjct: 205 --------------------------DPEGLLSTIPAVGTALLGLL--TGLFLRSQHALR 236

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            K     G   + I + L +   +PINK++++ SYV F  G
Sbjct: 237 TKIMGIAGAATVSILLGLLWNITLPINKKMWTSSYVLFAGG 277


>gi|190575857|ref|YP_001973702.1| transmembrane protein [Stenotrophomonas maltophilia K279a]
 gi|190013779|emb|CAQ47415.1| putative transmembrane protein [Stenotrophomonas maltophilia K279a]
          Length = 355

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 80/146 (54%), Gaps = 21/146 (14%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALKKVPKINGAVKK------IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            ++   P+   A  +      ++ R L++L  G +L          + + +D  H R  G
Sbjct: 67  FSVA--PRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWG 117

Query: 172 ILQRIALVYVVVALIETLTTKRRPNV 197
           +LQRIA   V  AL+  L    RP V
Sbjct: 118 VLQRIA---VCAALVGVLAVYARPRV 140


>gi|374311063|ref|YP_005057493.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358753073|gb|AEU36463.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 435

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 166/377 (44%), Gaps = 96/377 (25%)

Query: 45  EQKGELQLQQLLQQ----------KSKRVATLDAFRGLTVVLMILVDDAGG---AYARID 91
           E  G++ + Q + Q          K  RV ++D  RG+T+ LMILV+D G     + ++D
Sbjct: 22  EMIGQMAITQTVSQTVSQTERTVSKPGRVLSVDVLRGITIALMILVNDPGDWDHIFGQLD 81

Query: 92  HSPWNGCTLADFVMPFFLFIVGVAIALALK-KVPKIN--GAVKKIIF-RTLKL--LFWGI 145
           H+ WNG TL D V P FLF++G +I  +L+ ++ + N  G +   IF R  K+  L+W  
Sbjct: 82  HAAWNGWTLTDMVFPAFLFLMGASIIFSLQARIARGNCKGTLAGHIFARAGKILALYW-- 139

Query: 146 ILQGGYSHAPDALSYGVDMK-HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLS 204
                       L++   M   IRW G+L RIAL Y++ +L+  L   RR  VL      
Sbjct: 140 -----------VLAFFPRMHWTIRWFGVLPRIALCYLLASLV--LLATRRVRVL------ 180

Query: 205 IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNA 264
                        IA V +++  Y + +              +++   G    LG     
Sbjct: 181 -------------IAIVAFLLVGYWVLL--------------RWVPVPG----LGTPMRD 209

Query: 265 VGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAI 324
           + ++D+     N   +  +   + + +L   ++G L        R   +PEGLLST+ A+
Sbjct: 210 IPFMDQ-----NANLASWIDRGVSSWSLRWLHTGTL-------YRKTRDPEGLLSTLPAV 257

Query: 325 LSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH-----FTNAIPINKQLYSF 379
            +  +G   G  +I+ +     ++    M  GL  + +        ++   PINK L++ 
Sbjct: 258 ATTLLGALAGMWMINGQRVVNGMRR---MRIGLAAMGVAGVAAGVLWSRWFPINKNLWTS 314

Query: 380 SYVCFTAG----AADGC 392
           S+V   AG    A  GC
Sbjct: 315 SFVLLMAGWTALALAGC 331


>gi|146302702|ref|YP_001197293.1| hypothetical protein Fjoh_4975 [Flavobacterium johnsoniae UW101]
 gi|146157120|gb|ABQ07974.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 92/181 (50%), Gaps = 48/181 (26%)

Query: 61  KRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ +LD FRGLT++LM +V+   D G  Y  + H+ W+GCT  D V PFF+FI+GVA+ 
Sbjct: 4   ERLISLDVFRGLTILLMTIVNNPGDWGNVYPPLLHAEWHGCTPTDLVFPFFIFIMGVAVP 63

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGII-----------LQG--------------GY- 151
           LA+      +    KI+ R+L++L  GI            L+G              GY 
Sbjct: 64  LAMPDKFYDSTTFNKILVRSLRMLCLGIFFNFFGKIQLFGLEGIPLLIGRLAITIAVGYA 123

Query: 152 ------SHAPDALSY------------GVDMKH-IRWCGILQRIALVYVVVALIETLTTK 192
                 S   + L++            G++  H +R  G+LQRIA+VY VV+L+   T++
Sbjct: 124 LMGSFSSKVKNILAFSILFIYLFLAYSGIEAYHDVRLPGVLQRIAIVYFVVSLLYLKTSQ 183

Query: 193 R 193
           R
Sbjct: 184 R 184



 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 4/84 (4%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF-GLLIIAIILHFTNAIP 371
           +PEG+LST+ +I++G IG+  G VL   +  + ++     MG  G ++I   L +    P
Sbjct: 241 DPEGILSTLPSIVNGIIGLLIGQVL---QRDTTKILKAQKMGIAGTILIFFGLMWDLVFP 297

Query: 372 INKQLYSFSYVCFTAGAADGCLGI 395
           INK L++ SYV +T G A   L I
Sbjct: 298 INKSLWTSSYVLYTTGLATVFLTI 321


>gi|329964617|ref|ZP_08301671.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
 gi|328525017|gb|EGF52069.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
           12057]
          Length = 396

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 158/392 (40%), Gaps = 104/392 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q+ SKR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 4   QKTSKRILALDILRGITIAGMIMVNNPGSWAHIYAPLAHAQWIGLTPTDLVFPFFMFIMG 63

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ------GGYSHAPDALSYGVDM- 164
           ++  ++LKK      + A  KI+ RT+ +   G+ +         +S APD L +G ++ 
Sbjct: 64  ISTYISLKKYNFEFSHAAALKILKRTVIIFLIGMAIGWFSRFCYYWSSAPDNLGFGENLW 123

Query: 165 ------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
                   +R  G++QR+AL Y   ++I  LT K +                        
Sbjct: 124 ASVWTFDRMRILGVMQRLALCYGATSIIA-LTMKHKN----------------------- 159

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
             + Y+I T                 +  +I+  G  G      N +  VDR +    H+
Sbjct: 160 --IPYLIATLL---------------IGYFILLLGGNGFAYNETNILSIVDRAVLTPAHM 202

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
           Y D                               +PEG+LSTI AI    +G   G +++
Sbjct: 203 YKD----------------------------NGIDPEGILSTIPAIAHVLLGFCVGRMML 234

Query: 339 HFKGHSA---RLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAAD 390
                SA   R     S    L +   IL F   +     PI+K+++S ++V  T G A 
Sbjct: 235 G-DSQSAKGDRTNVLDSHLIKLFLAGTILTFAGFLLSYGCPISKKIWSPTFVLVTCGLA- 292

Query: 391 GCLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                  S F+  + W      ++ WS  + S
Sbjct: 293 -------SSFLALLIWIIDVKGYKRWSMFFES 317


>gi|295132874|ref|YP_003583550.1| hypothetical protein ZPR_1009 [Zunongwangia profunda SM-A87]
 gi|294980889|gb|ADF51354.1| membrane protein [Zunongwangia profunda SM-A87]
          Length = 376

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 97/345 (28%), Positives = 145/345 (42%), Gaps = 90/345 (26%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG-GA--YARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + S+R  +LD FRGLT+ LMILV+  G GA  Y  + H+ W G TLAD V P FLF VG 
Sbjct: 4   KTSERFLSLDVFRGLTIALMILVNTPGTGADLYPYLVHAQWFGFTLADLVFPSFLFAVGN 63

Query: 115 AIALALKKVPKINGAV--KKIIFRTLKLLFWGIIL---------QGGYSHAPDALSYGVD 163
           A++ +++K  +   A   KK++ RT  +   G ++         +G    +P        
Sbjct: 64  AMSFSMRKFQEAAPADFWKKVLKRTAIIFLLGFLMYWFPFFRMNEGHLELSP-------- 115

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
               R  G+LQRIAL Y   A++              R+ S+ T        GFI   I 
Sbjct: 116 FSETRIMGVLQRIALCYFFGAVLV-------------RYFSVKTI-------GFICAAIL 155

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGH-LGPACNAVGYVDRELWGINHLYSDP 282
           +     LY                     G  GH L  A NA    D  + G  H+Y   
Sbjct: 156 LAYWGILY-------------------GFGEPGHELEMATNAAAKFDYAILGEGHIY--- 193

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG 342
                             ++DA      PF+PEG+LST+ +I++   G +   V I  KG
Sbjct: 194 ------------------KKDA-----IPFDPEGILSTLPSIVNVLAG-YLAGVFIRRKG 229

Query: 343 HSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            S      + +  G L+ A+   +    P++K+L++  +   T G
Sbjct: 230 KSYETIAKLMLA-GFLVFALAEWWALIFPLSKKLWTSPFAMLTIG 273


>gi|345880604|ref|ZP_08832150.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
 gi|343922516|gb|EGV33216.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
          Length = 383

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 147/363 (40%), Gaps = 97/363 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            QK+ R+  +D  RG+T+  MILV++ GG   Y  ++H+ W G T  D V PFF+FI+G+
Sbjct: 3   TQKTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWFGLTPTDLVFPFFMFIMGI 62

Query: 115 AIALALKKV------PKINGAVKKIIFRTLKLLFWGIIL-------QGGYSHAPDALSYG 161
              L+L+K       P      KKII R + L   GI +       +G ++    AL + 
Sbjct: 63  TTYLSLRKYDFEWSWP----CAKKIIKRGMLLYVIGIAISWLMMFCRGLFNEDYAALPFF 118

Query: 162 VDM-------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
             +        HIR  G+  R+A  YV  +++  L+ K R                    
Sbjct: 119 SHVFAAANVFDHIRLVGVFPRLAFCYVFASVVA-LSVKHR-------------------- 157

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
             FI ++I  +                   +  + V C   G    A N    +D  + G
Sbjct: 158 --FIPWLIAAVF------------------IGYFAVLCLGNGFAHDASNICNVIDEAILG 197

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
             HLY                     + D P       +PEGLLS++ A+    IG   G
Sbjct: 198 RQHLY---------------------KWDIP-------DPEGLLSSLPALGHVLIGFCVG 229

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            V++     + +++     G  L I+  +L +    PI+K+L++ ++   T G A   L 
Sbjct: 230 RVVMSATSLNDKIEKLFIYGAVLTILGFLLSY--GCPISKKLWTPTFALVTCGLASTTLA 287

Query: 395 ITN 397
           + +
Sbjct: 288 LLS 290


>gi|346224087|ref|ZP_08845229.1| hypothetical protein AtheD1_02868 [Anaerophaga thermohalophila DSM
           12881]
          Length = 369

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 53/143 (37%), Positives = 77/143 (53%), Gaps = 13/143 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            +KS+R   LD  RG+T+ LMI V+  G     YA + HS W+GCT  D V PFFLF+VG
Sbjct: 1   MKKSERYLALDVLRGMTIALMITVNTPGSWQYVYAPLRHSSWHGCTPTDLVFPFFLFVVG 60

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           V++  +  K        +  ++  RTL +   G+ L    +  P  ++   D   +R  G
Sbjct: 61  VSMFFSFAKYGNTLNKASFNRLGRRTLLIFAIGLFL----NSFPQWMT---DYSSLRIMG 113

Query: 172 ILQRIALVYVVVALIETLTTKRR 194
           +LQRIAL Y   +LI  L+ KR+
Sbjct: 114 VLQRIALAYGFASLI-VLSMKRK 135



 Score = 45.4 bits (106), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 48/79 (60%), Gaps = 3/79 (3%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEGLLSTI A+++  +G   G V I      A+L   +++ +G+L+  I   +    
Sbjct: 188 PFDPEGLLSTIPAVVTVLLGYLTG-VFIS-NTEKAKLPARLAL-YGVLVTIIGRLWGFVF 244

Query: 371 PINKQLYSFSYVCFTAGAA 389
           PINK L++ SYV +TAG A
Sbjct: 245 PINKPLWTSSYVLYTAGLA 263


>gi|289667572|ref|ZP_06488647.1| hypothetical protein XcampmN_03447, partial [Xanthomonas campestris
           pv. musacearum NCPPB 4381]
          Length = 298

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 117/284 (41%), Gaps = 70/284 (24%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGIL 173
           ++ AL         + ++  R + ++  G+++     +   PD       +  +R  G+L
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRI L Y+  AL   L     P  + P  +++   Y W  +             Y    P
Sbjct: 142 QRIGLCYLAAAL---LVRYLPPRGIAPACVALLLGY-WALL-------------YVFGQP 184

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
               S+  + G +                     +D  L+G  HLY              
Sbjct: 185 GVELSKTGNAGTR---------------------LDLWLYGRAHLY-------------- 209

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                  R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 210 -------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 240


>gi|21241481|ref|NP_641063.1| hypothetical protein XAC0710 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21106824|gb|AAM35599.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 388

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 117/285 (41%), Gaps = 70/285 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG 
Sbjct: 17  SKRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGS 76

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGI 172
           A++ AL         + ++  R   +L  G+++     +   PD       +  +R   +
Sbjct: 77  AMSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTCV 136

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRI L Y+  AL   L     P  + P  L++   Y W  +             Y+   
Sbjct: 137 LQRIGLCYLAAAL---LVRYLPPRGIAPVCLALLLGY-WALL-------------YAFGQ 179

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
           P    S+  + G +                     +D  L+G +HLY             
Sbjct: 180 PGAELSKTGNAGTR---------------------LDLWLYGRDHLY------------- 205

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                   R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 206 --------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236


>gi|410631394|ref|ZP_11342069.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
 gi|410148840|dbj|GAC18936.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
           BSs20135]
          Length = 366

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 145/350 (41%), Gaps = 84/350 (24%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q    R+  +D  RGL + LM+LV++ G     YA   H+ W+G T  D V PFFLF++G
Sbjct: 3   QSIQTRIEAIDVLRGLALALMLLVNNPGSWSAVYAPFLHADWHGLTPTDLVFPFFLFVMG 62

Query: 114 VAIALALKKVPKINGAVKKIIF-RTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCG 171
            ++A +L+   + +G     IF R+  L+F G +LQ   +  APD           R  G
Sbjct: 63  ASMACSLRGQIQASGLPWLSIFKRSFLLVFIGFLLQIIPFDQAPDTW---------RIMG 113

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +LQRI L +++VA +  +  +R                 W  +   +  ++Y +   S  
Sbjct: 114 VLQRIGLCFLLVASMLAIIKER-----------------WLLLSAVVTLIVYWLLLLS-- 154

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
                 +  + + ++                N+V + D  + G  H++            
Sbjct: 155 ------AGQAPYSLEN---------------NSVRHFDMAILGSAHMWQGK--------- 184

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV 351
                              PF+PEGLLSTI A ++   G      ++  K    ++   +
Sbjct: 185 -----------------GLPFDPEGLLSTIGAAMTVLSGYLICVNVLQQKNQKQQILQLM 227

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFI 401
            +G  LL +  +    +  PINK L++ SYV  ++  A  CL +   I +
Sbjct: 228 IVGAILLALGFVWSVWH--PINKSLWTSSYVLVSSAFA--CLSLAVIILL 273


>gi|295690502|ref|YP_003594195.1| hypothetical protein Cseg_3137 [Caulobacter segnis ATCC 21756]
 gi|295432405|gb|ADG11577.1| Protein of unknown function DUF2261, transmembrane [Caulobacter
           segnis ATCC 21756]
          Length = 372

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/341 (26%), Positives = 140/341 (41%), Gaps = 81/341 (23%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIV 112
           + + + R  +LD FRGLTV LMI+V+ AG    AY ++ H+PW G T AD V P FLF V
Sbjct: 1   MSKPAARFLSLDVFRGLTVCLMIVVNTAGPGAKAYTQLVHAPWFGFTAADAVFPSFLFAV 60

Query: 113 GVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCG 171
           G ++A A  +    N  + K++ R   +   G ++    +    D     +     R  G
Sbjct: 61  GCSMAFAFSRPIPTNEFLAKVLRRAALIFLLGFLMYWFPFVKKIDGHWALIPFADTRVMG 120

Query: 172 ILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLY 231
           +LQRIAL Y++ A             L PR +   +A         +  + Y     +L 
Sbjct: 121 VLQRIALCYMLAAFAV--------RWLSPRLIVALSA---------VLLLGYWAILMTLG 163

Query: 232 VPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
            P    S+  + G                      ++D  L G NHLY            
Sbjct: 164 DPAAPLSKLGNAGT---------------------HLDLFLIGQNHLY------------ 190

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHF-----KGHSAR 346
                    R+D        F+PEGLL T    L  T+ +  G++   F        SA 
Sbjct: 191 ---------RKD------GGFDPEGLLGT----LPSTVNVLAGYLAARFLKENPGSQSAM 231

Query: 347 LKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            +  ++   G+++I   L ++   PI K+L++ S+V  T G
Sbjct: 232 ARMAIA---GVVLILAGLAWSPLFPIAKKLWTGSFVLLTVG 269


>gi|399088486|ref|ZP_10753550.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
 gi|398030770|gb|EJL24173.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
          Length = 398

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 143/342 (41%), Gaps = 65/342 (19%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+  LD  RGL V  MILV   G    AYA + H+ W G TLAD V P FLF VGVAI L
Sbjct: 10  RIVALDVLRGLAVAGMILVTSPGAWAHAYAPLKHAAWQGWTLADLVFPTFLFCVGVAIGL 69

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
           ++ ++    GA   +  +  +     I+L    +  P       D+ H+R  G+LQRI L
Sbjct: 70  SVPRLRIGEGASAALWIKVARRTALLILLGLVLNALPR-----FDLAHLRIPGVLQRIGL 124

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFS 238
            Y + + I  L  +   +               Q        V+ ++     Y    +F+
Sbjct: 125 CYALASAICILPARAEAD--------------GQLRLNVGGVVLAVVGLLVGYWALLTFT 170

Query: 239 EHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
                GV ++  +  +     PA     ++DR ++ I HL+    W   E   ++     
Sbjct: 171 PVPGFGVDRWDSQGAL-----PA-----FIDRAVFTIPHLWP---WGTTEGVGVT----- 212

Query: 299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-------ARLKHWV 351
                        ++PEGLLST  A    T+ +  G V   F   +        R+   +
Sbjct: 213 -------------YDPEGLLSTFPA----TVNVLLGAVAAAFLARTGDGRQGRGRVLAAL 255

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
                 LI+A  L     +P+NK+L++ S+  F++GA+   L
Sbjct: 256 LALGAALIVA-GLALDPIVPVNKRLWTPSFALFSSGASLAAL 296


>gi|194367192|ref|YP_002029802.1| hypothetical protein Smal_3420 [Stenotrophomonas maltophilia
           R551-3]
 gi|194349996|gb|ACF53119.1| conserved hypothetical protein [Stenotrophomonas maltophilia
           R551-3]
          Length = 355

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 17/144 (11%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALKK----VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
            ++      V       + ++ R L++L  G +L          + + +D  H R  G+L
Sbjct: 67  FSVAPRALDVALRPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGVL 119

Query: 174 QRIALVYVVVALIETLTTKRRPNV 197
           QRIA   V  AL+  L    RP V
Sbjct: 120 QRIA---VCAALVGVLAVYARPRV 140


>gi|294463099|gb|ADE77087.1| unknown [Picea sitchensis]
          Length = 218

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 56/78 (71%), Gaps = 1/78 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLS+I A+++  IG+H+GH+L+HFKGHS R+   +    GL+   I L     +P
Sbjct: 44  FDPEGLLSSIMAVVTCFIGLHFGHILVHFKGHSERVLQCIIPSLGLIFFGIALGIF-GMP 102

Query: 372 INKQLYSFSYVCFTAGAA 389
           +NK LY+F+Y+C T+GAA
Sbjct: 103 LNKPLYTFNYMCVTSGAA 120


>gi|313147781|ref|ZP_07809974.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423280992|ref|ZP_17259903.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
 gi|313136548|gb|EFR53908.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404583442|gb|EKA88121.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
           610]
          Length = 387

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 147/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTANKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     + H  ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LGESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
              ++ Y+I                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 ---YIPYLIAALL---------------IGYFIILITGNGFEYNSTNILAVVDRAVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEG+LSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K  + +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|424666001|ref|ZP_18103037.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
 gi|404574254|gb|EKA79005.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
           616]
          Length = 387

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 147/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTANKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     + H  ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LGESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
              ++ Y+I                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 ---YIPYLIAALL---------------IGYFIILITGNGFEYNSTNILAVVDRAVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEG+LSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K  + +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|289663929|ref|ZP_06485510.1| hypothetical protein XcampvN_12875 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 392

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 119/284 (41%), Gaps = 70/284 (24%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG A
Sbjct: 22  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGIL 173
           ++ AL         + ++  R + ++  G+++     +   PD       +  +R  G+L
Sbjct: 82  MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRI L Y+  AL   L     P  + P  +++   Y W         ++Y+        P
Sbjct: 142 QRIGLCYLAAAL---LVRYLPPRGIAPACVALLLGY-WA--------LLYVFGQ-----P 184

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
               S+  + G +                     +D  L+G  HLY              
Sbjct: 185 GVELSKTGNAGTR---------------------LDLWLYGRAHLY-------------- 209

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                  R+D        F+PEGLL T+SA ++   G   G  L
Sbjct: 210 -------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 240


>gi|388258355|ref|ZP_10135531.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
 gi|387937867|gb|EIK44422.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
          Length = 362

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 137/341 (40%), Gaps = 96/341 (28%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R   LD  RGLT+ LMILV+  G     Y  + H+ W+G T  DFV PFF+FIVG ++ 
Sbjct: 4   QRFQALDVMRGLTLALMILVNTPGSWSFVYGPLLHADWHGATATDFVFPFFMFIVGSSMY 63

Query: 118 LALKKVPKINGAVK--KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            A++ + ++  A +  KI+ R + L   G++L         A  +  ++++ R  G+LQR
Sbjct: 64  FAMRGLRQLAPAAQAQKILRRVVLLFVIGVLL--------SAYPFTNNIENWRVMGVLQR 115

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
           IA+ Y                                   GF AF+I             
Sbjct: 116 IAIAY-----------------------------------GFAAFIIL------------ 128

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPAC-------NAVGYVDRELWGINHLYSDPVWSRLE 288
            +   +   V   I+  G  G L  A        N V   D  + G NHL+         
Sbjct: 129 -YFGFTGRVVMSAILLLGYWGLLNIAADPYSLEHNLVRQFDLAVLGANHLWQGK------ 181

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                  F+PEG+LST+ +I++  IG     VL+  +  +  L 
Sbjct: 182 --------------------GLAFDPEGILSTVPSIVNVIIGFEATRVLLASEDKAKALS 221

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                   L+ +A++ +     PINK L++ S+V  T G A
Sbjct: 222 QLFVAALLLIGLALVWNLL--FPINKSLWTSSFVVLTCGVA 260


>gi|192359631|ref|YP_001981658.1| hypothetical protein CJA_1162 [Cellvibrio japonicus Ueda107]
 gi|190685796|gb|ACE83474.1| putative membrane protein [Cellvibrio japonicus Ueda107]
          Length = 399

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/131 (39%), Positives = 74/131 (56%), Gaps = 13/131 (9%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R   LD  RGLT+ LMILV+  G     Y  + H+ W+G T  DFV PFFLFIVG A+ 
Sbjct: 37  QRFLALDVMRGLTLALMILVNTPGSWSHVYGPLLHADWHGVTPTDFVFPFFLFIVGSAMY 96

Query: 118 LALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            +++ + +++ +  ++K+  R L L   GI+L         A  +  D+   R  G+LQR
Sbjct: 97  FSVRGLAQLSLSQQLRKVGRRVLLLFVMGILLA--------AYPFTADVHDWRIMGVLQR 148

Query: 176 IALVYVVVALI 186
           IAL Y V ALI
Sbjct: 149 IALAYGVAALI 159


>gi|167764222|ref|ZP_02436349.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
 gi|167698338|gb|EDS14917.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
           43183]
          Length = 396

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 157/392 (40%), Gaps = 107/392 (27%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           + +KR+  LD  RG+T+  MI+V++ G     YA + H+ WNG T  D V PFF+FI+G+
Sbjct: 6   KTNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGI 65

Query: 115 AIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSH-------APDALSYGVDM- 164
           +  ++LKK        A  KI+ RT+ +   G+ + G +S          + + +G  + 
Sbjct: 66  STYISLKKYNFEFSRAAGMKILKRTILIFLIGMGI-GWFSRFCYYWTSPTEGIGFGAQLW 124

Query: 165 ------KHIRWCGILQRIALVYVVVALIETLTTKRR--PNVLEPRHLSIFTAYQWQWIGG 216
                   IR  G++QR+AL Y   A+I  LT K R  P ++     ++ T Y       
Sbjct: 125 EAAWTFDRIRILGVMQRLALCYGATAIIA-LTMKHRNIPYLIA----TLLTGY------- 172

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           FI  V                              CG  G      N +  VDR +    
Sbjct: 173 FILLV------------------------------CG-NGFAYNDTNILSVVDRAILTPA 201

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI AI    +G   G +
Sbjct: 202 HMYKD----------------------------NGIDPEGLLSTIPAIAHVLLGFCVGRM 233

Query: 337 LIH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAGAAD 390
           ++   K   +R     S    L +   IL F   +     PINK+++S ++V  T G A 
Sbjct: 234 MLEGGKADESRESMLNSHLIKLFLAGTILTFAGFLLSYGCPINKKIWSPTFVLATCGLA- 292

Query: 391 GCLGITNSIFIPEMDWDECDACFRAWSTRYPS 422
                  S F+  + W      ++ WS  + S
Sbjct: 293 -------SSFLALLIWIIDVKGYKKWSLFFES 317


>gi|428210738|ref|YP_007083882.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
 gi|427999119|gb|AFY79962.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
           6304]
          Length = 398

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 141/352 (40%), Gaps = 85/352 (24%)

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPF 107
            +Q LL   S R+ +LD FRG+ +  MILV++ G     Y  + H+PW+G T  D + P 
Sbjct: 11  SVQNLL--NSMRLTSLDVFRGMAIASMILVNNPGSWQQVYPPLLHAPWHGFTPTDLIFPA 68

Query: 108 FLFIVGVAIALALKKV------PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS-Y 160
           FLFI GVA+A +  K       P       KI+ R L L   G+ L G        L   
Sbjct: 69  FLFISGVAMAFSFAKYTNSPNSPPAASVYFKILRRALILFGLGLFLNGSTLVLKTLLQGQ 128

Query: 161 GVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAF 220
            +D   +R  G+LQRI+L Y+  A      ++RR                     GF+  
Sbjct: 129 PLDFGTLRIMGVLQRISLAYLFGATAILNLSRRR--------------------LGFLCL 168

Query: 221 VI---YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
            I   Y      + VP +   + S  G                A   V Y+DR +    H
Sbjct: 169 AILLGYWFALTQIPVPGYGPGDLSAKG----------------AGTLVAYLDRLILTPPH 212

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +  D                              FEPEGLLST+ ++++  +G   G  L
Sbjct: 213 ILGD----------------------------GSFEPEGLLSTLPSVVTLLLGFFIGDWL 244

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILH--FTNAIPINKQLYSFSYVCFTAG 387
              +      +  + M  G+ ++ I+    +    PINKQL++ SYV  +AG
Sbjct: 245 ---QKQPVTSRTSLQMA-GVAVVTIVTGSLWGLVFPINKQLWTSSYVVLSAG 292


>gi|404450663|ref|ZP_11015643.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
 gi|403763718|gb|EJZ24662.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
          Length = 381

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 77/133 (57%), Gaps = 8/133 (6%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R   LD  RGLT+ LMI+V+  G     Y    H+PW+G T+ D V P FLF+VG A++
Sbjct: 13  ERYLALDVLRGLTIALMIVVNTPGSWSHMYGPFMHAPWHGFTITDLVFPTFLFVVGNAMS 72

Query: 118 LALKKVPKINGA--VKKIIFRTLKLLF--WGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
            ++KK+ K+     ++K++ R+  +    WG+     +    + L+  ++   +R  G+L
Sbjct: 73  FSMKKLEKMGQGLFLRKVLKRSFLIFIIGWGLNAFPFFDQTENGLAM-INWGEVRLLGVL 131

Query: 174 QRIALVYVVVALI 186
           QRIAL Y++ +L+
Sbjct: 132 QRIALCYLIASLV 144



 Score = 46.2 bits (108), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEGLLST  A+++   G   G  +     ++  +K+ +  G  L++IA+ L +    
Sbjct: 205 PFDPEGLLSTFPAVVNVIAGYFAGKFIQQMGNNTKTVKYLLVAG--LILIAVCLAWDPFF 262

Query: 371 PINKQLYSFSYV 382
           PINK+L++ SYV
Sbjct: 263 PINKKLWTSSYV 274


>gi|53711719|ref|YP_097711.1| hypothetical protein BF0428 [Bacteroides fragilis YCH46]
 gi|52214584|dbj|BAD47177.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 387

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 147/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     +    ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           +I ++I I+                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 YIPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDRTVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIITCGLASSFLAL 289


>gi|256420508|ref|YP_003121161.1| hypothetical protein Cpin_1463 [Chitinophaga pinensis DSM 2588]
 gi|256035416|gb|ACU58960.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 374

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/132 (38%), Positives = 74/132 (56%), Gaps = 7/132 (5%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG--GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           +R   LD FRGLTV  MI+V+  G   +Y  ++H+ WNGCT  D V P FLF VG A++ 
Sbjct: 7   QRFLPLDVFRGLTVCFMIIVNTPGWDTSYYILNHAQWNGCTPTDMVFPSFLFAVGNAMSF 66

Query: 119 ALKKVPKI-NGAV-KKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQ 174
           +++K  ++ N AV  KI  RTL +   G ++       H    L + + +   R  G+LQ
Sbjct: 67  SMRKFQQLENTAVLSKIFRRTLLIFLLGFLMYWLPFVRHTESGLEF-IPLSDTRILGVLQ 125

Query: 175 RIALVYVVVALI 186
           RIAL Y   +L+
Sbjct: 126 RIALCYCFASLL 137



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 52/77 (67%), Gaps = 4/77 (5%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR-LKHWVSMGFGLLIIAIILHFTNAI 370
           F+PEGLLST+ AI++   G +Y  + I  +G + + L+  + MG  L+++A++  ++ A 
Sbjct: 199 FDPEGLLSTLPAIVNVIAG-YYTGLFIQQEGKTGKGLRKLLQMGALLILVALV--WSMAF 255

Query: 371 PINKQLYSFSYVCFTAG 387
           PINK+L++ SYV +T G
Sbjct: 256 PINKKLWTSSYVLYTVG 272


>gi|60679957|ref|YP_210101.1| hypothetical protein BF0369 [Bacteroides fragilis NCTC 9343]
 gi|336407897|ref|ZP_08588393.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|423248371|ref|ZP_17229387.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|423253319|ref|ZP_17234250.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|423269643|ref|ZP_17248615.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|423272798|ref|ZP_17251745.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
 gi|60491391|emb|CAH06139.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|335944976|gb|EGN06793.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
 gi|392657219|gb|EIY50856.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
           CL03T12C07]
 gi|392659584|gb|EIY53202.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
           CL03T00C08]
 gi|392700489|gb|EIY93651.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
           CL05T00C42]
 gi|392708362|gb|EIZ01469.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
           CL05T12C13]
          Length = 387

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 147/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     +    ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           +I ++I I+                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 YIPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDRTVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 72/131 (54%), Gaps = 5/131 (3%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R  +LD FRGLT+ LMILV+  G    A+ ++ H+PW G T AD V P FLF VG A++
Sbjct: 737 ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 796

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQR 175
            AL +   +   +++I  R+  +   G ++       H  D     + +   R  G+LQR
Sbjct: 797 FALDRGQPLGAFLRRIGKRSALIFLLGFLMYWFPFVHHGADGSWSFIAIDQTRVPGVLQR 856

Query: 176 IALVYVVVALI 186
           IAL Y + AL+
Sbjct: 857 IALCYALGALL 867


>gi|320105553|ref|YP_004181143.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
 gi|319924074|gb|ADV81149.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
          Length = 412

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 89/339 (26%), Positives = 140/339 (41%), Gaps = 77/339 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDD--AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            K  R+ +LD  RG+T+  MILV++    GA+  + H+ WNG T  D V P FL +VG++
Sbjct: 26  HKPARLLSLDVLRGVTIGFMILVNNQTGEGAFFPLQHAKWNGFTPTDLVFPTFLLLVGLS 85

Query: 116 IALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
             L+  +     G  K  IF     R+  L  +G+I+    ++AP        ++ +R  
Sbjct: 86  TVLS-TEARLARGVAKSTIFLHTLQRSAVLFLFGLIV----NNAP-----FFHLQTLRVY 135

Query: 171 GILQRIALVYVVVALIETLTT--KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
           G+L RIA+ Y +V  +  L    K+R  +L     +    Y W  +              
Sbjct: 136 GVLPRIAVCYFIVGSLYLLVRDLKQRAFILAAAAAACLVGY-WALMR------------- 181

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
             ++P   F      G   + +         P  N V Y+DR ++  +HLY     +R  
Sbjct: 182 --FIPIPGF------GTPTHEIPIN-----DPDGNLVAYIDRHIFSASHLYEK---TR-- 223

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                                   +PEGLLSTI A+ +   GI  G  L   +    + K
Sbjct: 224 ------------------------DPEGLLSTIPAVATALFGILAGIWLRTSRSTMQKAK 259

Query: 349 HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
                G   LI+    H   A PINK+L++ S+  +  G
Sbjct: 260 GIEYAGISFLILGGAWHL--AFPINKKLWTSSFSLWAGG 296


>gi|84625357|ref|YP_452729.1| hypothetical protein XOO_3700 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188575197|ref|YP_001912126.1| hypothetical protein PXO_04319 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|84369297|dbj|BAE70455.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188519649|gb|ACD57594.1| membrane protein, putative [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 388

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/160 (36%), Positives = 82/160 (51%), Gaps = 16/160 (10%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           K +R  +LD FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG A
Sbjct: 18  KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77

Query: 116 IALAL-KKVPKIN--GAVKK---IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           ++ AL   +P +   G V K   +I     L++W       +   PD       +  +R 
Sbjct: 78  MSFALATNMPHLQFLGRVSKRAALIALCGVLMYWFPF----FHLQPDGGWAFTTVDQVRL 133

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
            G+LQRI L Y+  AL   L     P  + P  +++   Y
Sbjct: 134 TGVLQRIGLCYLAAAL---LVRYLPPRGIAPACVALLLGY 170


>gi|265765098|ref|ZP_06093373.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263254482|gb|EEZ25916.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 387

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 147/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYIYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     +    ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           +I ++I I+                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 YIPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDRTVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|390956359|ref|YP_006420116.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
 gi|390411277|gb|AFL86781.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
          Length = 404

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 146/334 (43%), Gaps = 85/334 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDD--AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           K++R+ ++D  RGLT+  MILV++    GA+  + H+ WNG TL D V P FLF+VG+++
Sbjct: 27  KTERLLSVDVLRGLTIAFMILVNNQPGPGAFFELQHAQWNGFTLTDLVFPTFLFLVGLSL 86

Query: 117 ALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWC 170
            L+        GA +K +F     R+  L  +GI++    + H          +  IR+ 
Sbjct: 87  VLS-TAARLAKGASRKTLFLHTLRRSAVLALFGIVVNTFPFQH----------LDRIRFY 135

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQR AL Y+VV+ +  L    +         +I  A            V+Y +    +
Sbjct: 136 GVLQRTALCYLVVSGLCLLRKGWKDKA------AIAVA----------CLVVYWVLMRFV 179

Query: 231 YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEAC 290
            VP +    H +  +              P  N   ++DR ++   HLY           
Sbjct: 180 PVPGFGTPTH-EIPIND------------PNGNLTAWLDRLIFAPQHLYQQ--------- 217

Query: 291 TLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHW 350
                    +R           +PEGLLST+ AI +   G+  G  L   +  +A+    
Sbjct: 218 ---------VR-----------DPEGLLSTLPAISTALYGVLAGTWLRTTRSTTAK---- 253

Query: 351 VSMGFGLLIIAIILH---FTNAIPINKQLYSFSY 381
            ++G  L  +A+ +    ++   P+NK+L++ S+
Sbjct: 254 -AVGLALGGVAMTVAGWLWSYGFPLNKKLWTSSF 286


>gi|313203961|ref|YP_004042618.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
 gi|312443277|gb|ADQ79633.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
          Length = 382

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 148/353 (41%), Gaps = 78/353 (22%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R   LD FRG+TV  MI+V+ +G     Y  + H+ WNG T  D V P FLF VG A+ 
Sbjct: 13  SRFTALDIFRGMTVCFMIIVNTSGNGATTYWPLMHADWNGFTPTDLVFPSFLFAVGNALG 72

Query: 118 LALKK--VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV--DMKHIRWCGIL 173
            A+K+    K +  + KI  RT  +   G ++        +A S+ +   +   R  G+L
Sbjct: 73  FAMKRWDTMKQSDVLLKIFKRTALIFLIGYLMYWFPFFRLNAESHLILSPISQTRIMGVL 132

Query: 174 QRIALVYVVVA-LIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           QRIAL Y + A L+  L TKR                   W+G        +++  + +V
Sbjct: 133 QRIALCYGITALLVYYLGTKRT-----------------IWVG--------VVSLLAYWV 167

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
              +F E                       NAV  +D  L G +HLY             
Sbjct: 168 LLLAFGEAG--------------AEFSKTGNAVLRLDIWLLGTHHLYGGE---------- 203

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR-LKHWV 351
                             PF+PEG+LST+ A+ +   G   G  L   KG S   L   +
Sbjct: 204 ----------------GFPFDPEGVLSTLPALFNVIAGFAVGRYLQQQKGKSYESLAKLL 247

Query: 352 SMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
            +G GLL++A    + + +PINK+L++ SY   T G    CL ++  I+  + 
Sbjct: 248 LVGIGLLVLAYC--WNSWMPINKKLWTSSYAVLTVGL--DCLLLSVIIYFTDF 296


>gi|392390355|ref|YP_006426958.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
 gi|390521433|gb|AFL97164.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
           DSM 15997]
          Length = 390

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/139 (34%), Positives = 74/139 (53%), Gaps = 6/139 (4%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD FRG TV LMILV++ G     +  + H+ W GCT  D V PFFLF VG A+A 
Sbjct: 4   RYYSLDVFRGATVALMILVNNPGSWSAMFKPLTHAEWAGCTPTDLVFPFFLFAVGNAMAF 63

Query: 119 ALKKVPKINGAV--KKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVDMKHIRWCGILQR 175
            + ++ K    V  +K++ RT  +   G++L    +    D +      +++R  G+LQR
Sbjct: 64  VIPRMQKAGSQVFWRKVLKRTFLIFIIGLLLNWFPFVQWKDGILTFKHWENVRILGVLQR 123

Query: 176 IALVYVVVALIETLTTKRR 194
           IA  Y   A+I     +++
Sbjct: 124 IAFAYFFAAIIAYYFKEKK 142


>gi|423282312|ref|ZP_17261197.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
 gi|404581880|gb|EKA86575.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
           615]
          Length = 387

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 147/358 (41%), Gaps = 90/358 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRT-------LKLLFWGIILQGGYSHAPDALSYGVDM 164
           ++  ++L+K      + A  KI+ RT       L + ++ +  +   S + + +S+   +
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSSEDISFFSRL 122

Query: 165 K-------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                   HIR  G++QR+AL Y   A+I  +   +                       +
Sbjct: 123 YESIWTFGHIRILGVMQRLALCYGATAIIALIMKHK-----------------------Y 159

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I ++I I+                   +  +I+     G    + N +  VDR + G  H
Sbjct: 160 IPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDRTVLGEAH 201

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLSTI +I    IG   G +L
Sbjct: 202 MYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKLL 233

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           +  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 234 MEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|319785830|ref|YP_004145305.1| transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
 gi|317464342|gb|ADV26074.1| putative transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
          Length = 357

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 54/131 (41%), Positives = 78/131 (59%), Gaps = 8/131 (6%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFI 111
           +   + +R+A++DA RGLTV  M+LV+   D G  YA + H+ W+GCT AD V PFFL I
Sbjct: 1   MADARFRRLASVDALRGLTVAAMLLVNNPGDWGHVYAPLLHADWHGCTPADLVFPFFLAI 60

Query: 112 VGVAIALALKKVPKIN-GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           VGV+IAL +  VP+I  GA +  + RT+ +    I+  G   H      + +D  H R  
Sbjct: 61  VGVSIALGV--VPRIEAGADRAGLMRTVAVRPLRILAVGLLLHL--LAWWWLDQPHYRPW 116

Query: 171 GILQRIALVYV 181
           G+LQRI L ++
Sbjct: 117 GVLQRIGLCFL 127


>gi|325103749|ref|YP_004273403.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324972597|gb|ADY51581.1| hypothetical protein Pedsa_1010 [Pedobacter saltans DSM 12145]
          Length = 374

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 73/131 (55%), Gaps = 6/131 (4%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD  RG TV  MI+V+  G     YA + H+PW+G T+ D V P FLF+VG A++ 
Sbjct: 11  RYLSLDVLRGATVAFMIIVNTPGSWSYVYAPLKHAPWHGFTVTDLVFPTFLFVVGNAMSF 70

Query: 119 ALKKVPKINGA--VKKIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWCGILQR 175
            + K+ +   +  +KK+  RTLK+   G+ L    +    D +    D   IR  G+LQR
Sbjct: 71  GMGKLKEQGNSAFLKKVFSRTLKIFLIGLFLNMFPFVKWVDDVLVMKDFTEIRIWGVLQR 130

Query: 176 IALVYVVVALI 186
           IA+ Y + +L+
Sbjct: 131 IAVCYCIASLL 141



 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 46/82 (56%), Gaps = 12/82 (14%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA- 369
           PF+PEG+LSTI A+++  +G ++  + I  KG++       S  F L+   +IL    + 
Sbjct: 198 PFDPEGVLSTIPAVVNVILG-YFAGLFIQKKGNNK------STAFNLIGTGVILLLAASA 250

Query: 370 ----IPINKQLYSFSYVCFTAG 387
                PINK +++ SYV +T G
Sbjct: 251 WNLVFPINKPIWTSSYVLYTVG 272


>gi|329851960|ref|ZP_08266641.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
 gi|328839809|gb|EGF89382.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
          Length = 369

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/343 (25%), Positives = 145/343 (42%), Gaps = 77/343 (22%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              +R  +LD FRGLTV  MI+V+ +G     +A++ H+ W G TLAD V P FLF VG 
Sbjct: 1   MAGQRFTSLDVFRGLTVAFMIVVNTSGPGAAPFAQLSHATWFGLTLADLVFPAFLFAVGN 60

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGVD-MKHIRWCGI 172
           A++    K       + K++ R   L   G ++    + HA  A  + ++ ++H R  G+
Sbjct: 61  AMSFGDPKSGPTGRYLGKVVKRAAILFLLGYLMYWFPFVHA-TADGWALNPVEHTRIPGV 119

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRIAL ++  A+                      A +W  +   I     ++  Y    
Sbjct: 120 LQRIALCFLAAAI----------------------AVRWLDVPKLIGLSAVLLLGY---- 153

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRG-HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACT 291
             W             ++  G  G  L P  N    +DR ++GINH+Y+           
Sbjct: 154 --WG-----------ALMVFGPPGEQLTPLGNIGALIDRAVFGINHMYAK---------- 190

Query: 292 LSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG-HVLIHFKGHSARLKHW 350
                 G             ++PEGL ST+ AI++   G   G ++       +  ++  
Sbjct: 191 ----GKG-------------YDPEGLFSTLPAIVNVLAGYLAGRYIRSQPDLRTVVIRLA 233

Query: 351 VSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           V+ G   LI+A  L ++   P++K+L++ S+     G   G L
Sbjct: 234 VAAG---LIVAAALAWSLTFPLSKRLWTSSFALINIGIDLGLL 273


>gi|423259248|ref|ZP_17240171.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|423263781|ref|ZP_17242784.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
 gi|387776828|gb|EIK38928.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
           CL07T00C01]
 gi|392706047|gb|EIY99170.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
           CL07T12C05]
          Length = 387

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 146/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     Y  + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYVPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     +    ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           +I ++I I+                   +  +I+     G    + N +  VDR + G  
Sbjct: 159 YIPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDRTVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|403236334|ref|ZP_10914920.1| hypothetical protein B1040_11244 [Bacillus sp. 10403023]
          Length = 373

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 148/345 (42%), Gaps = 76/345 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG-GAYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
                R+ +LD  RG+ VV  + +     G+Y    H+ W G TL DF++P F+ + GV 
Sbjct: 6   NSARSRIHSLDMARGIIVVFSVFLSSLPYGSYDFATHASWYGLTLVDFILPCFITVFGVG 65

Query: 116 IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
           +A+A +K  K     K+ I RT+KL+ +G++           +++  D+  +R+ G+LQ 
Sbjct: 66  MAIAYQKGVK----WKRFISRTIKLILFGLLFN-------IIVAWSFDLSTLRFTGVLQM 114

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW 235
            AL+ +   LI      +RP  +    L IF+ Y         A ++Y+  T    +P  
Sbjct: 115 YALLGIGTVLITRFI--KRPITVSLVGLLIFSIYG--------AILLYMGQTCEGSLPQ- 163

Query: 236 SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP 295
                                   P CN    VD  ++G +H+YS               
Sbjct: 164 ------------------------PGCNPSWLVDPVVFGESHIYS--------------- 184

Query: 296 NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF 355
               L E         F+PEG+ +  SA+ +  +G   G ++I +K   A    W  +  
Sbjct: 185 ----LGERG-------FDPEGIPAIFSALGNVLLGFAAGRIIILYKEKGA---GWRLLLH 230

Query: 356 GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
           G+L++A+       +P  K+L++ S+   TAGA    L + + IF
Sbjct: 231 GVLLLALAFLVEAFLPFGKRLWTPSFGLVTAGATSLLLALLHIIF 275


>gi|299140911|ref|ZP_07034049.1| membrane protein [Prevotella oris C735]
 gi|298577877|gb|EFI49745.1| membrane protein [Prevotella oris C735]
          Length = 370

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 143/336 (42%), Gaps = 70/336 (20%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              +R+ +LD  RGL +  MILV++ G     Y  ++HS WNG T  D V PFF+F +G+
Sbjct: 1   MTQQRLISLDMLRGLAMAGMILVNNPGSWSHIYVPLEHSVWNGLTPTDLVFPFFVFAMGM 60

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG-VDMKHIRWCGIL 173
           A+  + K +  +  +  + + +   LLF   ++    +     L+ G +    +R  G+L
Sbjct: 61  AMGFSTKNLTALRASYLRKVMKRSVLLF---VIGLLLTLLGRWLNTGELCFSQLRVMGVL 117

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QR++L Y+VVALI      RR   +      +       W+   +               
Sbjct: 118 QRLSLSYLVVALIV-----RRVKGVPTMTFVVVALLSGYWVLLLLG-------------- 158

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
                    HG                A N V  VDR L G +HLY +    RL      
Sbjct: 159 ---------HGFDF------------SANNIVAVVDRWLLGESHLYIE----RL------ 187

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 353
            P+  P+           F+PEGLLSTI  +    +G     +L   +    R+     +
Sbjct: 188 -PDGTPI----------AFDPEGLLSTIPCVAQVLLGYICSRLLCTSQELPQRILRLAVI 236

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G  LL+  ++L +    P+NK+++S ++V  T+G A
Sbjct: 237 GALLLLSGLLLSYM--CPLNKKIWSSTFVMVTSGYA 270


>gi|262381451|ref|ZP_06074589.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296628|gb|EEY84558.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 372

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 144/350 (41%), Gaps = 101/350 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA------------------YARIDHSPWNGC 98
           +Q+ +R+ +LDA RG  ++ ++     GGA                    +++H  WNG 
Sbjct: 5   KQQPQRLQSLDALRGFDMLFIM-----GGASLFVALATLFPNPFFQAIAGQMEHVEWNGL 59

Query: 99  TLADFVMPFFLFIVGVAIALALKK---VPKINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
              D + P FLFI G++   +L+K        GA+ KKI+ R + L+F G++  G     
Sbjct: 60  AHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG----- 114

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
              LS+  D  H+R   +L RI L ++  AL                   +F  + W+  
Sbjct: 115 --LLSFEFD--HLRCASVLARIGLGWMFAAL-------------------LFVRFGWKAR 151

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            G  A ++        +VP               +   G  G      N VGY+DR    
Sbjct: 152 AGITALILVGYWLAMAFVP---------------VPDAGGAGPFTLEGNLVGYIDR---- 192

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                            L  P  G L E         F+PEGL ST+ AI +  +G+  G
Sbjct: 193 -----------------LFLP--GRLHETV-------FDPEGLFSTVPAIATAMLGMFTG 226

Query: 335 H-VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             + +  +G + R K    +G G +++ + L ++   PINK+L++ S+VC
Sbjct: 227 EWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVC 276


>gi|116331948|ref|YP_801666.1| hypothetical protein LBJ_2457 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116125637|gb|ABJ76908.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
          Length = 363

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/330 (25%), Positives = 138/330 (41%), Gaps = 81/330 (24%)

Query: 72  LTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING 128
           +TVV MILV++ G     Y+ + H+ WNGCT  D V PFFLF VG +I ++L     IN 
Sbjct: 1   MTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYSKNGINR 60

Query: 129 AV--KKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVA 184
                 I  R + L+  G+ L   G ++ +            +R  G+LQRI  VY VVA
Sbjct: 61  IRIWIGICIRGISLILLGLFLNFFGEWTFS-----------ELRIPGVLQRIGFVYWVVA 109

Query: 185 LIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHG 244
            +  +   ++                       + F+I I     L V  W  +  +  G
Sbjct: 110 TLFLVFPGKKV----------------------LVFLIPI-----LLVHTWILTHIAPPG 142

Query: 245 VKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
                ++ G         +   ++DR ++G  HL+                      + +
Sbjct: 143 ESMVSLEQGK--------DIGAWIDRTIFGEKHLW----------------------KFS 172

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
            +W     +PEG LS I++I +   G+  G +L   +G     +     G G L   + L
Sbjct: 173 KTW-----DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RVLSIFGLGFLFTFVGL 226

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            +  ++P+NK L++ SY  +T G A  C+G
Sbjct: 227 LWDRSLPMNKSLWTGSYAVYTTGFAFLCIG 256


>gi|255315516|ref|ZP_05357099.1| hypothetical protein CdifQCD-7_14229, partial [Clostridium
           difficile QCD-76w55]
          Length = 381

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 10  KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 69

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 70  VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 120

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 121 LGVLQRMGLVYFVTSLVYLLLKK 143



 Score = 45.4 bits (106), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 196 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 253

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 254 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 281


>gi|255093765|ref|ZP_05323243.1| hypothetical protein CdifC_14056 [Clostridium difficile CIP 107932]
          Length = 505

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 134 KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 193

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 194 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 244

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 245 LGVLQRMGLVYFVTSLVYLLLKK 267



 Score = 45.4 bits (106), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 320 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 377

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 378 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 405


>gi|384362003|ref|YP_006199855.1| hypothetical protein CDBI1_13575 [Clostridium difficile BI1]
          Length = 485

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 114 KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 173

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 174 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 224

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 225 LGVLQRMGLVYFVTSLVYLLLKK 247



 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 300 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 357

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 358 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 385


>gi|374374997|ref|ZP_09632655.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
 gi|373231837|gb|EHP51632.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
          Length = 395

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 147/359 (40%), Gaps = 101/359 (28%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD FRG TV LMILV++ G     YA +DH+PW+G T  D V PFFLF VG A++ 
Sbjct: 4   RYRSLDVFRGATVCLMILVNNPGSWAHIYAPLDHAPWHGLTPTDLVFPFFLFAVGNAMSF 63

Query: 119 ALKKVPKINGAV--KKIIFRTLKLLFWGIIL--------QGGYSHAPDALSYGVDMKHIR 168
            + ++ +   A   KKI  RTL +   GI L         G    A   ++       IR
Sbjct: 64  VIPRLQEAGPAEFWKKITKRTLIIFGIGIFLNWSPFVRWNGDTLQAVTWVTDPAKNIGIR 123

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRIA  Y   ++I           L+P+     TAY       F++ V+ ++  +
Sbjct: 124 IFGVLQRIAFCYFFASIIVYY--------LKPK-----TAY-------FLSLVL-LLAYW 162

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLE 288
            L +       +S            ++G  G        +D+ +  I H+Y         
Sbjct: 163 GLCILGNPADPYS------------LKGWFGTN------IDKAILHIPHMYKG------- 197

Query: 289 ACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLK 348
                        E  P      F+PEG  S++ AI+    G   G   ++ K  SA++ 
Sbjct: 198 -------------EGVP------FDPEGFASSLGAIVQIVFGYFVG---MYIKNSSAQIP 235

Query: 349 HWVS------------------MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
             ++                   G GLL+      +    PINK++++ SY  +T G A
Sbjct: 236 KDLTDKQDPRNPMFKMLTVLFVAGVGLLVTGFC--WDMVFPINKKIWTSSYTVYTTGLA 292


>gi|255651295|ref|ZP_05398197.1| hypothetical protein CdifQCD_14003 [Clostridium difficile
           QCD-37x79]
          Length = 461

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 90  KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 149

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 150 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 200

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 201 LGVLQRMGLVYFVTSLVYLLLKK 223



 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 276 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 333

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 334 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 361


>gi|255518179|ref|ZP_05385855.1| hypothetical protein CdifQCD-_13768 [Clostridium difficile
           QCD-97b34]
          Length = 469

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 98  KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 157

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 158 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 208

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 209 LGVLQRMGLVYFVTSLVYLLLKK 231



 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 284 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 341

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 342 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 369


>gi|254976379|ref|ZP_05272851.1| hypothetical protein CdifQC_13741 [Clostridium difficile QCD-66c26]
          Length = 459

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 77/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 88  KLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 147

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 148 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRI 198

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L+  L  K
Sbjct: 199 LGVLQRMGLVYFVTSLVYLLLKK 221



 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 274 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 331

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 332 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 359


>gi|359686994|ref|ZP_09256995.1| hypothetical protein LlicsVM_01380 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418756670|ref|ZP_13312858.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
 gi|384116341|gb|EIE02598.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
           VAR 010]
          Length = 391

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/352 (26%), Positives = 147/352 (41%), Gaps = 95/352 (26%)

Query: 51  QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPF 107
            + +L  +   R+ ++D  RGLTV  MILV++ G     Y  + H+ W+GCT  D V PF
Sbjct: 14  SIHKLKSENRIRILSIDLLRGLTVAGMILVNNPGTWSNMYWPLKHAKWDGCTPTDLVFPF 73

Query: 108 FLFIVGVAIALALKKVPKING--AVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVD 163
           FLF VG +I  ++      NG     KI+ R++ L+F G+ L   G +S +         
Sbjct: 74  FLFAVGASIPFSVS-----NGIQEFPKILKRSVILIFLGLFLNFFGEWSFS--------- 119

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
             ++R+ G+LQRI   Y   A+        R   L+ R                      
Sbjct: 120 --NLRFPGVLQRIGFAYFFSAI------AYREKNLKFR---------------------- 149

Query: 224 IITTYSLYVPNWSFSEH-SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYS-D 281
           II   +L +  W   E     G  +  +K G         +   ++DRE++G  HL+   
Sbjct: 150 IILFLTLLISYWYLQEFIPPPGAAEPSMKEGK--------DWGAWLDREVFGQAHLWKFG 201

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL-IHF 340
            VW                            +PEGLL++ ++I S   GI  G  L +H 
Sbjct: 202 KVW----------------------------DPEGLLTSFTSIASVFCGIFAGEFLKVHL 233

Query: 341 KGHSARLK---HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           +   + L      V   F +L++  +  +    PINK L++ +Y  +TAG A
Sbjct: 234 EKKESPLSISGKIVLGAFAVLVVGGV--WGIYYPINKSLWTGTYSLWTAGWA 283


>gi|255013328|ref|ZP_05285454.1| hypothetical protein B2_05430 [Bacteroides sp. 2_1_7]
 gi|410103821|ref|ZP_11298742.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
 gi|409236550|gb|EKN29357.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
          Length = 372

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 143/350 (40%), Gaps = 101/350 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA------------------YARIDHSPWNGC 98
           +Q+ +R+ +LDA RG  ++ ++     GGA                    +++H  WNG 
Sbjct: 5   KQQPQRLQSLDALRGFDMLFIM-----GGASLFVALATLFPNPFFQAIAGQMEHVEWNGL 59

Query: 99  TLADFVMPFFLFIVGVAIALALKKVP---KINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
              D + P FLFI G++   +L+K        GA+ KKI+ R + L+F G++  G     
Sbjct: 60  AHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG----- 114

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
              LS+  D  H+R   +L RI L ++  AL                   +F  + W+  
Sbjct: 115 --LLSFEFD--HLRCASVLARIGLGWMFAAL-------------------LFVRFGWKVR 151

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            G    ++        +VP               +   G  G      N VGY+DR    
Sbjct: 152 AGITVLILVGYWLAMAFVP---------------VPDVGGAGPFTLEGNLVGYIDR---- 192

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                            L  P  G L E         F+PEGL ST+ AI +  +G+  G
Sbjct: 193 -----------------LFLP--GRLHETV-------FDPEGLFSTVPAIATAMLGMFTG 226

Query: 335 H-VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             + +  +G + R K    +G G +++ + L ++   PINK+L++ S+VC
Sbjct: 227 EWIKLRKEGLTDRKKELCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVC 276


>gi|375356811|ref|YP_005109583.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|383116724|ref|ZP_09937472.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|251947990|gb|EES88272.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
 gi|301161492|emb|CBW21032.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 387

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 146/359 (40%), Gaps = 92/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           Q  +KR+  LD  RG+T+  MI+V++ G     YA + H+ W G T  D V PFF+FI+G
Sbjct: 3   QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62

Query: 114 VAIALALKK--VPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMK---- 165
           ++  ++L+K      + A  KI+ RT+ +   G+ +     +    ++LS G D+     
Sbjct: 63  ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121

Query: 166 ---------HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                    HIR  G++QR+AL Y   A+I  +   +                       
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHK----------------------- 158

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
           +I ++I I+                   +  +I+     G    + N +  VD  + G  
Sbjct: 159 YIPYLIAILL------------------IGYFIILINGNGFEYNSSNILSIVDHTVLGEA 200

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLSTI +I    IG   G +
Sbjct: 201 HMYKD----------------------------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L+  K    +++    +G  L     +L +    PI+K+++S ++   T G A   L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY--GCPISKKIWSPTFAIVTCGLASSFLAL 289


>gi|296452402|ref|ZP_06894103.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
 gi|296258732|gb|EFH05626.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP08]
          Length = 481

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + RV ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 113 NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 172

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 173 PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVRILGV 223

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 224 LQRMGLVYFVTSLVYLLLKK 243



 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 296 KPDFEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 353

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 354 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 382


>gi|357624248|gb|EHJ75102.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 340

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 125/288 (43%), Gaps = 42/288 (14%)

Query: 112 VGVAIALALK-----KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKH 166
           +G A+ L+L       +P++N A+ ++  R+L L   GI L           S   +  +
Sbjct: 1   MGEAMVLSLNARLRTSLPRVN-ALGQVARRSLLLSLIGICLG----------SVNTNWSY 49

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYII 225
           +R+ G+LQR+A +Y++V  +E    +   N++  R L    A  W QW+   +   I + 
Sbjct: 50  VRFPGVLQRLAAMYLIVGSLECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLC 109

Query: 226 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWS 285
            T ++  P          G+ +     G            GY+DR + G NHLY    + 
Sbjct: 110 ITLTVAAPGCPVGYSGPGGLHR--TATGDFSLQNCTGGIAGYIDRLILGPNHLYQHGTFK 167

Query: 286 RLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA 345
            +    L                  P +PEG+L  +S +L    G H   +++ +    A
Sbjct: 168 SIYRTQL------------------PHDPEGILGILSGVLVVQAGAHAARIMLVYNHARA 209

Query: 346 RLKHWV--SMGFGLLIIAIILHFTNA--IPINKQLYSFSYVCFTAGAA 389
           R+  WV  S+ FG ++  ++  F++   IP+NK L+S SY   T+  A
Sbjct: 210 RIMRWVFWSVMFG-VVGGLLCKFSDGGYIPVNKNLWSVSYCLVTSSMA 256


>gi|296877751|ref|ZP_06901777.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
 gi|296431202|gb|EFH17023.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
           difficile NAP07]
          Length = 370

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + RV ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 2   NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 62  PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVRILGV 112

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 113 LQRMGLVYFVTSLVYLLLKK 132



 Score = 45.4 bits (106), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 185 KPDFEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 242

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 243 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 271


>gi|404486905|ref|ZP_11022093.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335959|gb|EJZ62425.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
           YIT 11860]
          Length = 373

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/353 (22%), Positives = 138/353 (39%), Gaps = 100/353 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI---------------LVDDAGGAYARIDHSPWNGCTLA 101
            +K+ R+ +LD  RG  ++ ++                V D    + R  H PW+G T  
Sbjct: 3   TKKNTRLLSLDTLRGFDMLFIMGFAPLVVTLNALHPTAVGDVIAGHMR--HVPWDGFTQH 60

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDA 157
           D + P FLFI G++   +L K      + K I  R  +    L+  G +  G        
Sbjct: 61  DMIFPLFLFIAGISFPFSLAKQRGSGSSDKHIYLRVFRRGVTLVLLGFLYNGFLQ----- 115

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
               ++   +R   +L RI L ++  A I               ++S+  + Q+      
Sbjct: 116 ----LNFPDVRLASVLGRIGLAWMFGAFI---------------YMSLKKSVQY------ 150

Query: 218 IAFVIYIITTYSL---YVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
              +++I+  Y L   +VP    +  S   ++                N VGY+DR    
Sbjct: 151 -GLIVFILVGYWLLLAFVPAPDAAGASPLSIEG---------------NLVGYIDRHCLP 194

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
              +Y +                              F+PEGLLST+ AI++  +GI+ G
Sbjct: 195 GKLIYGN------------------------------FDPEGLLSTLPAIVTALLGIYAG 224

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            ++   +  S   K  +  G G++++AI L +    PINK L+S S+ CF  G
Sbjct: 225 EIVRSTRLGSGERKSLLLSGIGVVLVAIGLVWNTVFPINKMLWSSSFTCFVGG 277


>gi|383643230|ref|ZP_09955636.1| hypothetical protein SeloA3_10334 [Sphingomonas elodea ATCC 31461]
          Length = 382

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 145/340 (42%), Gaps = 91/340 (26%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG-GA--YARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R   LD FRG T+ LMILV+ +G GA  Y ++ H+ W G TLAD V P FLF +G A++ 
Sbjct: 17  RFLALDVFRGATIFLMILVNTSGPGAEPYPQLVHAKWIGFTLADLVFPTFLFAMGNAMSF 76

Query: 119 ALKKVPKINGAVKKIIFRTLKLLF--------WGIILQG--GYSHAPDALSYGVDMKHIR 168
           A +K P   G     +FR   ++F        +  + QG  G++  P AL+        R
Sbjct: 77  AFRK-PVATGPFLARLFRRGAIIFVLGYLMYWFPFVEQGPDGWALKPFALT--------R 127

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQR+AL YV+  L+           L+PR L +            IA ++   T  
Sbjct: 128 VPGVLQRLALCYVLAGLMI--------RWLKPRQLLL----------AGIAMLLGYWTIL 169

Query: 229 SLYVP-NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
            ++ P   +F ++++ G +                     +D  L G  HLY        
Sbjct: 170 LVFSPAGMAFDKYANIGTQ---------------------IDLWLLGPGHLY-------- 200

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL 347
                        ++D      A F+PEGLL T+ A ++   G   G  ++        +
Sbjct: 201 -------------KKD------AGFDPEGLLGTLPATVNVIAGYLAGLAIVQGGDLRRTV 241

Query: 348 KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
                +G  L++    L ++   PI K+L++ SYV  T G
Sbjct: 242 GRMALVGAALVLAG--LAWSPWFPIAKKLWTGSYVLLTVG 279


>gi|298376668|ref|ZP_06986623.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|298266546|gb|EFI08204.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 372

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 144/350 (41%), Gaps = 101/350 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA------------------YARIDHSPWNGC 98
           +Q+ +R+ +LDA RG  ++ ++     GGA                    +++H  WNG 
Sbjct: 5   KQQPQRLQSLDALRGFDMLFIM-----GGASLFVALATLFPNPFFQAIAGQMEHVEWNGL 59

Query: 99  TLADFVMPFFLFIVGVAIALALKK---VPKINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
              D + P FLFI G++   +L+K        GA+ KKI+ R + L+F G++  G     
Sbjct: 60  AHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG----- 114

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
              LS+  D  H+R   +L RI L ++  AL                   +F  + W+  
Sbjct: 115 --LLSFEFD--HLRCASVLARIGLGWMFAAL-------------------LFVRFGWKAR 151

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            G  A ++        +VP               +   G  G      N VGY+DR    
Sbjct: 152 AGITALILVGYWLVMAFVP---------------VPDAGGAGPFTLEGNLVGYIDR---- 192

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                            L  P  G L E         F+PEGL ST+ AI +  +G+  G
Sbjct: 193 -----------------LFLP--GRLHETV-------FDPEGLFSTVPAIATAMLGMFTG 226

Query: 335 H-VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             + +  +G + R K    +G G +++ + L ++   P+NK+L++ S+VC
Sbjct: 227 EWIKLGKEGLTDRKKVLCLVGAGAVLLIVGLLWSLVFPVNKKLWTSSFVC 276


>gi|371776142|ref|ZP_09482464.1| hypothetical protein AnHS1_01923 [Anaerophaga sp. HS1]
          Length = 369

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 77/143 (53%), Gaps = 13/143 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            +K++R   LD  RG+T+ LMI V++ G     YA + HS W+GCT  D V PFFLF+VG
Sbjct: 1   MKKTERYLALDVLRGMTIALMITVNNPGSWKYIYAPLRHSSWHGCTPTDLVFPFFLFVVG 60

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           V++  +  K        + K++  RTL +   G+ L    +  P    +  D   +R  G
Sbjct: 61  VSMFFSFSKYGNTLNKESFKRLGRRTLLIFAIGLFL----NSFPQ---WDRDYSTLRIMG 113

Query: 172 ILQRIALVYVVVALIETLTTKRR 194
           +LQRIAL Y   +LI  L+  R+
Sbjct: 114 VLQRIALAYGFGSLI-VLSVPRK 135



 Score = 42.0 bits (97), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           PF+PEGLLSTI A+++  +G   G     F   + ++K    +    LI+AI       +
Sbjct: 188 PFDPEGLLSTIPAVVTVLLGYLTG----VFIKTTEKVKIPGQLALYGLIVAIAGRLWGLV 243

Query: 371 -PINKQLYSFSYVCFTAGAA 389
            PINK L++ SYV +TAG A
Sbjct: 244 FPINKPLWTGSYVLYTAGLA 263


>gi|427400072|ref|ZP_18891310.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
 gi|425720812|gb|EKU83727.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
          Length = 380

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 141/335 (42%), Gaps = 80/335 (23%)

Query: 62  RVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ +LDAFRG T+  M+LV+   D G  +A++ H+ W+G T  D + PFFLFI GVA+AL
Sbjct: 7   RLTSLDAFRGFTIAAMVLVNNPGDWGHLHAQLAHAAWHGWTFTDTIFPFFLFIGGVAMAL 66

Query: 119 ALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +L ++        +++ +  K    +   G +L          L    D   +R  G+LQ
Sbjct: 67  SLGRLAAAGAHKPQLLLKLAKRAALIFLIGFLLN---------LIPRFDFDSVRIPGVLQ 117

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RIAL  ++ A                  L ++  ++ Q +  F+   +Y +    + VP 
Sbjct: 118 RIALCTLLAA-----------------PLVVYLTWRGQALAVFLLLALYSVLMLLVPVP- 159

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                    G+          G+L P  +A  ++DR L                      
Sbjct: 160 ---------GIGA--------GNLEPGRDAGAWIDRAL---------------------- 180

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMG 354
              G L   A +W     +PEGL+ST+ A+ S   G+  G +L+       ++   +  G
Sbjct: 181 -MDGHLWAQAKTW-----DPEGLVSTLPAVCSLLFGVLAGRLLLSALPRVEQVVWLMLSG 234

Query: 355 FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              L +   L     +PINK L++ S+    +G A
Sbjct: 235 LACLALGSTLD-AVLMPINKSLWTPSFCLLMSGWA 268


>gi|423081105|ref|ZP_17069717.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
 gi|423085023|ref|ZP_17073481.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357550878|gb|EHJ32683.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
           050-P50-2011]
 gi|357551414|gb|EHJ33204.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
           002-P50-2011]
          Length = 427

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 59  NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 118

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 119 PISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 169

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 170 LQRMGLVYFVTSLVYLLLKK 189



 Score = 45.4 bits (106), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 242 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FN 299

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 300 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 328


>gi|427781073|gb|JAA55988.1| Putative heparan-alpha-glucosaminide n-acetyltransferase
           [Rhipicephalus pulchellus]
          Length = 337

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/297 (25%), Positives = 129/297 (43%), Gaps = 62/297 (20%)

Query: 112 VGVAIALALKKVPKINGAVKKIIFRTLK---LLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           +GV++A+ ++ + + +    +I  + +K   +LF       G     + LS  VD+  +R
Sbjct: 1   MGVSLAMTIRSLLRKSVTRGRIFLQIVKRSLILF-------GLGIMTNTLSGDVDLNTLR 53

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQR+A  Y+V A +  L  K     L    +    AY  +W+       +++  T+
Sbjct: 54  IPGVLQRLAFSYLVAATVHLLFAKPHEGQLVWAPVRDVLAYWPEWLLAIPMLALHLALTF 113

Query: 229 SLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPA----------CN--AVGYVDRELWGIN 276
            L VPN                 C  +G+LGP           C   A G++DR ++G +
Sbjct: 114 FLPVPN-----------------CP-QGYLGPGGLHLNSSFENCTGGAAGFIDRRIFGNS 155

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y  P    +    L                  P++PEG L  +++I    +G+  G +
Sbjct: 156 HIYQTPDMRHVYDTHL------------------PYDPEGTLGCLTSIFLVFLGLQAGKI 197

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIA-IILHFTNA---IPINKQLYSFSYVCFTAGAA 389
           L+ F    AR+  W   G    IIA ++ +F+     IPINK L+S S++  TA  A
Sbjct: 198 LLTFPEWKARVIRWCIWGLLCGIIAGVLCNFSKEEGWIPINKNLWSVSFILSTASTA 254


>gi|255307823|ref|ZP_05351994.1| hypothetical protein CdifA_14636 [Clostridium difficile ATCC 43255]
          Length = 483

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 115 NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 174

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 175 PISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 225

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 226 LQRMGLVYFVTSLVYLLLKK 245



 Score = 45.8 bits (107), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 298 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FN 355

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 356 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 384


>gi|255101955|ref|ZP_05330932.1| hypothetical protein CdifQCD-6_14161 [Clostridium difficile
           QCD-63q42]
          Length = 469

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 101 NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 160

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 161 PISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 211

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 212 LQRMGLVYFVTSLVYLLLKK 231



 Score = 45.8 bits (107), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 284 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FN 341

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 342 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 370


>gi|329963071|ref|ZP_08300851.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
 gi|328529112|gb|EGF56042.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
           12057]
          Length = 381

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 94/359 (26%), Positives = 146/359 (40%), Gaps = 93/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA---GGAYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + ++R+  LD  RGLT+  MILV+        YA + H+ WNG T  D + PFFLF++G
Sbjct: 1   MKNNQRLVALDVMRGLTIAGMILVNTPETWSYVYAPLQHARWNGLTPTDVIFPFFLFMMG 60

Query: 114 VAIALALKKVP--KINGAVKKIIFRTLKLLFWGIIL---------------QGGYSHAPD 156
           V++ ++LKK      +  + KII R+L L   G  +               Q G+   P 
Sbjct: 61  VSMYISLKKCSFHLSSHLLMKIIRRSLILFLIGTAIYALATFLGTLRDACRQPGFEGNPW 120

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
             ++   +   R  G+LQR+ + Y + ++I          VL  RH      Y     GG
Sbjct: 121 KEAFA-SLPGTRIPGVLQRLGVCYGIGSII----------VLTCRH-----RYIPHLAGG 164

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
            +A    I+   + +V       HS                     N +  VDR L+G N
Sbjct: 165 ILAGYFLILLFGNGFV-------HSPE-------------------NILSVVDRTLFGDN 198

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
            +                 N G +            +PEG LST+ +I    IG   G +
Sbjct: 199 MI-----------------NDGGI------------DPEGALSTLPSIAQVLIGFCIGKI 229

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            I       +L      G  +LI+  +  F+   P+NK++++ S+V  T G A   LGI
Sbjct: 230 CIETPDMREKLNKIFLYGSLMLIVGWL--FSYGCPLNKRVWTPSFVLVTCGFACLLLGI 286


>gi|357628855|gb|EHJ78009.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
           plexippus]
          Length = 275

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 88/179 (49%), Gaps = 25/179 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
            R+ +LD FRG+ + LM           +  H+ WNG T+AD V P+F F +G A+ L+L
Sbjct: 81  SRLRSLDIFRGIAIALM--------QANKFSHAVWNGLTVADLVFPWFAFTMGEAMVLSL 132

Query: 121 K-----KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
                  +P++N A+ ++  R+L L   GI L           S   +  ++R+ G+LQR
Sbjct: 133 NARLRTSLPRVN-ALGQVARRSLLLSLIGICLG----------SVNTNWSYVRFPGVLQR 181

Query: 176 IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQW-QWIGGFIAFVIYIITTYSLYVP 233
           +A +Y++V  +E    +   N++  R L    A  W QW+   +   I +  T ++  P
Sbjct: 182 LAAMYLIVGSLECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAP 240


>gi|126700401|ref|YP_001089298.1| membrane protein [Clostridium difficile 630]
 gi|115251838|emb|CAJ69673.1| putative membrane protein [Clostridium difficile 630]
          Length = 370

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 16/140 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I
Sbjct: 2   NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61

Query: 117 ALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R  G+
Sbjct: 62  PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 112

Query: 173 LQRIALVYVVVALIETLTTK 192
           LQR+ LVY V +L+  L  K
Sbjct: 113 LQRMGLVYFVTSLVYLLLKK 132



 Score = 45.1 bits (105), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 185 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKILVMSIILLIGAFI--FN 242

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 243 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 271


>gi|301309930|ref|ZP_07215869.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340409|ref|ZP_17318148.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831504|gb|EFK62135.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227844|gb|EKN20740.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
           CL09T03C24]
          Length = 372

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 143/350 (40%), Gaps = 101/350 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA------------------YARIDHSPWNGC 98
           +Q+ +R+ +LDA RG  ++ ++     GGA                    +++H  WNG 
Sbjct: 5   KQQPQRLQSLDALRGFDMLFIM-----GGASLFVALATLFPNPFFQAIAGQMEHVEWNGL 59

Query: 99  TLADFVMPFFLFIVGVAIALALKK---VPKINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
              D + P FLFI G++   +L+K        GA+ KKI+ R + L+F G++  G     
Sbjct: 60  AHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG----- 114

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
              LS+  D  H+R   +L RI L ++  AL                   +F  + W+  
Sbjct: 115 --LLSFEFD--HLRCASVLARIGLGWMFAAL-------------------LFVRFGWKVR 151

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            G    ++        +VP               +   G  G      N VGY+DR    
Sbjct: 152 AGITVLILVGYWLAMAFVP---------------VPDAGGAGPFTLEGNLVGYIDR---- 192

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                            L  P  G L E         F+PEGL ST+ AI +  +G+  G
Sbjct: 193 -----------------LFLP--GRLHETV-------FDPEGLFSTVPAIATAMLGMFTG 226

Query: 335 H-VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             + +  +G + R K    +G G +++ + L ++   PINK+L++ S+VC
Sbjct: 227 EWIKLRKEGLTDRKKVLCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVC 276


>gi|150007980|ref|YP_001302723.1| hypothetical protein BDI_1342 [Parabacteroides distasonis ATCC
           8503]
 gi|256840846|ref|ZP_05546354.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|423331513|ref|ZP_17309297.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936404|gb|ABR43101.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
 gi|256738118|gb|EEU51444.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|409230083|gb|EKN22951.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
           CL03T12C09]
          Length = 372

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 143/350 (40%), Gaps = 101/350 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGA------------------YARIDHSPWNGC 98
           +Q+ +R+ +LDA RG  ++ ++     GGA                    +++H  WNG 
Sbjct: 5   KQQPQRLQSLDALRGFDMLFIM-----GGASLFVALATLFPNPFFQAIAGQMEHVEWNGL 59

Query: 99  TLADFVMPFFLFIVGVAIALALKK---VPKINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
              D + P FLFI G++   +L+K        GA+ KKI+ R + L+F G++  G     
Sbjct: 60  AHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG----- 114

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWI 214
              LS+  D  H+R   +L RI L ++  AL                   +F  + W+  
Sbjct: 115 --LLSFEFD--HLRCASVLARIGLGWMFAAL-------------------LFVRFGWKVR 151

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
            G    ++        +VP               +   G  G      N VGY+DR    
Sbjct: 152 AGITVLILVGYWLAMAFVP---------------VPDAGGAGPFTLEGNLVGYIDR---- 192

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                            L  P  G L E         F+PEGL ST+ AI +  +G+  G
Sbjct: 193 -----------------LFLP--GRLHETV-------FDPEGLFSTVPAIATAMLGMFTG 226

Query: 335 H-VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             + +  +G + R K    +G G +++ + L ++   PINK+L++ S+VC
Sbjct: 227 EWIKLRKEGLTDRNKVLCLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVC 276


>gi|339021122|ref|ZP_08645235.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
 gi|338751776|dbj|GAA08539.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
          Length = 377

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 86/336 (25%), Positives = 141/336 (41%), Gaps = 64/336 (19%)

Query: 70  RGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKI 126
           RG T+V M++V++ G     ++ +DH+ WNGCT AD V PFFLF++G  I  A  +  + 
Sbjct: 2   RGATIVFMVIVNNPGDWNRVWSPLDHAAWNGCTPADLVFPFFLFLMGCVIPFAFDRRLR- 60

Query: 127 NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
            GA +  +   +    W  +   G         +   + H+R+ G+L RIAL YV    +
Sbjct: 61  EGAQRSQLVSHIA---WRGLALVGLKLLLSLYPF-FHVTHLRFFGVLTRIALCYVAAVSL 116

Query: 187 ETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP--NWSFSEHSDHG 244
              + K              T +    IG  +  + Y    Y+L VP   W   + +   
Sbjct: 117 YLCSRK--------------TGFLVSVIG--LILLAYWAILYALPVPGLGWPGKDFAFLD 160

Query: 245 VKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDA 304
           + +               N   ++DR+                 A   +  ++G L E  
Sbjct: 161 LNR---------------NMAAWLDRQ---------------FSAWCQTWLHTGILYEK- 189

Query: 305 PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIIL 364
            +W     +PEGLLST+ AI +   G+  G V        AR +  + M  G   I + L
Sbjct: 190 -TW-----DPEGLLSTLPAIATTLSGVLAGQVFRRKDISPAR-RPMLFMAAGGASILVGL 242

Query: 365 HFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
            +    P+NK L++ S+   ++GAA   L + + IF
Sbjct: 243 LWGQVFPLNKSLWTSSFTLVSSGAALCSLAVCDGIF 278


>gi|326800650|ref|YP_004318469.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326551414|gb|ADZ79799.1| hypothetical protein Sph21_3257 [Sphingobacterium sp. 21]
          Length = 396

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/142 (35%), Positives = 79/142 (55%), Gaps = 17/142 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++R  +LD FRG TV LMILV++ G    A++ + H+PW+GCT  D V PFFLF VG A+
Sbjct: 2   NQRYYSLDVFRGATVALMILVNNPGSWSYAFSPLKHAPWHGCTPTDLVFPFFLFAVGNAM 61

Query: 117 ALALKKVPKINGAV--KKIIFRTLKLLF------WGIILQGGYSHAPDALSYGVDM---- 164
           +  + ++    G V  KK++ RT+ +        W   +Q  +S+      Y ++     
Sbjct: 62  SFVIPRLRTQAGKVFWKKVLKRTILIFLIGLLLNWYPFVQ--WSNDTLLFKYWINPIKSD 119

Query: 165 KHIRWCGILQRIALVYVVVALI 186
             IR  G+LQRIAL Y   +++
Sbjct: 120 SGIRILGVLQRIALCYCFASIL 141



 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 7/92 (7%)

Query: 310 APFEPEGLLSTISAILSGTIGIHYG-HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
            PFEPEG+ ST +A++   IG   G ++    K     +  + ++   L++ A +L    
Sbjct: 200 VPFEPEGIASTFTAVIQVVIGFLVGQYIQTGTKAIEGPMLIYRTVS-TLMVTAALLTLGG 258

Query: 369 -----AIPINKQLYSFSYVCFTAGAADGCLGI 395
                A PINK++++ SYV +++G A   LG+
Sbjct: 259 MTWGLAFPINKKIWTSSYVLYSSGLAITALGV 290


>gi|359438686|ref|ZP_09228688.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|359445329|ref|ZP_09235071.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
 gi|358026628|dbj|GAA64937.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
 gi|358040838|dbj|GAA71320.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
          Length = 359

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 73/136 (53%), Gaps = 9/136 (6%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R   LDA RGLT+ LMILV+  G     YA + H+ W+GCT  D + PFF+FI+G A+ 
Sbjct: 2   TRYKALDAMRGLTIALMILVNTPGSWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            + KK      A +      L+L+  G I+      A +   +  +++++R  G+LQRI 
Sbjct: 62  FSFKKTNSAASASQ-----VLRLVKRGAII-FAIGLALNIYPFTTNIENLRILGVLQRIG 115

Query: 178 LVYVVVALIETLTTKR 193
           + Y++ ++      +R
Sbjct: 116 IAYILASICVLFLNRR 131


>gi|374310943|ref|YP_005057373.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358752953|gb|AEU36343.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 385

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/146 (34%), Positives = 77/146 (52%), Gaps = 18/146 (12%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  SKR+ ++D  RG T+  MILV+ AG    AY  + H+ WNGCT  D V P FLF+ G
Sbjct: 12  ELTSKRIPSVDVLRGFTLAAMILVNAAGEWPHAYWPLKHAQWNGCTPTDLVFPTFLFLTG 71

Query: 114 VAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
            ++  + +         +++   TLK    L F G++L        +AL Y   +  +R 
Sbjct: 72  TSLVFSFRSRLARGVGKRELFLHTLKRSVILFFIGVLL--------NALPY-FHIGTLRI 122

Query: 170 CGILQRIALVYVVVALIETLTTKRRP 195
            G+LQRIAL Y+ V+++      RRP
Sbjct: 123 YGVLQRIALCYLCVSVLYLW--NRRP 146


>gi|392555555|ref|ZP_10302692.1| hypothetical protein PundN2_08983 [Pseudoalteromonas undina NCIMB
           2128]
          Length = 359

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 73/136 (53%), Gaps = 9/136 (6%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R   LDA RGLT+ LMILV+  G     YA + H+ W+GCT  D + PFF+FI+G A+ 
Sbjct: 2   TRYKALDAMRGLTIALMILVNTPGSWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
            + KK      A +      L+L+  G I+      A +   +  +++++R  G+LQRI 
Sbjct: 62  FSFKKTNSAASASQ-----VLRLVKRGAII-FAIGLALNIYPFTTNIENLRILGVLQRIG 115

Query: 178 LVYVVVALIETLTTKR 193
           + Y++ ++      +R
Sbjct: 116 IAYILASICVLFLNRR 131


>gi|395213375|ref|ZP_10400182.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
 gi|394456744|gb|EJF11001.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
          Length = 391

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 73/135 (54%), Gaps = 7/135 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q  +R  +LD  RGLT+ LM++V++ G     YA   H+ W+G T+ D V P FLF+VG 
Sbjct: 19  QTYERYLSLDVLRGLTIALMVVVNNPGSWGSIYAPFKHAAWHGFTVTDLVFPSFLFVVGN 78

Query: 115 AIALALKKVPKINGAV--KKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVDMKHIRWC 170
           A++ +++K      +V  +K++ RT  +   G+ L         P+      D   +R  
Sbjct: 79  AMSFSMRKFETQPDSVFLRKVLKRTALIFLIGLFLNLFPFVMRNPEGAIVMKDFTAVRIM 138

Query: 171 GILQRIALVYVVVAL 185
           G+LQRIAL Y + +L
Sbjct: 139 GVLQRIALCYFIASL 153



 Score = 38.5 bits (88), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 45/79 (56%), Gaps = 6/79 (7%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH--SARLKHWVSMGFGLLIIAIILHFTN 368
           PF+PEGLLST+ A ++   G ++  + I   G+  S   K  V+   G +++A+ L +  
Sbjct: 215 PFDPEGLLSTLPATVNVIAG-YFAGLFIQKNGNNLSTVFKLMVA---GAILVAMALVWDI 270

Query: 369 AIPINKQLYSFSYVCFTAG 387
             PINK +++ SYV  + G
Sbjct: 271 YFPINKPIWTSSYVLHSVG 289


>gi|408821750|ref|ZP_11206640.1| hypothetical protein PgenN_01470 [Pseudomonas geniculata N1]
          Length = 355

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 79/156 (50%), Gaps = 17/156 (10%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALK----KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 173
            ++      V       + ++ R L+       +    +     + + +D  H R  G+L
Sbjct: 67  FSVAPRALDVSARPALARGVLERALR-------ILLAGALLHLLIWWALDTHHFRIWGVL 119

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           QRIA   V  AL+  L    RP +     +++   Y
Sbjct: 120 QRIA---VCAALVGVLAVYARPRMQVGALIALLVGY 152


>gi|428299602|ref|YP_007137908.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
 gi|428236146|gb|AFZ01936.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
          Length = 104

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 51/85 (60%), Gaps = 3/85 (3%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           L  Q S R+ +LD FRG+ +  MILV++ G     Y  ++H+ W+GCT  D V PFFLFI
Sbjct: 6   LPTQNSNRLVSLDVFRGIAIASMILVNNPGSWDSIYPPLEHAEWHGCTPTDLVFPFFLFI 65

Query: 112 VGVAIALALKKVPKINGAVKKIIFR 136
           VG+A+  +  K  K N    ++ +R
Sbjct: 66  VGMAMPFSFAKYTKENRPTARVYWR 90


>gi|323343595|ref|ZP_08083822.1| transmembrane protein [Prevotella oralis ATCC 33269]
 gi|323095414|gb|EFZ37988.1| transmembrane protein [Prevotella oralis ATCC 33269]
          Length = 384

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 147/350 (42%), Gaps = 91/350 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            Q+ KR+  +D  RG+T+  MILV++ G    YA ++H+ W G T  D V PFF+FI+G+
Sbjct: 5   TQQKKRILAVDILRGMTIAGMILVNNPGTDTVYAPLEHAEWIGLTPTDLVFPFFMFIMGI 64

Query: 115 AIALALKKVPKINGAV---KKIIFRTLKLLFWGIILQGGYSHA-----PD--ALSYGVDM 164
              L+LKK  +   +V   +KI  R L L   G+ +   +        PD  ++ +G  +
Sbjct: 65  TTYLSLKKF-EFKWSVECGRKIAKRALLLWLIGLAISWLFMFCRGLLDPDMSSMPFGSRL 123

Query: 165 -------KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                    +R  G+L R+ + Y + A++  L+ K +                      +
Sbjct: 124 WASVNTFDQLRLLGVLPRLGICYGLAAVV-ALSVKHK----------------------Y 160

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           I ++I II                   +  YI+     G+   A N +  VD  + G  H
Sbjct: 161 IPWLIAIIF------------------IGYYILLETCNGYAHDASNILAIVDDAVLGHGH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y                     R ++P       +PEGLLST  A+    IG   G  +
Sbjct: 203 VY---------------------RWESP-------DPEGLLSTFPALAHVLIGFCVGRTV 234

Query: 338 IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +  +  + +++    +G  L     +L +  A PI+K+L++ ++   T G
Sbjct: 235 MEMQNLNDKIERLFLIGALLTFAGFLLSY--ACPISKKLWTPTFAMVTCG 282


>gi|372221469|ref|ZP_09499890.1| hypothetical protein MzeaS_04069 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 378

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 56/150 (37%), Positives = 77/150 (51%), Gaps = 23/150 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG-GA--YARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K+ R   LD FRGLT+ LMILV+  G GA  Y  + H+ W G TLAD V P FLF +G 
Sbjct: 6   NKTNRFLALDVFRGLTICLMILVNSPGTGADIYPYLSHANWFGFTLADLVFPSFLFAMGN 65

Query: 115 AIALALKKVPKINGA--VKKIIFRTLKLLFWGIIL---------QGGYSHAPDALSYGVD 163
           A++ ++ K+     +   KK++ RTL +   G ++           G+S  P A      
Sbjct: 66  AMSFSMAKLKTTPASKFYKKVVRRTLIIFLLGYLMYWFPFFEWSAQGFSLKPVA------ 119

Query: 164 MKHIRWCGILQRIALVYVVVA-LIETLTTK 192
               R  G+LQRIAL Y   + LI  L +K
Sbjct: 120 --ETRIMGVLQRIALCYFFASVLIYHLNSK 147


>gi|423089801|ref|ZP_17078150.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
 gi|357557565|gb|EHJ39099.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
           difficile 70-100-2010]
          Length = 391

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 76/143 (53%), Gaps = 16/143 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  + R+ ++D  RGL++ LMI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +G
Sbjct: 20  KLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 79

Query: 114 VAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRW 169
           V I +++    K N +   II     R++ L+ +G  L   Y   P       D+  +R 
Sbjct: 80  VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRI 130

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQR+ LVY V +L   L  K
Sbjct: 131 LGVLQRMGLVYFVTSLAYLLLKK 153



 Score = 45.1 bits (105), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 206 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKILVMSIILLIGAFI--FN 263

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
              P NK+L+S S+V   AG+     GI  SIF
Sbjct: 264 QYFPFNKRLWSSSFVLLMAGSY----GILLSIF 292


>gi|344208862|ref|YP_004794003.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
 gi|343780224|gb|AEM52777.1| Protein of unknown function DUF2261, transmembrane
           [Stenotrophomonas maltophilia JV3]
          Length = 360

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 82/153 (53%), Gaps = 11/153 (7%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+A++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 12  RRLASIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 71

Query: 118 LALKKVPK-INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
            ++   P+ ++ A +  + R   +L   + +    +     + + +   H R  G+LQRI
Sbjct: 72  FSVA--PRALDAAARPALAR--GVLERALRILLAGALLHLLIWWALHTHHFRIWGVLQRI 127

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           A   V  AL+  L    RP       +++   Y
Sbjct: 128 A---VCAALVGVLAVYARPRAQAAVLVTLLVGY 157



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 8/101 (7%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLST+ A+ S  +G+  G +L +  G SA L      G G+    + L     +P+
Sbjct: 198 DPEGLLSTLGALASTVLGLIAGGLLRN--GRSAALA-----GLGVATAVLGLLLATVLPL 250

Query: 373 NKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACF 413
           NKQL++ SYV +T G A   L + + + I +  W      F
Sbjct: 251 NKQLWTPSYVLWTGGLAALALWLGH-VLIDQKGWPALGRRF 290


>gi|386392672|ref|ZP_10077453.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
 gi|385733550|gb|EIG53748.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
          Length = 370

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 14/141 (9%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
           +  R+A++D  RGL V  MIL ++ G     Y  + H+ W+G T ADF+ P FLF+VGV 
Sbjct: 5   RKTRLASVDGLRGLAVAGMILANNPGERGHVYRELQHAVWDGWTAADFIFPLFLFLVGVC 64

Query: 116 IALALKKVPKINGAVKKIIFRTLK---LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
           +ALA+ +     G   +   R L    +LF   +L+  Y          V  +++R  G+
Sbjct: 65  VALAVDRDTVRTGEAHRFWRRVLTRAIILFLLGLLENAYLR--------VSFENLRIPGV 116

Query: 173 LQRIALVYVVVALIETLTTKR 193
           LQRIA+VY+  A +      R
Sbjct: 117 LQRIAVVYLATAWLHVRCGNR 137


>gi|294949094|ref|XP_002786049.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
 gi|239900157|gb|EER17845.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
          Length = 277

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/299 (24%), Positives = 108/299 (36%), Gaps = 98/299 (32%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL 120
            R+  +D  RG + V   +VD  G     I H+PWNG  LAD VMP F+FI         
Sbjct: 30  PRIVAVDVMRGRSSVQ--IVDVCGKTVPSIGHAPWNGLHLADIVMPGFIFI--------- 78

Query: 121 KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVY 180
                                              D L+ G+D+   R  GILQRIA+ Y
Sbjct: 79  -----------------------------------DTLTLGLDLYTFRAPGILQRIAVCY 103

Query: 181 VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEH 240
               L+  L +   PN      L   +      + G +  +I           NW+    
Sbjct: 104 AAAVLLRKLVSDLSPNDTVKGALKNNSRV---LLMGLLCIII-----------NWAIMLL 149

Query: 241 SDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPL 300
                     +   RG L P CN    +DR ++G  H+YS P+W                
Sbjct: 150 GPQP------EGCSRGSLTPQCNVASNIDRMVFGPEHMYS-PLW---------------- 186

Query: 301 REDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
                       +PEGLLST+  + +  +G+  G  +     H+  L+    +G GLL+
Sbjct: 187 ------------DPEGLLSTLPTLATVALGLACGKFIQSRPSHTELLRL---VGCGLLL 230


>gi|282877735|ref|ZP_06286550.1| putative membrane protein [Prevotella buccalis ATCC 35310]
 gi|281300307|gb|EFA92661.1| putative membrane protein [Prevotella buccalis ATCC 35310]
          Length = 403

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 152/371 (40%), Gaps = 103/371 (27%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           + +SKR+  +D  RG+T+  MILV++ G     +A ++H+ WNG T  D V PFF+F++G
Sbjct: 6   KPQSKRILAIDILRGITIAGMILVNNPGSWAHIFAPLEHAEWNGMTPTDLVFPFFMFVMG 65

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQGGYSH--------APDALSYGVD 163
           + I ++++K         V KII RTL L   GI + G +S           DA + G  
Sbjct: 66  MCIFISMQKYQFACNRQTVYKIIRRTLLLYLVGIFV-GWFSRFCYRWAFPLEDA-TLGQQ 123

Query: 164 MKH-------IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
           + H       IR  G+L R+A+ Y + AL+  +T + R                      
Sbjct: 124 IWHTVWSFDTIRLSGVLARLAICYGITALLA-ITVRHRY--------------------- 161

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
            ++ VI ++  Y++                  ++ CG  G      N +  VDR +    
Sbjct: 162 LLSIVITLLIGYTI------------------LLFCG-NGFAYDETNILSIVDRAVLTDA 202

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H+Y D                               +PEGLLST  AI    IG   G +
Sbjct: 203 HMYHD----------------------------NGIDPEGLLSTFPAIAHTLIGFLIGKL 234

Query: 337 LI-HFKGHSARLKHWVSMGF------GLLIIAIILHFTNAI-----PINKQLYSFSYVCF 384
                K  +A  ++    G        L I   IL F+  +     P+NK+++S ++V  
Sbjct: 235 AFSKQKTVTATTQNDPKTGLILHNIVPLFIAGTILTFSGLLLAYGCPLNKKIWSPTFVLT 294

Query: 385 TAGAADGCLGI 395
           + G A   L +
Sbjct: 295 SCGLASTLLAL 305


>gi|429738942|ref|ZP_19272716.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
 gi|429158431|gb|EKY00988.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
           F0055]
          Length = 400

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 142/368 (38%), Gaps = 99/368 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
             KS R+  +D  RG+T+  MILV++ G     +A  +H+ WNG T  D V PFF+F++G
Sbjct: 5   TTKSSRILAIDILRGITIAGMILVNNPGNWGRIFAPFEHAEWNGMTPTDLVFPFFMFVMG 64

Query: 114 VAIALALKKVP-KINGAVKKIIFRTLKLLF-------WGIILQGGYSHAPDALSYGVDMK 165
           + I +A++K     N +    I + + L++       W       ++   +  S+G  + 
Sbjct: 65  MCIYIAMRKFDFTCNKSTVYKITKRMVLIYLVGLGIGWFAKFCFRWASPLEEASFGEQLW 124

Query: 166 HIRW-------CGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
           ++ W        G+L R+A+ Y + AL+  T+  K  P                      
Sbjct: 125 YMVWPFDSIRLTGVLARLAICYGITALLAVTVKHKNLP---------------------- 162

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
                YII T                 V  +I+     G      N +   DR +    H
Sbjct: 163 -----YIIVTLL---------------VGYFIILMAGNGFAYDETNILSIADRAVLTDVH 202

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLST+ +I    +G   G +L
Sbjct: 203 MYHD----------------------------NGIDPEGLLSTLPSIAHTLLGFMVGSLL 234

Query: 338 I-----HFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAG 387
                 H +    R    +S    L ++  IL F+  +     P+NK+++S +YV  T G
Sbjct: 235 FKTTDEHSEHTDVRTGIILSKVVPLFVVGTILIFSGFLLSYGCPLNKKVWSPTYVLVTCG 294

Query: 388 AADGCLGI 395
            A   L +
Sbjct: 295 LASTLLAL 302


>gi|445497063|ref|ZP_21463918.1| putative transmembrane protein [Janthinobacterium sp. HH01]
 gi|444787058|gb|ELX08606.1| putative transmembrane protein [Janthinobacterium sp. HH01]
          Length = 353

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 74/130 (56%), Gaps = 8/130 (6%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+ ++DA RGLTV  M+LV+DAG     Y  ++H+ W+GCT  DF+ P F+ IVG++I L
Sbjct: 2   RINSIDAVRGLTVAAMLLVNDAGDWSHVYPWLEHAEWHGCTPPDFIFPIFMLIVGISINL 61

Query: 119 ALKKVPKIN-GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           AL   P+++ GA    + R++ L    I+L G   H    L   ++ +  R  G+LQR  
Sbjct: 62  ALS--PRLDAGAATAPLARSVLLRAVRIVLLGLALHVVAMLL--LNGRGFRLFGVLQRTG 117

Query: 178 LVYVVVALIE 187
           + +    L+ 
Sbjct: 118 ICFAAAGLLA 127


>gi|254524630|ref|ZP_05136685.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
 gi|219722221|gb|EED40746.1| putative heparan-alpha-glucosaminide N-acetyltransferase
           (transmembrane protein 76) [Stenotrophomonas sp. SKA14]
          Length = 355

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 78/141 (55%), Gaps = 11/141 (7%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 7   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66

Query: 118 LALKKVPK-INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
            ++   P+ ++ A +  + R   +L   + +    +     + + +   H R  G+LQRI
Sbjct: 67  FSVA--PRALDAAARPALAR--GVLERALRILLAGALLHLLIWWALHTHHFRIWGVLQRI 122

Query: 177 ALVYVVVALIETLTTKRRPNV 197
           A   V  AL+  L    RP V
Sbjct: 123 A---VCAALVGVLAVYARPRV 140



 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLST+ A L+ T+       L+   G SA L      G G+    + L     +P+
Sbjct: 193 DPEGLLSTLGA-LASTVLGLLAGGLLR-NGRSAALG-----GLGVATAVLGLLLATVLPL 245

Query: 373 NKQLYSFSYVCFTAGAA 389
           NKQL++ SYV +T G A
Sbjct: 246 NKQLWTPSYVLWTGGLA 262


>gi|445498183|ref|ZP_21465038.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
 gi|444788178|gb|ELX09726.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
          Length = 370

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 73/137 (53%), Gaps = 6/137 (4%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++ + +R   +D  RGLTV LMI+V+  G     YA   H+ W+G TL D V P F+F+V
Sbjct: 1   MKTQHQRSQAIDVLRGLTVALMIMVNMPGTPATTYAPFLHAEWHGLTLTDLVFPTFMFVV 60

Query: 113 GVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV-DMKHIRW 169
           G A++  L+K   +  A  +KKI  RT  +   G ++      + D  S  V  +   R 
Sbjct: 61  GTALSFTLEKYEGMGEAAVLKKIFTRTALIFLCGFLMYWYPFFSTDGGSLTVLPLSGTRI 120

Query: 170 CGILQRIALVYVVVALI 186
            G+LQRIAL Y   +LI
Sbjct: 121 FGVLQRIALGYCAGSLI 137


>gi|395761203|ref|ZP_10441872.1| hypothetical protein JPAM2_05565 [Janthinobacterium lividum PAMC
           25724]
          Length = 373

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 73/134 (54%), Gaps = 7/134 (5%)

Query: 60  SKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           S+R   LD  RGLTV LMI+V+   D G  YA   H+ W+G TL D V P FLF+VG A+
Sbjct: 5   SQRYLALDVLRGLTVALMIVVNTPGDWGSVYAPFLHAEWHGFTLTDLVFPSFLFVVGNAL 64

Query: 117 ALALKKVPKI-NGAV-KKIIFRTLKLLFWGIILQG-GYSHAPDALSYG-VDMKHIRWCGI 172
           A  L K   + +GAV  K+  R+  +   G +L    +    DA  +    +   R  G+
Sbjct: 65  AFVLGKYENLAHGAVLAKLCKRSALIFLLGFLLYWFPFFKIDDAGQFAWSSLSQTRIPGV 124

Query: 173 LQRIALVYVVVALI 186
           LQRIA+ Y+  ALI
Sbjct: 125 LQRIAVCYLAAALI 138


>gi|381188372|ref|ZP_09895934.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
 gi|379650160|gb|EIA08733.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
           frigoris PS1]
          Length = 430

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 59/90 (65%), Gaps = 3/90 (3%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ +LD  RG+T+++M+LV++ G     +A ++H+ WNGCT  D V PFF+F++G AI 
Sbjct: 4   QRIISLDVLRGITIMMMVLVNNPGSWDNVFAPLEHANWNGCTPTDLVFPFFIFVLGAAIP 63

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIIL 147
           LA+        +  KI+ R+L+++  G+ L
Sbjct: 64  LAILTKELNQQSFLKILTRSLRIISLGLFL 93



 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM-GFGLLIIAIILHFTNAIP 371
           +PEG+LSTI AI SG IG+  G +L       A+ +  + M G GL ++   L +    P
Sbjct: 241 DPEGILSTIPAIASGIIGLLVGQLL---NSSLAKKEKGLKMFGAGLALVISGLIWNEFFP 297

Query: 372 INKQLYSFSYVCFTAGAA 389
           +NK L++ S+V +TAG A
Sbjct: 298 LNKSLWTSSFVLYTAGFA 315


>gi|410721825|ref|ZP_11361152.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
 gi|410598366|gb|EKQ52947.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
           MBC34]
          Length = 372

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 143/351 (40%), Gaps = 88/351 (25%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIV 112
           + +   R+ +LD FRG T+  MI+V+  G      + + H+ W G  LAD V PFFLFIV
Sbjct: 1   MVEVKDRLVSLDVFRGFTIAGMIMVNILGLYPDTPSLLQHASWIGLNLADLVFPFFLFIV 60

Query: 113 GVAIALALKKVPKINGAVK--KIIFRTLKLLFWGIILQGGYSHAPDALSYGV-DMKHIRW 169
           GV++  +     K     K  K +FR   L   G+ L  G         YGV D   IR 
Sbjct: 61  GVSMNFSFASRSKQPSWKKWGKFLFRVAALYLIGVALVFGLFF------YGVPDFSTIRI 114

Query: 170 CGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYS 229
            GILQ IAL  +  A +  L T+                    WI    + ++++     
Sbjct: 115 PGILQLIALSSLFAAPLARLRTR--------------------WIILAASVILFVQAAIL 154

Query: 230 LYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEA 289
           L+V        S  GV          G L  + N  G++D +++   HL           
Sbjct: 155 LWV--------SAPGVPA--------GSLEMSNNIAGWIDSQVFTPAHLLD--------- 189

Query: 290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARL-K 348
                      +E         F+PEG+++ I+      IG+  G  L        RL +
Sbjct: 190 -----------KEHV-------FDPEGMMAVINGTAMVLIGLACGRTL--------RLHR 223

Query: 349 HWVSMGF----GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           +W  + +    GL+ + I L  +  +PI KQL++ S++   AG A   L +
Sbjct: 224 NWKGVQYLIIGGLIALTIGLIISPVMPIIKQLWTSSFILVNAGLAAIILAL 274


>gi|114048505|ref|YP_739055.1| hypothetical protein Shewmr7_3014 [Shewanella sp. MR-7]
 gi|113889947|gb|ABI43998.1| conserved hypothetical protein [Shewanella sp. MR-7]
          Length = 395

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 154/374 (41%), Gaps = 86/374 (22%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGGAYA--RIDHSPWNGCTLADF 103
           +Q   R+ +LDA RG  +            L+IL   AG  +   ++ HS WNG    D 
Sbjct: 24  RQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSEWNGFRFYDL 83

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H        V
Sbjct: 84  IFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNHGWGT-GAPV 142

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D + +R+  +L RIA  +   AL+   T+ R   ++    L I  AY             
Sbjct: 143 DPEKVRYASVLGRIAFAWFFAALLVWHTSLRTQVLVA---LGILVAY------------- 186

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW-GINHLYSD 281
                  L++P                   G  G L P  +   YVD  L  G+++    
Sbjct: 187 ---GAVQLWLP----------------FPGGQAGELSPTESINAYVDSLLLPGVSYQGRT 227

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
           P                              +PEG+LST+ A+++   G+  GH ++  H
Sbjct: 228 P------------------------------DPEGVLSTLPAVVNALAGVFVGHFIVKSH 257

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++      G   L +  +L     IP+NK+L++ S+V  T+G +   L +  +I
Sbjct: 258 PKGEWAKVGLLSVAGGVCLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMLLLALFYAI 315

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 316 -VDVLKWQKLAFIF 328


>gi|146302547|ref|YP_001197138.1| hypothetical protein Fjoh_4820 [Flavobacterium johnsoniae UW101]
 gi|146156965|gb|ABQ07819.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 423

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 59/95 (62%), Gaps = 5/95 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              +R+ +LD FRG T++LM +V++ G     Y  ++H+ W+GCT  D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTILLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 115 AIALALKKVPKINGAV-KKIIFRTLKLLFWGIILQ 148
           AI  A+  V   +GAV  KI+ R+L++   G+ L 
Sbjct: 61  AIPFAM-PVKHFDGAVFNKILVRSLRIFCLGLFLS 94



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEG+LST+ AI +G +G++ G +L      +  LK     G  LLI  ++ +     PI
Sbjct: 241 DPEGILSTLPAIGTGILGMYIGQLLNLQTNRTEILKKTAVTGVILLIGGLLWNII--FPI 298

Query: 373 NKQLYSFSYVCFTAGAADGCLGI 395
           NK L++ SYV +TAG A  CL +
Sbjct: 299 NKSLWTSSYVLYTAGIATLCLSL 321


>gi|421097001|ref|ZP_15557700.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
 gi|410800246|gb|EKS02307.1| putative membrane protein [Leptospira borgpetersenii str.
           200901122]
          Length = 383

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 48/73 (65%), Gaps = 3/73 (4%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q  +R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFFLF VGV
Sbjct: 7   QNKERILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGV 66

Query: 115 AIALALKKVPKIN 127
           +I ++L     IN
Sbjct: 67  SIPISLYSKNGIN 79



 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 54/101 (53%), Gaps = 3/101 (2%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           ++PEGLLS I++I +   G+  G +L   +G   + +     G G L   + L +  ++P
Sbjct: 195 WDPEGLLSGIASIATSLFGVLCGFILFLREG-VGKNRVLGIFGLGFLFTFVGLLWDQSLP 253

Query: 372 INKQLYSFSYVCFTAGAADGCLGITN--SIFIPEMDWDECD 410
           +NK L++ SY  +TAG +  C+G     ++ I   +W+  +
Sbjct: 254 MNKSLWTGSYAAYTAGLSFFCIGFFEYLNLLILLKEWNRLN 294


>gi|239907232|ref|YP_002953973.1| hypothetical protein DMR_25960 [Desulfovibrio magneticus RS-1]
 gi|239797098|dbj|BAH76087.1| hypothetical membrane protein [Desulfovibrio magneticus RS-1]
          Length = 371

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 141/343 (41%), Gaps = 75/343 (21%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
            R++++D  RGL +  MI+V++ G     Y ++ H+ W+G TLAD V P FLF+VGV +A
Sbjct: 7   PRLSSVDTLRGLAIAAMIVVNNPGDRRFVYPQLLHAQWHGLTLADVVFPLFLFLVGVCVA 66

Query: 118 LALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIA 177
           LA+      +   +  ++R +      +   G   +A   LS+      +R  G+LQRIA
Sbjct: 67  LAIDPDKPRDAEARARLWRKILPRAAVLFALGLGENAYLRLSF----DELRLPGVLQRIA 122

Query: 178 LVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSF 237
           +VY+  A +++  + R   V+                 G +  + Y +   ++ VP    
Sbjct: 123 VVYLAAAWLQSRLSSRALAVV-----------------GAVTLLGYWLLLAAVPVPG--- 162

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
                HG     ++  ++G L         + R +W  +                     
Sbjct: 163 -----HGHPSLAMEPNLQGWLDQLV-----LGRHIWKFHT-------------------- 192

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
                   SW     +PEG+LST  AI  G IG+  G  L        R      +    
Sbjct: 193 --------SW-----DPEGILSTFPAIALGLIGVLAGRWLRRGGDRPGRAGLLGLL---- 235

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
            +IAI L +    P+NK L + S+V  T G     L + +++ 
Sbjct: 236 -LIAIGLAWDAVFPLNKSLCTSSFVLLTGGLGLAMLAVAHAVL 277


>gi|315500593|ref|YP_004089395.1| hypothetical protein Astex_3616 [Asticcacaulis excentricus CB 48]
 gi|315418605|gb|ADU15244.1| protein of unknown function DUF1624 [Asticcacaulis excentricus CB
           48]
          Length = 372

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 145/335 (43%), Gaps = 71/335 (21%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDA-GGA--YARIDHSPWNGCTLADFVMPFFLFIVGV 114
             + R   LD FRGLTV +MI+V+ +  GA  +A++ H+ W G TL D V P FLF +G 
Sbjct: 1   MSAARYTALDVFRGLTVCVMIVVNTSPAGAEPFAQLQHAQWFGFTLTDLVFPSFLFAIGN 60

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ-GGYSHAPDALSYGV-DMKHIRWCGI 172
           ++  A +K       + K++ R+  +   G ++    + H     ++   D+ H R  G+
Sbjct: 61  SMVFAFRKPLPHKEFLLKVLRRSALIFLLGYLMYWFPFVHQTTDGAWAFNDIGHTRIMGV 120

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQRIAL Y+  +L            L  R L I +A                +  +  + 
Sbjct: 121 LQRIALCYLFASL--------AARYLSVRGLVILSA----------------LLLFGYWG 156

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
             ++F+  +D           M G+LG        +D+ + G++H+Y             
Sbjct: 157 LLYAFTPAAD--------ALTMTGNLGAK------IDQFVLGLDHMY------------- 189

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                   R  A       +EPEGLLST+ AI++   G   G +++       R    + 
Sbjct: 190 --------RGGAKG-----YEPEGLLSTLPAIVNVLAGYLCGRLIL--DSEDRRRTVMIL 234

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            G GL ++A  L ++   P +K+L++ S+   T G
Sbjct: 235 SGAGLALVAAALVWSFGFPFSKRLWTSSFAVLTIG 269


>gi|343083133|ref|YP_004772428.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351667|gb|AEL24197.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 381

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 76/140 (54%), Gaps = 9/140 (6%)

Query: 62  RVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD  RGLT+ LM++V+   D   ++  + H+ W+G TL D V P FLF+VG A++ 
Sbjct: 13  RYQSLDVLRGLTLALMVIVNTPGDGSTSFGPLTHADWHGLTLTDLVFPSFLFVVGNAMSF 72

Query: 119 ALKKVPKINGA---VKKIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWCGILQ 174
           +L K  K+ G      K+  RT  +   G++L    +    D+     D   IR  G+LQ
Sbjct: 73  SLGKF-KLKGGKAYFSKVFKRTALIFIIGLLLTAFPFFRVNDSGVVPYDFTSIRILGVLQ 131

Query: 175 RIALVYVVVA-LIETLTTKR 193
           RIAL Y + A LI  L+ K+
Sbjct: 132 RIALCYGLGATLIYFLSPKK 151



 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 51/93 (54%), Gaps = 4/93 (4%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLST  A+++  +G   G + I  +G       W++M F ++++ +   +    P
Sbjct: 206 FDPEGLLSTFPAMVNVLLGYWVG-LQIQKRGGDIETVLWLAM-FAVILLVVGYLWDYGFP 263

Query: 372 INKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
           INK++++ S+   T G +   L +   +FI E+
Sbjct: 264 INKKIWTSSFTLVTVGYSTLTLALL--MFILEV 294


>gi|224064476|ref|XP_002301495.1| predicted protein [Populus trichocarpa]
 gi|222843221|gb|EEE80768.1| predicted protein [Populus trichocarpa]
          Length = 136

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 32/45 (71%), Positives = 39/45 (86%)

Query: 77  MILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           MILVDDAGGA+  I+HSPW G TL+DFVMPFFLF+VG++I+L  K
Sbjct: 1   MILVDDAGGAFPCINHSPWFGVTLSDFVMPFFLFVVGLSISLVFK 45



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 34/63 (53%), Gaps = 4/63 (6%)

Query: 228 YSLYVPNWSFSEHSDH--GVKK--YIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
           Y LY P+W F   S H  G K     V CG++G L P CNA G +DR   G + LY  PV
Sbjct: 69  YGLYDPDWEFEVPSTHLFGYKSGTKTVNCGVKGSLEPPCNAAGLIDRFFLGEHPLYQHPV 128

Query: 284 WSR 286
           + R
Sbjct: 129 YRR 131


>gi|47213040|emb|CAF93449.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 297

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 112/243 (46%), Gaps = 43/243 (17%)

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA------YQWQWIG 215
           V   ++R  G+LQR+AL Y+VVA ++ L  +R   V      + ++       Y   W+ 
Sbjct: 1   VSWDNLRIPGVLQRLALAYLVVACLDLLVARRFSCVFCVLQDAWWSQGIDILLYWPAWVC 60

Query: 216 GFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN--AVGYVDRELW 273
             +   +++  T+ L VP+         G+       G  G L P C   A G++DR L 
Sbjct: 61  VLLLESVWLFITFLLPVPDCPTGYLGPGGI-------GDMG-LYPNCTGGAAGFIDRWLL 112

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
           G  H+Y +P    + A  L                  P++PEG+L +I++IL   +G+  
Sbjct: 113 GEKHIYQNPSSQGIYATHL------------------PYDPEGILGSINSILIAFLGLQA 154

Query: 334 GHVLIHFKG-HSARLKHWVSMGFGLLIIAIILHFTNA------IPINKQLYSFSYVCFTA 386
           G +++H +  H   +  ++  GF L II+ +L  TN       IPINK L+S SYV   A
Sbjct: 155 GKIILHHRDLHQGVISRFLIWGFLLGIISAVL--TNCSTNQGLIPINKNLWSLSYVTTLA 212

Query: 387 GAA 389
             A
Sbjct: 213 CFA 215


>gi|418023168|ref|ZP_12662153.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
 gi|353537051|gb|EHC06608.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
          Length = 384

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 153/374 (40%), Gaps = 84/374 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  VKVNKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFNFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GVA+ L+ K++ K+  + +  ++R  +K LF  ++L   Y+H        
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAP 130

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y            
Sbjct: 131 ADPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGY------------ 175

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                   L++P                   G  G L P  +   YVD  L         
Sbjct: 176 ----GAMQLWLP----------------FPGGQAGVLSPTESINAYVDSIL--------- 206

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
                L   +                 R P +PEGLLSTI AI++   G+  GH L+  H
Sbjct: 207 -----LPGVSYQG--------------RTP-DPEGLLSTIPAIVNALTGVFVGHFLVKSH 246

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++    + G  LL    +L     IP+NK+L++ S+V  T+G +   L +  ++
Sbjct: 247 PKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYAL 304

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 305 -VDVLKWQKSAFFF 317


>gi|58583544|ref|YP_202560.1| hypothetical protein XOO3921 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58428138|gb|AAW77175.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 362

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/150 (36%), Positives = 76/150 (50%), Gaps = 16/150 (10%)

Query: 69  FRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALAL-KKVP 124
           FRGLT+ LMILV+ AG    AYA++ H+ W G TLAD V P FLF VG A++ AL   +P
Sbjct: 2   FRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALATNMP 61

Query: 125 KIN--GAVKK---IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
            +   G V K   +I     L++W       +   PD       +  +R  G+LQRI L 
Sbjct: 62  HLQFLGRVSKRAALIALCGVLMYWFPF----FHLQPDGGWAFTTVDQVRLTGVLQRIGLC 117

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           Y+  AL   L     P  + P  +++   Y
Sbjct: 118 YLAAAL---LVRYLPPRGIAPACVALLLGY 144


>gi|386719962|ref|YP_006186288.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
 gi|384079524|emb|CCH14124.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
           maltophilia D457]
          Length = 352

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 81/153 (52%), Gaps = 11/153 (7%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++DA RG+TV  M+LV++ G     +A + HS W+GCT  D V PFFLF+VGV++A
Sbjct: 4   RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 63

Query: 118 LALKKVPK-INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
            ++   P+ ++ A +  + R   +L   + +    +     + + +   H R  G+LQRI
Sbjct: 64  FSVA--PRALDAAARPALAR--GVLERALRILLAGALLHLLIWWALHTHHFRIWGVLQRI 119

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAY 209
           A   V  A +  L    RP V     +++   Y
Sbjct: 120 A---VCAASVGVLAVYARPRVQAGVLVTLLVGY 149



 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 8/101 (7%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLST+ A+ S  +G+  G +L + +  +       +   GLL+  ++       P+
Sbjct: 190 DPEGLLSTLGALASTVLGLIAGGLLRNGRAAALAGLGVATAVLGLLLATVL-------PL 242

Query: 373 NKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACF 413
           NKQL++ SYV +T G A   L + + + I +  W      F
Sbjct: 243 NKQLWTPSYVLWTGGLAALALWLGH-VLIDQKRWPPLGRRF 282


>gi|16552925|dbj|BAB71412.1| unnamed protein product [Homo sapiens]
          Length = 367

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 107/235 (45%), Gaps = 43/235 (18%)

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--VLEPRHLSI--FTAYQWQWIGGF 217
           V    +R  G+LQR+ + Y VVA++E L  K  P     E   LS+   T+   QW+   
Sbjct: 70  VSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLIL 129

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELW 273
           +   +++  T+ L VP          G     +  G  G  G  P C   A GY+DR L 
Sbjct: 130 VLEGLWLGLTFLLPVP----------GCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLL 179

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
           G +HLY  P  + L    ++                  ++PEG+L TI++I+   +G+  
Sbjct: 180 GDDHLYQHPSSAVLYHTEVA------------------YDPEGILGTINSIVMAFLGVQA 221

Query: 334 GHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFTNA--IPINKQLYSFSYV 382
           G +L+++K  +     R   W  +  GL+ +A+     N   IP+NK L+S SYV
Sbjct: 222 GKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYV 275


>gi|395804141|ref|ZP_10483382.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
 gi|395433785|gb|EJF99737.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
          Length = 423

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 58/95 (61%), Gaps = 5/95 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
              +R+ +LD FRG T+ LM +V++ G     Y  ++H+ W+GCT  D V PFF+FI+G 
Sbjct: 1   MTKERLTSLDVFRGFTIFLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60

Query: 115 AIALALKKVPKINGAV-KKIIFRTLKLLFWGIILQ 148
           AI  A+  V   +G+V  KI+ R+L++   G+ L 
Sbjct: 61  AIPFAM-PVKHFDGSVFNKILVRSLRIFCLGLFLS 94



 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 7/97 (7%)

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
           +G L   + +W     +PEG+LST+ AI +G +G++ G +L         +K     G  
Sbjct: 230 NGHLWASSKTW-----DPEGILSTLPAIGTGILGMYIGQLLNLSVDKMEIVKKTAIAGTA 284

Query: 357 LLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCL 393
           L+I  +I +     PINK L++ SYV +TAG A  CL
Sbjct: 285 LVIGGLIWNI--FFPINKSLWTSSYVLYTAGIATLCL 319


>gi|254446502|ref|ZP_05059978.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260810|gb|EDY85118.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 97/353 (27%), Positives = 151/353 (42%), Gaps = 84/353 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K +R+  LDA RG T++ MI+V+  G     Y+ + H+ W+G T  D V PFFLF VGV
Sbjct: 1   MKRERLLALDALRGFTIIGMIIVNSPGSWSHVYSPLLHASWHGVTPTDLVFPFFLFFVGV 60

Query: 115 AIALAL--KKVPKIN--GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 170
           +IALA   K+  K    G  +KI +R  K+   G+ L          L      + +R  
Sbjct: 61  SIALAYSGKRGTKRERVGKYRKIFWRVAKIFALGLFLN---------LWPYFYFEEMRVA 111

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           G+LQRIALV+ V A++  L T+ +               Q  W+G  I    + +    +
Sbjct: 112 GVLQRIALVFGVCAIL-FLNTRWK---------------QQLWVGASILLGYWALL---V 152

Query: 231 YVPNWSFSEHSDHGVKKYIV----------------KCGMRGHLGPACNAVGYVDRELWG 274
           +VP     E +   ++  IV                +  + G+  P  N   +VDR    
Sbjct: 153 WVPV-PLDEVNAGALETGIVERSYGTEVAVSVEARGETSIAGNFEPGVNIAAWVDRV--- 208

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
              L    +W R                         ++PEGLLST+ A+ +G  G+  G
Sbjct: 209 ---LLPGGMWER------------------------TWDPEGLLSTVPAVATGIFGMLVG 241

Query: 335 HVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            +++   G   R   WV    G++ + I   ++   P NK L+S S+V +  G
Sbjct: 242 ALILGV-GDPYRRVSWVFF-VGVVALLIGSAWSWVFPYNKNLWSSSFVLYAGG 292


>gi|355694569|gb|AER99714.1| heparan-alpha-glucosaminide N-acetyltransferase [Mustela putorius
           furo]
          Length = 296

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/231 (29%), Positives = 106/231 (45%), Gaps = 45/231 (19%)

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFT----AYQW-QWIGGFIAFV 221
           +R  G+LQR+ + Y VVA++E +  K  P        S F+     + W QW+   +   
Sbjct: 5   VRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASER-SCFSLRDIIFSWPQWLFILMLES 63

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINH 277
           I++  T+ L VP          G     +  G  G LG  P C   A GY+DR L G +H
Sbjct: 64  IWLALTFFLPVP----------GCPTGYLGPGGIGDLGKYPNCTGGAAGYIDRLLLGDDH 113

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y  P             +S  L        +  ++PEG+L +I++I+   +G+  G +L
Sbjct: 114 IYQHP-------------SSAVLYH-----TQVAYDPEGILGSINSIVMAFLGVQAGKIL 155

Query: 338 IHFKGHSA----RLKHWVSMGFGLLIIAIILHFTNA--IPINKQLYSFSYV 382
           +++K  +     R   W S   GL+ +A+     N   IPINK L+S SYV
Sbjct: 156 LYYKDQTKDILIRFTAW-SCFLGLISVALTKFSENEGFIPINKNLWSVSYV 205


>gi|373948546|ref|ZP_09608507.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
 gi|386325609|ref|YP_006021726.1| hypothetical protein [Shewanella baltica BA175]
 gi|333819754|gb|AEG12420.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica BA175]
 gi|373885146|gb|EHQ14038.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS183]
          Length = 384

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 154/374 (41%), Gaps = 84/374 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  VKVNKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GVA+ L+ K++ K+  + +  ++R  +K LF  ++L   Y+H        
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAP 130

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y            
Sbjct: 131 ADPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGY------------ 175

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                   L++P                   G  G L P  +   YVD  L         
Sbjct: 176 ----GAIQLWLP----------------FPGGQAGVLSPTESINAYVDSIL--------- 206

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
                L   +                 R P +PEGLLSTI AI++   G+  GH ++  H
Sbjct: 207 -----LPGVSYQG--------------RTP-DPEGLLSTIPAIVNALAGVFVGHFIVKSH 246

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++    + G   L++  +L     IP+NK+L++ S+V  T+G +   L +  ++
Sbjct: 247 PKGEWAKVGVLAAAGGIFLVLGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYAL 304

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 305 -VDVLKWQKAAFFF 317


>gi|298386962|ref|ZP_06996516.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298260112|gb|EFI02982.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 376

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/355 (22%), Positives = 142/355 (40%), Gaps = 89/355 (25%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGC 98
           + +L +  + R+A+LD  RG  + L++       A AR              DH  W G 
Sbjct: 1   MSKLSENNTSRLASLDILRGFDLFLLVFFQPVFAALARQLNLPFLNDILYQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHA 154
              D VMP FLF+ G ++  +L K   ++G+     ++I+ R   L  +G+I+QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL--- 117

Query: 155 PDALSYGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
                 G+D  HI  +   LQ IA+ Y + A+I+              H S    ++WQ 
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYFIAAVIQL-------------HFS----FRWQ- 154

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW 273
           IG         IT   L++               Y +     G   PA N    VDR   
Sbjct: 155 IG---------ITLLLLFI---------------YWIPMTFLGDFTPAGNFAEQVDR--- 187

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP-FEPEGLLSTISAILSGTIGIH 332
                           C L     G    +  +W  +P +    + S+++  ++  +G  
Sbjct: 188 ----------------CVLGRFRDGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAF 231

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            G ++   K +  ++   +S+  G+L++ + + ++  +PI K+L++ S    + G
Sbjct: 232 AGKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGG 285


>gi|295136516|ref|YP_003587192.1| hypothetical protein ZPR_4697 [Zunongwangia profunda SM-A87]
 gi|294984531|gb|ADF54996.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
          Length = 371

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 77/146 (52%), Gaps = 10/146 (6%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD  RG+TV LMILV++ G     YA   H+ W+G TL D V P FLF+VG A++ 
Sbjct: 5   RYLSLDILRGMTVALMILVNNPGSWATIYAPFKHAAWHGFTLTDLVFPTFLFVVGNAMSF 64

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAP-----DALSYGVDMKHIRWCGIL 173
           + KK+   + +  + + +T K      ++  G S+ P     D      ++  IR  G+L
Sbjct: 65  SFKKMN--SWSTPEFLTKTFKRAAIIFLIGLGLSYYPFVRRTDGEFILKNILDIRIMGVL 122

Query: 174 QRIALVYVVVALIETLTTKRRPNVLE 199
           QRIA+ Y++ A+      K+   VL 
Sbjct: 123 QRIAVCYLLAAIAIRFLKKKWLVVLS 148



 Score = 38.1 bits (87), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF--GLLIIAIILHFTN 368
           PF+PEGLLS + A+    + + +G++   F   S   K  V      GL +I++ L +  
Sbjct: 195 PFDPEGLLSCLPAV----VNVIFGYLAGRFIQQSVNKKKLVIQLVIAGLAMISLALVWDV 250

Query: 369 AIPINKQLYSFSYVCFTAG 387
            +P+NK +++ SYV  + G
Sbjct: 251 ILPVNKPIWTSSYVILSTG 269


>gi|325954677|ref|YP_004238337.1| hypothetical protein [Weeksella virosa DSM 16922]
 gi|323437295|gb|ADX67759.1| hypothetical protein Weevi_1050 [Weeksella virosa DSM 16922]
          Length = 402

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 94/361 (26%), Positives = 142/361 (39%), Gaps = 90/361 (24%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
            K+ R  +LD FRG T+ LMILV++ G     ++ + H+ W+GCT  D V PFFLF VG 
Sbjct: 1   MKTTRYYSLDVFRGATIALMILVNNPGSWSYMFSPLQHASWHGCTPTDLVFPFFLFAVGN 60

Query: 115 AIALALK--KVPKINGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGV----DMKH 166
           A++  +   K+   N   KKII RT+ +   G+ +          + L +      +   
Sbjct: 61  AMSFGMSHLKLQASNVFWKKIIKRTILIFAIGLFINWWPFLKWENNELVFRAWRESEENG 120

Query: 167 IRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIIT 226
           +R  G+LQRIA+     +   TL    R  V+    + I   Y                 
Sbjct: 121 VRIMGVLQRIAIANFFAS---TLAYYYRDRVVLKISILILLFY----------------- 160

Query: 227 TYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSR 286
                   W+ +     GV  Y     + G +G        +D  L G+ H+Y       
Sbjct: 161 --------WALTFFLG-GVDPY----SLEGFIGTK------IDVHLIGLAHMYKGE---- 197

Query: 287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG---- 342
                                   PF+PEGL STI AI    +G   G V I  +G    
Sbjct: 198 ----------------------GVPFDPEGLYSTIPAISQILLGYLVG-VYIQKQGDIRW 234

Query: 343 -------HSARLKHWVSMGFGLLIIAIILHFTNAI--PINKQLYSFSYVCFTAGAADGCL 393
                   +  +   +S  F L   A+I+ +   +  P NK+++S SYV  T   A   +
Sbjct: 235 FSRSLPASNLPIYRMLSGLFVLGAFALIMGYIWQLDFPYNKKIWSSSYVIHTTALAIFTI 294

Query: 394 G 394
           G
Sbjct: 295 G 295


>gi|410463501|ref|ZP_11317013.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
 gi|409983383|gb|EKO39760.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
           Maddingley MBC34]
          Length = 371

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 75/142 (52%), Gaps = 17/142 (11%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++DA RGL +  MI+V++ G     Y ++ H+ W+G TLAD V P FLF+VGV +
Sbjct: 6   TSRLLSVDALRGLAIAAMIVVNNPGDRRFIYPQLLHAHWHGLTLADVVFPLFLFLVGVCV 65

Query: 117 ALA--LKKVPKINGAV---KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
           ALA  L K     G     +KI+ R   L   G+          +     +    +R  G
Sbjct: 66  ALAIDLDKARDAKGRARLWRKILPRAAVLFALGL---------GETAYLRLSFDELRIPG 116

Query: 172 ILQRIALVYVVVALIETLTTKR 193
           +LQRIA+VY+  A +++  + R
Sbjct: 117 VLQRIAVVYLAAAWLQSRLSSR 138


>gi|152999681|ref|YP_001365362.1| hypothetical protein Shew185_1147 [Shewanella baltica OS185]
 gi|151364299|gb|ABS07299.1| conserved hypothetical protein [Shewanella baltica OS185]
          Length = 384

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 153/374 (40%), Gaps = 84/374 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  VKVNKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GVA+ L+ K++ K+  + +  ++R  +K LF  ++L   Y+H        
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAP 130

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y            
Sbjct: 131 ADPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGY------------ 175

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                   L++P                   G  G L P  +   YVD  L         
Sbjct: 176 ----GAMQLWLP----------------FPGGQAGVLSPTESINAYVDSIL--------- 206

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
                L   +                 R P +PEGLLSTI AI++   G+  GH ++  H
Sbjct: 207 -----LPGVSYQG--------------RTP-DPEGLLSTIPAIVNALTGVFVGHFIVKSH 246

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++    + G  LL    +L     IP+NK+L++ S+V  T+G +   L +  ++
Sbjct: 247 PKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYAL 304

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 305 -VDVLKWQKSAFFF 317


>gi|217974365|ref|YP_002359116.1| hypothetical protein Sbal223_3208 [Shewanella baltica OS223]
 gi|217499500|gb|ACK47693.1| conserved hypothetical protein [Shewanella baltica OS223]
          Length = 384

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 153/374 (40%), Gaps = 84/374 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  VKVNKPRLVSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GVA+ L+ K++ K+  + +  ++R  +K LF  ++L   Y+H        
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPISERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAP 130

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y            
Sbjct: 131 ADPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGY------------ 175

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                   L++P                   G  G L P  +   YVD  L         
Sbjct: 176 ----GAIQLWLP----------------FPGGQAGVLSPTESINAYVDSIL--------- 206

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
                L   +                 R P +PEGLLSTI AI++   G+  GH ++  H
Sbjct: 207 -----LPGVSYQG--------------RTP-DPEGLLSTIPAIVNALTGVFVGHFIVKSH 246

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++    + G  LL    +L     IP+NK+L++ S+V  T+G +   L +  ++
Sbjct: 247 PKGEWAKVGLLAAAGGILLAFGWLLDLV--IPVNKELWTSSFVLVTSGWSMILLAVFYAL 304

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 305 -VDVLKWQKVAFFF 317


>gi|410663435|ref|YP_006915806.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409025792|gb|AFU98076.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 356

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 13/138 (9%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R   LDA RGLT+ LMI+V+  G     Y  + H+ W G T  D V PFFLFIVG ++ 
Sbjct: 3   QRYIALDALRGLTLALMIVVNTPGSWAHVYGPLLHADWMGWTFTDLVFPFFLFIVGASLY 62

Query: 118 LALKKVPKINGA--VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR 175
            + K +  +  A  ++KII R+L                 +   + V +  +R  G+LQR
Sbjct: 63  FSQKGMASLTRADQLRKIIRRSLL--------LIVLGVLLEYYPFIVSLHELRLPGVLQR 114

Query: 176 IALVYVVVALIETLTTKR 193
           I L + V AL+      R
Sbjct: 115 IGLAFGVAALLVVFVPAR 132


>gi|29348589|ref|NP_812092.1| hypothetical protein BT_3180 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340494|gb|AAO78286.1| putative transmembrane protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 376

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/355 (21%), Positives = 143/355 (40%), Gaps = 89/355 (25%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGC 98
           + +L ++ + R+A+LD  RG  + L++       A  R              DH  W G 
Sbjct: 1   MNKLSEKNTTRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHA 154
              D VMP FLF+ G ++  +L K   ++G+     ++I+ R   L  +G+I+QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWLVYRRILRRVFLLFIFGMIVQGNL--- 117

Query: 155 PDALSYGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
                 G+D  HI  +   LQ IA+ Y++ A+I+              H S    ++WQ 
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYLIAAVIQL-------------HFS----FRWQ- 154

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW 273
           IG         IT   L++               Y +     G   PA N    VDR   
Sbjct: 155 IG---------ITLLLLFI---------------YWIPMTFLGDFTPAGNFAEQVDR--- 187

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP-FEPEGLLSTISAILSGTIGIH 332
                           C L     G    +  +W  +P +    + S+++  ++  +G  
Sbjct: 188 ----------------CVLGRFRDGVFWNEDGTWSFSPYYNYTWIWSSLTFGVTVMLGAF 231

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            G ++   K +  ++   +S+  G+L++ + + ++  +PI K+L++ S    + G
Sbjct: 232 AGKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGG 285


>gi|311274235|ref|XP_003134250.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Sus scrofa]
          Length = 297

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 106/237 (44%), Gaps = 45/237 (18%)

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRH-----LSIFTAYQWQWIGGFIAFVI 222
           R  G+LQR+ + Y VVA++E L  K  P            L + +++  QW+   +   +
Sbjct: 6   RIPGVLQRLGVTYFVVAVLELLFAKPVPESCASERSCFSLLDVTSSWP-QWLFVLVLEGV 64

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHL 278
           ++  T+ L VP          G     +  G  G LG  P C   A GY+DR L G +HL
Sbjct: 65  WLALTFFLPVP----------GCPTGYLGPGGIGDLGKYPNCTGGAAGYIDRLLLGDDHL 114

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
           Y  P  + L                     +  ++PEG+L TI++IL   +G+  G +L+
Sbjct: 115 YQHPSPAVLYHT------------------KVAYDPEGILGTINSILMAYLGVQAGKILL 156

Query: 339 HFKGHSA----RLKHWVSMGFGLLIIAIILHFTNA--IPINKQLYSFSYVCFTAGAA 389
           ++K  +     R   W     GL+ +A+     N   IP+NK L+S SYV   + +A
Sbjct: 157 YYKDRTKGILIRFAVWGCF-LGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSA 212


>gi|113971267|ref|YP_735060.1| hypothetical protein Shewmr4_2932 [Shewanella sp. MR-4]
 gi|113885951|gb|ABI40003.1| conserved hypothetical protein [Shewanella sp. MR-4]
          Length = 395

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 152/374 (40%), Gaps = 86/374 (22%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGGAYA--RIDHSPWNGCTLADF 103
            Q   R+ +LDA RG  +            L+IL   AG  +   ++ HS WNG    D 
Sbjct: 24  SQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSEWNGFRFYDL 83

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H         
Sbjct: 84  IFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNHGWGT-GVPA 142

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D + +R+  +L RIA  +   AL+   T+ R   ++    L I  AY             
Sbjct: 143 DPEKVRYASVLGRIAFAWFFAALLVWHTSLRTQVLVA---LGILVAYG------------ 187

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW-GINHLYSD 281
                  L++P                   G  G L P  +   YVD  L  G+++    
Sbjct: 188 ----AMQLWLP----------------FPGGQAGVLSPTESINAYVDSLLLPGVSYQGRT 227

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
           P                              +PEG+LST+ A+++   G+  GH ++  H
Sbjct: 228 P------------------------------DPEGVLSTLPAVVNALAGVFVGHFIVKSH 257

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
            KG  A++      G   L +  +L     IP+NK+L++ S+V  T+G +   L +  ++
Sbjct: 258 PKGEWAKVGLLSVAGGVCLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMLLLALFYAL 315

Query: 400 FIPEMDWDECDACF 413
            +  + W +    F
Sbjct: 316 -VDVLKWQKLTFIF 328


>gi|440791267|gb|ELR12512.1| transmembrane protein [Acanthamoeba castellanii str. Neff]
          Length = 825

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 61/107 (57%), Gaps = 7/107 (6%)

Query: 19  QEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMI 78
           +E    K+  + ++   G  +++ Q+  K   ++       S RV +LDA RGL + +MI
Sbjct: 362 EEHQHKKNGGDALDSMYGGGQADRQNGPKPAPRV-------SSRVNSLDAVRGLAIAIMI 414

Query: 79  LVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPK 125
            V+  GG Y   +HS WNG T+AD V P+F++I+G ++A++   + K
Sbjct: 415 FVNYGGGGYWFFNHSAWNGITVADLVFPWFIWIMGTSMAISFTSLEK 461



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 79/187 (42%), Gaps = 46/187 (24%)

Query: 212 QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN-----AVG 266
            W+  F    +Y + T+ L VP                  CG RG+LGP  +     A G
Sbjct: 591 HWLVAFSLLFVYFMITFFLDVPG-----------------CG-RGYLGPDISTATGGAAG 632

Query: 267 YVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILS 326
           Y+D++++  +H+Y+ P       C              P +    ++PEG L  +++I  
Sbjct: 633 YIDKKIFTEDHIYNQPT------CQ-------------PLYLTGSYDPEGTLGNLTSIFM 673

Query: 327 GTIGIHYGHVLIHFKGHSARLKHW----VSMGFGLLIIAIILHFTNAIPINKQLYSFSYV 382
             +G+  G  L+ +K H  R+  W    + +GF  L +          P+NK L+S S++
Sbjct: 674 VFLGLQSGRTLMAWKDHKHRVVRWYIWSIVLGFIALGLCEAKQNGGFFPLNKNLWSPSFI 733

Query: 383 CFTAGAA 389
             TA  A
Sbjct: 734 LATAAMA 740


>gi|254784997|ref|YP_003072425.1| hypothetical protein TERTU_0813 [Teredinibacter turnerae T7901]
 gi|237684955|gb|ACR12219.1| putative membrane protein [Teredinibacter turnerae T7901]
          Length = 354

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           ++R   LDA RG+T+ +MILV+  G     Y  + H+ W+G T  DFV PFFLFIVG A+
Sbjct: 2   NERSLALDALRGITLAMMILVNTPGSWSHVYPPLLHANWHGVTPTDFVFPFFLFIVGCAL 61

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 176
             + +K  +++     I    LK +F   +L         A  Y       R  G+LQRI
Sbjct: 62  FFSNRKNHQLD-----IYTHALK-IFRRTVLLLLAGLGLHAYLYSGTFAEFRLPGVLQRI 115

Query: 177 ALVYVVVALIETLTTKRR 194
           AL Y   A I  L  + R
Sbjct: 116 ALAYGAAAFIVWLPVRAR 133



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLST+ AI++   G     +++        L    ++  G+   A++LH    +P
Sbjct: 183 FDPEGLLSTLPAIVTVLSGYEATRIIVERTTQQKVLVIIAALAIGM---ALLLH--PWVP 237

Query: 372 INKQLYSFSYVCFTAGAA 389
           INK L++ SYV  T+G A
Sbjct: 238 INKYLWTSSYVLLTSGVA 255


>gi|312131163|ref|YP_003998503.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
 gi|311907709|gb|ADQ18150.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
           17132]
          Length = 413

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 141/376 (37%), Gaps = 120/376 (31%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVG---- 113
           +R+ +LD FRGLT++LMI V++ G     YA + H+ W+G T  D V PFF+F +G    
Sbjct: 6   QRIVSLDVFRGLTMILMITVNNPGDWSNVYAPLLHAEWHGWTPTDLVFPFFVFAMGMALP 65

Query: 114 ------------------------VAIALALKKVPKIN----GAVKKIIFRTLKLLFWGI 145
                                   +A+ L L    KI       +  ++FR +   F G 
Sbjct: 66  FSMKPGSGLSKDDFLKILARSARLIALGLFLNFFSKIEFGNAQGITLLLFRLMITGFVGF 125

Query: 146 ILQGGYSHAPDALSYGV--------------DMKHIRWCGILQRIALVYVVVALIETLTT 191
           +L G +       +                     +R  G+LQR+  VY   A++    +
Sbjct: 126 LLMGNFPTKIKLYTALALLGLMLALAYSGLPHFAQVRIPGVLQRLGTVYFFAAILYLAFS 185

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
            R                  QW  G    VIY +    + VP          GV  +   
Sbjct: 186 LRV-----------------QWGIGLSVLVIYWLLLAYIPVPG--------SGVTGF--- 217

Query: 252 CGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
              +G   PA     ++D  + G      D VWS                         P
Sbjct: 218 --EKGENLPA-----WIDSIVLG------DHVWS----------------------SSKP 242

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           ++PEG+LST+ AI+S  +G   G  L   K      K  +  G  LLI    L ++   P
Sbjct: 243 WDPEGVLSTLPAIISCLLGAWAGVFLREDK------KKLLLTGVILLICG--LAWSTFFP 294

Query: 372 INKQLYSFSYVCFTAG 387
           INK L++ S+V  TAG
Sbjct: 295 INKALWTSSFVLLTAG 310


>gi|117921549|ref|YP_870741.1| hypothetical protein Shewana3_3111 [Shewanella sp. ANA-3]
 gi|117613881|gb|ABK49335.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 395

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 85/348 (24%), Positives = 143/348 (41%), Gaps = 85/348 (24%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGGAYA--RIDHSPWNGCTLADF 103
           +Q   R+ +LDA RG  +            L++L   AG  +   ++ HS WNG    D 
Sbjct: 24  RQSKPRLMSLDALRGFDMFWILGGEALFGALLMLTGWAGWQWGDTQMHHSEWNGFRFYDL 83

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H         
Sbjct: 84  IFPLFIFLSGVALGLSPKRLDKLPMQERMPVYRHGIKRLFLLLLLGILYNHGWGT-GAPA 142

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D + +R+  +L RIA  +   AL+   T+ R   ++    L I  AY             
Sbjct: 143 DPEKVRYASVLGRIAFAWFFAALLVWHTSLRTQVLVA---LGILVAY------------- 186

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW-GINHLYSD 281
                  L++P                   G  G L P  +   YVD  L  G+++    
Sbjct: 187 ---GAVQLWLP----------------FPGGQAGVLSPTESINAYVDSLLLPGVSYQGRT 227

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--H 339
           P                              +PEG+LST+ A+++   G+  GH ++  H
Sbjct: 228 P------------------------------DPEGVLSTLPAVVNALAGVFVGHFIVKSH 257

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            KG  A++      G   L +  +L     IP+NK+L++ S+V  T+G
Sbjct: 258 PKGEWAKVGLLSVAGGVCLALGWLL--GGVIPVNKELWTSSFVLVTSG 303


>gi|322785719|gb|EFZ12357.1| hypothetical protein SINV_16151 [Solenopsis invicta]
          Length = 111

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 7/81 (8%)

Query: 40  SEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCT 99
           + V+DE   +       + K  RV  +D FRG++ + MI V+D  G+Y  ++H+ WNG  
Sbjct: 32  NNVKDESSNK-------EPKKNRVKAIDTFRGISTLFMIFVNDGSGSYTVLEHATWNGLL 84

Query: 100 LADFVMPFFLFIVGVAIALAL 120
           L D V P F++I+GV + +AL
Sbjct: 85  LGDLVFPCFIWIMGVCVPIAL 105


>gi|297567057|ref|YP_003686029.1| hypothetical protein [Meiothermus silvanus DSM 9946]
 gi|296851506|gb|ADH64521.1| Protein of unknown function DUF2261, transmembrane [Meiothermus
           silvanus DSM 9946]
          Length = 377

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 56/166 (33%), Positives = 82/166 (49%), Gaps = 24/166 (14%)

Query: 38  ERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVD----DAGGAYARIDHS 93
           +    QD+Q  E       ++ + R+ +LD FRGLT++LM+LV+    DA   Y  + H+
Sbjct: 6   DNPPTQDQQT-ETPFPS--RKTAMRLGSLDVFRGLTILLMLLVNNVALDANTPYL-LTHA 61

Query: 94  PWN-GCTLADFVMPFFLFIVGVAIALALKKVPKINGAV----KKIIFRTLKLLFWGIILQ 148
           PW  G  LAD V P+FL  VGVAI  A     K N        KII R++ L   G+++ 
Sbjct: 62  PWKGGVYLADLVFPWFLLAVGVAIPFAAASFRKKNLPSWRYDLKIIQRSIVLFGLGLLIV 121

Query: 149 GGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRR 194
              +  P    + +D        +LQ IA+ Y+V A +  L   RR
Sbjct: 122 SSIARRP---VFALD--------VLQLIAMAYLVAAWLYDLPAHRR 156


>gi|311031971|ref|ZP_07710061.1| hypothetical protein Bm3-1_15792 [Bacillus sp. m3-13]
          Length = 370

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 135/330 (40%), Gaps = 81/330 (24%)

Query: 61  KRVATLDAFRGLTVVLMILVDD--AGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           KR  ++D  RG+ V++ + V     G  Y  + H+ W G T+ D V P FL + G+ +A+
Sbjct: 10  KRYRSIDVTRGIVVLVSVFVSALPGGAEYDFLRHAYWYGLTITDLVFPAFLTVYGIGLAI 69

Query: 119 ALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIAL 178
             +K  +     K ++ RT  L+ +G++            S+  D+  +R+ G+LQ  A+
Sbjct: 70  VYRKGVRW----KDLLRRTFLLVLYGLLFN-------LIASWSFDLSTLRFTGVLQLFAI 118

Query: 179 VYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY-SLYVPNWSF 237
             + V ++  L                  A  W+     +A  + I T Y S+ V +   
Sbjct: 119 TGLGVVVLSYL------------------AKGWK---SMLALGMVIATAYLSILVIS--- 154

Query: 238 SEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNS 297
           S   + GV +              CN  G VD  ++G  H+Y+                 
Sbjct: 155 SVGCEGGVPQ------------RDCNPSGVVDVLVFGEKHMYAQG--------------- 187

Query: 298 GPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGL 357
                         F+PEG+LS  SA+ +   G   G VL    G    L+    +  GL
Sbjct: 188 -----------EKGFDPEGILSIFSALSNVAFGFAVGLVL---NGRKQILQRVFGISIGL 233

Query: 358 LIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           + +A I  F N I +NK+L++ S+    +G
Sbjct: 234 ISLAFI--FNNFIELNKRLWTPSFAILASG 261


>gi|338212226|ref|YP_004656281.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306047|gb|AEI49149.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 369

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 142/352 (40%), Gaps = 101/352 (28%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAY-----------------ARIDHSPWNGCTLADF 103
            R+A+LDA RG  ++++      GGA+                 A+  H  W+G T  DF
Sbjct: 8   SRLASLDALRGFDMLMI----SGGGAFLSLMGGKTDSALLNAVAAQFHHPDWDGFTFYDF 63

Query: 104 VMPFFLFIVGVAIALALKK-----VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDAL 158
           + P FLF+ GV+++++LK      +P+    ++K+  R L L F G++ +    +AP  +
Sbjct: 64  IFPLFLFMAGVSLSISLKNGIAKGIPQYK-LMEKVFKRMLILFFLGLLDK----NAPIDI 118

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
              +D  HIR+  +L RI +   +VA+                 L + T ++ Q I  F 
Sbjct: 119 ---LDPAHIRYGTVLGRIGIATFLVAI-----------------LYLNTGWKTQLIVAF- 157

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
                  T   LY          D+G           G+L    N VG++DR        
Sbjct: 158 -------TILGLYYAALMLISVGDYG----------GGNLSFEGNLVGWIDRAF------ 194

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
               +  RL+  T                    ++   + + +SA      G   G +L+
Sbjct: 195 ----MPGRLKQTT--------------------YDELAMTTQLSATCLTIFGSLAGKILL 230

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAD 390
                +A +K     G G++ IA  L +    PINK L+S S++  TAG A 
Sbjct: 231 --DKTTANIKLIRLAGMGVIGIAAGLAWATVFPINKHLWSSSFILLTAGMAS 280


>gi|373953356|ref|ZP_09613316.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373889956|gb|EHQ25853.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 404

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 119/306 (38%), Gaps = 80/306 (26%)

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALA----LKKVPKINGAVK-KIIFRTLKLLFW 143
           ++ HSPWNG T  D + P F+FI G+++  +    + + P  N  +  ++I RT+ L+  
Sbjct: 79  QLHHSPWNGFTFYDLIFPLFIFIAGISMPFSYNRQVAQSPSSNKQIYVRLIKRTVLLILL 138

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHL 203
           G ++ G    A          +  R+  +L RIAL     A+I   ++ R          
Sbjct: 139 GTVVNGALHFA--------GYQQTRFASVLGRIALACFFAAVIYLNSSLR---------- 180

Query: 204 SIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN 263
                  WQ I   +  + Y +    + VP         HG           G L P  N
Sbjct: 181 -------WQIIWFAVILLGYWLLMALVPVPG--------HGA----------GVLTPGAN 215

Query: 264 AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISA 323
              ++D+                           G L        R  ++PEGLLSTI A
Sbjct: 216 LSAWIDQHFL-----------------------PGKLH-------RKVYDPEGLLSTIPA 245

Query: 324 ILSGTIGIHYGHVLIHFKGH--SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSY 381
           I +  +GI  GH L    G   S   K  +    G+ +I I L +  A PINK +++ S+
Sbjct: 246 IATAMMGIFTGHFLQWEPGERLSPLKKIGIMAAAGISLILIALIWNMAFPINKNMWTSSF 305

Query: 382 VCFTAG 387
             +  G
Sbjct: 306 TLYAGG 311


>gi|408673387|ref|YP_006873135.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387855011|gb|AFK03108.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 423

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 54/83 (65%), Gaps = 3/83 (3%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R+ ++D FRG+T++LM +V++ G     YA ++H+ W+GCT  D V PFFLFIVG++  
Sbjct: 3   QRLTSIDVFRGMTIMLMTIVNNPGDWSHIYAPLEHAEWHGCTPTDLVFPFFLFIVGISTV 62

Query: 118 LALKKVPKINGAVKKIIFRTLKL 140
           L+       +   ++II R L++
Sbjct: 63  LSSPVKRFDSNTFERIITRALRI 85



 Score = 44.7 bits (104), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 6/85 (7%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHF--TNAI 370
           +PEG+LSTI AI +   G+  G +L +    + +  + +S G    +I +++ F   +  
Sbjct: 240 DPEGILSTIPAIGTALAGVFTGKLLTNDFPKNKKAIYLLSAG----VIGVMIGFLWNDYF 295

Query: 371 PINKQLYSFSYVCFTAGAADGCLGI 395
           PINK L++ SYV + AG A   LG+
Sbjct: 296 PINKALWTSSYVLYVAGWALLVLGV 320


>gi|315498708|ref|YP_004087512.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
 gi|315416720|gb|ADU13361.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
          Length = 378

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 73/144 (50%), Gaps = 22/144 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARID--------HSPWNGCTLADFVMPFF 108
           +  ++R+ +LD  RGLTV+ MILV+   G Y  +         H+ W G  +AD V P F
Sbjct: 5   KTATQRLPSLDVLRGLTVIGMILVNATAGMYYGLQAKVFPLLLHAHWEGLKIADVVFPAF 64

Query: 109 LFIVGVAIALALKKVPKING----AVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGV 162
           L +VG++I +AL +     G      +KI  R L+L   G +L   G  +H         
Sbjct: 65  LTMVGLSIPMALNRAKMTTGLDVAQARKIGGRVLRLFLIGWLLSNLGWLAH--------F 116

Query: 163 DMKHIRWCGILQRIALVYVVVALI 186
           D +  R+ G+LQRI LVY   A++
Sbjct: 117 DGEPWRFWGVLQRIGLVYGAAAVL 140


>gi|408369302|ref|ZP_11167083.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
 gi|407745048|gb|EKF56614.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
          Length = 345

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 131/316 (41%), Gaps = 78/316 (24%)

Query: 77  MILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA--VK 131
           MI+V+  G     Y  + H+ W+G TL D V P FLF+VG A++ +++K  + + A  +K
Sbjct: 1   MIIVNTPGSWGSVYRPLSHASWHGFTLTDLVFPTFLFVVGNAMSFSMRKFEQTSQAAFLK 60

Query: 132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 191
           K+I RT  +   G +L          L     ++  R  G+LQRIAL Y   +L+     
Sbjct: 61  KVIKRTFVIFAIGFLLSWFPFFRDGQLK---PLEDARIFGVLQRIALCYFFASLV----- 112

Query: 192 KRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVK 251
                              +  I G + F +  +  Y L +  ++  +++  G       
Sbjct: 113 -----------------IHYFKIKGALVFSMVALLGYHLIM--YTMGDYTLEG------- 146

Query: 252 CGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP 311
                      NA   +D  L G NHLY                               P
Sbjct: 147 -----------NAALKLDLWLLGPNHLYQGE--------------------------GIP 169

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLST+ A ++   G ++  + +   G + +    + +G G  ++ + L +    P
Sbjct: 170 FDPEGLLSTLPATVNVIFG-YFAGLFLQQSGKNFKTIALLMIG-GATLVFLALGWDLLFP 227

Query: 372 INKQLYSFSYVCFTAG 387
           INK++++ SYV  T G
Sbjct: 228 INKKIWTSSYVLLTVG 243


>gi|260790699|ref|XP_002590379.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
 gi|229275571|gb|EEN46390.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
          Length = 347

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 63/258 (24%), Positives = 112/258 (43%), Gaps = 46/258 (17%)

Query: 108 FLFIVGVAIALA---LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM 164
           F+FI+G ++AL+   ++K       V ++I R+ KL   G  L  G+           D+
Sbjct: 66  FVFIMGTSMALSFRGMRKRTSTRRVVFRVITRSAKLFLVGFFLNAGHGRN--------DL 117

Query: 165 KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ-----------W-Q 212
             +R  G+LQR+++ Y+V   IE    K R +  E   L+  T  +           W  
Sbjct: 118 GTVRVPGVLQRLSIAYLVSGFIECFVGKERKSSDERSRLTNPTLQKIHNALRDIVDNWAA 177

Query: 213 WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDREL 272
           W+   +  VI++I T+ L VP       + +      +      +L     A GY+DR +
Sbjct: 178 WLLHLLILVIHLIITFLLPVPGCP----TGYLGPGGPLLGDGVEYLNCTGGAAGYIDRLI 233

Query: 273 WGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIH 332
            G +H+Y  P                          +  F+PEG+L +++ I +  +G+ 
Sbjct: 234 LG-SHMYQTPTVRVFYK------------------TKVAFDPEGILGSLTTIFNCFLGLQ 274

Query: 333 YGHVLIHFKGHSARLKHW 350
            G +L+++K HS+R+  W
Sbjct: 275 AGKILVYYKEHSSRIIRW 292


>gi|380512476|ref|ZP_09855883.1| hypothetical protein XsacN4_14717 [Xanthomonas sacchari NCPPB 4393]
          Length = 384

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 66/126 (52%), Gaps = 15/126 (11%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R  +LD FRGLT+ LMILV+  G    A+ ++ H+PW G T AD V P FLF VG A++
Sbjct: 16  ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 75

Query: 118 LALKKVPKINGAVKKI------IFRTLKLLFW-GIILQGGYSHAPDALSYGVDMKHIRWC 170
            AL +   +   ++++      IF    L++W   + QG   H          +   R  
Sbjct: 76  FALDRGQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGHWSL-----TAIDQTRVP 130

Query: 171 GILQRI 176
           G+LQRI
Sbjct: 131 GVLQRI 136


>gi|329851798|ref|ZP_08266479.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
 gi|328839647|gb|EGF89220.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
          Length = 398

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 153/354 (43%), Gaps = 81/354 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
           +  S R   LD  RGL ++ M+L ++AG     Y  +DH+ W+G TL D V P F+  VG
Sbjct: 22  KPASVRFEALDILRGLFIIGMLLANNAGDWSHIYTPLDHAEWHGFTLTDMVFPGFMTCVG 81

Query: 114 VAIALALKKVPKI----NGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMK 165
           +++ L+L +  K      G    ++  +L+    L+  G+ L          L    D +
Sbjct: 82  LSMTLSLGRRQKTLNSQAGGKAALLVHSLRRAAILVGIGLFLN---------LLPQFDFE 132

Query: 166 HIRWCGILQRIALVYVVVALIETLTTKRRPN---VLEPRHLSIFTAYQWQWIGGFIAFVI 222
           H R  G+LQRI + Y + + +  L + +      +L  R L++       W  GF+  V 
Sbjct: 133 HWRLPGVLQRIGICYAIASGLVVLHSHQNQQGGLILHSRALAL-------WGVGFL--VA 183

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
           Y +    + VP+ + +   D              H  PA     +VD ++ G+NH     
Sbjct: 184 YTLLLKYVPVPDGAGANQWD------------AIHSWPA-----WVDMQVLGVNH----- 221

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG 342
           VWS  +                       ++PEGLLS++ A  +   GI  G   ++   
Sbjct: 222 VWSGAKT----------------------YDPEGLLSSVPATSNILFGILMG---LYINT 256

Query: 343 HSARLKHWVSMGF-GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
            + R   W  +   G+L++ + L   + +PI K+L++ S+V  + G A   L +
Sbjct: 257 RTPR-NAWGGVAIIGVLLMLLALVLDSYVPIIKKLWTPSFVLLSCGFAFTVLAV 309


>gi|88859970|ref|ZP_01134609.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
 gi|88817964|gb|EAR27780.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
          Length = 378

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/353 (22%), Positives = 141/353 (39%), Gaps = 95/353 (26%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGG--AYARIDHSPWNGCTLADF 103
           + + +R+A+LDA RG+ +            L IL    G   A+ +  HS W+G T  D 
Sbjct: 7   KPQKRRLASLDALRGMDMFWILGGEKIFAALFILTGWTGWQVAHGQTLHSNWHGFTFYDL 66

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIF-RTLKLLF----WGIILQGGYSHAPDAL 158
           + P F+F+ GVA+ L+ K++  +    +++ + + LK LF    +G++   G+       
Sbjct: 67  IFPLFIFLAGVAMGLSPKRIDHLPFQERRVYYAKALKRLFLLAGFGVLYNHGWGTGIP-- 124

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
               +++ IR+  +L RIA+ + V  ++   T+ R                         
Sbjct: 125 ---FNLEEIRYASVLGRIAIAWFVCVMLVWHTSLRTQ----------------------- 158

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
                IIT  SL +  W              V  G  G L  A     +VD+ L      
Sbjct: 159 -----IITAVSLLIGYWLLLCFVP-------VPGGQAGDLTMAGTWNAWVDQHL------ 200

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
                   L   T  +  +               +PEGLLS + A+++  +G+  G ++ 
Sbjct: 201 --------LPGITYQNRAT---------------DPEGLLSNVPAVVNALMGVFAGRLI- 236

Query: 339 HFKGHSARLKHWVSM----GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
                + +   W ++    G GL  +A+   +    P+NK L++ S+V  T G
Sbjct: 237 ---AKANQFGEWKTVSYLFGAGLTSLALGWAWNMVFPVNKDLWTSSFVLVTVG 286


>gi|116748970|ref|YP_845657.1| hypothetical protein Sfum_1534 [Syntrophobacter fumaroxidans MPOB]
 gi|116698034|gb|ABK17222.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
          Length = 374

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 68/132 (51%), Gaps = 12/132 (9%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+A+LDAFRG  +  MILV+  G     Y+++ H+ WNG T AD + P FLF+VGV++
Sbjct: 7   NTRLASLDAFRGAVIAGMILVNSPGRWVYTYSQLKHAQWNGWTFADTIFPAFLFVVGVSM 66

Query: 117 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDM--KHIRWCGILQ 174
             +  +  +      +++ +  +                  L+  +D    ++R  G+LQ
Sbjct: 67  VFSFSRRRECEEPAWRLVLQVFR-------RTSLIFLLGLLLNVMLDFHGSNLRIPGVLQ 119

Query: 175 RIALVYVVVALI 186
           RIA  Y V +LI
Sbjct: 120 RIAACYFVASLI 131


>gi|345322030|ref|XP_003430524.1| PREDICTED: hypothetical protein LOC100681967 [Ornithorhynchus
           anatinus]
          Length = 530

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 62/109 (56%), Gaps = 6/109 (5%)

Query: 43  QDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLAD 102
            D+ +G+  L   +    +R  T      L++ LM+ V+  GG Y   +H+PWNG T+AD
Sbjct: 103 SDQVRGQRHLLGCVGGGVRRTPTPSPL--LSLTLMVFVNYGGGGYWFFEHAPWNGLTVAD 160

Query: 103 FVMPFFLFIVGVAIALALKKVPK--INGA--VKKIIFRTLKLLFWGIIL 147
            VMP+F+FI+G ++ALA   + +  +N    ++K+ +RT  L+  G+  
Sbjct: 161 LVMPWFVFILGTSVALAFYAMRRRGVNRVQLLRKLTWRTAVLMIIGLFF 209


>gi|119774084|ref|YP_926824.1| hypothetical protein Sama_0947 [Shewanella amazonensis SB2B]
 gi|119766584|gb|ABL99154.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 378

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 84/356 (23%), Positives = 144/356 (40%), Gaps = 91/356 (25%)

Query: 53  QQLLQQKSK--RVATLDAFRGLTV--------VLMILVDDAGGAY-----ARIDHSPWNG 97
            Q  Q K+   R+ +LDA RG  +        + + L    G ++     A + HS W+G
Sbjct: 1   MQATQTKAAKPRLMSLDALRGFDMFWILGGEKLFIALFALTGWSFWQLADAEMHHSEWHG 60

Query: 98  CTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPD 156
            T  D + P F+F+ GVA+ L+ K++ K+  A +  I+R  +K LF  + L   Y+H   
Sbjct: 61  FTFYDLIFPLFIFLSGVALGLSPKRLDKLAPAERNPIYRHAVKRLFLLLALGVLYNHG-- 118

Query: 157 ALSYGVDM----KHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQ 212
              +G  +      +R+  +L RIA  +   AL+   T+ R           I TA    
Sbjct: 119 ---WGTGIPAHSDEVRYASVLGRIAFAWFFAALLVWHTSLRT---------QIATALA-- 164

Query: 213 WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRE- 271
                   +++      L++P                V  G  G L P+ +   +VD   
Sbjct: 165 --------ILFGYAAIQLWLP----------------VPGGQAGVLTPSGSINAWVDTHF 200

Query: 272 LWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
           L GI + +                               P++PEG+LST+ AI++  +G+
Sbjct: 201 LPGITYQHR------------------------------PYDPEGILSTLPAIVNALMGV 230

Query: 332 HYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             G  ++         K  +  G G L + +     + +P+NK L++ S+V  T G
Sbjct: 231 FVGRFIVKPDARGDWAKAGILTGAGGLSLVLGWSLDSVLPVNKDLWTSSFVLVTTG 286


>gi|24375008|ref|NP_719051.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
 gi|24349746|gb|AAN56495.1| N-acetylglucosamine locus membrane protein of unknown function
           DUF1624 NagX [Shewanella oneidensis MR-1]
          Length = 395

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 142/347 (40%), Gaps = 83/347 (23%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGGAYA--RIDHSPWNGCTLADF 103
            Q   R+ +LDA RG  +            L+I    AG  +   ++ HS W+G  L D 
Sbjct: 24  SQPKPRLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDL 83

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H        V
Sbjct: 84  IFPLFIFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYNHGWGT-GAPV 142

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D   IR+  +L RIA  +   AL+   T+ R   ++    L  + A Q            
Sbjct: 143 DPDKIRYASVLGRIAFAWFFAALLVWHTSLRTQVLVAVGILVGYGAMQ------------ 190

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
                  L++P                   G  G L P  +   YVD  L          
Sbjct: 191 -------LWLP----------------FPGGQAGVLSPTVSINAYVDSLL---------- 217

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--HF 340
               L   +                 R P +PEG+LST+ A+++   G+  GH ++  H 
Sbjct: 218 ----LPGVSYQG--------------RMP-DPEGVLSTLPAVVNALAGVFVGHFIVKSHP 258

Query: 341 KGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           KG  A++    + G   L +  +L     IP+NK+L++ S+V  T+G
Sbjct: 259 KGEWAKVGLLGAAGGVCLALGWLLD--AVIPVNKELWTSSFVLVTSG 303


>gi|383124758|ref|ZP_09945419.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
 gi|251841090|gb|EES69171.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
          Length = 376

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 79/355 (22%), Positives = 143/355 (40%), Gaps = 89/355 (25%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGC 98
           + +L +  + R+A+LD  RG  + L++       A  R              DH  W G 
Sbjct: 1   MSKLSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHA 154
              D VMP FLF+ G ++  +L K   ++G+     ++I+ R   L  +G+I+QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL--- 117

Query: 155 PDALSYGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
                 G+D  HI  +   LQ IA+ Y + A+I+              H S    ++WQ 
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYFIAAVIQL-------------HFS----FRWQ- 154

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW 273
           IG         IT   L++               Y +     G   PA N    VDR  W
Sbjct: 155 IG---------ITLLLLFI---------------YWIPMTFLGDFTPAGNFAEQVDR--W 188

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP-FEPEGLLSTISAILSGTIGIH 332
            +        W+                ED  +W  +P +    + S+++  ++  +G  
Sbjct: 189 VLGRFRDGVFWN----------------EDG-TWSFSPYYNYTWIWSSLTFGVTVMLGAF 231

Query: 333 YGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            G ++   K +  ++   +S+  G+L++ + + ++  +PI K+L++ S    + G
Sbjct: 232 AGKIMKEGKANRKKVVQTLSV-IGVLLVGLAMLWSLQMPIIKRLWTGSMTLLSGG 285


>gi|338211253|ref|YP_004655306.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305072|gb|AEI48174.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 374

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 89/347 (25%), Positives = 137/347 (39%), Gaps = 91/347 (26%)

Query: 59  KSKRVATLDAFRGLTV--------VLMILVDDAGGAYA-----RIDHSPWNGCTLADFVM 105
           +  R+ +LDA RG  +        +  +L    G   A     ++ H  WNG    D + 
Sbjct: 9   RPHRLLSLDALRGFDMFWITGGEEIFHLLAKATGWTGAIIMAEQLSHPDWNGFRAYDLIF 68

Query: 106 PFFLFIVGV----AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           P FLF+ GV    ++ + L++       ++K+I R L L+  GII   G    P      
Sbjct: 69  PLFLFLSGVSAPYSLGVRLERGDDRGKMLRKVIQRGLTLVLLGIIYNNGLQIKP------ 122

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
             ++ +R+  +L RI L  +   +I   T+ R               Y W        FV
Sbjct: 123 --LEDMRFPSVLGRIGLAGMFAQIIYLYTSTR-------------VQYIW--------FV 159

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
                  SL +  W+F           +  CG  G +   CN V Y+DR           
Sbjct: 160 -------SLLLGYWAFVMLVP------VPGCG-AGLMTMECNPVSYLDR----------- 194

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK 341
                     L  P  G L +D         +PEGL+STI AI +G +GI  G++L   +
Sbjct: 195 ----------LIIP--GHLHKDI-------HDPEGLVSTIPAIATGLLGIFAGNLLRADE 235

Query: 342 GHSARL-KHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             ++R  K  V    G+L + I   +    PINK L++ S+V    G
Sbjct: 236 RSTSRTQKVLVLFVAGILFLIIGKLWDYVFPINKNLWTSSFVMTVGG 282


>gi|326801867|ref|YP_004319686.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326552631|gb|ADZ81016.1| hypothetical protein Sph21_4499 [Sphingobacterium sp. 21]
          Length = 376

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 73/143 (51%), Gaps = 19/143 (13%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIV 112
           ++++  R   LD FRG+T+  MI+V+  G     Y  ++H+ W+G T  D V P FLF V
Sbjct: 1   MKEQKLRFTALDVFRGMTICFMIIVNSPGSGATPYWPLNHATWHGFTPTDLVFPSFLFAV 60

Query: 113 GVAIALALKKVPKINGAVKKIIFRTLK----------LLFWGIILQGGYSHAPDALSYGV 162
           G A++ + +K   ++   K+++    K          L++W    +    H  + +S+  
Sbjct: 61  GNALSFSERKFQYLSS--KQVLLTIFKRAALIFLLGFLMYWFPFFKITEQH--EIISF-- 114

Query: 163 DMKHIRWCGILQRIALVYVVVAL 185
            +   R  G+LQRIAL Y+  AL
Sbjct: 115 PLHETRVFGVLQRIALCYLFTAL 137


>gi|195167204|ref|XP_002024424.1| GL15027 [Drosophila persimilis]
 gi|194107797|gb|EDW29840.1| GL15027 [Drosophila persimilis]
          Length = 493

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/50 (56%), Positives = 36/50 (72%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFF 108
           + KR+ +LD FRGL++VLMI V+  GG YA I+H+ WNG  LAD V PF 
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPFL 230


>gi|380693009|ref|ZP_09857868.1| hypothetical protein BfaeM_03398 [Bacteroides faecis MAJ27]
          Length = 376

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 92/209 (44%), Gaps = 45/209 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           +  + R+A+LD  RG  + L++       A  R              DH  W G    D 
Sbjct: 6   ENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFWDL 65

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLF----WGIILQGGYSHAPDALS 159
           VMP FLF+ G ++  +L K    +G+ + +  R LK +F    +G+I+QG          
Sbjct: 66  VMPLFLFMTGASMPFSLSKYIGTSGSYRPVYRRILKRVFLLFVFGMIVQGN--------L 117

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW-IGGF 217
            G+D KHI  +   LQ IA+ Y++ A+I+              H S    ++WQ  I   
Sbjct: 118 LGLDGKHIYLYSNTLQSIAVGYLIAAVIQL-------------HFS----FKWQIGITLL 160

Query: 218 IAFVIYIITTY-SLYVPNWSFSEHSDHGV 245
           + FV +I  T+   + P  +F+E  D  V
Sbjct: 161 LLFVYWIPMTFLGDFTPAGNFAEQVDRWV 189


>gi|288929890|ref|ZP_06423732.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
 gi|288328709|gb|EFC67298.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
          Length = 399

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + + R+ ++D  RGLT+  MI V++ G     YA ++H+ WNG T  D V PFF+ ++G
Sbjct: 4   NKTTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFMCVMG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVD------ 163
           + I +A++K         V KI+ R + +   G+ +     + +  ++   G D      
Sbjct: 64  MCIYIAMRKFDFACNRATVYKIVKRMVLIYLVGLAIGWFAKFCYRWNSPQEGADFFSQLW 123

Query: 164 -----MKHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFT 207
                   IR  G+L R+A+ Y + AL+  T+  K  P ++    L+ F 
Sbjct: 124 YMVWSFDKIRLTGVLARLAICYGITALLAITVRHKHLPYIIVGLLLTYFV 173


>gi|126173329|ref|YP_001049478.1| hypothetical protein Sbal_1087 [Shewanella baltica OS155]
 gi|386340088|ref|YP_006036454.1| hypothetical protein [Shewanella baltica OS117]
 gi|125996534|gb|ABN60609.1| conserved hypothetical protein [Shewanella baltica OS155]
 gi|334862489|gb|AEH12960.1| Protein of unknown function DUF2261, transmembrane [Shewanella
           baltica OS117]
          Length = 387

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 89/351 (25%), Positives = 141/351 (40%), Gaps = 86/351 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAY-----ARIDHSPWNGCT 99
           ++    R+ +LDA RG  +            L+IL   AG A       ++ HS W+G  
Sbjct: 12  VKVNKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWAGWQWGDEQMHHSQWHGFH 71

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDAL 158
             D + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H     
Sbjct: 72  FYDLIFPLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT- 130

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
               D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y         
Sbjct: 131 GAPADPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGY--------- 178

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
                      L++P                   G  G L P  +   YVD  L      
Sbjct: 179 -------GAMQLWLP----------------FPGGQAGVLSPTESINAYVDSIL------ 209

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
                   L   +                 R P +PEGLLSTI AI++   G+  GH ++
Sbjct: 210 --------LPGVSYQG--------------RTP-DPEGLLSTIPAIVNALAGVFVGHFIV 246

Query: 339 --HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             H KG  A++    + G   L    +L     IP+NK+L++ S+V  T+G
Sbjct: 247 KSHPKGEWAKVGLLAAAGCVCLTFGWLLDLV--IPVNKELWTSSFVLVTSG 295


>gi|291516094|emb|CBK65304.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 331

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 74/312 (23%), Positives = 122/312 (39%), Gaps = 77/312 (24%)

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWG 144
           ++ H  WNG T  D + P FLFI GV+   +L K      + ++I+F+ ++    L+  G
Sbjct: 8   QMGHVSWNGLTQHDTIFPLFLFIAGVSFPFSLSKQRASGISERRILFKVIRRGMTLIVLG 67

Query: 145 IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLS 204
           +I  G +           D   +R   +L RI L ++  +L+            + R  +
Sbjct: 68  MIYNGLFRF---------DFASLRVASVLGRIGLAWMFASLLYMYC--------KVRTRA 110

Query: 205 IFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNA 264
           +F A         +  + Y +  Y +  P+      +  G             L  A N 
Sbjct: 111 VFAA---------VVLIGYSLLMYLVVAPD------APDGTDP----------LSVAGNI 145

Query: 265 VGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAI 324
            G+VDR+ W               AC                     F+PEGLLST+ AI
Sbjct: 146 AGWVDRQ-WLPGTF----------AC-------------------GSFDPEGLLSTLPAI 175

Query: 325 LSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA-IPINKQLYSFSYVC 383
           +S   G+  G  L+  +   +  +  + M    + I +I    N  IPINK+L+S S+ C
Sbjct: 176 VSALFGMFTGEFLLRKRSSLSGEQIALCMALAAVAITVIGIIWNCWIPINKKLWSSSFTC 235

Query: 384 FTAGAADGCLGI 395
              G + G   +
Sbjct: 236 VVTGYSLGLFAL 247


>gi|189463416|ref|ZP_03012201.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM 17136]
 gi|189429845|gb|EDU98829.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM 17136]
          Length = 82

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 3/68 (4%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           +K++R+ +LD  RG+T+  MI+V++ G     Y  + H+ WNG T  D V PFF+FI+G+
Sbjct: 2   EKAQRLISLDVLRGITIAGMIIVNNPGSWKHVYTPLTHAVWNGLTPTDLVFPFFMFIMGI 61

Query: 115 AIALALKK 122
           +  ++LKK
Sbjct: 62  STYISLKK 69


>gi|399069322|ref|ZP_10749357.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
 gi|398045229|gb|EJL37978.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
           AP07]
          Length = 233

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 63/115 (54%), Gaps = 20/115 (17%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG--------GAYARIDHSPWNGCTLADFVMPFF 108
           + K+ R+A+LD  RGLT+V MI+V+ A           +A ++H+ W G T AD V P F
Sbjct: 4   RPKAARLASLDVLRGLTIVGMIVVNTASYLHYVSGYAVFAGLEHAEWRGFTAADAVFPAF 63

Query: 109 LFIVGVAIALALKKVPKING------------AVKKIIFRTLKLLFWGIILQGGY 151
           +F+ GV+I LAL  +   +G            A+++++ R+ +L   G+IL   Y
Sbjct: 64  VFMTGVSIPLALGPLALGDGPIERGMAGLDGAALRRLLVRSGRLFLLGLILSNLY 118


>gi|223936396|ref|ZP_03628308.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894914|gb|EEF61363.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 383

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 72/140 (51%), Gaps = 25/140 (17%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI----------LVDDAGGAYA---RIDHSPWNGCTLADF 103
           Q+ + R+ ++DA RG  +  ++           +DD+   +A   +++H  W G    D 
Sbjct: 25  QKANTRIISIDALRGFDMFWIMGGDQLVRSFQKIDDSAPTHALANQMEHCEWAGFHFYDL 84

Query: 104 VMPFFLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           + P F+F+ GV+I  ++ ++ + +G   AVK+I FR++ L  +GI   GG S+       
Sbjct: 85  IFPLFVFLAGVSIVFSITRLIEHSGRVAAVKRIAFRSVILFLFGIFYMGGVSNG------ 138

Query: 161 GVDMKHIRWCGILQRIALVY 180
               K+I   G+L RIA+ Y
Sbjct: 139 ---FKNIYLAGVLHRIAVAY 155


>gi|260910302|ref|ZP_05916976.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635554|gb|EEX53570.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 399

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 99/368 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 113
            + + R+ ++D  RGLT+  MI V++ G     YA ++H+ WNG T  D V PFF+ ++G
Sbjct: 4   NKTTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFMCVMG 63

Query: 114 VAIALALKKVPKI--NGAVKKIIFRTLKLLFWGIILQ--GGYSHAPDALSYGVD------ 163
           + I +A+ K         V KI+ R + +   G+ +     + +  +    G D      
Sbjct: 64  MCIYIAMSKFNFACNRATVYKILKRMVLIYLVGLAIGWFAKFCYRWNNPQEGADFFSQLW 123

Query: 164 -----MKHIRWCGILQRIALVYVVVALIE-TLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                   IR  G+L R+A+ Y + AL+  T+  K  P +                +GG 
Sbjct: 124 YMVWSFDKIRLTGVLARLAVCYGITALLAITVRHKHLPYI----------------VGGL 167

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           +                          +  +++     G      N +  VDR +    H
Sbjct: 168 L--------------------------LAYFVILMAGNGFAYDETNILSIVDRAVLTDAH 201

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           +Y D                               +PEGLLST+ +I    +G   G +L
Sbjct: 202 MYHD----------------------------NGIDPEGLLSTLPSIAHTLLGFIIGGML 233

Query: 338 IH-----FKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQLYSFSYVCFTAG 387
                   +   AR    ++    L ++   L F   +     PINK+++S ++V  T G
Sbjct: 234 FRKADAGVQQLDARANITLTKVVPLFVVGTSLLFAGYLLSYGCPINKKVWSPTFVLVTCG 293

Query: 388 AADGCLGI 395
            A   L +
Sbjct: 294 LASMLLAL 301


>gi|404485011|ref|ZP_11020215.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
 gi|404340016|gb|EJZ66447.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
           YIT 11860]
          Length = 440

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 71/148 (47%), Gaps = 25/148 (16%)

Query: 57  QQKSKRVATLDAFRGLTVVLMIL-----------VDDA--GGAYARIDHSPWNGCTLADF 103
           Q   +R+A+LD  RG  + L++            VD +       + DH  W G    D 
Sbjct: 71  QPVGQRLASLDILRGFDLFLLVFLQPVLVSLGACVDSSVMNAVLYQFDHEVWEGFRFWDL 130

Query: 104 VMPFFLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +  K  ++       KK++ R + L   G+++QG           
Sbjct: 131 VMPLFLFMTGVSMPFSFSKYERVESRRFIYKKVLRRFVILFLLGMVVQGNL--------L 182

Query: 161 GVDMKHIR-WCGILQRIALVYVVVALIE 187
           G+D+K+IR +   LQ IA  Y++ ALI+
Sbjct: 183 GLDLKYIRLYSNTLQAIAAGYLIAALIQ 210


>gi|393784535|ref|ZP_10372698.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665516|gb|EIY59040.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
           CL02T12C01]
          Length = 378

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 76/153 (49%), Gaps = 28/153 (18%)

Query: 54  QLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTL 100
           ++L+  S R+A+LD  RG  + L++       A  +              DH  W G  L
Sbjct: 5   KILENNSSRLASLDILRGFDLFLLVFFQPVFVALGQQLDLPFLNRLVYQFDHEAWVGFHL 64

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLF-WGIILQGGYSHAP 155
            D VMP FLF+ G ++  +L K  KI+ A    V + IFR + LLF +G+I+QG      
Sbjct: 65  WDLVMPLFLFMTGASMPFSLSKY-KISSAGCQFVYRRIFRRVVLLFLFGMIVQGN----- 118

Query: 156 DALSYGVDMKHIR-WCGILQRIALVYVVVALIE 187
                G D +HI  +   LQ IA+ Y++ A+I+
Sbjct: 119 ---LLGFDSQHIYLYSNTLQAIAVGYLIAAIIQ 148


>gi|154492358|ref|ZP_02031984.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|423722056|ref|ZP_17696232.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
 gi|154087583|gb|EDN86628.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
           43184]
 gi|409242758|gb|EKN35518.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
           CL09T00C40]
          Length = 370

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 87/348 (25%), Positives = 141/348 (40%), Gaps = 98/348 (28%)

Query: 57  QQKSKRVATLDAFRG-----------LTVVLMILVDDA-----GGAYARIDHSPWNGCTL 100
           Q++S+R+ +LDA RG           L V L  L  ++     GG   ++DH  W+G T 
Sbjct: 4   QKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGLTH 60

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPD 156
            D + P FLFI G++   +L+K  +   +     K+I+ R + L+  G +  G       
Sbjct: 61  HDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF--- 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                 D  ++R   +L RI L ++  AL   L    R +V      +I   Y W WI  
Sbjct: 118 ------DFANLRCASVLARIGLGWMFAAL---LFVHFRTSVRAWIAGTILVGY-WVWIA- 166

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P                V     G      N VGYVDR L    
Sbjct: 167 --------------FIP----------------VPGAEAGPFTLEGNWVGYVDRLL---- 192

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
                                G L +         F+PEGLLST+  I++  +G+  G  
Sbjct: 193 -------------------LPGRLHQGF-------FDPEGLLSTLPGIVTAMLGMFTGEF 226

Query: 337 L-IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
           + +  +G + + K    +  G+L++ + + ++   PINK+L+S S+VC
Sbjct: 227 IKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVC 274


>gi|386312853|ref|YP_006009018.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
 gi|319425478|gb|ADV53552.1| N-acetylglucosamine related transporter, NagX [Shewanella
           putrefaciens 200]
          Length = 384

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 149/368 (40%), Gaps = 84/368 (22%)

Query: 62  RVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFF 108
           R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D + P F
Sbjct: 18  RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167
           +F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H         D + +
Sbjct: 78  IFLSGVALGLSPKRLDKLPMKDRLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKV 136

Query: 168 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITT 227
           R+  +L RIA  +   AL+   T+ R   ++    L I   Y                  
Sbjct: 137 RYASVLGRIAFAWFFAALLVWHTSLRTQVIVA---LGILLGY----------------GA 177

Query: 228 YSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRL 287
             L++P                   G  G L P  +   YVD  L              L
Sbjct: 178 MQLWLP----------------FPSGQAGVLSPTQSINAYVDSIL--------------L 207

Query: 288 EACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSA 345
              +                 R P +PEGLLSTI A+++   G+  G+ ++  H +G   
Sbjct: 208 PGVSYQG--------------RTP-DPEGLLSTIPAVVNALAGVFVGYFIVKSHPQGEWV 252

Query: 346 RLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMD 405
           ++    + G   L +  +L     IP+NK+L++ S+V  T+G +   L +  ++ +  + 
Sbjct: 253 KVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILLALFYAL-VDVLK 309

Query: 406 WDECDACF 413
           W +    F
Sbjct: 310 WQKAAFFF 317


>gi|375149723|ref|YP_005012164.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063769|gb|AEW02761.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 368

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 3/66 (4%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           +KR  +LD FRGL + LMI+V+  G    ++A + H+ WNG TLAD V P FLF VG A+
Sbjct: 4   NKRFISLDVFRGLIICLMIIVNTPGSHDTSFALLQHANWNGFTLADLVFPSFLFAVGNAL 63

Query: 117 ALALKK 122
             +++K
Sbjct: 64  FFSMQK 69



 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 6/95 (6%)

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSAR-LKHWVSMGFGLLIIAIILHFTN 368
            PFEPEG+LST+ AI +   G   G   I   G + R L   ++ G GL  + +  ++  
Sbjct: 197 VPFEPEGILSTLPAITNVVAGYLVGWY-IQTAGKTKRMLLRLIATGAGLTFLGLCWNY-- 253

Query: 369 AIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPE 403
            IPINK L++ S+V  + G    CL +   I+I +
Sbjct: 254 VIPINKNLWTSSFVVHSTGL--DCLLLAAIIYIAD 286


>gi|146292182|ref|YP_001182606.1| hypothetical protein Sputcn32_1079 [Shewanella putrefaciens CN-32]
 gi|145563872|gb|ABP74807.1| conserved hypothetical protein [Shewanella putrefaciens CN-32]
          Length = 384

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 85/373 (22%), Positives = 150/373 (40%), Gaps = 84/373 (22%)

Query: 57  QQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLADF 103
           +    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D 
Sbjct: 13  KANKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDL 72

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++ K+  + +  ++R  +K LF  ++L   Y+H         
Sbjct: 73  IFPLFIFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPA 131

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D + IR+  +L RIA  +   AL+   T+ R   ++    L I   Y             
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAALLVWHTSLRTQVIVA---LGILLGY------------- 175

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
                  L++P                   G  G   P  +   YVD  L          
Sbjct: 176 ---GAMQLWLP----------------FPSGQAGVFSPTQSINAYVDSIL---------- 206

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--HF 340
               L   +                 R P +P+GLLSTI A+++   G+  G+ ++  H 
Sbjct: 207 ----LPGVSYQG--------------RTP-DPQGLLSTIPAVVNALAGVFVGYFIVKSHP 247

Query: 341 KGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIF 400
           +G   ++    + G   L +  +L     IP+NK+L++ S+V  T+G +   L +  ++ 
Sbjct: 248 QGEWVKVGLLATAGGAWLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMILLALFYAL- 304

Query: 401 IPEMDWDECDACF 413
           +  + W +    F
Sbjct: 305 VDVLKWQKVAFFF 317


>gi|336312505|ref|ZP_08567454.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
 gi|335864011|gb|EGM69129.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
          Length = 384

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 86/378 (22%), Positives = 149/378 (39%), Gaps = 92/378 (24%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  IKVTKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDA 157
            + P F+F+ GVA+ L+ K++ K+  + +  ++     R L LL  GI+   G+     A
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPLSERLPVYRHGIKRLLLLLLLGILYNHGWGTGAPA 131

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                D + +R+  +L RIA  +   AL+   T+ R   ++    L I   Y        
Sbjct: 132 -----DPEKVRYASVLGRIAFAWFFAALLVWHTSLRTQIIVA---LGILLGYG------- 176

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
                       L++P                   G  G L P  +   Y+D  L     
Sbjct: 177 ---------AIQLWLP----------------FSGGQAGVLSPTESINAYIDSIL----- 206

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                    L   +       P               EGLLSTI A+++   G+  GH +
Sbjct: 207 ---------LPGVSYQGRTLDP---------------EGLLSTIPAVVNALAGVFVGHFI 242

Query: 338 I--HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           +  H +G  A++    + G   L +  +L     IP+NK+L++ S+V  T+G +   L +
Sbjct: 243 VKSHPQGEWAKVGLLAAAGGVCLALGWLLD--GVIPVNKELWTSSFVLVTSGWSMVLLAV 300

Query: 396 TNSIFIPEMDWDECDACF 413
             ++ +  + W +    F
Sbjct: 301 FYAL-VDVLKWQKAAFFF 317


>gi|393787642|ref|ZP_10375774.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
 gi|392658877|gb|EIY52507.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
           CL02T12C05]
          Length = 373

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/357 (22%), Positives = 133/357 (37%), Gaps = 85/357 (23%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-------------WNGCTLAD 102
           +   +KR+A+LD  RG  +  ++++      +  I+++P             W G    D
Sbjct: 1   MATANKRLASLDLLRGFDLFCLLMLQPILMTWLEIENNPSLDPITNQFTHVEWQGVAFWD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDAL 158
            +MP F+F+ G+ I  A+ K  +     +   FR  K    L F G ++QG      + L
Sbjct: 61  LIMPLFMFMSGITIPFAMSKYKQGEKIDRHFYFRLFKRFFVLFFLGWVVQG------NLL 114

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
           +  +   HI +   LQ IA+ YVV AL+    + R                        I
Sbjct: 115 ALDIRQFHI-FANTLQAIAVGYVVAALLYVWCSFRTQ----------------------I 151

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
           +F +    TY L      F+   D   +             P  N    +DR        
Sbjct: 152 SFTVLCFITYLL-----VFATIGDMNYE-------------PGTNIAEEIDR-------- 185

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
                      C L S   G    +      + +    +LS+++ I++  +G   GH+L 
Sbjct: 186 -----------CVLGSLRDGVTWTNGTWSFDSSYHYTWILSSLNFIVTVMLGSFAGHILR 234

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
             K    RLK  V +  G  ++A+ L      PI K+++S S   F  G     +GI
Sbjct: 235 LRKEPVQRLK--VLLLTGGFLVAVALLMDPLFPIIKRIWSSSMTLFYGGVCFLLMGI 289


>gi|225875032|ref|YP_002756491.1| hypothetical protein ACP_3497 [Acidobacterium capsulatum ATCC
           51196]
 gi|225792728|gb|ACO32818.1| putative membrane protein [Acidobacterium capsulatum ATCC 51196]
          Length = 378

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 78/143 (54%), Gaps = 20/143 (13%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            +KR+ ++D  RG+T+  MILV++ G    A+  + H+ WNG T  D V P F+F+VG++
Sbjct: 10  NAKRMVSIDLLRGITIAFMILVNNNGDEAHAFWALKHAQWNGFTPTDLVFPTFIFVVGIS 69

Query: 116 IALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRW 169
           +  + +   +  G  + +I      R++ L   G+++ G  Y H      +G     +R 
Sbjct: 70  LVFSTEARLR-RGQSRLLIAAHALRRSVILFLLGLVVNGFPYFH------FGT----LRI 118

Query: 170 CGILQRIALVYVVVALIETLTTK 192
            G+LQRIA+ Y+  +L+  L+ +
Sbjct: 119 YGVLQRIAICYLFGSLLYLLSRR 141


>gi|237722081|ref|ZP_04552562.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|293373568|ref|ZP_06619919.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|299145142|ref|ZP_07038210.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229448950|gb|EEO54741.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|292631466|gb|EFF50093.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|298515633|gb|EFI39514.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 377

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 26/153 (16%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCT 99
            +L +  S R+A+LD  RG  + L++       A AR              DH  W G  
Sbjct: 3   NKLPENNSIRLASLDILRGFDLFLLVFFQPVFVALARQMNMSFLDSILYQFDHEVWEGFR 62

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAP 155
             D VMP FLF+ G ++  +L K     G+     ++I+ R   L  +G+I+QG      
Sbjct: 63  FWDLVMPLFLFMTGASMPFSLSKYIGTTGSYWPVYRRILKRVFLLFIFGMIVQGNL---- 118

Query: 156 DALSYGVDMKHIR-WCGILQRIALVYVVVALIE 187
                G+D  H+  +   LQ IA+ Y++ A+I+
Sbjct: 119 ----LGLDATHLYLYSNTLQSIAVGYLIAAVIQ 147


>gi|171914858|ref|ZP_02930328.1| hypothetical protein VspiD_26815 [Verrucomicrobium spinosum DSM
           4136]
          Length = 379

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 124/312 (39%), Gaps = 53/312 (16%)

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWG 144
           +++ H  W G    D + P FLF+VGV+I L++ ++    G   A+ +I+ R+  L   G
Sbjct: 28  SQLQHVEWEGFRFYDAIFPLFLFLVGVSIVLSVDRMVARVGRSRALARIVRRSALLFAVG 87

Query: 145 IILQGGYSHA-PDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHL 203
           +   GG +   PD          ++  G+L RIAL Y+V A +  L  ++   +     L
Sbjct: 88  VFYYGGIARPWPD----------VQLSGVLPRIALCYLVAATLYVLLPRKGIVIATAACL 137

Query: 204 SIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN 263
             + A                     L+VP  +    +  G +K +           A +
Sbjct: 138 LGYWALM-------------------LFVPFPNVDIKTPAGARKQVEAKTREELFAGAAS 178

Query: 264 AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISA 323
                 +E   + H Y D  W       L   N                  EGLLSTI A
Sbjct: 179 TTKGTFKEGLNLAH-YVDACWLPGRKRNLYYSN------------------EGLLSTIPA 219

Query: 324 ILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
           + S   G+  G VL H +  S R K    +G GL  + I L +    P+ K+L++ S+  
Sbjct: 220 VASTLFGVLAGWVLTHGR-RSGRWKVGWLVGSGLAGVVIGLLWGLEFPVIKRLWTSSFCM 278

Query: 384 FTAGAADGCLGI 395
             AG +   LG+
Sbjct: 279 VAAGFSAVLLGL 290


>gi|91794054|ref|YP_563705.1| hypothetical protein Sden_2703 [Shewanella denitrificans OS217]
 gi|91716056|gb|ABE55982.1| conserved hypothetical protein [Shewanella denitrificans OS217]
          Length = 400

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/377 (23%), Positives = 147/377 (38%), Gaps = 89/377 (23%)

Query: 27  SENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRG-----------LTVV 75
           S   +N +  +  +  + + +    L +       R+ +LDA RG           L   
Sbjct: 5   STTDVNSKVAVRVNSTEHKGQTSTSLNK------PRLKSLDALRGFDMFWIIGGEGLFAA 58

Query: 76  LMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKI 133
           L  L   AG   A  ++ HS W+G TL D + P F+F+ GVA+ L+ K++ +   AV   
Sbjct: 59  LFTLTGWAGWNIASRQMQHSQWHGFTLYDLIFPLFIFLSGVALGLSPKRLDQQAFAVALP 118

Query: 134 IFR-TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
           +++   K L   I L   Y+H         D+  IR+  +L RI   +   A+   L   
Sbjct: 119 LYQHACKRLILLIALGILYNHGWGT-GIPADLDKIRYSSVLARIGFAWFFAAM---LVWH 174

Query: 193 RRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKC 252
            R ++     +SI   Y                T   LY+P                V  
Sbjct: 175 TRLSIQVIVSVSIIGLY----------------TLAQLYLP----------------VPG 202

Query: 253 GMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPF 312
           G  G      +   YVD                            G LR       R P 
Sbjct: 203 GQAGQFTLDASINTYVD----------------------------GLLRPGIAYQDR-PL 233

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           +PEG+LST+ A+++  +G+  G  +I  H +G  A++   V +  G+L++ +       I
Sbjct: 234 DPEGILSTVPAVINAMVGVFAGQFIIRAHSRGDWAKVG--VLIACGVLLLVLAWLLEPMI 291

Query: 371 PINKQLYSFSYVCFTAG 387
           P+NK L++ S+V  T+G
Sbjct: 292 PVNKDLWTTSFVLVTSG 308


>gi|260684359|ref|YP_003215644.1| hypothetical protein CD196_2626 [Clostridium difficile CD196]
 gi|260688018|ref|YP_003219152.1| hypothetical protein CDR20291_2673 [Clostridium difficile R20291]
 gi|260210522|emb|CBA65033.1| putative membrane protein [Clostridium difficile CD196]
 gi|260214035|emb|CBE06182.1| putative membrane protein [Clostridium difficile R20291]
          Length = 352

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 16/123 (13%)

Query: 77  MILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKI 133
           MI+ ++ G     Y ++ H+ W+G TLADF  PFF+  +GV I +++    K N +   I
Sbjct: 1   MIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTIPISINSKLKNNKSTLSI 60

Query: 134 IF----RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
           I     R++ L+ +G  L   Y   P       D+  +R  G+LQR+ LVY V +L+  L
Sbjct: 61  ILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRILGVLQRMGLVYFVTSLVYLL 111

Query: 190 TTK 192
             K
Sbjct: 112 LKK 114



 Score = 45.1 bits (105), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFT 367
           +  FEP+G L++I AI SG +G   G VL+    G   +    + M   LLI A I  F 
Sbjct: 167 KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFI--FN 224

Query: 368 NAIPINKQLYSFSYVCFTAGAADGCLGI 395
              P NK+L+S S+V   AG+    L I
Sbjct: 225 QYFPFNKRLWSSSFVLLMAGSYGMLLSI 252


>gi|433678126|ref|ZP_20510025.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
 gi|430816762|emb|CCP40477.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
           translucens pv. translucens DSM 18974]
          Length = 384

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 13/125 (10%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R  +LD FRGL + LMIL +  G    A+ ++ H+PW G T AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 118 LALKKVPKINGAVKKI------IFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            AL +   +   ++++      IF    L++W   +  G   A  + S+ + +   R  G
Sbjct: 76  FALDRSQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQG---ADGSWSF-IAIDQTRVPG 131

Query: 172 ILQRI 176
           +LQRI
Sbjct: 132 VLQRI 136


>gi|392548092|ref|ZP_10295229.1| hypothetical protein PrubA2_17028 [Pseudoalteromonas rubra ATCC
           29570]
          Length = 373

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 84/357 (23%), Positives = 139/357 (38%), Gaps = 101/357 (28%)

Query: 56  LQQKSKRVATLDAFRGLTV-----------VLMILVDDAGGAY--ARIDHSPWNGCTLAD 102
           +    KR+A+LDA RG+ +            L +L    G     A+  HS W+G T  D
Sbjct: 1   MSNNKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKIFEAQTLHSAWHGFTFYD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDA 157
            + P F+F+ GVA+ L  K++  +  A +K I+     R   L  +G++   G+     A
Sbjct: 61  LIFPLFIFLSGVAMGLRPKRIDHLPMAERKPIYIKAIKRLGLLCLFGVLYNHGWGTGIPA 120

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIE---TLTTKRRPNVLEPRHLSIFTAYQWQWI 214
                D   IR+  +L RIA+ +   A++    +L T     V       I  AY   W+
Sbjct: 121 -----DFGEIRYASVLGRIAIAWFFCAMLVWHCSLKTTALTGV------GILLAY---WL 166

Query: 215 GGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWG 274
                           ++P                V  G  G L PA +   +VD+ L  
Sbjct: 167 -------------LLCFIP----------------VPGGSAGELTPAGSWNAWVDQAL-- 195

Query: 275 INHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYG 334
                                  G   ++ P       +PEG+LS+  AI++   G+  G
Sbjct: 196 ---------------------LPGITYQNRP------VDPEGILSSFPAIVNAIAGVFAG 228

Query: 335 HVLIHFKGHSARLKHWVSMG----FGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
            ++      S +L  W   G     G++ +A+   +    P+NK+L++ S+V  T G
Sbjct: 229 QLI----AQSDKLGQWQVAGRLFAAGIVSLALGWLWDLQFPVNKELWTSSFVLVTVG 281


>gi|423345097|ref|ZP_17322786.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
 gi|409222883|gb|EKN15820.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
           CL03T12C32]
          Length = 370

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 87/348 (25%), Positives = 140/348 (40%), Gaps = 98/348 (28%)

Query: 57  QQKSKRVATLDAFRG-----------LTVVLMILVDDA-----GGAYARIDHSPWNGCTL 100
           Q++S+R+ +LDA RG           L V L  L  ++     GG   ++DH  W+G T 
Sbjct: 4   QKQSRRLLSLDALRGVDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGLTH 60

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPD 156
            D + P FLFI G++   +L+K  +   +     K+I+ R + L+  G +  G       
Sbjct: 61  HDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF--- 117

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                 D  ++R   +L RI L ++  AL   L    R +V       I   Y W WI  
Sbjct: 118 ------DFANLRCASVLARIGLGWMFAAL---LFVHFRTSVRAWIAGVILVGY-WVWIA- 166

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
                         ++P                V     G      N VGYVDR L    
Sbjct: 167 --------------FIP----------------VPGAEAGPFTLEGNWVGYVDRLL---- 192

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
                                G L +         F+PEGLLST+  I++  +G+  G  
Sbjct: 193 -------------------LPGRLHQGF-------FDPEGLLSTLPGIVTAMLGMFTGEF 226

Query: 337 L-IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
           + +  +G + + K    +  G+L++ + + ++   PINK+L+S S+VC
Sbjct: 227 IKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVFPINKKLWSSSFVC 274


>gi|322436067|ref|YP_004218279.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
 gi|321163794|gb|ADW69499.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
          Length = 391

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/65 (46%), Positives = 43/65 (66%), Gaps = 3/65 (4%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + RV ++D  RGLT+ LMILV+D G A   Y ++ H+ WNG T AD V P FLF+ G ++
Sbjct: 14  APRVLSIDVLRGLTIALMILVNDPGDAGCVYPQLQHAEWNGYTAADLVFPNFLFLGGASL 73

Query: 117 ALALK 121
             +L+
Sbjct: 74  VFSLQ 78


>gi|423282787|ref|ZP_17261672.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
 gi|404581658|gb|EKA86354.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
           615]
          Length = 375

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  VI
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLVI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|440804580|gb|ELR25457.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acanthamoeba
           castellanii str. Neff]
          Length = 446

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 144/351 (41%), Gaps = 98/351 (27%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG----------------AYARIDHSPWNGCTLAD 102
           K  R+ +LD FRG+T++ MILVD+ G                 A  R   +  +    AD
Sbjct: 56  KKPRLQSLDVFRGVTMLGMILVDNQGNFDHVVRPLDESIVRHPAPPRPPTNARSWVDPAD 115

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVK---KIIFRTLKLLFWGIILQGGYSHAPDALS 159
               +  F V +A+     K+P   G +K   +++ R   L   G++L    S+  D   
Sbjct: 116 HCAQWDGFAVALAMNGFWDKIPDRRGKIKAWARVLQRIGTLFVVGLLLNAFGSNPWDKWP 175

Query: 160 YGVDMKHIRWCGILQRIALVY--VVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
           +     H R  G   RIAL Y  V V  + T T  +R  +L       FTA         
Sbjct: 176 HW----HFRIMGC--RIALCYGTVTVLFLATSTIVQRVVML------CFTA--------- 214

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
               IY+   Y L VP                 KCG RG+L P CNA G++DR ++G   
Sbjct: 215 ----IYVGLMYGLDVP-----------------KCG-RGNLTPGCNAGGFIDRSIFG--- 249

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
                                        W   P +PEGLLST++A L+  +G+ +G +L
Sbjct: 250 ----------------------------DWMIRPNDPEGLLSTLTATLTCYLGLEFGRIL 281

Query: 338 IHFKGHSARLK-HWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             ++ +   L   WV +  GL+ +A+ L     +PINK+++S  +     G
Sbjct: 282 HKYRANQLELVCRWVMLALGLIGLALFLWLW--MPINKKMWSVPFALMMGG 330


>gi|403717790|ref|ZP_10942873.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
 gi|403208927|dbj|GAB97556.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
           100340]
          Length = 461

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 72/304 (23%), Positives = 117/304 (38%), Gaps = 77/304 (25%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
            S+R  +LD  RGL +V+ + V+    A A  +H+ W G    D V P F+ + G  +A 
Sbjct: 78  PSRRFISLDVARGLMLVVSVAVNAWITAPAWFEHAAWAGVHPVDLVFPTFVALSGAGLAF 137

Query: 119 A-LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG-VDMKHIRWCGILQRI 176
           A  +++P     ++ ++ R + L   G+     Y+     LS G +D    R  G+LQ  
Sbjct: 138 AYARRIP-----LRPLLSRVIVLALAGL----AYNAHAQYLSTGQLDWATFRIPGVLQLY 188

Query: 177 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 236
           A + +V+AL+  +  +                  W W      F I   T ++L      
Sbjct: 189 AAIVLVIALLHFVLRR-----------------WWAW----PLFTIVAATCFAL------ 221

Query: 237 FSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPN 296
                   + ++   C   G L P CN  G  D  L+G+ H+Y                 
Sbjct: 222 -------ALNRFAAGCP-GGALTPECNPSGLFDPALFGVEHIYHQ--------------- 258

Query: 297 SGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFG 356
                       R   +PEG+++   A +    G   GH+L      S+RL        G
Sbjct: 259 -----------GRFGHDPEGIVALAGATIVACAGASLGHLL-----KSSRLPARRQALTG 302

Query: 357 LLII 360
           LLI+
Sbjct: 303 LLIV 306


>gi|265767324|ref|ZP_06094990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263252629|gb|EEZ24141.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 375

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  VI
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLVI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|53715734|ref|YP_101726.1| hypothetical protein BF4455 [Bacteroides fragilis YCH46]
 gi|423271955|ref|ZP_17250924.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|423276040|ref|ZP_17254983.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
 gi|52218599|dbj|BAD51192.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|392696310|gb|EIY89506.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
           CL05T00C42]
 gi|392699545|gb|EIY92721.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
           CL05T12C13]
          Length = 375

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  VI
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLVI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|242062184|ref|XP_002452381.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
 gi|241932212|gb|EES05357.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
          Length = 96

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 53/97 (54%), Gaps = 4/97 (4%)

Query: 171 GILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSL 230
           GILQRIA+ Y++ A+ E +  K   +V       +   Y++Q   G +  + Y I  Y +
Sbjct: 3   GILQRIAIAYLLAAICE-IWLKGDDDV--DSGYGLLRRYRYQLFVGLVLSIAYTILLYGI 59

Query: 231 YVPNWSFSEHSDHGVKK-YIVKCGMRGHLGPACNAVG 266
           YVP+W +        +K + VKCG+RG  G ACNAVG
Sbjct: 60  YVPDWEYKISGPGSTEKSFSVKCGVRGDTGLACNAVG 96


>gi|392551353|ref|ZP_10298490.1| hypothetical protein PspoU_08780 [Pseudoalteromonas spongiae
           UST010723-006]
          Length = 379

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 78/155 (50%), Gaps = 23/155 (14%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAGG--AYARIDHSPWNGCTLADF 103
           ++  KR+A+LDA RG  +            L +L   AG   A A+  HS W+G T  D 
Sbjct: 6   KKTKKRLASLDALRGFDMFWILGGEKIFAALFVLTGWAGWKVAEAQTLHSQWHGFTFYDL 65

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDAL 158
           + P F+F+ GVA+ L+ K++  +    +K I+     R L L F+G++   G+       
Sbjct: 66  IFPLFIFLSGVAMGLSPKRIDHLPFVDRKPIYIKAFKRLLLLCFFGVLYNHGWGTGVP-- 123

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
              ++ + +R+  +L RIA+ + V A++   T+ R
Sbjct: 124 ---LNPEEVRYASVLGRIAVAWFVAAMLVWHTSFR 155



 Score = 38.5 bits (88), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 38/77 (49%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           P +PEG+LS+I AI +   G+  G  +   +         +    GL+ +A+   +    
Sbjct: 211 PMDPEGILSSIPAIANALFGVIAGRYIKQAQERGEWKTAGILFAAGLVALAVGWLWNMVF 270

Query: 371 PINKQLYSFSYVCFTAG 387
           P+NK L++ S+V  T G
Sbjct: 271 PVNKDLWTSSFVMVTVG 287


>gi|294139796|ref|YP_003555774.1| hypothetical protein SVI_1025 [Shewanella violacea DSS12]
 gi|293326265|dbj|BAJ00996.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 378

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 89/382 (23%), Positives = 151/382 (39%), Gaps = 96/382 (25%)

Query: 54  QLLQQKSKRVATLDAFRGLTVVLMILVDDA--------------GGAYARIDHSPWNGCT 99
            L +   +R+ +LDA RG  +   IL  +A                A A++ HS W+G T
Sbjct: 4   SLTKAPKRRLMSLDALRGFDM-FWILGGEALFAGLLAWSSWQGWQWADAQMHHSQWHGFT 62

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGY-SH 153
             D + P F+F+ GVA+ L+ K++ K+  A +  ++     R   LLF+G++   G+ + 
Sbjct: 63  FYDLIFPLFIFLSGVALGLSPKRLDKLPIAQRMPLYKHSVKRLFLLLFFGVLYNHGWGTG 122

Query: 154 APDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
           AP      V +  +R+  +L RIA  +   A++   T+ R    +    L I  AY    
Sbjct: 123 AP------VAIDEVRYASVLGRIAFAWFFAAMLVWHTSFRTQVFVT---LGILIAY---- 169

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW 273
                           L++P                   G+ G   P      YVD    
Sbjct: 170 ------------GLLQLFMP----------------FPGGVGGVFTPQGTINAYVDTHF- 200

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
                                   G   ++ P       +PEG+LSTI A+ +   G+  
Sbjct: 201 ----------------------LPGITYQNRP------LDPEGILSTIPAVANAMAGVFV 232

Query: 334 GHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADG 391
           GH ++  H +G  A++     +  G  I+A+       IP+NK L++ S+V  T G +  
Sbjct: 233 GHFIVKEHKQGEWAKVV--CLLLSGAFILALGWWVNLIIPVNKDLWTSSFVLVTTGWSII 290

Query: 392 CLGITNSIFIPEMDWDECDACF 413
            L I  ++ +  + W +    F
Sbjct: 291 LLAIFYAL-VDVLKWQKIAFPF 311


>gi|440731410|ref|ZP_20911431.1| membrane protein [Xanthomonas translucens DAR61454]
 gi|440373102|gb|ELQ09871.1| membrane protein [Xanthomonas translucens DAR61454]
          Length = 384

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 65/125 (52%), Gaps = 13/125 (10%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVAIA 117
           +R  +LD FRGL + LMIL +  G    A+ ++ H+PW G T AD   P FLF+VG A++
Sbjct: 16  ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75

Query: 118 LALKKVPKIN------GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCG 171
            AL +   +       G    +IF    L++W   +  G   A  + S+ + +   R  G
Sbjct: 76  FALDRSQPLGAFLCRVGKRSALIFLLGFLMYWFPFVHQG---ADGSWSF-IAIDQTRVPG 131

Query: 172 ILQRI 176
           +LQRI
Sbjct: 132 VLQRI 136


>gi|218260820|ref|ZP_03475939.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|423344001|ref|ZP_17321714.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218224343|gb|EEC96993.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
           DSM 18315]
 gi|409213863|gb|EKN06876.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 371

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 88/353 (24%), Positives = 142/353 (40%), Gaps = 107/353 (30%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYA------------------RIDHSPWNGC 98
           Q++S+R+ +LDA RG  +  ++     GGA                    ++ H  W+G 
Sbjct: 4   QKQSRRLLSLDALRGFDMFFIM-----GGASLFVALATLFPNPFFQVIGDQMHHVKWDGL 58

Query: 99  TLADFVMPFFLFIVGVAIALALKKVP---KINGAV-KKIIFRTLKLLFWGIILQGGYSHA 154
           T  D + P FLFI G++   +L+K     K +  + +KII R L L+  G +  G  +  
Sbjct: 59  THHDTIFPLFLFIAGISFPFSLEKQREQGKTDADIYRKIIRRGLTLVVLGFVYNGLLNF- 117

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALI--ETLTTKRRPNVLEPRHLSIFTAYQWQ 212
                   D +H R+  +L RI L ++  ALI   T T  R                   
Sbjct: 118 --------DFEHQRYASVLGRIGLAWMFGALIFVNTRTITR------------------V 151

Query: 213 WIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDREL 272
           WI       + I+  Y L +      + +  GV        M G L      VGYVDR L
Sbjct: 152 WI------TVAILVGYWLLLAFVPAPDGNGAGVFT------MEGSL------VGYVDRLL 193

Query: 273 W-GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGI 331
             G  HL                                  +PEG+LST+ A+ +  +G+
Sbjct: 194 LPGRLHLTVH-------------------------------DPEGILSTVPAVATALLGM 222

Query: 332 HYGHVL-IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVC 383
             G  + +  +G + + K    +  G +++A+ L ++   PINK L++ S+VC
Sbjct: 223 FTGEFIKMQREGLTDKKKVGGLVIAGAVLLAVGLLWSLFFPINKNLWTSSFVC 275


>gi|408673239|ref|YP_006872987.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
 gi|387854863|gb|AFK02960.1| Protein of unknown function DUF2261, transmembrane [Emticicia
           oligotrophica DSM 17448]
          Length = 370

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 130/349 (37%), Gaps = 92/349 (26%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-------------WNGCTLADF 103
              ++R+ +LDA RG  +  ++  +D   A +   H P             WNG    D 
Sbjct: 4   SSSTQRLYSLDALRGFDMFWIMGGEDFFHALSEATHHPAAIWIATQLSHVAWNGFRFYDL 63

Query: 104 VMPFFLFIVGV----AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
           + P FLFI GV    ++   ++K       +++II R L L+  G+I   G         
Sbjct: 64  IFPLFLFISGVSTPYSVGREIEKGIDKQAILRRIIKRGLILVLLGVIYNNGLQIK----- 118

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIA 219
              ++  IR+  +L RI L Y+   +I    ++R               +Q+ W  G   
Sbjct: 119 ---ELSQIRFPSVLGRIGLAYMFACIIYVYASQR---------------WQYVWFSG--- 157

Query: 220 FVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG-MRGHLGPACNAVGYVDRELWGINHL 278
               ++  Y L              + K+    G + G L    N   +VDR L      
Sbjct: 158 ----LLIGYYL--------------IMKFNAAPGFVAGDLTIQGNFASWVDRML------ 193

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
                           P    L+           +PEGL STI AI +G +GI  G+ L 
Sbjct: 194 ---------------VPGRLHLKIH---------DPEGLFSTIPAIGTGLLGIFAGNYLK 229

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +    +   K       GL+ + I   +    PINK L++ S++ +  G
Sbjct: 230 NVTTKTPTQKAATLAILGLVCLIIGGSWGIVFPINKNLWTSSFMLYAGG 278


>gi|383119755|ref|ZP_09940493.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|423252290|ref|ZP_17233284.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|423252861|ref|ZP_17233792.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
 gi|251944624|gb|EES85099.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
 gi|392647563|gb|EIY41262.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
           CL03T00C08]
 gi|392659230|gb|EIY52856.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
           CL03T12C07]
          Length = 375

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  +I
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLLI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|336411649|ref|ZP_08592112.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
 gi|335941083|gb|EGN02943.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
          Length = 375

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  +I
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLLI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|374312990|ref|YP_005059420.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358755000|gb|AEU38390.1| protein of unknown function DUF1624 [Granulicella mallensis
           MP5ACTX8]
          Length = 408

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 17/105 (16%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW---------NGCTLADFVMPFFL 109
           ++ RVA++D FRGLT+ +MI V+D  G    +   PW         +  T  D V PFFL
Sbjct: 14  RTTRVASIDIFRGLTMAIMIFVNDLDG----VQGLPWWTHHAKANIDVMTYVDMVFPFFL 69

Query: 110 FIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGG 150
           FI+G+++ LA    LKK P I      ++ R++ L+  G+IL   
Sbjct: 70  FIIGLSMPLAIRQRLKKNPSIPQLWLHVLIRSVSLVALGVILANA 114


>gi|343086706|ref|YP_004776001.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342355240|gb|AEL27770.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 368

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 16/103 (15%)

Query: 53  QQLLQQKS-KRVATLDAFRGLTVVLMI------------LVDDAGG---AYARIDHSPWN 96
             +LQ  S KR+ +LDA+RG+T+ L++            L  +  G    + +  H PWN
Sbjct: 1   MTVLQSASNKRLVSLDAYRGITMFLLVAESARLYGAFEGLFPEVSGWQMFFTQFTHHPWN 60

Query: 97  GCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK 139
           G    D + PFF+FIVGVA+  +L K  +  G  +K+    LK
Sbjct: 61  GLRFWDLIQPFFMFIVGVAMPFSLNKRLEKQGDRRKVTLHILK 103


>gi|410657728|ref|YP_006910099.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
 gi|409020083|gb|AFV02114.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
           DCA]
          Length = 370

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/352 (21%), Positives = 138/352 (39%), Gaps = 79/352 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAY--ARIDHSPWNGCTLADFVMPFFLFIVG 113
           +++ + R+  LD  R L+V+L+ L     G    A I H+PW G T  DF  P F+ + G
Sbjct: 1   MKRITNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSG 60

Query: 114 VAIALALKK-VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            ++A+A +K VP +     ++I R   L+  G+I           +S+   +  +R+ G+
Sbjct: 61  TSMAIAYRKHVPWV-----RLIRRFFVLIIIGLIFN-------SLVSWEFHLSQLRFTGV 108

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQ +A   ++  LI  ++ K                + W +  G +    Y+        
Sbjct: 109 LQVLAFTGIMTTLITRVSGK----------------WFWPFTAGLLILAAYL-------- 144

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                      G+  Y  +        P  N  G +D  ++  +HLY             
Sbjct: 145 -----------GILLYTSQSFPGSLPSPDHNLSGMIDPFIFTKSHLYVH----------- 182

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                            A ++PEG+ +  SAI S   G   G  L      +        
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAGLFL---NNKNIGRNFLKI 224

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
           +    +++ +    +N IPI K+L++ S+V  ++GA    L   + I+ P++
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILVLAFAHLIWDPQI 276


>gi|167623085|ref|YP_001673379.1| hypothetical protein Shal_1151 [Shewanella halifaxensis HAW-EB4]
 gi|167353107|gb|ABZ75720.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
          Length = 398

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 78/161 (48%), Gaps = 24/161 (14%)

Query: 45  EQKGELQLQQLLQQKSK-RVATLDAFRGLTV-----------VLMILVDDAGGAY--ARI 90
           E   +LQ Q + + ++K R+ +LDA RG  +            L++L   AG  +   ++
Sbjct: 14  ESHLKLQTQSIAKSEAKPRLKSLDALRGFDMFWILGGEAIFAALLVLTGWAGFKWFDGQM 73

Query: 91  DHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV-----KKIIFRTLKLLFWGI 145
            HS WNG T  D + P F+F+ GVA+ L+ K++ K+         K  I R   LL +G+
Sbjct: 74  HHSVWNGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPPRLPLYKHAIKRLFLLLLFGV 133

Query: 146 ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
           I   G+       S+ V    IR+  +L RIA  +   AL+
Sbjct: 134 IYNHGWGTGA---SFAVG--DIRYASVLGRIAFAWFFCALL 169



 Score = 47.0 bits (110), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 49/84 (58%), Gaps = 4/84 (4%)

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAII 363
           ++  A  +PEG+LSTI A+++G  G+  GH ++  H KG   +L   V    G+ ++A+ 
Sbjct: 225 TYQNAAVDPEGILSTIPAVVNGLFGVFVGHFIVKPHVKGEWFKLA--VLAASGVALVALG 282

Query: 364 LHFTNAIPINKQLYSFSYVCFTAG 387
              +  IP+NK L++ S+   ++G
Sbjct: 283 WLISPLIPVNKTLWTSSFTLVSSG 306


>gi|440747820|ref|ZP_20927075.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436483562|gb|ELP39602.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/356 (22%), Positives = 143/356 (40%), Gaps = 92/356 (25%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVD------DAGGAYARID-------HSPWNGC 98
           ++ LL+Q SKR+ ++DA RG  ++++   D      +   ++A +D       H  W G 
Sbjct: 1   MEILLKQPSKRLVSIDALRGFDMLMICGADAFFRSLEGKTSFAWVDVLARQFEHPEWIGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHA 154
           T  DF+ P FLF+ GV+I  +L K    N +     KK + RTL L+  G++ +    +A
Sbjct: 61  TFYDFIFPLFLFVAGVSIPFSLGKSLAENVSKREIYKKALSRTLLLIGLGMLDK----NA 116

Query: 155 PDALSYGVDMKHIRWCGILQRIALV-YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQW 213
           P       D + IR   +L RI +  +V V L      ++R  ++               
Sbjct: 117 PFPF---FDWEQIRLGSVLGRIGIAGFVTVFLFLNFPARQRLGIVG-------------- 159

Query: 214 IGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELW 273
               +  + Y    + + VP +                    G+L    N  G++DR   
Sbjct: 160 ----LVLIAYYAAVFLIPVPGFG------------------AGNLSFEGNLAGWIDRTF- 196

Query: 274 GINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHY 333
                    +  RL                     +  F+  G+L+T  AI    +G   
Sbjct: 197 ---------LPGRL--------------------LQGSFDELGILTTFPAICLTILGAQA 227

Query: 334 GHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
           G +L +    S + K   ++ FG++ I + L +    PI K++++ S++   AG A
Sbjct: 228 GEILRN-AQLSEQQKVVRTLLFGVVCIGLALIWHLHFPIFKRMWTSSFILLNAGMA 282


>gi|375360501|ref|YP_005113273.1| putative transmembrane protein [Bacteroides fragilis 638R]
 gi|301165182|emb|CBW24752.1| putative transmembrane protein [Bacteroides fragilis 638R]
          Length = 373

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 85/203 (41%), Gaps = 42/203 (20%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           S R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   SPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALA-LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMK 165
            FLF+ G ++  +  K  P      +KII R + L  +G+I+QG           G+D K
Sbjct: 69  LFLFMTGASMPFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLDPK 120

Query: 166 HIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYI 224
           H+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  +IY 
Sbjct: 121 HLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLLIYW 163

Query: 225 ITTYSL--YVPNWSFSEHSDHGV 245
           I    L  + P  +F+E  D  V
Sbjct: 164 IPMTFLGDFTPEGNFAEKVDRLV 186


>gi|393788826|ref|ZP_10376952.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
 gi|392653932|gb|EIY47582.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
           CL02T12C05]
          Length = 376

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 69/154 (44%), Gaps = 26/154 (16%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGC 98
           + ++ +    R+A+LD  RG  + L++       A AR              DH  W G 
Sbjct: 1   MNKVPENDFTRLASLDILRGFDLFLLVFFQPVFVALARQLNLPFLDEVLYQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKIN----GAVKKIIFRTLKLLFWGIILQGGYSHA 154
              D VMP FLF+ G ++  +L K    +       ++I+ R + L  +G+I+QG     
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSLSKYKTASVDYWPVYRRILKRVILLFIFGMIVQGN---- 116

Query: 155 PDALSYGVDMKHIR-WCGILQRIALVYVVVALIE 187
                 G D KHI  +   LQ IA+ Y + A+I+
Sbjct: 117 ----LLGFDSKHIYFYSNTLQSIAVGYFIAAVIQ 146


>gi|410660784|ref|YP_006913155.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
 gi|409023140|gb|AFV05170.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
          Length = 368

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 77/352 (21%), Positives = 137/352 (38%), Gaps = 79/352 (22%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAY--ARIDHSPWNGCTLADFVMPFFLFIVG 113
           +++ + R+  LD  R L+V+L+ L     G    A I H+PW G T  DF  P F+ + G
Sbjct: 1   MKRITNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSG 60

Query: 114 VAIALALKK-VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGI 172
            ++A+  +K VP +     ++I R   LL  G+I           +S+   +  +R+ G+
Sbjct: 61  TSMAIVYRKHVPWV-----RLIRRFFVLLIIGLIFN-------SLVSWEFQLSELRFTGV 108

Query: 173 LQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYV 232
           LQ +A   ++ ALI  ++ K                + W +  G +    Y+        
Sbjct: 109 LQVLAFTGIMTALITRVSGK----------------WFWPFTAGLLILAAYL-------- 144

Query: 233 PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL 292
                      G+  Y  +        P  N  G  D  ++  +HLY             
Sbjct: 145 -----------GILLYTSQSFPGSLPSPDHNLSGMTDPFIFTKSHLYVH----------- 182

Query: 293 SSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVS 352
                            A ++PEG+ +  SAI S   G   G  L      +        
Sbjct: 183 ---------------GDAGYDPEGICTLFSAIASTLFGYTAGLFL---NNKNIGRDFLKI 224

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEM 404
           +    +++ +    +N IPI K+L++ S+V  ++GA    L   + I+ P++
Sbjct: 225 LALAAVLLLLTPLLSNFIPIGKRLWTPSFVTLSSGATILILAFAHLIWDPQI 276


>gi|320333679|ref|YP_004170390.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
 gi|319754968|gb|ADV66725.1| hypothetical protein Deima_1072 [Deinococcus maricopensis DSM
           21211]
          Length = 376

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 8/87 (9%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAY---ARIDHSPW-NGCTLADFVMPFFLFIVGV 114
           +  R+A LDA+RGLTV+LM+LV++    +     + H+PW  G TLAD V P+FLF  G 
Sbjct: 24  RGARLAALDAWRGLTVLLMLLVNNVALDWRTPKELMHAPWGGGATLADLVFPWFLFCAGT 83

Query: 115 AIALALKKVPKIN----GAVKKIIFRT 137
           A+  +L    +        V+K++ RT
Sbjct: 84  ALPFSLASARRAGVRGWALVRKLLTRT 110


>gi|408382946|ref|ZP_11180487.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
 gi|407814484|gb|EKF85111.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
           formicicum DSM 3637]
          Length = 382

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 63/130 (48%), Gaps = 25/130 (19%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARI-----DHSPWNGCTLADFVMPFFLFIVG 113
           K +RV +LD FRGL V  MI V+    A++       +H+ WNG T AD V P FLFIVG
Sbjct: 20  KKRRVISLDVFRGLAVAAMIFVNAM--AFSEFTPGIFEHATWNGLTFADLVFPSFLFIVG 77

Query: 114 VAIALALKKVPKINGAVKK-----IIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           V++A +     K +   K+      +FR   L   G+ L    S          D   +R
Sbjct: 78  VSMAYSFAARSKNS---KRDLWGHFLFRVGALFTIGVALNWFTS----------DFSMVR 124

Query: 169 WCGILQRIAL 178
             G+LQ IAL
Sbjct: 125 IPGVLQLIAL 134


>gi|60683670|ref|YP_213814.1| hypothetical protein BF4252 [Bacteroides fragilis NCTC 9343]
 gi|423259842|ref|ZP_17240765.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|423267497|ref|ZP_17246478.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
 gi|60495104|emb|CAH09923.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
 gi|387775880|gb|EIK37984.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
           CL07T00C01]
 gi|392696971|gb|EIY90158.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
           CL07T12C05]
          Length = 375

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 85/205 (41%), Gaps = 44/205 (21%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMP 106
           + R+A+LD  RG  + L++         A             + DH  W G    D VMP
Sbjct: 9   APRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68

Query: 107 FFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
            FLF+ G ++  +  K    P      +KII R + L  +G+I+QG           G+D
Sbjct: 69  LFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL--------LGLD 120

Query: 164 MKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
            KH+  +   LQ IA  Y++ A+I+                 +   ++WQ I   +  +I
Sbjct: 121 PKHLYLYSNTLQAIATGYLIAAIIQ-----------------LHCNFRWQLIVTALLLLI 163

Query: 223 YIITTYSL--YVPNWSFSEHSDHGV 245
           Y I    L  + P  +F+E  D  V
Sbjct: 164 YWIPMTFLGDFTPEGNFAEKVDRLV 188


>gi|255037955|ref|YP_003088576.1| hypothetical protein Dfer_4208 [Dyadobacter fermentans DSM 18053]
 gi|254950711|gb|ACT95411.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 371

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 87/382 (22%), Positives = 144/382 (37%), Gaps = 105/382 (27%)

Query: 60  SKRVATLDAFRGLTV--------VLMILVDDAGGAYA-----RIDHSPWNGCTLADFVMP 106
           S+R+ +LD  RG  +        +  +L    G ++A     +  H  WNG    D + P
Sbjct: 7   SQRLLSLDTLRGFDMFWISGGEEIFHVLAKVTGWSWAIVLAHQFTHPDWNGFRAYDLIFP 66

Query: 107 FFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
            FLF+ GV+   +L     K VP  +  V+K+I R + L+F GII   G         + 
Sbjct: 67  TFLFMAGVSTPFSLGSRLEKGVPP-SQLVRKVIQRGIILVFLGIIYNNG--------IFE 117

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFV 221
            +   +R+  +L RI L  +   +I      R                +W W GG     
Sbjct: 118 TEWSQMRYPSVLARIGLAGMFAQIIYLYFGYRA---------------RWIWFGG----- 157

Query: 222 IYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSD 281
             ++  Y L++  +             +  CG  G L   CN   Y D            
Sbjct: 158 --LLIGYYLFMMFYP------------VPGCG-AGLLTIDCNPASYFD------------ 190

Query: 282 PVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI--- 338
                    +L  P    L            +PEGL+STI AI +G +GI  G +L    
Sbjct: 191 ---------SLIIPGRLHLTIH---------DPEGLVSTIPAIATGLMGIFAGELLRTSH 232

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNS 398
                 +++ + V  G   L++ ++  +    PINK L++ S+V         C G  ++
Sbjct: 233 EVLSQKSKVIYLVFAGVVSLLVCLVWDYF--FPINKNLWTSSFVL--------CAGGFST 282

Query: 399 IFIPEMDWDECDACFRAWSTRY 420
           + +    W      +R W+  +
Sbjct: 283 LLLALFYWIVDVLNYRKWTLFF 304


>gi|383753678|ref|YP_005432581.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
 gi|381365730|dbj|BAL82558.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
          Length = 384

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 71/136 (52%), Gaps = 14/136 (10%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVD---DAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           +  +R+A +D FRGL + +M+LV+   +   A+  + H+PW G T+AD   P F+FI+GV
Sbjct: 6   KNKRRLAAIDIFRGLAIAIMLLVNALPNFEQAWPLLVHAPWAGLTIADLAFPGFVFIMGV 65

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLF----------WGIILQGGYSHAPDALSYGVDM 164
           + +L   K  + +G+ +K      + L           + ++LQ  +   P        +
Sbjct: 66  SASLWFPKHEQ-DGSGEKFCIILKRSLLLILLGFFLCQFPLVLQHVFQPEPGGSLIKDIV 124

Query: 165 KHIRWCGILQRIALVY 180
           +H R  G+LQR+ LVY
Sbjct: 125 EHGRIPGVLQRLGLVY 140


>gi|442611023|ref|ZP_21025729.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
 gi|441746951|emb|CCQ11791.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
           luteoviolacea B = ATCC 29581]
          Length = 373

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 84/352 (23%), Positives = 140/352 (39%), Gaps = 91/352 (25%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAY--ARIDHSPWNGCTLAD 102
           +    KR+A+LDA RG  ++           L +L    G     A   HS W+G T  D
Sbjct: 1   MTTTKKRLASLDALRGFDMMWILGGQGIFAALFVLTGWTGWRTFEAHTVHSDWHGFTFYD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRT-----LKLLFWGIILQGGYSHAPDA 157
            + P F+F+ GVA+ L+ K++  +  + +  I+R      + L   GI+   G+     A
Sbjct: 61  LIFPLFIFLSGVAMGLSPKRIDHLPMSERTPIYRKALKRFVLLCLLGILYNHGWGTGIPA 120

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
                D   IR+  +L RIA  +++ AL+    + ++   +    LSI   Y W W+   
Sbjct: 121 -----DFSEIRYSSVLGRIAFAWLICALLVWHFSLKQVAYIG---LSILITY-WIWL--- 168

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
                        Y+P                V  G  G L    +   ++D+     N 
Sbjct: 169 ------------CYIP----------------VPGGDAGDLSIGGSWNAWIDQ-----NF 195

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL 337
           L                   G   +D P       +PEG+LS++ A+++   G+  G  L
Sbjct: 196 L------------------PGVRYQDRPV------DPEGILSSLPAVVNAIAGLFAGQ-L 230

Query: 338 IHFKGHSARLKHWVSMGF--GLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           I         K  V++ F  G+L IA+   +    P+NK+L++ S+   T G
Sbjct: 231 IKRAPEKGEWKC-VALLFSGGVLFIALGWLWDLVFPVNKELWTSSFTLVTIG 281


>gi|343087500|ref|YP_004776795.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342356034|gb|AEL28564.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 386

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 133/348 (38%), Gaps = 95/348 (27%)

Query: 61  KRVATLDAFRGLTVVL-------MILVDDAGGAYA------RIDHSPWNGCTLADFVMPF 107
           KR+ ++DA RG  ++L       ++L+    G  A      +  H  WNG +  DF+ P 
Sbjct: 25  KRLLSIDALRGFDMLLIAGAGAFLVLLKGKTGIPAIDWIAGQFYHPAWNGFSFYDFIFPL 84

Query: 108 FLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           FLFI GV++  +L K   +  +     KK   R L L+  GI+    Y ++P  +    +
Sbjct: 85  FLFIAGVSLTFSLNKGRNLGMSKPTLYKKTFSRMLVLILLGIL----YKNSPVPI---FE 137

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
              IR+  +L RI +   V  L+         N    + L I  A            V+Y
Sbjct: 138 PSQIRYGSVLGRIGIATFVTTLVYL-------NFDFYKRLGIAMA----------ILVLY 180

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
               + + VP +   + S  G                  N VG+ DR             
Sbjct: 181 YAALFLIPVPGYGAGDLSIEG------------------NLVGWFDRTF----------- 211

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                         G L+++        ++  GLL+ I A+     G   G +L      
Sbjct: 212 ------------MPGILKQEI-------YDELGLLTQIPALCLTIFGTLAGEILTKAWLD 252

Query: 344 SARLKHWVSMGFGLLIIAII--LHFTNAIPINKQLYSFSYVCFTAGAA 389
           + ++K     G   L + +I  LHF    PINK L+S S++  T+G A
Sbjct: 253 TKKIKQLAIAGVISLTLGLIWDLHF----PINKHLWSSSFILLTSGMA 296


>gi|456861512|gb|EMF80162.1| hypothetical protein LEP1GSC188_2620 [Leptospira weilii serovar
           Topaz str. LT2116]
          Length = 88

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 38/60 (63%), Gaps = 3/60 (5%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFF 108
           +++   Q   R+ +LD FRG+TV+ MILV++ G     Y+ + H+ WNGCT  D V PFF
Sbjct: 1   MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFF 60


>gi|313225183|emb|CBY20977.1| unnamed protein product [Oikopleura dioica]
          Length = 335

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 27/118 (22%)

Query: 70  RGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA 129
           RG+ + +MI V+  GG Y   DH+ W G T+AD  MP+F+F++GV++  +       N  
Sbjct: 2   RGIAIGIMIFVNYGGGGYWFFDHAVWFGLTVADLAMPWFMFMMGVSLTFSF------NSM 55

Query: 130 VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIE 187
           VKK++  +  L                     V+     + G+LQR A+ Y VV+ ++
Sbjct: 56  VKKVLRLSYNL---------------------VNPTFGTFPGVLQRFAICYAVVSPLQ 92


>gi|223936398|ref|ZP_03628310.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223894916|gb|EEF61365.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 427

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 66/150 (44%), Gaps = 27/150 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARI-------------DHSPWNGCTLADFVMPF 107
           +R+ ++DA RG  +  +I  D    A  R+             DH  W G    D + P 
Sbjct: 24  QRLMSVDALRGFDMFWIIGADSLVYALHRLSQNRVTDFLGLQLDHCDWAGFHFYDLIFPL 83

Query: 108 FLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWGIILQGGYSHA-PDALSYGVD 163
           F+FI+GV++  +L K  +  G   AVK++  R+  L    +I  GG   A PD       
Sbjct: 84  FVFIMGVSVVFSLTKAIQQLGRAEAVKRVFRRSALLFVVALIYSGGVRSAWPD------- 136

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKR 193
              IR  G+L RIAL Y V  LI      R
Sbjct: 137 ---IRLLGVLNRIALCYFVGGLIFCFFKPR 163


>gi|223937685|ref|ZP_03629587.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223893657|gb|EEF60116.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 413

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 76/348 (21%), Positives = 132/348 (37%), Gaps = 72/348 (20%)

Query: 58  QKSKRVATLDAFRGLTVVLM---------ILVDDAGGAY-----ARIDHSPWNGCTLADF 103
           +K+ R+ +LDA+RG  ++LM         +   +   ++      + +H  W GC L D 
Sbjct: 25  EKATRLISLDAYRGFVMLLMASEGFNMWRMAEQNPNSSFWQFLKYQTEHVDWRGCALWDL 84

Query: 104 VMPFFLFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
           + P F+F+VGVA+  +L     K    N  +   ++R++ L+F GI L+          S
Sbjct: 85  IQPSFMFMVGVAMPFSLASRRAKGQSFNTMLGHTLWRSIALVFIGIFLR----------S 134

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIA 219
            G    +  +  +L +I L Y  + L+    TK R           FTA     +G + A
Sbjct: 135 VGRHQTYFTFEDVLTQIGLGYTFLFLLA--WTKLRVQ---------FTAAMLILVGYWAA 183

Query: 220 FVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLY 279
           F +Y +              +  H         G   H     N    VD+  W +N   
Sbjct: 184 FALYPLPVNDFDYQKVGIPANWHH-------LTGFAAHWDKNTNLAAAVDQ--WFLNLF- 233

Query: 280 SDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIH 339
                          P   P   +   +    F P   L+T+        G+  G  +  
Sbjct: 234 ---------------PREHPFVFNGGGYLTLSFVPS--LATM------IFGLLAGQFMRE 270

Query: 340 FKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
               S++++  V  G   L +  +L  T   P  K++++ S+V F+ G
Sbjct: 271 QSTQSSKVRLLVGAGIACLALGAVLDMTGICPSVKRIWTPSWVIFSTG 318


>gi|423280893|ref|ZP_17259805.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
 gi|404583534|gb|EKA88212.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
           610]
          Length = 375

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 76/174 (43%), Gaps = 25/174 (14%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGC 98
           + + L     R+A+LD  RG  + L++       A A             + DH  W G 
Sbjct: 1   MNKPLSTTPPRLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAP 155
              D VMP FLF+ G ++  +  K    P      +KII R + L  +G+I+QG      
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSFSKFKDDPDKGTIYRKIIRRFILLFIFGMIVQGNL---- 116

Query: 156 DALSYGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA 208
                G++ K++  +   LQ IA  Y++ A+I    + RR  ++    L I+ A
Sbjct: 117 ----LGLNPKYLYLYSNTLQAIATGYLIAAIILLHCSFRRQLIVTALLLLIYWA 166


>gi|424665544|ref|ZP_18102580.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
 gi|404574617|gb|EKA79366.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
           616]
          Length = 375

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 69/153 (45%), Gaps = 25/153 (16%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGC 98
           + + L   S R+A+LD  RG  + L++       A A             + DH  W G 
Sbjct: 1   MNKPLSSPSPRLASLDILRGFDLFLLVFFQPVLWALAHQLNLPWLNSILFQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAP 155
              D VMP FLF+ G ++  +  K    P      +KI+ R + L  +G+I+QG      
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSFSKFKDDPDKGPIYRKILKRFILLFIFGMIVQGNL---- 116

Query: 156 DALSYGVDMKHIR-WCGILQRIALVYVVVALIE 187
                G++ K++  +   LQ IA  Y++ A+I+
Sbjct: 117 ----LGLNPKYLYLYSNTLQAIATGYLIAAIIQ 145


>gi|313149262|ref|ZP_07811455.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313138029|gb|EFR55389.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 375

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 76/174 (43%), Gaps = 25/174 (14%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGC 98
           + + L     R+A+LD  RG  + L++       A A             + DH  W G 
Sbjct: 1   MNKPLSTTPPRLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQGGYSHAP 155
              D VMP FLF+ G ++  +  K    P      +KII R + L  +G+I+QG      
Sbjct: 61  RFWDLVMPLFLFMTGASMPFSFSKFKDDPDKGPIYRKIIRRFILLFIFGMIVQGNL---- 116

Query: 156 DALSYGVDMKHIR-WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA 208
                G++ K++  +   LQ IA  Y++ A+I    + RR  ++    L I+ A
Sbjct: 117 ----LGLNPKYLYLYSNTLQAIATGYLIAAIILLHCSFRRQLIVTALLLLIYWA 166


>gi|224027055|ref|ZP_03645421.1| hypothetical protein BACCOPRO_03816, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224020291|gb|EEF78289.1| hypothetical protein BACCOPRO_03816 [Bacteroides coprophilus DSM
           18228]
          Length = 373

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/352 (22%), Positives = 131/352 (37%), Gaps = 100/352 (28%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYA----------------RIDHSPWNGCTL 100
           + KS+R+ +LD  RG     M  +   G  +A                +++H PW+G   
Sbjct: 9   EVKSQRLQSLDVLRGFD---MFFIMGGGALFAGLATCCPIPFFQAIARQMEHVPWHGVAF 65

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKIN----GAVKKIIFRTLKLLFWGIILQGGYSHAPD 156
            D + P FLFI G++   +L+K            +K+I R L L+  G I  G       
Sbjct: 66  EDMIFPLFLFIAGISFPYSLEKQKACGMSSAAIYRKVIRRGLVLVLLGCIYNGLLDF--- 122

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALI-ETLTTKRRPNVLEPRHLSIFTAYQWQWIG 215
                 D  H+R+  +L RI L ++  AL+   + T+ R  V+      +F  Y W  + 
Sbjct: 123 ------DFAHLRYASVLGRIGLSWMFAALLFLNVRTRVRMGVVA----LLFIGY-WALLA 171

Query: 216 GFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGI 275
            F A             P+ +    S  G                  + VGYVDR ++  
Sbjct: 172 CFPA-------------PDGNGDPFSMEG------------------SLVGYVDR-MFLP 199

Query: 276 NHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
             LY                                 +PEGLL  I A+ + ++G+  G 
Sbjct: 200 GQLY-----------------------------LGIHDPEGLLGIIPAVGTASLGMLTGE 230

Query: 336 VLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             I  +G     K  +    G++++ +   +    PINK L++ S+ C   G
Sbjct: 231 -WIKREGLPELRKVVLLACAGVMLVVVGWIWDLVFPINKNLWTSSFACLVGG 281


>gi|170725675|ref|YP_001759701.1| hypothetical protein Swoo_1314 [Shewanella woodyi ATCC 51908]
 gi|169811022|gb|ACA85606.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908]
          Length = 378

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 88/354 (24%), Positives = 143/354 (40%), Gaps = 97/354 (27%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA--------------GGAYARIDHSPWNGCTLAD 102
           +   +R+ +LDA RG  +   IL  +A                A A++ HS W+G T  D
Sbjct: 7   KTPKRRLMSLDALRGFDM-FWILGGEALFAGLLLWTGWHGWQWADAQMHHSQWHGFTFYD 65

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGY-SHAPD 156
            + P F+F+ GVA+ L+ K++ K+  A +  ++     R L LLF+G++   G+ + AP 
Sbjct: 66  LIFPLFIFLSGVALGLSPKRLDKLPMAQRMPLYKHSVKRLLLLLFFGVLYNHGWGTGAP- 124

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGG 216
                V +  +R+  +L RIA  +   A++    T  R  VL    +             
Sbjct: 125 -----VAIDEVRYASVLGRIAFAWFFAAML-VWHTSFRTQVLVTLGI------------- 165

Query: 217 FIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGIN 276
               V+Y +    + VP +     S  G     V      HL P    + Y +R L    
Sbjct: 166 ---LVLYGLAQLLIPVPGYGAGIFSPEGSINAYVDT----HLLP---GIAYQNRAL---- 211

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
               DP                                EG+LST+ AI++   G+  GH 
Sbjct: 212 ----DP--------------------------------EGILSTLPAIVNAMAGVFVGHF 235

Query: 337 LI--HFKGHSARLKHWVSMGF-GLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           ++  H KG   ++   V+M   G L++         IP+NK L++ S+V  T G
Sbjct: 236 IVKEHAKGEWYKV---VTMLIAGALVLGCGWLLNLVIPVNKDLWTSSFVLVTTG 286


>gi|440749360|ref|ZP_20928608.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
 gi|436482365|gb|ELP38488.1| N-acetylglucosamine related transporter, NagX [Mariniradius
           saccharolyticus AK6]
          Length = 401

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 56/98 (57%), Gaps = 9/98 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA---GGAYARIDH--SPWNGCTLADFVMPFFLFI 111
               +RV ++D FR +T++LMI V+D     G  A + H  +  +G  LAD V P FL I
Sbjct: 7   NANPRRVYSIDVFRAITMMLMIFVNDLWTLEGIPAWLGHVDAKEDGMGLADVVFPAFLVI 66

Query: 112 VGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGI 145
           VG++I  AL    +K  ++ G +K I FRTL LL  G+
Sbjct: 67  VGLSIPFALSKRIEKGERLAGTLKHIFFRTLALLTMGV 104


>gi|311746093|ref|ZP_07719878.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
 gi|126576311|gb|EAZ80589.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
          Length = 367

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 15/80 (18%)

Query: 58  QKSKRVATLDAFRGLTVVLMIL------------VDDAG---GAYARIDHSPWNGCTLAD 102
            KS R+ +LD FRGLT+ L+I               D G     + +  H PWNG    D
Sbjct: 6   SKSGRLVSLDTFRGLTMFLLIAEAAFVYESLLEAFPDPGILNSFFTQFTHHPWNGLRFWD 65

Query: 103 FVMPFFLFIVGVAIALALKK 122
            + PFF+FIVGVA+  +L K
Sbjct: 66  LIQPFFMFIVGVAMPFSLNK 85


>gi|289422375|ref|ZP_06424221.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
 gi|289157210|gb|EFD05829.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
           653-L]
          Length = 463

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 88/188 (46%), Gaps = 24/188 (12%)

Query: 17  VEQEQDDGKDSENGINK--EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTV 74
           +++      D E+ I K      E+ +   E K +L ++     +  RV ++D  RGL V
Sbjct: 5   IKKTNMANNDEESWIRKIVYNYYEKLKSYYEYKADLDIKY----EYMRVQSIDYMRGLLV 60

Query: 75  VLMILVDDAGG----AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
           +L + + + G     +YA   +S WNG TL D ++P FL ++G +I   +KK  + N  +
Sbjct: 61  ILSMFMINQGLENQISYA-FQNSKWNGMTLNDILVPMFLLVIGSSIPFYVKKHYEENEDI 119

Query: 131 KKII----FRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
           + I+     +++ +   G+I    Y  A D         ++R  G +Q +  VY++  L+
Sbjct: 120 RHIVKMSFIKSIIVFLIGLIFSCIYYPAND---------YVRLTGPIQMMVFVYIMSLLL 170

Query: 187 ETLTTKRR 194
                K R
Sbjct: 171 YIGFLKMR 178


>gi|291295418|ref|YP_003506816.1| hypothetical protein [Meiothermus ruber DSM 1279]
 gi|290470377|gb|ADD27796.1| conserved hypothetical protein [Meiothermus ruber DSM 1279]
          Length = 399

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/138 (33%), Positives = 67/138 (48%), Gaps = 18/138 (13%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYARID---HSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+  LD  RGLTV LM+LV++     A  D   H+P+ G TLAD V P+FLF +G AI  
Sbjct: 24  RLLALDGLRGLTVFLMLLVNNLALQEATPDQLVHAPFGGVTLADLVFPWFLFCMGAAIPY 83

Query: 119 ALKKVPK----INGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           A     K    +   + +I+ RT  +   G+ L    +  P           +   G+LQ
Sbjct: 84  AASSFDKQKLPLWRRLLRILRRTSLIFLLGLFLTSALARTP-----------VFALGVLQ 132

Query: 175 RIALVYVVVALIETLTTK 192
            IAL Y + AL   ++ +
Sbjct: 133 LIALAYCLAALFYLISPR 150


>gi|429727718|ref|ZP_19262477.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
 gi|429151771|gb|EKX94625.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
           anaerobius VPI 4330]
          Length = 463

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 88/188 (46%), Gaps = 24/188 (12%)

Query: 17  VEQEQDDGKDSENGINK--EKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTV 74
           +++      D E+ I K      E+ +   E K +L  +     +  RV ++D  RGL V
Sbjct: 5   IKKTNMANNDEESWIRKIVYNYYEKLKSYYEYKADLDAKY----EDMRVQSIDYMRGLLV 60

Query: 75  VLMILVDDAGG----AYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
           +L + + + G     +YA   +S WNG TL D ++P FL ++G +I   +KK  + N  +
Sbjct: 61  ILSMFMINQGLENQISYA-FQNSKWNGMTLLDILVPMFLLVIGSSIPFYVKKHYEENEDL 119

Query: 131 KKII----FRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
           + I+     +++ +   G+I    Y  A D         ++R  G +Q +A VY++  L+
Sbjct: 120 RHIVKMSFIKSIIVFVIGLIFSCIYYPAND---------YVRLTGPIQMMAFVYIMSLLL 170

Query: 187 ETLTTKRR 194
                K R
Sbjct: 171 YIGFLKMR 178


>gi|196233857|ref|ZP_03132695.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196222051|gb|EDY16583.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 437

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 74/187 (39%), Gaps = 31/187 (16%)

Query: 27  SENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGA 86
           ++  ++   G    EV       + +     Q + R+ +LDA RG     M  +  AG  
Sbjct: 9   AKANLDAASGASLEEVAQIPSPNVPVSPETGQPAGRLVSLDALRGFD---MFWIVGAGAV 65

Query: 87  YARID----------------HSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV 130
              +D                H  W G    D + P FLFI+G++I  +L K     G  
Sbjct: 66  IQSLDKMCRTPFTAGLAWQFKHVHWKGLHCYDVIFPLFLFIIGISIVFSLDKALATGGKA 125

Query: 131 K---KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIE 187
           +   ++  R++ L   G++  GG+              +++  G+L RIAL Y+  ALI 
Sbjct: 126 QVLTRVARRSVLLFALGVLYYGGFMK---------PWPNVQLGGVLPRIALCYLAAALIY 176

Query: 188 TLTTKRR 194
           T     R
Sbjct: 177 TFIRSTR 183


>gi|392402534|ref|YP_006439146.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
 gi|390610488|gb|AFM11640.1| Protein of unknown function DUF2261, transmembrane [Turneriella
           parva DSM 21527]
          Length = 396

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 68/145 (46%), Gaps = 28/145 (19%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R  +LD  RGLT+ LMI+V++ G     +A + H+ W+G   AD V P FLF+ G A AL
Sbjct: 2   RNFSLDLLRGLTIALMIIVNNPGDWKAMFAVLRHAEWHGFLGADIVFPLFLFVAGYAAAL 61

Query: 119 ALKKV--------PKINGAVK-----------KIIFRTLKLLFWGIILQG-GYSHAPDA- 157
            + ++        P    A+             ++ R   L   G+ L        PD  
Sbjct: 62  KIDRLYGPTTAGGPHCASALTLEERELPAYYLPLMRRAAILFLIGLFLNAWPLGLLPDTE 121

Query: 158 LSYGVDMKHIRWCGILQRIALVYVV 182
            S+G    H+R  G+LQRIA+  +V
Sbjct: 122 FSFG----HLRVLGVLQRIAICVLV 142


>gi|317503636|ref|ZP_07961655.1| conserved hypothetical protein, partial [Prevotella salivae DSM
           15606]
 gi|315665261|gb|EFV04909.1| conserved hypothetical protein [Prevotella salivae DSM 15606]
          Length = 59

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 37/59 (62%), Gaps = 2/59 (3%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFI 111
           +  +K+ R+  +D  RG+T+  MILV++ GG   Y  ++H+ W G T  D V PFF+FI
Sbjct: 1   MTTKKTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWLGLTPTDLVFPFFMFI 59


>gi|317505448|ref|ZP_07963366.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
 gi|315663361|gb|EFV03110.1| ABC superfamily ATP binding cassette transporter permease subunit
           [Prevotella salivae DSM 15606]
          Length = 380

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 75/156 (48%), Gaps = 27/156 (17%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDD-----------AGGA-----YARIDHSPWNGCTLA 101
           QK  R+ +LD  RG  + +++LV             A G        ++ H PW G    
Sbjct: 8   QKPNRLLSLDILRGADLAMLVLVQPILLKALETMQPAEGTVGHFIMGQLLHLPWEGFCFW 67

Query: 102 DFVMPFFLFIVGVAIALAL---KKVPKINGAV-KKIIFRTLKLLFWGIILQGGYSHAPDA 157
           D +MP F+F+ G+ I  A+   K+  +I+G+  ++I+ R + L   G++ QG      + 
Sbjct: 68  DIIMPLFMFMSGITIPFAMARYKRGERIDGSFYRRILKRFVVLWILGMVCQG------NL 121

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           L++ +   H+ +   LQ IA+ YV VA +    + R
Sbjct: 122 LAFDLQQLHL-YSNTLQSIAVGYVAVAFLYVFCSLR 156


>gi|413922900|gb|AFW62832.1| hypothetical protein ZEAMMB73_935848 [Zea mays]
          Length = 1241

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 35/57 (61%)

Query: 250  VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPS 306
            VKCG+RG     CNAVG +DR++ GI HLY  PV++R +    ++  +   R  AP+
Sbjct: 969  VKCGVRGDTSSGCNAVGMIDRKILGIQHLYGRPVYARSKNYRKNTLAASSSRRKAPA 1025


>gi|456985620|gb|EMG21387.1| hypothetical protein LEP1GSC150_0590 [Leptospira interrogans
           serovar Copenhageni str. LT2050]
          Length = 77

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 25/50 (50%), Positives = 33/50 (66%), Gaps = 3/50 (6%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFF 108
           R+ +LD FRG+TV  MILV++ G     Y+ + H+ WNGCT  D V PFF
Sbjct: 9   RILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFF 58


>gi|255037019|ref|YP_003087640.1| hypothetical protein Dfer_3263 [Dyadobacter fermentans DSM 18053]
 gi|254949775|gb|ACT94475.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 380

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 85/356 (23%), Positives = 142/356 (39%), Gaps = 101/356 (28%)

Query: 62  RVATLDAFRGLTVVLMI----LVDDAGG---------AYARIDHSPWNGCTLADFVMPFF 108
           R+A++DA RG  ++++      +   GG           A+ +H  WNG T  DF+ P F
Sbjct: 20  RLASIDALRGFDMLMIAGGGQFIATLGGKTGISFIDAVAAQFEHPAWNGFTFYDFIFPLF 79

Query: 109 LFIVGVAIALAL-----KKVPKINGAVKKIIF-RTLKLLFWGIILQGGYSHAPDALSYGV 162
           LF+ G ++A ++     K +P     ++  +F R L L+  GI+ +    +AP  +    
Sbjct: 80  LFLAGTSLAFSVTGGLAKGIPP--SVIRNKVFKRMLILIALGILDK----NAPMDI---F 130

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D  HIR+  +L RI L   + A++                   F   Q  +IG  I  V 
Sbjct: 131 DPAHIRYGSVLGRIGLATFISAILYMK----------------FGTNQRLYIGVGI-LVA 173

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
           Y +T   + VP +   + +  G                  N VG++DR            
Sbjct: 174 YYLTLILVPVPGFGSGDLTFEG------------------NLVGWIDRNFM--------- 206

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI-HFK 341
                          G L++         ++   LL+   A+     G   G VL+   +
Sbjct: 207 --------------PGILKQGT-------YDELALLTQFPALCLTLFGTVAGDVLLRENR 245

Query: 342 GHS--ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           G+S   +L  + S G     I+I + +  A PINK L+S S++  T+G A   L +
Sbjct: 246 GNSKIGKLLLFASTG-----ISIGILWNFAFPINKHLWSSSFIMLTSGMAFAMLAL 296


>gi|445497064|ref|ZP_21463919.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
 gi|444787059|gb|ELX08607.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
          Length = 381

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 62/115 (53%), Gaps = 12/115 (10%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAY---ARIDHSP--WNGCTLADFVMPFFLFI 111
             K  RV  +DAFRG+T+++MI V+   G     A ++H+P   +  T  D V P FLFI
Sbjct: 3   NSKPARVLAIDAFRGITILVMIFVNTLAGVRGMPAWMEHAPADADAMTFPDVVFPAFLFI 62

Query: 112 VGVAI--ALALKKVPKINGAV--KKIIFRTLKLLFWGIIL---QGGYSHAPDALS 159
           VG++I  A+A ++      A   + ++ R   LL  G+ +   +GGY+ A   +S
Sbjct: 63  VGMSIPFAMAQRQAAGDTPAARWRHVLARAAGLLVLGVFMVNAEGGYNEAAMGMS 117


>gi|149176468|ref|ZP_01855081.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
 gi|148844581|gb|EDL58931.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
          Length = 518

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 85/436 (19%), Positives = 162/436 (37%), Gaps = 100/436 (22%)

Query: 21  QDDGKDSEN-GINKEKGLERSEVQDEQKGELQLQQLLQQKSK-----RVATLDAFRGLTV 74
             D ++SE   + +  G+E S   +  K   +    L++K K     R+ +LDA+RG  +
Sbjct: 25  NSDSQNSETIPLQQTPGVEVSAEVEPAKAVTEKDVSLKEKKKPETNQRLVSLDAYRGFVM 84

Query: 75  VLMI---------------LVDDAGGAY-------------ARIDHSPWNGCTLADFVMP 106
           + M                ++D   G                ++ H  W G    D + P
Sbjct: 85  LAMASGGLAIASVVRNSPEVLDQYNGTQWESSWKTLWQTLSYQLSHVEWTGAGFWDLIQP 144

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKI----IFRTLKLLFWGIILQGGYSHAPDALSYGV 162
            F+F+VGV++  +++K  +   +  KI    IFR + L+  G+ L   +S         V
Sbjct: 145 SFMFMVGVSMPFSVRKRRQKGDSTFKIWMHAIFRAILLVALGVFLSSQFSPERGFTYEDV 204

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
              +  +  +L +I L Y+VV                     +  ++  Q IG     V+
Sbjct: 205 PQTNFTFANVLCQIGLGYLVV------------------FFYVNRSFATQMIG-----VV 241

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKC------------GMRGHLGPACNAVGYVDR 270
            I+  Y  +   +   E     VK Y+ +             G+        NA   VDR
Sbjct: 242 TILGGYWFFFYQYMPPEDELAAVKTYLKEVQHKDEAEWSQFSGIGSAWNKHTNAAAAVDR 301

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
           +L  +   Y +             P   P + D     +  ++    + +I+ +L    G
Sbjct: 302 QLLNMFPRYDN-------------PKDDPDQGDTFWVNKGGYQTLNFIPSIATML---FG 345

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFT-----------NAIPINKQLYSF 379
           +  G +LI  +    ++K  +  G     ++++L  +           + +PI K+++S 
Sbjct: 346 LMAGQLLISNRLEKMKVKWLLQAGLICFGVSMLLDTSIWPVNINNWEWHLVPIVKRIWSP 405

Query: 380 SYVCFTAGAADGCLGI 395
            +  F+AG A   L +
Sbjct: 406 GWAIFSAGWAFWFLAV 421


>gi|374384982|ref|ZP_09642493.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
 gi|373227040|gb|EHP49361.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
           12061]
          Length = 382

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 74/169 (43%), Gaps = 25/169 (14%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
              +KR+ +LDA RG  +  ++ +     + AR              +H  W G +  D 
Sbjct: 8   HSAAKRLESLDALRGFDLFFLVALGPLMNSLARAADAEWFNNWMGIFNHVSWEGFSPWDL 67

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGG-YSHAPDALS 159
           +MP FLF+ G+++  AL   K +P     ++++  R L L  +G+I QG      PD   
Sbjct: 68  IMPLFLFMSGISMPFALARYKSMPDKRPLLRRLGKRILLLWIFGMICQGNLLGLNPD--- 124

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA 208
                K   +   LQ IA  Y++ AL+   T +R   +     L I+ A
Sbjct: 125 -----KIYLYSNTLQAIAAGYLITALLFLFTRRRTQLITAVLLLLIYWA 168


>gi|94985055|ref|YP_604419.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
 gi|94555336|gb|ABF45250.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
          Length = 573

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 7/97 (7%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGA---YARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q   R+  LDA+RGLTV+LM+LV++         ++ H+P+ G TL D V P+FLF  G 
Sbjct: 214 QTRVRLTALDAWRGLTVLLMLLVNNVALGDLTPPQLQHAPFGGLTLTDLVFPWFLFCAGA 273

Query: 115 AIALALKKVPKIN----GAVKKIIFRTLKLLFWGIIL 147
           A+  +   + +        V++++ R   L   G  L
Sbjct: 274 ALPFSQAAMRRAGVTGWARVRRLVTRAALLYLVGAFL 310


>gi|325106033|ref|YP_004275687.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324974881|gb|ADY53865.1| hypothetical protein Pedsa_3330 [Pedobacter saltans DSM 12145]
          Length = 397

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 18/138 (13%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA--GGAYARIDHSPWN--GCTLADFVMPFFLFIV 112
           ++ S R+ ++D  RGLT+ LM+ V+D    G    + H+  N  G  LAD+V P FLF+V
Sbjct: 3   EKLSVRILSIDIMRGLTLFLMLFVNDLYEPGVPKWLVHTKANVDGMGLADWVFPGFLFMV 62

Query: 113 GVAIALALK-KVPKINGAVK---KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           G++I  A+K +  K     K    I+ R L LLF GI++       P+       M  + 
Sbjct: 63  GLSIPYAVKARKAKGESGFKIFVHILLRALSLLFIGILMLNADRVNPEL----TGMNKLL 118

Query: 169 WCGILQRIALVYVVVALI 186
           W G      LVY+ V L+
Sbjct: 119 WAG------LVYISVFLV 130


>gi|430745463|ref|YP_007204592.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
 gi|430017183|gb|AGA28897.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
           18658]
          Length = 391

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 79/156 (50%), Gaps = 24/156 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADFV 104
           + S+R+ ++DA RG  ++ +I  +    A AR             ++H+ W+G  L D +
Sbjct: 24  KPSERLLSIDALRGFDMLWIIGGERLAKALARWSDSSAGKVVQEQLEHAEWHGFRLNDLI 83

Query: 105 MPFFLFIVGVAIALALKKVP-KINGA-VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV 162
            P FLF+VG  +  +L K+  +  GA  ++I  RTL L   G++  G        L +  
Sbjct: 84  FPLFLFLVGTVLPFSLGKLQGQGRGAEYRRIARRTLLLFALGLLCNG-------VLKF-- 134

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVL 198
           D  ++R  G+LQRIAL Y + ALI    ++R   +L
Sbjct: 135 DWANLRVAGVLQRIALCYGIAALISLWFSRRGVAIL 170


>gi|15806610|ref|NP_295325.1| hypothetical protein DR_1602 [Deinococcus radiodurans R1]
 gi|6459373|gb|AAF11168.1|AE002004_7 hypothetical protein DR_1602 [Deinococcus radiodurans R1]
          Length = 388

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 69/135 (51%), Gaps = 18/135 (13%)

Query: 62  RVATLDAFRGLTVVLMILVDDA--GGAYAR-IDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           R+  LDA+RGLTV+LM+LV++   G +  R + H+ + G TL D V P+FLF  G A+  
Sbjct: 33  RLTALDAWRGLTVLLMLLVNNVALGDSTPRQLSHAHFGGLTLTDLVFPWFLFCAGAALPF 92

Query: 119 ALKKVPK--ING--AVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           +   + K  + G    ++++ R   L   G  +    SH    L+ G+        G+LQ
Sbjct: 93  SAAAMNKAGVTGWPLYRRLLERAALLYLMGAFVTSVTSH---RLTLGL--------GVLQ 141

Query: 175 RIALVYVVVALIETL 189
            IAL     AL+  L
Sbjct: 142 LIALASFFAALLGQL 156


>gi|153805867|ref|ZP_01958535.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
 gi|149130544|gb|EDM21750.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
          Length = 377

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 63/148 (42%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           L   S R+A+LD  RG  + L++       A  +              DH  W G    D
Sbjct: 7   LSPVSGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWD 66

Query: 103 FVMPFFLFIVGVAIALA---LKKVPKINGAVKKIIFRTLKLLFWGIILQGG-YSHAPDAL 158
            VMP FLF+ G ++  +    K  P      +KII R + L  +G+I+QG      PD+L
Sbjct: 67  LVMPLFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSL 126

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALI 186
                     +   LQ IA  Y++ A+I
Sbjct: 127 Y--------LYSNTLQAIATGYLIAAII 146


>gi|423219681|ref|ZP_17206177.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
 gi|392624886|gb|EIY18964.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
           CL03T12C61]
          Length = 375

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 63/148 (42%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           L   S R+A+LD  RG  + L++       A  +              DH  W G    D
Sbjct: 5   LSPVSGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWD 64

Query: 103 FVMPFFLFIVGVAIALA---LKKVPKINGAVKKIIFRTLKLLFWGIILQGG-YSHAPDAL 158
            VMP FLF+ G ++  +    K  P      +KII R + L  +G+I+QG      PD+L
Sbjct: 65  LVMPLFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSL 124

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALI 186
                     +   LQ IA  Y++ A+I
Sbjct: 125 Y--------LYSNTLQAIATGYLIAAII 144


>gi|410099161|ref|ZP_11294133.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219183|gb|EKN12146.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 377

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 27/155 (17%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           +   KR+ +LDA RG  +  ++ +     + AR              +H  W G +  D 
Sbjct: 3   KTTYKRLESLDALRGFDLFFLVALGPLAHSLARAADVGWLNDCMWAFNHVQWEGFSPWDL 62

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK---LLFW--GIILQGGYSHAPDAL 158
           +MP FLF+ G ++  AL +   ++   KK +FR L    LL W  G++ QG      + L
Sbjct: 63  IMPLFLFMSGASMPFALSRFKGVSD--KKTLFRRLGKRILLLWIFGMMCQG------NLL 114

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
            +  D  ++ +   LQ IA  Y++ A++   T++R
Sbjct: 115 GFDPDRIYL-YSNTLQSIAAGYLITAVLFLYTSRR 148


>gi|375255119|ref|YP_005014286.1| hypothetical protein BFO_1396 [Tannerella forsythia ATCC 43037]
 gi|363406141|gb|AEW19827.1| putative membrane protein [Tannerella forsythia ATCC 43037]
          Length = 375

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 78/359 (21%), Positives = 131/359 (36%), Gaps = 90/359 (25%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILV------------DDAGGAYAR-IDHSPWNGCTLADF 103
               KR+ +LD  RG  +  ++++            + A    AR   H  W G    D 
Sbjct: 4   SSSEKRLVSLDLLRGFDLFCLLMLQPILMTWLEIADNPAWAPLARQFTHVEWRGVAFWDL 63

Query: 104 VMPFFLFIVGVAIALALKKVPKINGA------VKKIIFRTLKLLFWGIILQGGYSHAPDA 157
           +MP F+F+ G+ +  AL K  +  GA        K++ R + L F G I+QG      + 
Sbjct: 64  IMPLFMFMSGITVPFALSKYKR--GAKPGHSFYLKLLKRFVILFFLGWIVQG------NL 115

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGF 217
           L+   +  HI +   LQ IA+ YVV A      + R                  Q     
Sbjct: 116 LALDPNRFHI-FANTLQAIAVGYVVTAFCYVRFSFRV-----------------QLGATV 157

Query: 218 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINH 277
           + F+ Y++   ++   NW                        P  N    +DR       
Sbjct: 158 LFFIAYLLVFATVGGMNWE-----------------------PGTNIAEEIDR------- 187

Query: 278 LYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAP-FEPEGLLSTISAILSGTIGIHYGHV 336
                       C L     G + E   SW   P +    +LS+++ +++   G   GH+
Sbjct: 188 ------------CVLGRFRDGIITEADGSWKFDPAYHYTWILSSLNFVVTVMTGSFAGHI 235

Query: 337 LIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGI 395
           L   K    RL   +  G  L++ A+++      P+ K+++S S   F  G     +G+
Sbjct: 236 LRLRKTARQRLMRLLITGVSLVVAALLMD--PVFPLIKRIWSSSMTLFYGGVCFLLMGL 292


>gi|456890770|gb|EMG01561.1| hypothetical protein LEP1GSC123_2562 [Leptospira borgpetersenii
           str. 200701203]
          Length = 74

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/46 (52%), Positives = 31/46 (67%), Gaps = 3/46 (6%)

Query: 66  LDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFF 108
           +D FRG+TVV MILV++ G     Y+ + H+ WNGCT  D V PFF
Sbjct: 1   MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 46


>gi|372268269|ref|ZP_09504317.1| hypothetical protein AlS89_10220 [Alteromonas sp. S89]
          Length = 365

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 71/145 (48%), Gaps = 15/145 (10%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMI--------LVDDAGGA-----YARIDHSPWNGCTLAD 102
           + +K +R+A++DA RG  +  +I        L    G +     +A++ H+PW+G T  D
Sbjct: 1   MTKKKQRLASVDALRGFDMFWIIGGEALFLPLFALTGWSIFQFGHAQMQHTPWHGFTFYD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GV + LA K +  +  + +  ++R   K L   ++L   Y+H        
Sbjct: 61  LIFPLFIFLSGVTLGLANKSLRGLPVSQRAPVYRKATKRLLLLVLLGILYNHGWGT-GIP 119

Query: 162 VDMKHIRWCGILQRIALVYVVVALI 186
            D+  IR+  +L RI   +   A+I
Sbjct: 120 ADLSEIRYASVLARIGFAWFFAAMI 144



 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKG-HSARLKHWVSMGFGLLIIAIILHFTNAI 370
           ++PEG+LSTI AI +   G+  G  L    G H A LK   + G   L+   + H  +  
Sbjct: 199 YDPEGVLSTIPAIANALFGVFAGRWLSKHAGDHKAILKGLFAAGVACLVAGYVWH--SVY 256

Query: 371 PINKQLYSFSYVCFT 385
           P+NK+L++ S+V  T
Sbjct: 257 PVNKELWTSSFVLIT 271


>gi|284036950|ref|YP_003386880.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
 gi|283816243|gb|ADB38081.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
          Length = 404

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 9/102 (8%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSP--WNGCTLADFVMPFFLFIVGV 114
           S RV  +D  R +T++LMI V+D     A    ++H P   +G  LAD V P FLFIVG+
Sbjct: 18  SMRVDAIDILRAMTMILMIFVNDLWSLTAIPDWLEHVPHGVDGIGLADVVFPGFLFIVGM 77

Query: 115 AIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYS 152
           ++  A+    +K    +  V  II R++ LL  G+ L  G S
Sbjct: 78  SLPFAMNARRQKGDTNSALVSHIIMRSIALLVMGVFLVNGES 119


>gi|343082821|ref|YP_004772116.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351355|gb|AEL23885.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
           marinum DSM 745]
          Length = 367

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 48/108 (44%), Gaps = 17/108 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDD-------------AGGAYARIDHSPWNGCTLADFVM 105
           K  R+ ++DA RG  ++ +I  D               G    +  H  W G T  D +M
Sbjct: 7   KPNRILSIDALRGFDMLFIIFADRFFALLHKGGQTPFTGFLANQFSHPDWFGSTFYDIIM 66

Query: 106 PFFLFIVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIILQG 149
           P FLF+VG  I  +L K  + N       KK+  R L L F G I+QG
Sbjct: 67  PLFLFMVGAVIPFSLSKRMQENTGKAQIYKKLFKRVLILFFLGWIVQG 114


>gi|333379187|ref|ZP_08470911.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
 gi|332885455|gb|EGK05704.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
           22836]
          Length = 395

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 56/98 (57%), Gaps = 9/98 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPWNGCTL--ADFVMPFFLFI 111
            +K KR+A++D +R LT+  MI V+D          ++H+  N   L  +D V P FLFI
Sbjct: 6   TEKPKRIASIDIYRALTMFFMIFVNDLWSVSNVPHWLEHAAANEDMLGFSDIVFPSFLFI 65

Query: 112 VGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGI 145
           +G++I LA+    KK    +G +K II R++ LL  G+
Sbjct: 66  LGMSIPLAIEIRKKKGDSNSGILKHIIIRSIALLVMGL 103


>gi|392544017|ref|ZP_10291154.1| hypothetical protein PpisJ2_19642 [Pseudoalteromonas piscicida JCM
           20779]
          Length = 377

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 23/155 (14%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAG--GAYARIDHSPWNGCTLADF 103
             K KR+A+LDA RG+ +            L +L    G     A   HSPW+G T  D 
Sbjct: 4   NNKPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTLHSPWHGFTFYDL 63

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLF-----WGIILQGGYSHAPDAL 158
           + P F+F+ GVA+ L+ K++  +    +K  +            +G++   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKPFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
              +D   IR+  +L RIA  +   AL+   T+ R
Sbjct: 122 ---MDPDGIRYASVLGRIAFAWFFCALLVWHTSLR 153



 Score = 42.4 bits (98), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 50/94 (53%)

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH 365
           S+   P +PEG+LS++ AI++   G+  G  + + +         +  G G+L++A+   
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQGEWKTVGILAGSGVLVLALGWL 263

Query: 366 FTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
           +    P+NK+L++ S+V  T G +   L +  +I
Sbjct: 264 WDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAI 297


>gi|292609605|ref|XP_002660455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Danio
           rerio]
          Length = 292

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 31/46 (67%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           +R+ +LD FRGL++V+M+ V+  GG Y    H  WNG T+AD V P
Sbjct: 246 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFP 291


>gi|134025078|gb|AAI35092.1| Unknown (protein for IMAGE:7224994) [Danio rerio]
          Length = 291

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 31/46 (67%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMP 106
           +R+ +LD FRGL++V+M+ V+  GG Y    H  WNG T+AD V P
Sbjct: 245 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFP 290


>gi|319902718|ref|YP_004162446.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
 gi|319417749|gb|ADV44860.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
          Length = 380

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 69/158 (43%), Gaps = 26/158 (16%)

Query: 46  QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDH 92
            K  LQ Q  L+  S R+A+LD  RG  + L++       +  +              DH
Sbjct: 1   MKNALQ-QDSLKISSSRLASLDVLRGFDLFLLVFFQPVLMSLGQQLNLPFMDVVLYQFDH 59

Query: 93  SPWNGCTLADFVMPFFLFIVGVAIALALKKV---PKINGAVKKIIFRTLKLLFWGIILQG 149
             W G    D +MP FLF+ GV++  +  K    P      +KI  R L L   G+++QG
Sbjct: 60  EVWEGFRFWDLIMPLFLFMTGVSMPFSFAKYQSSPDKCIIYRKIFRRVLLLFLLGMVVQG 119

Query: 150 GYSHAPDALSYGVDMKHIR-WCGILQRIALVYVVVALI 186
                      G++ KHI  +   LQ IA+ Y++  +I
Sbjct: 120 NL--------LGLNPKHIYFYTNTLQAIAVGYLIAGMI 149


>gi|340617673|ref|YP_004736126.1| hypothetical protein zobellia_1684 [Zobellia galactanivorans]
 gi|339732470|emb|CAZ95738.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 346

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK---VPKINGAVKKIIFRTLKLLFWGI 145
           ++ H PWNG    D + PFF+FIVGVA+  +L+K        G  K I+ R   L  +G 
Sbjct: 30  QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGDKKGVTKHILRRCFLLFAFGA 89

Query: 146 ILQGGYSHA 154
           +L   YSHA
Sbjct: 90  LLHCVYSHA 98


>gi|409203840|ref|ZP_11232043.1| hypothetical protein PflaJ_21058 [Pseudoalteromonas flavipulchra
           JG1]
          Length = 377

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 68/155 (43%), Gaps = 23/155 (14%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAG--GAYARIDHSPWNGCTLADF 103
             K KR+A+LDA RG+ +            L +L    G     A   HSPW+G T  D 
Sbjct: 4   NNKPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTLHSPWHGFTFYDL 63

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLF-----WGIILQGGYSHAPDAL 158
           + P F+F+ GVA+ L+ K++  +    +K  +            +G++   G+       
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFNERKSFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
              +D   +R+  +L RIA  +   AL+   T+ R
Sbjct: 122 ---MDPDGVRYASVLGRIAFAWFFCALLVWHTSLR 153



 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 49/94 (52%)

Query: 306 SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH 365
           S+   P +PEG+LS++ AI++   G+  G  + + +         +  G G+L +A+   
Sbjct: 204 SYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAIANAQTQGEWKIVGILAGSGVLALALGWL 263

Query: 366 FTNAIPINKQLYSFSYVCFTAGAADGCLGITNSI 399
           +    P+NK+L++ S+V  T G +   L +  +I
Sbjct: 264 WDMQFPVNKELWTSSFVLVTVGWSAILLAVFYAI 297


>gi|456985619|gb|EMG21386.1| putative membrane protein [Leptospira interrogans serovar
           Copenhageni str. LT2050]
          Length = 296

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 65/138 (47%), Gaps = 28/138 (20%)

Query: 257 HLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEG 316
           +L P  +   ++DR ++G NHL+                      + + +W     +PEG
Sbjct: 67  YLEPGKDIGAWIDRNVFGENHLW----------------------KFSKTW-----DPEG 99

Query: 317 LLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQL 376
             S IS+I +  +G+  G +L   K +  + +     GFG L + + L +   +P+NK L
Sbjct: 100 FFSGISSITTSLLGVFCGSILSS-KTNETKKQILSIFGFGTLFVLVGLLWNQNLPMNKSL 158

Query: 377 YSFSYVCFTAGAADGCLG 394
           ++ SYV +TAG A   +G
Sbjct: 159 WTGSYVIYTAGLAFLSIG 176


>gi|320105641|ref|YP_004181231.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
 gi|319924162|gb|ADV81237.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
          Length = 406

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 58/115 (50%), Gaps = 18/115 (15%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW---------NGCTLADFVMPFFLFI 111
           +R+ +LD FRGL + LMI V++     A I   PW         +  T  D V P FLFI
Sbjct: 13  QRILSLDIFRGLNIALMIFVNEL----AEIKGLPWWTYHAPGKVDVMTYVDMVFPGFLFI 68

Query: 112 VGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV 162
           +G+AI LAL    +K       +  I  R+  LL  GIIL+ G    P AL +G+
Sbjct: 69  LGMAIPLALNARIRKGDSPATLLGYIALRSAALLVLGIILENGGRGDP-ALMHGL 122


>gi|345517559|ref|ZP_08797028.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
 gi|254837353|gb|EET17662.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
          Length = 359

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 68/153 (44%), Gaps = 25/153 (16%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADFV 104
           + S R+ +LD  RGL + L++       ++ +              +H  W G    D V
Sbjct: 5   KTSSRLDSLDMLRGLDLFLLVFFQPVLMSFGQQTDFPWMTSILYQFEHEVWVGFRFWDLV 64

Query: 105 MPFFLFIVGVAIALALKKVPKI---NGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           MP FLF+ GV++  +  K   I   N   +KI  R L L   G+++QG           G
Sbjct: 65  MPLFLFMTGVSMPFSFAKYRDISDRNAVYRKITRRFLLLFLLGMVVQGNL--------LG 116

Query: 162 VDMKHIR-WCGILQRIALVYVVVALIETLTTKR 193
           +D +HI  +   LQ IA  Y++ AL+    T R
Sbjct: 117 LDWEHIYLYNNTLQAIAAGYLIAALLLLHCTVR 149


>gi|284041428|ref|YP_003391358.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
 gi|283820721|gb|ADB42559.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
           DSM 74]
          Length = 381

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 123/344 (35%), Gaps = 91/344 (26%)

Query: 61  KRVATLDAFRGLTV--------VLMILVDDAGGAYA-----RIDHSPWNGCTLADFVMPF 107
           KR+ +LD  RG  +        +  +L    G A A     +  H  WNG    D + P 
Sbjct: 19  KRLLSLDTLRGFDMFWIMGGEEIFHVLAKTTGWAGAILLADQFSHPAWNGFRAYDLIFPL 78

Query: 108 FLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           F+F+ GV+   +    L +        +KII R L L+  GII   G         +   
Sbjct: 79  FMFMAGVSTPFSVGSRLDQGTDKAKIARKIISRGLILVVLGIIYNNGL--------FNRV 130

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
            + +R+  +L RI L  +   LI             PR        Q+ W  G +  + Y
Sbjct: 131 FEDMRFPSVLGRIGLAGMFAQLIYL--------YFRPRA-------QYIWFVGLL--LGY 173

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
                 + VP                  CG  G L   CN   ++DR L    HLY    
Sbjct: 174 WALMMLVPVPG-----------------CG-AGVLTMECNLASFIDRMLVP-GHLY---- 210

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                                    +   +PEGL ST+ AI +  +GI  G  L    G 
Sbjct: 211 -------------------------KTIHDPEGLFSTLPAIDNTLLGIFAGTFL-RTHGR 244

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
           +   K  + +G G   + +   +    PINK L++ S+V  T G
Sbjct: 245 TGNQKTALLLGAGAAFVLLGWLWDFVFPINKNLWTSSFVLVTGG 288


>gi|260911058|ref|ZP_05917694.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260634862|gb|EEX52916.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 409

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 57/112 (50%), Gaps = 12/112 (10%)

Query: 87  YARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKIN-----GAVKKIIFRTLKLL 141
           Y +I H PW G    D +MP F+F+ G+ I  ++ K  +       G + +++ R + L 
Sbjct: 81  YGQITHVPWQGFCFWDIIMPLFMFMSGITIPFSMAKYQRGESKAGVGFLLRLLKRFVVLW 140

Query: 142 FWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
             G+++QG      + L+      H+ +   LQ IA+ YVVVAL+   T+ R
Sbjct: 141 VLGMVVQG------NLLALDARQLHL-YSNTLQSIAVGYVVVALLFVYTSWR 185


>gi|300123408|emb|CBK24681.2| unnamed protein product [Blastocystis hominis]
          Length = 213

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 30  GINKEKGLERSEVQDEQKGELQLQQLLQQKSK-RVATLDAFRGLTVVLMILVDDAGGAYA 88
           G N+E     +   +E   + + Q ++ Q  K RV ++D FRG+T+ +MI  +   G Y+
Sbjct: 131 GRNEEASKTEASSVNEPLIQKEKQSVVSQPMKSRVQSIDVFRGITICIMIFANYGAGQYS 190

Query: 89  R-IDHSPWNGCTLADFVMPFF 108
             + H+ W+G T ADF  P +
Sbjct: 191 HSLMHAAWDGITFADFAFPLY 211


>gi|313145390|ref|ZP_07807583.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423279948|ref|ZP_17258861.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
 gi|424661980|ref|ZP_18099017.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|313134157|gb|EFR51517.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404578291|gb|EKA83026.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
           616]
 gi|404584284|gb|EKA88949.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
           610]
          Length = 377

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 77/169 (45%), Gaps = 25/169 (14%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-------------WNGCTLADF 103
           Q   +R+ +LDA RGL +  ++ +        R   SP             W G +  D 
Sbjct: 3   QPIRQRLESLDALRGLDLFFLVALGPLLRTLVRAIDSPHLDGVNWCLRHVDWIGFSPWDL 62

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK--LLFW--GIILQGGYSHAPDALS 159
           +MP FLF+ G++I  AL +  K      K+I+R  K  LL W  G++ QG      + LS
Sbjct: 63  IMPLFLFMSGISIPFALSRF-KGEADKSKLIYRLCKRVLLLWIFGMMCQG------NLLS 115

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTA 208
           +  D  ++ +   LQ IA  Y+  AL+   T ++   VL    L I+ A
Sbjct: 116 FDPDHLYL-YTNTLQSIATGYIAAALLFLYTGRKTQIVLCVALLLIYWA 163


>gi|157960931|ref|YP_001500965.1| hypothetical protein Spea_1103 [Shewanella pealeana ATCC 700345]
 gi|157845931|gb|ABV86430.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345]
          Length = 394

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 76/151 (50%), Gaps = 25/151 (16%)

Query: 62  RVATLDAFRGLTV-----------VLMILVDDAGGAY--ARIDHSPWNGCTLADFVMPFF 108
           R+ +LDA RG  +            L++L   AG  +  +++ HS W+G T  D + P F
Sbjct: 28  RLKSLDALRGFDMFWILGGEAIFAALLLLTGWAGFNWFDSQMHHSTWHGFTFYDLIFPLF 87

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGY-SHAPDALSYGV 162
           +F+ GVA+ L+ K++ K+    +  ++     R L LL +G+I   G+ + AP AL    
Sbjct: 88  IFLSGVALGLSPKRLDKLPLPQRMPLYQHAIKRLLLLLLFGVIYNHGWGTGAPFALG--- 144

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKR 193
               IR+  +L RIA  +   AL+   T+ R
Sbjct: 145 ---DIRYASVLGRIAFAWFFCALLVWHTSLR 172


>gi|288800484|ref|ZP_06405942.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
 gi|288332697|gb|EFC71177.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
           F0039]
          Length = 409

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 28/155 (18%)

Query: 60  SKRVATLDAFRGLTVVLMILVDD----------------AGGAYARIDHSPWNGCTLADF 103
           S R+ +LD  RG  + L++L                       + +I H PW G    D 
Sbjct: 38  SGRLLSLDLLRGADLALLVLFQPIIYQWVEASEPTPGSFGEMVFGQITHVPWEGFCFWDI 97

Query: 104 VMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDAL 158
           +MP F+F+ G+ I  ++      KV    G + +++ R + L   G+I QG      + L
Sbjct: 98  IMPLFMFMSGITIPFSMGKYQQGKVKADKGFLWRLLKRFVVLWVLGMIAQG------NLL 151

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
            +   + H+ +   LQ IA+ YV+VAL+   T+ R
Sbjct: 152 LFDPRLIHL-YSNTLQSIAVGYVMVALLFVYTSWR 185


>gi|392965639|ref|ZP_10331058.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
 gi|387844703|emb|CCH53104.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
          Length = 411

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPW--NGCTLADFVMPFFLFI 111
           Q   KRV ++D FR LT++ MI V+D     G    ++HSP   +   LAD V P FLFI
Sbjct: 15  QLPEKRVHSIDIFRALTMLFMIFVNDLWTLIGIPDWLEHSPADVDFLGLADVVFPCFLFI 74

Query: 112 VGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGI 145
           VG++I  A    L K       ++ I+ R++ LL  G+
Sbjct: 75  VGMSIPFAIQGRLAKGDSYGLIIRHIVVRSVALLIMGV 112


>gi|430744438|ref|YP_007203567.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
 gi|430016158|gb|AGA27872.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 73/156 (46%), Gaps = 16/156 (10%)

Query: 4   LRIVEEGLG----RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQK 59
           +R   EG      R     +E   G+ S     + +G ++ +      GE         K
Sbjct: 1   MRPAPEGFSSGEHRAMWCSRELRSGRHSSKKRRERQGFKQRQGVRVSNGEAA-----GTK 55

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA---RIDHSPWNGCTLADFVMPFFLFIVGVAI 116
           + R+ ++D FRG TV  M +V+  GG  A    + H+  N  + AD +MP FLF  G + 
Sbjct: 56  TGRIVSMDQFRGYTVAGMCVVNFLGGLQAIHPVLKHNN-NYFSYADTIMPSFLFACGFSY 114

Query: 117 AL-ALKKVPKINGAV--KKIIFRTLKLLFWGIILQG 149
            L ALK++ +   A   ++ ++R+L L+   +++ G
Sbjct: 115 RLTALKRLDQFGPAAMYRRFVWRSLGLVLLSLMMYG 150


>gi|392308231|ref|ZP_10270765.1| hypothetical protein PcitN1_06167 [Pseudoalteromonas citrea NCIMB
           1889]
          Length = 375

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 71/151 (47%), Gaps = 15/151 (9%)

Query: 57  QQKSKRVATLDAFRGLTV-----------VLMILVDDAG--GAYARIDHSPWNGCTLADF 103
           +   KR+A+LDA RG+ +            L +L    G     A   HS W+G T  D 
Sbjct: 4   KTDKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTVHSAWHGFTFYDL 63

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIF-RTLKLLFWGIILQGGYSHAPDALSYGV 162
           + P F+F+ GVA+ L+ K++  +  + ++  + + LK LF    L   Y+H        V
Sbjct: 64  IFPLFIFLSGVAMGLSPKRIDHLPFSERRGYYNKALKRLFLLSALGVLYNHGWGT-GIPV 122

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKR 193
            +  IR+  +L RIA+ +    L+   T+ R
Sbjct: 123 ALGEIRYASVLGRIAIAWFFCMLLVWHTSLR 153


>gi|431799248|ref|YP_007226152.1| hypothetical protein Echvi_3932 [Echinicola vietnamensis DSM 17526]
 gi|430790013|gb|AGA80142.1| Protein of unknown function (DUF1624) [Echinicola vietnamensis DSM
           17526]
          Length = 412

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 57/112 (50%), Gaps = 17/112 (15%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNG--------CTLADFVMPFFLF 110
           ++KR   +D FR +T++LMI V+D    +    +  W G           +D + P FLF
Sbjct: 21  EAKRSYAIDVFRAVTMLLMIFVND---LWTLEGYPDWLGHAAVGEDRLGFSDVIFPAFLF 77

Query: 111 IVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDA 157
           IVG++I  AL     K++PKI  A + II R L LL  GI      S+A  A
Sbjct: 78  IVGLSIPFALQNRFRKRIPKIKLA-EHIILRGLALLVMGIFHVNLESYAAQA 128


>gi|345881756|ref|ZP_08833266.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
 gi|343918415|gb|EGV29178.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
          Length = 380

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 75/161 (46%), Gaps = 27/161 (16%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDD-----------AGGA-----YARIDHSPWNGCTL 100
             + +R+ +LD  RG  + +++L+             A G        ++ H PW G   
Sbjct: 7   SSQPQRLLSLDILRGADLAMLVLIQPILFRALKTAHPAEGTIGHFIMGQLSHLPWEGFCF 66

Query: 101 ADFVMPFFLFIVGVAIALAL---KKVPKINGAVK-KIIFRTLKLLFWGIILQGGYSHAPD 156
            D +MP F+F+ G+ I  A+   K+  +I+G    +II R + L   G+++QG      +
Sbjct: 67  WDIIMPLFMFMSGITIPFAMSRYKRGARIDGQFYWRIIKRFVVLWVLGMVVQG------N 120

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNV 197
            L++ +   H+ +   LQ IA+ YV VA +    + R   V
Sbjct: 121 LLAFDLRQLHL-FSNTLQSIAVGYVAVAFLFVFCSLRTQIV 160


>gi|380025576|ref|XP_003696546.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Apis florea]
          Length = 298

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 106/237 (44%), Gaps = 41/237 (17%)

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHL-SIFTAY-QWQWIGGFIAF 220
            +  +R+ G+LQ + + Y V A++ET+  K             I  ++ QW  + G    
Sbjct: 9   SLHDLRFPGVLQLLGVSYFVCAILETIFMKPHSQFGRFAMFRDILESWPQWLIMAG---- 64

Query: 221 VIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCG---MRG-HLGPACNAVGYVDRELWGIN 276
              I+TT++L     +F     +  K Y    G    RG ++     A GY+DR ++G N
Sbjct: 65  ---IVTTHTLI----TFLLPISNCPKGYFGPGGEYHFRGKYMNCTAGAAGYIDRLIFG-N 116

Query: 277 HLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHV 336
           H Y+          T +      LR D          PEGL++TISAI    +G+H G +
Sbjct: 117 HTYNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKI 157

Query: 337 LIHFKGHSARLKHWVSMG-FGLLIIAIILHFTNA---IPINKQLYSFSYVCFTAGAA 389
           L+ +   ++R+  W     F  ++  I+ +F      IPI+K++ + SYV   +  A
Sbjct: 158 LLLYYQCNSRVIRWFLWTIFTGIVAGILCNFETQGGIIPISKRMMTLSYVLICSSFA 214


>gi|291514403|emb|CBK63613.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 376

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 135/348 (38%), Gaps = 93/348 (26%)

Query: 60  SKRVATLDAFRGLTVVLMI--------LVDDAGGAY-----ARIDHSPWNGCTLADFVMP 106
           S+R+ +LDA RG  ++ ++        L     G +     A++ H+ WNG    D + P
Sbjct: 11  SQRLLSLDALRGFDMLFIMGFAGLVTALCKLCPGEFSDWMTAQMGHADWNGFFHHDTIFP 70

Query: 107 FFLFIVGVAIALALKKVPKINGAVK----KIIFRTLKLLFWGIILQGGYSHAPDALSYGV 162
            FLFI G++   +L K  +   + +    K+I R L L+  G +  G +          +
Sbjct: 71  LFLFIAGISFPFSLAKQREKGMSERSIYLKVIRRGLTLVALGFVYSGLFK---------L 121

Query: 163 DMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVI 222
           D   +R   +L RI L ++  AL+      R   V+                        
Sbjct: 122 DFATLRLPSVLGRIGLAWMFAALLFVNFNVRTRAVIAAA--------------------- 160

Query: 223 YIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 282
            I+  Y L +    F    D          G  G L    N VGYVDR +   +HL    
Sbjct: 161 -ILLGYGLLL---QFVAAPD---------AGGAGPLTLEGNIVGYVDRIVMP-SHLLGG- 205

Query: 283 VWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKG 342
                                        F+PEGLLST+ AI++  +G+  G  +   + 
Sbjct: 206 ---------------------------RGFDPEGLLSTLPAIVTAMLGMFTGEFVRRSEE 238

Query: 343 HS--ARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGA 388
            +  +R   W++ G   L+   +       PINK+L++ ++V F AGA
Sbjct: 239 RTSGSRKTLWMAAGAVALLALALCLDPLQ-PINKKLWTPAFV-FAAGA 284


>gi|406834451|ref|ZP_11094045.1| hypothetical protein SpalD1_22506 [Schlesneria paludicola DSM
           18645]
          Length = 358

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 16/105 (15%)

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK-------VPKINGAVKKIIFRTLKL 140
            +++H  W+G    D + P FLF+VGV +  +L K       VP  +G   +II RTL L
Sbjct: 25  TQLEHVKWDGFHFYDLIFPLFLFLVGVVLPFSLTKYQTAGELVPNRSGVYARIIRRTLLL 84

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVAL 185
           +  G+I  G            +D  + RW G+LQRI + Y   AL
Sbjct: 85  IALGLIGNGILQ---------LDFTNFRWPGVLQRIGICYFFAAL 120


>gi|269978070|ref|ZP_06185020.1| putative membrane protein [Mobiluncus mulieris 28-1]
 gi|269933579|gb|EEZ90163.1| putative membrane protein [Mobiluncus mulieris 28-1]
          Length = 442

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 135/392 (34%), Gaps = 98/392 (25%)

Query: 14  TQLVEQEQDDGKDSENGINKEKGLERSEVQDE---------QKGELQLQQLLQQKSKRVA 64
           T +      D  +      +E G   +  Q E           GE    +    K  R+ 
Sbjct: 28  TSVSRGAISDSSEVSGMAPRETGNPSAATQSEAATTRQTEPNTGETNQTETKPAKPGRIT 87

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARID---HSPWNGCTLADFVMPFFLFIVGVAIALALK 121
           +LD  RG    L++ V  A     R D   H+PW G    D + P F+ + G+ +A A  
Sbjct: 88  SLDVGRGW--FLIMSVTSAAWLLPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH 145

Query: 122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
                   +++I+   +  L +  +  G +           D    R+ G LQ  A   V
Sbjct: 146 NRVSFKVTLRRIVVLVVVGLLYNGVSSGQW-----------DPATFRFTGPLQVYA---V 191

Query: 182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
           +VA+I T             HL    A  W    G  A V  + T               
Sbjct: 192 IVAIIATC------------HL---FARNWMAWAGITAGVAVLQT--------------- 221

Query: 242 DHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
             G+  +       G L P+CN  G  DR L G +  Y                  G L 
Sbjct: 222 --GLLTWWAGTCPSGVLSPSCNPSGMWDRALLGAHMYY-----------------GGFLG 262

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIA 361
            D          PEGL++   A+L+   G   GH+ +     S+R   W +    LL +A
Sbjct: 263 HD----------PEGLVAITGALLTAAAGTTAGHLAL-----SSRRLGWKTGPVKLLALA 307

Query: 362 IILH-----FTNAIPINKQLYSFSYVCFTAGA 388
             +       T  +P  K+L++ S+    AGA
Sbjct: 308 AAMSVFGLILTIWVPAFKRLWTPSF-SLIAGA 338


>gi|305665830|ref|YP_003862117.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
 gi|88710601|gb|EAR02833.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
          Length = 346

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 85  GAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK---VPKINGAVKKIIFRTLKLL 141
           G   ++ H PWNG    D + PFF+FIVGVA+  +L+K         A + I+ R   L 
Sbjct: 26  GLAHQLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGSRKSATRHILKRCFLLF 85

Query: 142 FWGIILQGGYSHA 154
            +G +L   YSHA
Sbjct: 86  AFGALLHCVYSHA 98


>gi|390958852|ref|YP_006422609.1| hypothetical protein Terro_3042 [Terriglobus roseus DSM 18391]
 gi|390413770|gb|AFL89274.1| Protein of unknown function (DUF1624) [Terriglobus roseus DSM
           18391]
          Length = 406

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 55/108 (50%), Gaps = 23/108 (21%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW---------NGCTLADFVMPFFL 109
           K +R+ +LD FRGL + LMI V++       I   PW         N  T  D V P FL
Sbjct: 10  KPQRIQSLDIFRGLNIALMIFVNE----LHEIKGLPWWTYHAPGAANVMTYVDMVFPAFL 65

Query: 110 FIVGVAIALALK-------KVPKINGAVKKIIFRTLKLLFWGIILQGG 150
            IVG+++ LAL+       + P++   +  ++ R++ L+  G+ILQ  
Sbjct: 66  VIVGMSLPLALQARIRRGDETPQL---IWYVVLRSVALIVLGLILQNA 110


>gi|307700906|ref|ZP_07637931.1| putative membrane protein [Mobiluncus mulieris FB024-16]
 gi|307613901|gb|EFN93145.1| putative membrane protein [Mobiluncus mulieris FB024-16]
          Length = 442

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 88/384 (22%), Positives = 133/384 (34%), Gaps = 98/384 (25%)

Query: 22  DDGKDSENGINKEKGLERSEVQDE---------QKGELQLQQLLQQKSKRVATLDAFRGL 72
            D  +      +E G   +  Q E           GE    +    K  R+ +LD  RG 
Sbjct: 36  SDSSEVSGMAPRETGNPSAATQSEAATTRQTEPNTGETNQAETKPAKPGRITSLDVGRGW 95

Query: 73  TVVLMILVDDAGGAYARID---HSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA 129
              L++ V  A     R D   H+PW G    D + P F+ + G+ +A A          
Sbjct: 96  --FLIMSVTSAAWLLPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVT 153

Query: 130 VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
           +++I+   +  L +  +  G +           D    R+ G LQ  A   V+VA+I T 
Sbjct: 154 LRRIVVLVVVGLLYNGVSSGQW-----------DPATFRFTGPLQVYA---VIVAIIATC 199

Query: 190 TTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYI 249
                       HL    A  W    G  A V  + T                 G+  + 
Sbjct: 200 ------------HL---FARNWMAWAGITAGVAVLQT-----------------GLLTWW 227

Query: 250 VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCR 309
                 G L P+CN  G  DR L G +  Y                  G L  D      
Sbjct: 228 AGTCPSGVLSPSCNPSGMWDRALLGAHMYY-----------------GGFLGHD------ 264

Query: 310 APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH---- 365
               PEGL++   A+L+   G   GH+ +     S+R   W +    LL +A  +     
Sbjct: 265 ----PEGLVAITGALLTAAAGTTAGHLAL-----SSRRLGWKTGPVKLLALAAAMSVFGL 315

Query: 366 -FTNAIPINKQLYSFSYVCFTAGA 388
             T  +P  K+L++ S+    AGA
Sbjct: 316 ILTIWVPAFKRLWTPSF-SLIAGA 338


>gi|212557932|gb|ACJ30386.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
          Length = 387

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 76/156 (48%), Gaps = 25/156 (16%)

Query: 57  QQKSKRVATLDAFRGLTV----------VLMILVDDAGGAY---ARIDHSPWNGCTLADF 103
           ++   R+ +LDA RG  +            +I++   GG +    ++ HS W+G T  D 
Sbjct: 16  KKNKVRLKSLDALRGFDMFWILGGEAIFAALIVLTGWGGLHWLDKQMHHSAWHGFTFYDL 75

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGY-SHAPDA 157
           + P F+F+ GVA+ L+ K++ K+    +  ++     R L LL  G+I   G+ + AP A
Sbjct: 76  IFPLFIFLSGVALGLSPKRLDKLPMVQRMPLYQHAVKRLLLLLLLGVIYNHGWGTGAPMA 135

Query: 158 LSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           L        IR+  +L RIA  +   AL+   T+ R
Sbjct: 136 LG------DIRYASVLGRIAFAWFFCALLVWHTSLR 165



 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 49/81 (60%), Gaps = 4/81 (4%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHF 366
            A  +PEG+LSTI A+++G +G+  GH ++  H KG   +L   V    GL ++A+    
Sbjct: 217 NAAVDPEGILSTIPAVVNGLLGVFVGHFIVKTHAKGEWYKLA--VMCLLGLGLLALGWLI 274

Query: 367 TNAIPINKQLYSFSYVCFTAG 387
           +  IP+NK L++ S+V  T+G
Sbjct: 275 SPIIPVNKTLWTSSFVLVTSG 295


>gi|256424049|ref|YP_003124702.1| hypothetical protein Cpin_5069 [Chitinophaga pinensis DSM 2588]
 gi|256038957|gb|ACU62501.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 390

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 44/72 (61%), Gaps = 5/72 (6%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR---IDH--SPWNGCTLADFVMPFFLF 110
           + + S+R+ ++DAFR LT++ MI V+D  G       IDH  +  +G   AD V P FLF
Sbjct: 1   MLKPSQRLLSIDAFRALTMLTMIFVNDVSGVKNIPEWIDHVKAQDDGMGFADTVFPAFLF 60

Query: 111 IVGVAIALALKK 122
           IVG++I  A+ K
Sbjct: 61  IVGLSIPFAIGK 72


>gi|375148919|ref|YP_005011360.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361062965|gb|AEW01957.1| Protein of unknown function DUF2261, transmembrane [Niastella
           koreensis GR20-10]
          Length = 397

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 43/66 (65%), Gaps = 5/66 (7%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWN--GCTLADFVMPFFLFIVGV 114
           ++R+A++D FR LT++LMI V+D G        ++H+  N  G  LAD V P FLFIVG+
Sbjct: 5   TQRLASIDVFRALTMLLMIFVNDLGTLKNIPLWLEHTKANEDGMGLADTVFPAFLFIVGL 64

Query: 115 AIALAL 120
           +I  A+
Sbjct: 65  SIPFAI 70


>gi|320450186|ref|YP_004202282.1| hypothetical protein TSC_c11130 [Thermus scotoductus SA-01]
 gi|320150355|gb|ADW21733.1| putative membrane protein [Thermus scotoductus SA-01]
          Length = 334

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 37/59 (62%), Gaps = 2/59 (3%)

Query: 60  SKRVATLDAFRGLTVVLMILVDD-AGGAYARIDHSPWNGCT-LADFVMPFFLFIVGVAI 116
           S R   LDAFRGLTV LM+ V++   GA   ++H P+ G   LAD V P++L  +G AI
Sbjct: 4   SARSLALDAFRGLTVALMLFVNNLPPGAPPYLEHGPFGGSVYLADLVFPWYLLAMGAAI 62


>gi|326799399|ref|YP_004317218.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326550163|gb|ADZ78548.1| hypothetical protein Sph21_1988 [Sphingobacterium sp. 21]
          Length = 398

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 57/97 (58%), Gaps = 8/97 (8%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA--GGAYARIDHSPWN--GCTLADFVMPFFLFIV 112
           +  S+R+ ++D  RGLT++LM+ V+D    G  A + H+  +  G  LAD+V P FLFIV
Sbjct: 5   KVASERILSVDIMRGLTLLLMLFVNDLFEPGVPAWLLHTKVDVDGMGLADWVFPGFLFIV 64

Query: 113 GVAIALALKKVPKINGAVKKII----FRTLKLLFWGI 145
           GV++  A++       + ++II     RTL LL  G+
Sbjct: 65  GVSVPYAIRSRLNKGESKRQIIGHIAVRTLSLLIIGV 101


>gi|374372786|ref|ZP_09630447.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
 gi|373234862|gb|EHP54654.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
          Length = 357

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 22/102 (21%)

Query: 65  TLDAFRGLTVVLMILVDDAGGAYARIDHS----------------PWNGCTLADFVMPFF 108
           +LD  RGL +VL+ L  ++ G Y  + H+                PW+G    D + P F
Sbjct: 3   SLDFMRGLIMVLLAL--ESTGLYEHLSHASAGTWFEGIMQQFFHHPWHGLHFWDLIQPGF 60

Query: 109 LFIVGVAIALALKKVPK----INGAVKKIIFRTLKLLFWGII 146
           +F+ GVA+A +L+K  +     N ++KK + R+  L FWG++
Sbjct: 61  MFMAGVAMAYSLQKQKQRDYTWNRSLKKTLRRSGWLFFWGVL 102


>gi|409198223|ref|ZP_11226886.1| hypothetical protein MsalJ2_14356 [Marinilabilia salmonicolor JCM
           21150]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 15/103 (14%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFF 108
           +  + R+ ++D  R +TV LM+ V+D  G         +A   H   +G  LAD V P F
Sbjct: 4   KTGTHRIKSIDILRAITVALMVFVNDLPGIRDIPQWLGHASAGH---DGMFLADIVFPLF 60

Query: 109 LFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIIL 147
           LF VG++I LA+    KK       V+ I+ RT  L+F G+++
Sbjct: 61  LFWVGMSIPLAVDGRQKKGDSDLTIVRHILKRTFSLVFIGVLM 103


>gi|329960675|ref|ZP_08299018.1| conserved domain protein [Bacteroides fluxus YIT 12057]
 gi|328532548|gb|EGF59342.1| conserved domain protein [Bacteroides fluxus YIT 12057]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 59/112 (52%), Gaps = 21/112 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +RVA +D FR LT+ LM+ V+D  G         +A+ID    +    +D + P FLF +
Sbjct: 7   QRVAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHAKIDE---DMLGFSDTIFPAFLFCM 63

Query: 113 GVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSHA 154
           G++++LA+    KK       +  I +RT+ LL  G+       ++GG SH+
Sbjct: 64  GMSVSLAIQNRYKKGNTTLQVISHIFWRTIALLAMGLFSLNSGGIEGGLSHS 115


>gi|406831132|ref|ZP_11090726.1| hypothetical protein SpalD1_05831 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 56/106 (52%), Gaps = 10/106 (9%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYA---RIDHSPWNGCTLADFVMPFFLFIVGV 114
           + S R+ +LD FRG T++ M+LV+  GG  A   RI     + C+ AD +MP FLF  G 
Sbjct: 10  KPSLRLTSLDQFRGYTMLGMLLVNFIGGYKAVSPRILLHTHDYCSYADTIMPHFLFAAGF 69

Query: 115 AIALALKKVPKING------AVKKIIFRTLKLLFW-GIILQGGYSH 153
           A+ L+L +  +  G      A+++I+   L  + W G    GG  H
Sbjct: 70  ALRLSLGRRMEAGGKMPWGRAIRRILGLALVAIIWYGYCDWGGVVH 115


>gi|322785713|gb|EFZ12351.1| hypothetical protein SINV_07836 [Solenopsis invicta]
          Length = 106

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 318 LSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLY 377
           L  ++AI    +G+  G +L+ +KG   R+  W+        +  ILHFTN IPINK L+
Sbjct: 10  LRILTAIFQVFLGVQAGVILMIYKGWKERVIRWLLWAVFYGCMGCILHFTNIIPINKSLW 69

Query: 378 SFSYV----CFTAGAADGCLGITNSI 399
           S S+V     F      GC  + + I
Sbjct: 70  SLSFVFVSTSFALAFLSGCYLLIDVI 95


>gi|407789242|ref|ZP_11136344.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
 gi|407207220|gb|EKE77163.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
          Length = 364

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 61/126 (48%), Gaps = 13/126 (10%)

Query: 89  RIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQ 148
           ++ HS W+G T  D + P F+F+ GV + LA K+   + G  ++ ++R+       ++L 
Sbjct: 48  QMAHSDWHGLTAYDGIFPLFIFLSGVTLGLADKRASALGGGARRALYRSALRRLLLLLLL 107

Query: 149 GG-YSHAPDALSYGVDMK----HIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHL 203
           G  Y+H      +G  +      +R+  +L RI L + V A+   L    RP V +   L
Sbjct: 108 GVLYNH-----GWGTGLPGHWDEVRYASVLGRIGLAWFVAAM---LVWHCRPKVWQGVAL 159

Query: 204 SIFTAY 209
           +I   Y
Sbjct: 160 AILLGY 165


>gi|406980095|gb|EKE01754.1| hypothetical protein ACD_21C00059G0003 [uncultured bacterium]
          Length = 295

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 7/92 (7%)

Query: 306 SWCRAP---FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI 362
           +W  AP   F+PEG+LST+ AI +   G+  G +L   K    +   W+ + FG  +I +
Sbjct: 103 TWVNAPAPGFDPEGILSTLPAIATMLFGVLTGQLL---KSSFTQKTVWMLI-FGGALIFL 158

Query: 363 ILHFTNAIPINKQLYSFSYVCFTAGAADGCLG 394
            L  ++ +PINK L++ SY  FT+G A    G
Sbjct: 159 GLVMSHWLPINKNLWTSSYAVFTSGMASVVFG 190


>gi|338211620|ref|YP_004655673.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305439|gb|AEI48541.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 393

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 74/344 (21%), Positives = 132/344 (38%), Gaps = 71/344 (20%)

Query: 62  RVATLDAFRGLTVVLMI------------LVDDAGGAYARI--DHSPWNGCTLADFVMPF 107
           R++++DA+RG  + LM+            L D +  A+     DH  W GC+L D + P 
Sbjct: 8   RISSVDAYRGFVMFLMMAEVLEFGHISKALPDSSFWAFLAYNQDHVEWVGCSLHDLIQPS 67

Query: 108 FLFIVGVA----IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           F F+VGVA    IA  + K           + R+L L+F GI L+          S   +
Sbjct: 68  FSFLVGVALPYSIASRMAKGQNFGSMFGHTVQRSLILIFLGIFLR----------SMHRE 117

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
             +  +   L +I L Y V+  +   T + +   L     +I   Y       ++ F  Y
Sbjct: 118 QTNFTFEDTLTQIGLGYPVLFWLGFKTFRTQLIALS----TILVGY-------WLFFAAY 166

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
            +        +   +   +H ++      G   H     NA    D+  W +N       
Sbjct: 167 PLPGADFDYASVGVTADWEHNLQ------GFAAHWNKNTNAAWSFDQ--WFMNLF----- 213

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH 343
                      P   P   +   +    F P   L T+       +G+  G  L      
Sbjct: 214 -----------PREKPFIRNGGGYSTLSFIPT--LGTM------VLGLMAGQWLKSDTAP 254

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAG 387
             ++K + ++G GLL++  +L++    P  K++++ ++V F+ G
Sbjct: 255 MEKVKRFAALGVGLLVVGSLLNWLGICPNVKRIWTPTWVLFSGG 298


>gi|408370371|ref|ZP_11168148.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
 gi|407744129|gb|EKF55699.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
          Length = 394

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 8/98 (8%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPW-NGCTLADFVMPFFLFIVG 113
             S R+ ++D  RG+T+ LM+ V+D    G     +    W +G  LAD+V P FLF+VG
Sbjct: 1   MNSNRIMSIDIMRGITLFLMLFVNDLFIPGVPKWLVHTQEWEDGMGLADWVFPGFLFMVG 60

Query: 114 VAIALALKKVPKINGAVKK----IIFRTLKLLFWGIIL 147
           ++I  A+K       +  +    +I RTL LL  GI++
Sbjct: 61  LSIPYAMKARKNKGQSNLRLWSHVIMRTLSLLLIGILM 98


>gi|404404699|ref|ZP_10996283.1| hypothetical protein AJC13_04673 [Alistipes sp. JC136]
          Length = 376

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 67/292 (22%), Positives = 109/292 (37%), Gaps = 89/292 (30%)

Query: 61  KRVATLDAFRGLTVVLMI--------LVDDAGGAYA-----RIDHSPWNGCTLADFVMPF 107
           +R+ +LDA RG  +  ++        L     G++A     ++ H+ W+G    D + P 
Sbjct: 12  QRLLSLDALRGFDMFFIMGFAGLVVALCKLRPGSFADWMSAQMGHAAWDGFFHHDTIFPL 71

Query: 108 FLFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           FLFI G++   +L    +K  +      K+I R L L+  G++  G          + +D
Sbjct: 72  FLFIAGISFPFSLAKQREKGVRERSIYTKVIRRGLTLVALGLVYNG---------LFNLD 122

Query: 164 MKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIY 223
              +R   +L RI L ++  A++      R                        IA    
Sbjct: 123 FATLRLPSVLGRIGLAWMFAAMLFIRFGIRTR----------------------IALAAV 160

Query: 224 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPV 283
           I+  Y L +    F    D      + + G         N VGY+DR +    HLY    
Sbjct: 161 ILVGYGLLL---QFVAAPDAAGAGPLTEAG---------NIVGYIDRTIMP-AHLYG--- 204

Query: 284 WSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGH 335
                       N G             F+PEGLLST+ AI++  +G+  G 
Sbjct: 205 ------------NRG-------------FDPEGLLSTLPAIVTAMLGMFTGE 231


>gi|326663866|ref|XP_696425.5| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
           [Danio rerio]
          Length = 170

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 9/87 (10%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLIHFKG-HSARLKHWVSMGFGLLIIAIIL----H 365
           PF+PEG+L +I++IL   +G+  G +L+H++  H   +  ++  G  L II+ +L     
Sbjct: 2   PFDPEGVLGSINSILMAFLGLQAGKILLHYRDQHRQIITRFLMWGLILGIISAVLTKCSR 61

Query: 366 FTNAIPINKQLYSFSYV----CFTAGA 388
               IP+NK L+S SYV    CF   A
Sbjct: 62  NDGFIPVNKNLWSLSYVTTLSCFAFVA 88


>gi|406831133|ref|ZP_11090727.1| hypothetical protein SpalD1_05836 [Schlesneria paludicola DSM
           18645]
          Length = 415

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 9/94 (9%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
            S R+ +LD FRG T+V MILV+  G       R+     + C+ AD +MP F F VG A
Sbjct: 11  SSPRLTSLDQFRGYTMVGMILVNYLGAYKEVTPRLFRHTNDYCSYADTIMPHFFFAVGFA 70

Query: 116 IALALKKVPKING------AVKKIIFRTLKLLFW 143
           + L+L K  +  G      A+++I+   L  + W
Sbjct: 71  MRLSLGKRIEAGGKMPWGRAIRRILGLALVAIVW 104


>gi|306818439|ref|ZP_07452162.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
 gi|304648612|gb|EFM45914.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
          Length = 399

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 87/373 (23%), Positives = 131/373 (35%), Gaps = 98/373 (26%)

Query: 33  KEKGLERSEVQDE---------QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           +E G   +  Q E           GE    +    K  R+ +LD  RG    L++ V  A
Sbjct: 4   RETGNPSAATQSEAATTRQTEPNTGETNQTETKPAKPGRITSLDVGRGW--FLIMSVTSA 61

Query: 84  GGAYARID---HSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKL 140
                R D   H+PW G    D + P F+ + G+ +A A          +++I+   +  
Sbjct: 62  AWLLPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRRIVVLVVVG 121

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP 200
           L +  +  G +           D    R+ G LQ  A   V+VA+I T            
Sbjct: 122 LLYNGVSSGQW-----------DPATFRFTGPLQVYA---VIVAIIATC----------- 156

Query: 201 RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGP 260
            HL    A  W    G  A V  + T                 G+  +       G L P
Sbjct: 157 -HL---FARNWMVWAGITAGVAVLQT-----------------GLLTWWAGTCPSGVLSP 195

Query: 261 ACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLST 320
           +CN  G  DR L G +  Y                  G L  D          PEGL++ 
Sbjct: 196 SCNPSGMWDRALLGAHMYY-----------------GGFLGHD----------PEGLVAI 228

Query: 321 ISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH-----FTNAIPINKQ 375
             A+L+   G   GH+ +     S+R   W +    LL +A  +       T  +P  K+
Sbjct: 229 TGALLTAAAGTTAGHLAL-----SSRRLGWKTGPVKLLALAAAMSVFGLILTIWVPAFKR 283

Query: 376 LYSFSYVCFTAGA 388
           L++ S+    AGA
Sbjct: 284 LWTPSF-SLIAGA 295


>gi|430744193|ref|YP_007203322.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
 gi|430015913|gb|AGA27627.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
           18658]
          Length = 368

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 47/94 (50%), Gaps = 5/94 (5%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVAI 116
              R+ +LD FRG TVV M+ V+  G   A   +     + C+ AD +MP F F VG A 
Sbjct: 19  SGSRIVSLDQFRGYTVVGMLFVNFLGNFDALPAVFKHHNSYCSYADTIMPQFFFAVGFAY 78

Query: 117 ALALKK---VPKINGAVKKIIFRTLKLLFWGIIL 147
            L   +      I GAV  ++ R+L L+  G ++
Sbjct: 79  RLTFLRRLETSGIGGAVAAVLRRSLGLILLGFVI 112


>gi|338209612|ref|YP_004653659.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336303425|gb|AEI46527.1| Protein of unknown function DUF2261, transmembrane [Runella
           slithyformis DSM 19594]
          Length = 398

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 58/99 (58%), Gaps = 9/99 (9%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPW--NGCTLADFVMPFFLF 110
           +Q   KRV ++DAFR LT++LMI V+D     G    ++H+    +    +D + P FLF
Sbjct: 1   MQITLKRVPSIDAFRALTMLLMIFVNDFWSLSGIPYWLEHAKAEEDFLGFSDIIFPCFLF 60

Query: 111 IVGVAIALALK-KVPKING---AVKKIIFRTLKLLFWGI 145
           I+G+AI  A++ ++ K +     V+ II R++ L+  GI
Sbjct: 61  ILGMAIPFAVQNRIAKGDTRWQIVRHIILRSVALIVMGI 99


>gi|160874301|ref|YP_001553617.1| hypothetical protein Sbal195_1181 [Shewanella baltica OS195]
 gi|378707545|ref|YP_005272439.1| hypothetical protein [Shewanella baltica OS678]
 gi|160859823|gb|ABX48357.1| conserved hypothetical protein [Shewanella baltica OS195]
 gi|315266534|gb|ADT93387.1| hypothetical protein Sbal678_1209 [Shewanella baltica OS678]
          Length = 384

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 74/152 (48%), Gaps = 15/152 (9%)

Query: 56  LQQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAYA--RIDHSPWNGCTLAD 102
           ++    R+ +LDA RG  +            L+IL   AG  +   ++ HS W+G    D
Sbjct: 12  VKVNKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYD 71

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFR-TLKLLFWGIILQGGYSHAPDALSYG 161
            + P F+F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H        
Sbjct: 72  LIFPLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAP 130

Query: 162 VDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
            D + IR+  +L RIA  +   AL+   T+ R
Sbjct: 131 ADPEKIRYASVLGRIAFAWFFAALLVWHTSLR 162



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 57/107 (53%), Gaps = 6/107 (5%)

Query: 309 RAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHF 366
           R P +PEGLLSTI AI++   G+  GH ++  H KG  A++    + G   L    +L  
Sbjct: 215 RTP-DPEGLLSTIPAIVNALAGVFVGHFIVKSHPKGEWAKVGLLAAAGCVCLAFGWLLDL 273

Query: 367 TNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACF 413
              IP+NK+L++ S+V  T+G +   L +  ++ +  + W +    F
Sbjct: 274 V--IPVNKELWTSSFVLVTSGWSMILLAVFYAL-VDVLKWQKAAFFF 317


>gi|334364999|ref|ZP_08513969.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313158791|gb|EFR58176.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 383

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 32/151 (21%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMI---------------LVDDAGGAYARIDHSPWNGCTL 100
           + ++S+R+ +LDA RG  ++ ++                V DA  A A + H  W+G   
Sbjct: 17  IMKQSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDA--AAASMSHVAWDGFAH 74

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAP 155
            D + P FLFI GV+   ++ K  +  G  +     KI+ R L L+  G++  G +    
Sbjct: 75  HDTIFPLFLFIAGVSFPYSVAK-QRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--- 130

Query: 156 DALSYGVDMKHIRWCGILQRIALVYVVVALI 186
                 +D +++R   +L RI L + + A++
Sbjct: 131 ------LDFENLRIASVLGRIGLAWSIAAVL 155



 Score = 42.0 bits (97), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLST+ A+++  +G+  G  +        R   W++      ++A  L F+  +P
Sbjct: 217 FDPEGLLSTVPAVVTAMLGMFTGEFVRRSDIRGGRKTLWMAA-AAAALLAAGLAFSGVLP 275

Query: 372 INKQLYSFSYVCFTAGAADGCLGI 395
           +NK+L+S ++VC     + G   +
Sbjct: 276 VNKKLWSSTFVCVVGAYSLGMFAL 299


>gi|428319838|ref|YP_007117720.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428243518|gb|AFZ09304.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 482

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 53/126 (42%), Gaps = 34/126 (26%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWN-----------------GCTLAD 102
           S+R   LDA RG+ V+ M+L     G  AR     W                  G T  D
Sbjct: 14  SQRADALDALRGIAVLAMVL----SGTIARKTLPAWMYHAQLPPPDHIFNNKLPGLTWVD 69

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV 162
            V PFFLF +G AI LAL +        KK+I           ILQ G+  A    S+ +
Sbjct: 70  LVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILS---------ILQRGFLLA----SFAI 116

Query: 163 DMKHIR 168
            ++HIR
Sbjct: 117 FLQHIR 122


>gi|115770385|ref|XP_001180412.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Strongylocentrotus purpuratus]
          Length = 78

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 43/75 (57%), Gaps = 9/75 (12%)

Query: 83  AGGAYAR-----IDHSPWNGCTLADFVMPFFLFIVGVAIALA----LKKVPKINGAVKKI 133
           AGGAY       + H+ W+G T+ADF+ P+F+FI+G +I L+    L K        KK+
Sbjct: 4   AGGAYGDGHYWFVSHAIWSGITVADFMFPWFVFIMGTSIHLSINILLSKGQSYPSIYKKL 63

Query: 134 IFRTLKLLFWGIILQ 148
           + R++ L   G+ +Q
Sbjct: 64  VSRSITLFIMGVCIQ 78


>gi|373954275|ref|ZP_09614235.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
 gi|373890875|gb|EHQ26772.1| Protein of unknown function DUF2261, transmembrane
           [Mucilaginibacter paludis DSM 18603]
          Length = 397

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 53/95 (55%), Gaps = 9/95 (9%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSP--WNGCTLADFVMPFFLFIVGVA 115
            RV ++D FR +T+ LMI V+D  G       I H+    +G  LAD V P FLFIVG++
Sbjct: 7   NRVHSIDIFRAVTMFLMIFVNDIDGVPGVPEWIKHAGERTDGLGLADIVFPAFLFIVGLS 66

Query: 116 IALALK-KVPKINGAVK---KIIFRTLKLLFWGII 146
           I  A++ ++ + +   K    I+ R L L+F G I
Sbjct: 67  IPHAIQSRISRGDSKTKIAAYIVMRALALIFIGFI 101


>gi|334366956|ref|ZP_08515871.1| putative membrane protein [Alistipes sp. HGB5]
 gi|313156833|gb|EFR56273.1| putative membrane protein [Alistipes sp. HGB5]
          Length = 370

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 69/155 (44%), Gaps = 26/155 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMI----LVDDAGGAY---------ARIDHSPWNGCTLAD 102
           +   ++R+ +LD  RG+ +  ++    LV      +         +++ H+ WNG T+ D
Sbjct: 1   MMNPNRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKI---IFRT-LKLLFWGIILQGGYSHAPDAL 158
            + P FLFI GVA   +L K        K+I   IFR  L L   G++  G +       
Sbjct: 61  TIFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE------ 114

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
              ++   +R   +L RI L ++  AL+    + R
Sbjct: 115 ---LNFSSLRIASVLGRIGLAWMFAALLCVYCSVR 146



 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           F+PEG+LST+ A++S   G+  G  L+   +G S   K +      L I    L +   +
Sbjct: 202 FDPEGILSTLPAVVSALFGMFTGEFLLDGRRGLSGSWKAFYMAVAALAITTAGLCWNLIM 261

Query: 371 PINKQLYSFSYVCFTAGAADG 391
           P+NK L+S S+ C  +G + G
Sbjct: 262 PVNKNLWSSSFTCVVSGYSLG 282


>gi|329849634|ref|ZP_08264480.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
 gi|328841545|gb|EGF91115.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
          Length = 410

 Score = 47.8 bits (112), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA---GGAYARIDH--SPWNGCTLADFVMPFFLFI 111
           + +  RV  +D  R LT+VLMI V+D     G    ++H  S  +G  L+D V P FLFI
Sbjct: 16  KNQFSRVGAIDLVRALTMVLMIFVNDLWSLKGVPVWLEHVASGVDGMGLSDVVFPAFLFI 75

Query: 112 VGVAIALALK----KVPKINGAVKKIIFRTLKLLFWGIILQGG 150
           VG+++  A+     +   +   V  I+ R++ LL  G+ L  G
Sbjct: 76  VGLSLPFAVSSRQARGDSLGSTVLHILGRSVALLVMGVFLVNG 118


>gi|333382416|ref|ZP_08474086.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828727|gb|EGK01419.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 394

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 11/100 (11%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPWNGCTL--ADFVMPFFLF 110
           L  K  RVA++D FR LT+  MI V+D     G    ++H+  +   L  +D V P FLF
Sbjct: 5   LIAKPVRVASIDIFRALTMFFMIFVNDFWSVSGVPHWLEHAAASEDMLGFSDVVFPSFLF 64

Query: 111 IVGVAIALALKKVPKINGAVKK-----IIFRTLKLLFWGI 145
           I+G++I LA++   K  G  KK     I+ R++ LL  G+
Sbjct: 65  ILGMSIPLAMESRMK-KGETKKQILWHIVVRSVALLVMGL 103


>gi|390946391|ref|YP_006410151.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
 gi|390422960|gb|AFL77466.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
          Length = 366

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 32/150 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI---------------LVDDAGGAYARIDHSPWNGCTLA 101
            ++S+R+ +LDA RG  ++ ++                V DA  A A + H  W+G    
Sbjct: 1   MKQSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDA--AAASMSHVAWDGFAHH 58

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPD 156
           D + P FLFI GV+   ++ K  +  G  +     KI+ R L L+  G++  G +     
Sbjct: 59  DTIFPLFLFIAGVSFPYSVAK-QRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK---- 113

Query: 157 ALSYGVDMKHIRWCGILQRIALVYVVVALI 186
                +D +++R   +L RI L + + A++
Sbjct: 114 -----LDFENLRIASVLGRIGLAWSIAAVL 138



 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           F+PEGLLST+ A+++  +G+  G  +        R   W++      ++A  L F+  +P
Sbjct: 200 FDPEGLLSTVPAVVTAMLGMFTGEFVRRGDIRGGRKTLWMAA-AAAALLAAGLAFSGVLP 258

Query: 372 INKQLYSFSYVCFTAGAADGCLGI 395
           +NK+L+S ++VC     + G   +
Sbjct: 259 VNKKLWSSTFVCVVGAYSLGMFAL 282


>gi|187735009|ref|YP_001877121.1| transmembrane protein [Akkermansia muciniphila ATCC BAA-835]
 gi|187425061|gb|ACD04340.1| putative transmembrane protein [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 373

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 62/151 (41%), Gaps = 25/151 (16%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMI--LVDDAGGAYARIDHSP-----------WNGC 98
           +  L   + +R+A +DA RG  +  +   L     G     D SP           W G 
Sbjct: 1   MSSLSDTRPQRIAAIDALRGFDMFFLTGGLALVVAGINLFYDRSPEWLVKHSTHVAWEGF 60

Query: 99  TLADFVMPFFLFIVGVAIALALKKVPKINGAVK---KIIFRTLKLLFWGIILQGG-YSHA 154
              D VMP FLFIVG A+  +  K        K   K+  R + L   G+++QG   S  
Sbjct: 61  AAWDLVMPLFLFIVGTAMPFSFSKRIGSEPLWKIYLKVARRVVVLFLLGMVVQGNLLSFE 120

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVAL 185
           P  +S         +C  LQ IA  Y++ A+
Sbjct: 121 PSRMSL--------YCNTLQAIASGYLIAAI 143


>gi|404403948|ref|ZP_10995532.1| hypothetical protein AJC13_00860 [Alistipes sp. JC136]
          Length = 369

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 68/151 (45%), Gaps = 34/151 (22%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI----------------LVDDAGGAYARIDHSPWNGCTL 100
            + ++R+ +LDA RG  ++ ++                  D   G+   + H+ W+G T 
Sbjct: 1   MKPNQRLLSLDALRGFDMLFIMGFSGLVASLCALWPNPFTDAVAGS---MGHAAWDGLTH 57

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKING-----AVKKIIFRTLKLLFWGIILQGGYSHAP 155
            D + P FLFI GV+   +L K  + NG      + K+I R + L+  G++  G +    
Sbjct: 58  HDTIFPLFLFIAGVSFPFSLAK-QRANGLGERAILGKVIRRGVTLVVLGLVYNGLFK--- 113

Query: 156 DALSYGVDMKHIRWCGILQRIALVYVVVALI 186
                 +D   +R   +L RI L ++  A++
Sbjct: 114 ------LDFASLRVASVLGRIGLAWMFAAIL 138



 Score = 44.7 bits (104), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 48/104 (46%), Gaps = 2/104 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFK-GHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           F+PEGLLST+ AI+S   G+  G  L   + G +   K          I  + + ++  +
Sbjct: 201 FDPEGLLSTVPAIVSAMFGMFTGEFLRRERPGLTGDRKALYMALAAAAITLVGIAWSGVM 260

Query: 371 PINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFR 414
           PINK+L+S S+ C   G + G   +   + I    W      FR
Sbjct: 261 PINKKLWSSSFTCVVTGYSLGMFALFYYL-IDVRGWKRWTLFFR 303


>gi|390946357|ref|YP_006410117.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
 gi|390422926|gb|AFL77432.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
          Length = 369

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 68/154 (44%), Gaps = 26/154 (16%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI----LVDDAGGAY---------ARIDHSPWNGCTLADF 103
              ++R+ +LD  RG+ +  ++    LV      +         +++ H+ WNG T+ D 
Sbjct: 1   MNPNRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDT 60

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKI---IFRT-LKLLFWGIILQGGYSHAPDALS 159
           + P FLFI GVA   +L K        K+I   IFR  L L   G++  G +        
Sbjct: 61  IFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE------- 113

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
             ++   +R   +L RI L ++  AL+    + R
Sbjct: 114 --LNFSSLRIASVLGRIGLAWMFAALLCVYCSVR 145



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           F+PEG+LST+ A++S   G+  G  L+   +G S   K +      L I    L +   +
Sbjct: 201 FDPEGILSTLPAVVSALFGMFTGEFLLDGRRGLSGSWKAFYMAVAALAITTAGLCWNLIM 260

Query: 371 PINKQLYSFSYVCFTAGAADG 391
           P+NK L+S S+ C  +G + G
Sbjct: 261 PVNKNLWSSSFTCVVSGYSLG 281


>gi|332666399|ref|YP_004449187.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332335213|gb|AEE52314.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 369

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 66/153 (43%), Gaps = 27/153 (17%)

Query: 58  QKSKRVATLDAFRGLTV--------VLMILVDDAG-----GAYARIDHSPWNGCTLADFV 104
              +R+ +LDA RG  +        V+  L    G      A  ++ H  W+G  L D +
Sbjct: 5   STGQRLYSLDALRGFDMFWIMGAEAVVHSLATATGSSVWEAAAHQLSHPDWHGFRLYDLI 64

Query: 105 MPFFLFIVGVAIALALKKVPKINGAVK-----KIIFRTLKLLFWGIILQGGYSHAPDALS 159
            P FLF+ GVA   ++ +  + NG  K     ++I R L L+  GII   G    P    
Sbjct: 65  FPLFLFLAGVATPYSVGRDLE-NGKPKQQLLLRVIRRGLVLVLLGIIYNNGLVLKP---- 119

Query: 160 YGVDMKHIRWCGILQRIALVYVVVALIETLTTK 192
               +  IR+  +L RI L Y+   +I   T +
Sbjct: 120 ----LAEIRFPSVLGRIGLAYMFANIIYLYTKQ 148


>gi|410099160|ref|ZP_11294132.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219182|gb|EKN12145.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 371

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 66/151 (43%), Gaps = 33/151 (21%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAY----------------ARIDHSPWN-GCT 99
           Q++S+R+ +LDA RG     M  +   G  +                A++ H+ W  G T
Sbjct: 4   QKQSQRLLSLDALRGFD---MFFIMGGGSLFVALATLVPTPFFESIAAQMSHAKWGAGFT 60

Query: 100 LADFVMPFFLFIVGVAIALALKKVPKIN----GAVKKIIFRTLKLLFWGIILQGGYSHAP 155
             D + P FLFI G++   +L+K  +         KKII R + L+  G +  G      
Sbjct: 61  FEDIIFPLFLFIAGISFPFSLEKQRERGMSEAAIYKKIIRRGITLVVLGFVYNG------ 114

Query: 156 DALSYGVDMKHIRWCGILQRIALVYVVVALI 186
                 ++ +  R+  +L RI L ++  ALI
Sbjct: 115 ---LLNLNFETQRYASVLARIGLGWMFGALI 142



 Score = 41.6 bits (96), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 61/111 (54%), Gaps = 17/111 (15%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGH-VLIHFKG--HSARLKHWVSMGFGLLIIAII--LHF 366
           ++PEG+LST+ AI +  +G+  G  V +  +G   + ++ + +++G  LL+I ++  L F
Sbjct: 203 YDPEGILSTLPAIGTALLGMFTGEFVKLRREGLTETKKVVYMLAVGGCLLVIGLLWGLFF 262

Query: 367 TNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWS 417
               PINK L++ S+VC   G +     I  ++F   +D  EC    R W+
Sbjct: 263 ----PINKYLWTSSFVCTVGGIS----AILFAVFYYIVDVKEC----RGWT 301


>gi|300867270|ref|ZP_07111930.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300334747|emb|CBN57096.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 486

 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 52/126 (41%), Gaps = 34/126 (26%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW-----------------NGCTLAD 102
           +KR   LDA RG+ V+ M+L     G  AR     W                  G T  D
Sbjct: 18  NKRADALDALRGIAVLAMVL----SGTIARKTLPAWMYHAQEPPPSHLFNPKLAGLTWVD 73

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGV 162
            V PFFLF +G AI LAL +        KK+I           ILQ G+       S+ +
Sbjct: 74  LVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILS---------ILQRGFLLG----SFAI 120

Query: 163 DMKHIR 168
            ++HIR
Sbjct: 121 FLQHIR 126


>gi|149178821|ref|ZP_01857402.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
 gi|148842362|gb|EDL56744.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
           8797]
          Length = 405

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 9/98 (9%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGG----AYARIDHSPWNGCTLADFVMPFFLFIV 112
           ++ +KR+ +LD FRG TV  M LV+  G           H+ +  C+ AD +MP FLF V
Sbjct: 11  EKFNKRIVSLDQFRGYTVAGMFLVNYMGFFVVCPVVLKHHNTY--CSYADTIMPHFLFAV 68

Query: 113 GVAIALALKKVPKINGAVK---KIIFRTLKLLFWGIIL 147
           G A  L   +  +  GAV    +++ R L L+   +I+
Sbjct: 69  GFAFRLTFGRRVQTAGAVSAYARVVRRLLGLVLVSLII 106


>gi|157374353|ref|YP_001472953.1| hypothetical protein Ssed_1214 [Shewanella sediminis HAW-EB3]
 gi|157316727|gb|ABV35825.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
          Length = 378

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 311 PFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHFTN 368
           P +PEG+LSTI A+ +   G+  GH +I  H KG   ++ + +  G   L +  +L F  
Sbjct: 210 PLDPEGILSTIPAVANALAGVFVGHFIIKPHPKGEWFKVVYMLVAGAAFLGLGWLLDFI- 268

Query: 369 AIPINKQLYSFSYVCFTAG 387
            +P+NK+L++ S+   T G
Sbjct: 269 -VPVNKELWTSSFTLVTIG 286



 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 72/155 (46%), Gaps = 23/155 (14%)

Query: 57  QQKSKRVATLDAFRGLTVV-----------LMILVDDAGGAY--ARIDHSPWNGCTLADF 103
           +   +R+ +LDA RG  +            L+      G  +   ++ HS W+G T  D 
Sbjct: 7   KVSKRRLMSLDALRGFDMFWILGGEVLFAGLLAWTGWQGWQWFDTQMHHSEWHGFTFYDL 66

Query: 104 VMPFFLFIVGVAIALALKKVPKINGAVKKIIF-----RTLKLLFWGIILQGGYSHAPDAL 158
           + P F+F+ GVA+ L+ K++ K+  A +  ++     R L LLF+GI+   G+       
Sbjct: 67  IFPLFIFLSGVALGLSPKRLDKLPIAKRMPLYIHAVKRLLLLLFFGILYNHGWGTGVP-- 124

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
              V +  +R+  +L RIA  +   A++   T  R
Sbjct: 125 ---VVLDEVRYASVLGRIAFAWFFAAILVWHTCLR 156


>gi|406831131|ref|ZP_11090725.1| hypothetical protein SpalD1_05826 [Schlesneria paludicola DSM
           18645]
          Length = 520

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 8/96 (8%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAY--ARIDHSPWNGCTLADFVMPFFLFIVG 113
           +   S R+ +LD FRG T++ M+LV+  G  +   +I     + C+ AD +MP FLF  G
Sbjct: 9   VPAPSARLTSLDQFRGYTMLGMLLVNYLGSYHVCPQILKHSHDYCSYADTIMPQFLFAAG 68

Query: 114 VAIALALKKVPKING------AVKKIIFRTLKLLFW 143
            A+ L+L K   + G      A+++I+   L  + W
Sbjct: 69  FAMRLSLGKRLAVGGFAPWGRAIRRILGLALVAILW 104


>gi|227875179|ref|ZP_03993321.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
 gi|227844084|gb|EEJ54251.1| possible heparan-alpha-glucosaminide N-acetyltransferase
           [Mobiluncus mulieris ATCC 35243]
          Length = 399

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 81/373 (21%), Positives = 126/373 (33%), Gaps = 98/373 (26%)

Query: 33  KEKGLERSEVQDE---------QKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDA 83
           +E G   +  Q E           GE    +    K  R+ +LD  RG    L++ V  A
Sbjct: 4   RETGNPSAATQSEAATTRQTEPNTGETNQAETKPAKPGRITSLDVGRGW--FLIMSVTSA 61

Query: 84  GGAYARID---HSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKL 140
                R D   H+PW G    D + P F+ + G+ +A A          +++I+   +  
Sbjct: 62  AWLLPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRRIVVLVVVG 121

Query: 141 LFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP 200
           L +  +  G +           D    R+ G LQ  A++  ++A                
Sbjct: 122 LLYNGVSSGQW-----------DPATFRFTGPLQVYAVIVTIIATCHLF----------- 159

Query: 201 RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGP 260
                  A  W    G  A V  + T                 G+  +       G L P
Sbjct: 160 -------ARNWMAWAGITAGVAVLQT-----------------GLLTWWAGTCPSGVLSP 195

Query: 261 ACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLST 320
           +CN  G  DR L G +  Y                  G L  D          PEGL++ 
Sbjct: 196 SCNPSGMWDRALLGAHMYY-----------------GGFLGHD----------PEGLVAI 228

Query: 321 ISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI-----PINKQ 375
             A+L+   G   GH+ +     S+R   W +    LL +A  +     I     P  K+
Sbjct: 229 TGALLTAAAGTTAGHLAL-----SSRRLGWKTGPVKLLALAAAMSVFGLILNIWVPAFKR 283

Query: 376 LYSFSYVCFTAGA 388
           L++ S+    AGA
Sbjct: 284 LWTPSF-SLIAGA 295


>gi|256840847|ref|ZP_05546355.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298376669|ref|ZP_06986624.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|256738119|gb|EEU51445.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|298266547|gb|EFI08205.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 378

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 68/146 (46%), Gaps = 23/146 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           ++K  R+ +LD  RG  +  +++++      A                H  W G +  D 
Sbjct: 3   KKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDL 62

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +L   K +P      ++I  R L L  +G++ QG      + L+ 
Sbjct: 63  VMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLAL 116

Query: 161 GVDMKHIRWCGILQRIALVYVVVALI 186
             D  ++ +   LQ IA+ Y++ +L+
Sbjct: 117 DPDRVYL-YSNTLQSIAMGYLIASLL 141


>gi|301309931|ref|ZP_07215870.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340410|ref|ZP_17318149.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831505|gb|EFK62136.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227845|gb|EKN20741.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
           CL09T03C24]
          Length = 378

 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 68/146 (46%), Gaps = 23/146 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           ++K  R+ +LD  RG  +  +++++      A                H  W G +  D 
Sbjct: 3   KKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDL 62

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +L   K +P      ++I  R L L  +G++ QG      + L+ 
Sbjct: 63  VMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLAL 116

Query: 161 GVDMKHIRWCGILQRIALVYVVVALI 186
             D  ++ +   LQ IA+ Y++ +L+
Sbjct: 117 DPDRVYL-YSNTLQSIAMGYLIASLL 141


>gi|317477968|ref|ZP_07937151.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
 gi|316905882|gb|EFV27653.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
          Length = 394

 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 21/111 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +R+A +D FR LT+ LM+ V+D  G         +AR+D    +    +D + P FLF +
Sbjct: 7   QRIAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHARMDE---DMMGFSDTIFPAFLFCM 63

Query: 113 GVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           G++++ A+    KK       V  I +RT+ L+  G+       ++GG SH
Sbjct: 64  GMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|423344000|ref|ZP_17321713.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409213862|gb|EKN06875.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 376

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 64/148 (43%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           +    KR+ +LD  RG  +  ++ ++       R               H  W G +  D
Sbjct: 1   MNSTYKRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWD 60

Query: 103 FVMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
            VMP F+F+ GV++  AL   K +P      ++II R   L  +G++ QG          
Sbjct: 61  LVMPLFMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNL-------- 112

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALI 186
            G+D   I  +   LQ IA+ Y++ A++
Sbjct: 113 LGLDPDRIYLYSNTLQAIAMGYLIAAML 140


>gi|150007979|ref|YP_001302722.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
 gi|423331514|ref|ZP_17309298.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
 gi|149936403|gb|ABR43100.1| putative transmembrane protein [Parabacteroides distasonis ATCC
           8503]
 gi|409230084|gb|EKN22952.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
           CL03T12C09]
          Length = 378

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 68/146 (46%), Gaps = 23/146 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           ++K  R+ +LD  RG  +  +++++      A                H  W G +  D 
Sbjct: 3   KKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDL 62

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +L   K +P      ++I  R L L  +G++ QG      + L+ 
Sbjct: 63  VMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLAL 116

Query: 161 GVDMKHIRWCGILQRIALVYVVVALI 186
             D  ++ +   LQ IA+ Y++ +L+
Sbjct: 117 DPDRVYL-YSNTLQSIAMGYLIASLL 141


>gi|270294981|ref|ZP_06201182.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274228|gb|EFA20089.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 394

 Score = 46.2 bits (108), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 21/111 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +R+A +D FR LT+ LM+ V+D  G         +AR+D    +    +D + P FLF +
Sbjct: 7   QRIAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHARMDE---DMMGFSDTIFPAFLFCM 63

Query: 113 GVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           G++++ A+    KK       V  I +RT+ L+  G+       ++GG SH
Sbjct: 64  GMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|254445881|ref|ZP_05059357.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260189|gb|EDY84497.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
           DG1235]
          Length = 394

 Score = 46.2 bits (108), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 61  KRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPWNGCTL--ADFVMPFFLFIVGVA 115
            R+ ++D FRGLT++LMI V+D          ++H+P +   +  +D + P FLFIVG++
Sbjct: 3   SRIHSIDIFRGLTMLLMIWVNDFWSLTNVPTWLEHAPGDADAMGFSDIIFPAFLFIVGLS 62

Query: 116 IALALK-KVPKINGA---VKKIIFRTLKLLFWGIIL 147
           I  AL+ ++ K +     +  I+ R+  LL  G ++
Sbjct: 63  IPFALRSRLAKGDSKPTIITHILARSFALLLMGFLM 98


>gi|291514624|emb|CBK63834.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
          Length = 352

 Score = 46.2 bits (108), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 13/110 (11%)

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKI---IFRT-LKLLFW 143
           +++ H+ WNG T+ D + P FLFI GVA   +L K        K+I   IFR  L L   
Sbjct: 28  SQMQHAAWNGLTIQDTIFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALL 87

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKR 193
           G++  G +          ++   +R   +L RI L ++  AL+    + R
Sbjct: 88  GMVYNGLFE---------LNFSSLRIASVLGRIGLAWMFAALLCVYCSVR 128



 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIH-FKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           F+PEG+LST+ A++S   G+  G  L+   +G S   K +      L I    L +   +
Sbjct: 184 FDPEGILSTLPAVVSALFGMFTGEFLLDGRRGLSGSWKAFYMAVAALAITTAGLCWNLIM 243

Query: 371 PINKQLYSFSYVCFTAGAADG 391
           P+NK L+S S+ C  +G + G
Sbjct: 244 PVNKNLWSSSFTCVVSGYSLG 264


>gi|386821099|ref|ZP_10108315.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
 gi|386426205|gb|EIJ40035.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
          Length = 395

 Score = 46.2 bits (108), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 54/99 (54%), Gaps = 8/99 (8%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDA--GGAYARIDHSPW--NGCTLADFVMPFFLFIV 112
            +   R+ ++D  RGLT+ LM+ V+D    G    + HS    +   LAD+V P FLF+V
Sbjct: 1   MKTPTRILSIDIMRGLTLFLMLFVNDLFEPGVPKWLVHSKATEDAMGLADWVFPGFLFMV 60

Query: 113 GVAIALA-LKKVPKING---AVKKIIFRTLKLLFWGIIL 147
           G++I  A L +  K  G    +K I+ RTL LL  G+ +
Sbjct: 61  GLSIPFAFLSRRKKGEGDLEILKHILVRTLSLLLIGVFM 99


>gi|436836802|ref|YP_007322018.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
 gi|384068215|emb|CCH01425.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
          Length = 401

 Score = 46.2 bits (108), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSP--WNGCTLADFVMPFFLFIVGVA 115
            RV ++D  R LT+VLMI V+D     A    ++H P   +G  LAD V P FLFIVG++
Sbjct: 16  TRVDSIDVLRALTMVLMIFVNDLWSLTAIPGWLEHVPEGADGIGLADVVFPAFLFIVGLS 75

Query: 116 IALALK 121
           I  A++
Sbjct: 76  IPFAIQ 81


>gi|305666718|ref|YP_003863005.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
 gi|88708942|gb|EAR01176.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
          Length = 395

 Score = 45.8 bits (107), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSPWNGCTL--ADFVMPFFLFI 111
           + K+ R+A++D  R LT++LMI V+D          + H+  N   L  +D + P FLFI
Sbjct: 3   KSKTLRIASIDVLRALTMLLMIWVNDFWTLTQVPKWLTHAKPNEDYLGFSDIIFPLFLFI 62

Query: 112 VGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSH 153
           VG++I  A+     K  P+ +   K I+ R++ LL  G+ +    +H
Sbjct: 63  VGLSIPFAINNRMAKGEPR-SIMFKHIVIRSISLLIIGVFMVNYETH 108


>gi|218260819|ref|ZP_03475938.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
 gi|218224342|gb|EEC96992.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
           DSM 18315]
          Length = 376

 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 64/148 (43%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           +    KR+ +LD  RG  +  ++ ++       R               H  W G +  D
Sbjct: 1   MNSTYKRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWD 60

Query: 103 FVMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
            VMP F+F+ GV++  AL   K +P      ++II R   L  +G++ QG          
Sbjct: 61  LVMPLFMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNL-------- 112

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALI 186
            G+D   I  +   LQ IA+ Y++ A++
Sbjct: 113 LGLDPGRIYLYSNTLQAIAMGYLIAAML 140


>gi|389866878|ref|YP_006369119.1| hypothetical protein MODMU_5285 [Modestobacter marinus]
 gi|388489082|emb|CCH90660.1| conserved transmembrane protein of unknown function [Modestobacter
           marinus]
          Length = 327

 Score = 45.8 bits (107), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYARID--HSPWNGCTLADFVMPFFLFIVGVAI 116
           +R+  +D  RGL VV M++VD+ G A       H+ W+G  +AD V P FL + GV++
Sbjct: 2   RRLHGVDVLRGLAVVGMLVVDNRGNASIATQWHHAAWDGLHVADVVFPAFLLVAGVSM 59


>gi|456861595|gb|EMF80245.1| hypothetical protein LEP1GSC188_2621 [Leptospira weilii serovar
           Topaz str. LT2116]
          Length = 179

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEGLLS +++I +   G+  G +L   +    R + +   G G L   + L +  ++P+
Sbjct: 101 DPEGLLSGVASIATSLFGVLCGFILF-LRESGGRSRVFSIFGLGFLFTFVGLLWDQSLPM 159

Query: 373 NKQLYSFSYVCFTAGAA 389
           NK L++ SY  +TAG A
Sbjct: 160 NKSLWTGSYAAYTAGLA 176


>gi|255013329|ref|ZP_05285455.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
 gi|410103820|ref|ZP_11298741.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
 gi|409236549|gb|EKN29356.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
          Length = 378

 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 68/146 (46%), Gaps = 23/146 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           ++K  R+ +LD  RG  +  +++++      A                H  W G +  D 
Sbjct: 3   KKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDL 62

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +L   K +P      ++I  R + L  +G++ QG      + L+ 
Sbjct: 63  VMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQG------NLLAL 116

Query: 161 GVDMKHIRWCGILQRIALVYVVVALI 186
             D  ++ +   LQ IA+ Y++ +L+
Sbjct: 117 DPDRVYL-YSNTLQSIAMGYLIASLL 141


>gi|399028715|ref|ZP_10729871.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
 gi|398073551|gb|EJL64721.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
          Length = 382

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 1/79 (1%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVL-IHFKGHSARLKHWVSMGFGLLIIAIILHFTNAI 370
           ++PEGL STI A+ +  +G+  G  L I     S   K  + +   +++I I L +    
Sbjct: 213 YDPEGLFSTIPAVATALLGMFLGTFLKIKANHFSTNKKILIMVASAIVLIGIGLIWNYDF 272

Query: 371 PINKQLYSFSYVCFTAGAA 389
           PINK L+S S+VCF  G +
Sbjct: 273 PINKHLWSSSFVCFVGGCS 291



 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 61/157 (38%), Gaps = 45/157 (28%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADFVMPFF 108
           R+ +LDA RG  +  ++  +    A A+             + H+ W G T  D + P F
Sbjct: 9   RLVSLDALRGFVMFWIMSGEHIIHALAKAAPIPVFVWMSSQLHHTEWEGITFYDMIFPIF 68

Query: 109 LFIVGVAIALALKKVPKINGA----------VKKIIFRTLK----LLFWGIILQG----- 149
           LF+ GV++  + +K   I G            KKI    LK    L+F G I+ G     
Sbjct: 69  LFVAGVSMPYSFEKKMSIAGVNTPMELPAKEKKKIYLSMLKRTCILIFLGFIVNGLLRFD 128

Query: 150 GYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
           GY                R+  +L RI L +    +I
Sbjct: 129 GYDQT-------------RFASVLGRIGLAWFFAGII 152


>gi|182412825|ref|YP_001817891.1| hypothetical protein Oter_1003 [Opitutus terrae PB90-1]
 gi|177840039|gb|ACB74291.1| conserved hypothetical protein [Opitutus terrae PB90-1]
          Length = 411

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 65/136 (47%), Gaps = 27/136 (19%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDD---AGGAYA----------RIDHSPWNGCTLADFV 104
               R+ ++DA RG  +  ++  D    A GA +          +++H  W G    D +
Sbjct: 7   SSPPRLVSVDALRGFDMFWILGADALVLALGAMSLSPTLRALAGQLEHKDWAGFAFYDLI 66

Query: 105 MPFFLFIVGVAIALALKKVPKING---AVKKIIFRTLKLLFWGIILQGGYSHA-PDALSY 160
            P F+FIVGV+   +L  +    G   AVK+I+ RTL LL +GI   GG +H  PD    
Sbjct: 67  FPLFVFIVGVSTVFSLTSLVAREGRAAAVKRILRRTLLLLAFGIFYNGGLAHQWPD---- 122

Query: 161 GVDMKHIRWCGILQRI 176
                 +R  G+LQRI
Sbjct: 123 ------VRLVGVLQRI 132


>gi|262381452|ref|ZP_06074590.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296629|gb|EEY84559.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 378

 Score = 45.4 bits (106), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 68/146 (46%), Gaps = 23/146 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADF 103
           ++K  R+ +LD  RG  +  +++++      A                H  W G +  D 
Sbjct: 3   KKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWDL 62

Query: 104 VMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSY 160
           VMP FLF+ GV++  +L   K +P      ++I  R + L  +G++ QG      + L+ 
Sbjct: 63  VMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQG------NLLAL 116

Query: 161 GVDMKHIRWCGILQRIALVYVVVALI 186
             D  ++ +   LQ IA+ Y++ +L+
Sbjct: 117 DPDRVYL-YSNTLQSIAMGYLIASLL 141


>gi|387789753|ref|YP_006254818.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
 gi|379652586|gb|AFD05642.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
          Length = 389

 Score = 45.4 bits (106), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 15/115 (13%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW--------NGCTLADFVMPF 107
           + +   R+ ++D  R LT+ LMI V+D    ++ ++   W        +G  L+D + P 
Sbjct: 1   MTKTINRLGSIDVIRALTMFLMIFVNDL---WSLVNVPKWLEHVDVQTDGMGLSDVIFPA 57

Query: 108 FLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDAL 158
           FLFIVG++I  +    +KK       +K I  R+  LL  G       S+ P AL
Sbjct: 58  FLFIVGLSIPFSVENRIKKGDSTIQLLKHIFIRSFALLVIGFFHVNLESYNPGAL 112


>gi|423345098|ref|ZP_17322787.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
 gi|409222884|gb|EKN15821.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
           CL03T12C32]
          Length = 376

 Score = 45.1 bits (105), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           ++   +R+ +LD  RG  +  ++ ++    +  R               H  W G +  D
Sbjct: 1   MKPAYQRLESLDVLRGFDLFCLVALEGVLHSLGRAIDAPWYNDFLWGFSHVQWEGFSSWD 60

Query: 103 FVMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
            VMP F+F+ GV++  AL   K +P      ++I+ R   L  +G++ QG          
Sbjct: 61  LVMPLFMFMAGVSMPFALSRYKAMPDKWAVYRRIVKRVALLWIFGMMCQGNL-------- 112

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALI 186
            G+D   I  +   LQ IA+ Y++ A++
Sbjct: 113 LGLDPDRIYLYSNTLQAIAMGYLISAML 140


>gi|323343607|ref|ZP_08083834.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
 gi|323095426|gb|EFZ38000.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
           33269]
          Length = 468

 Score = 44.7 bits (104), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 67/142 (47%), Gaps = 22/142 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH---SPWNGCTLADFV 104
            K +R   LDA RG  ++ MIL       V  A   +A++   DH       G T  D +
Sbjct: 1   MKQERAHALDALRGYAIMTMILSATEAFRVLPAWMYHAQVPPPDHVFNPSIYGITWVDLI 60

Query: 105 MPFFLFIVGVAIALALKKVPKINGAVKKI----IFRTLKLLFWGIILQGGYSHAPDALSY 160
            PFFLF +G AI L+L +  K   +++K+      R LKL F+ I +   Y   P  L Y
Sbjct: 61  FPFFLFSMGAAIPLSLGRQYKAGASLRKLCRKSAIRWLKLAFFAIFI---YHTFPFMLGY 117

Query: 161 GVDMKHIRWCGILQRIALVYVV 182
               + +R+   L   AL++V+
Sbjct: 118 --RQEWLRYAVPLAGFALMFVL 137


>gi|332668157|ref|YP_004450945.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332336971|gb|AEE54072.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 387

 Score = 44.7 bits (104), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 17/103 (16%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-W--------NGCTLADFVMPFF 108
            K++R+ ++D  R +T++LMI V+D       + H P W        +G  L+D V P F
Sbjct: 1   MKNQRLPSIDILRAVTMLLMIFVND----LWSLTHVPHWLLHTAAEEDGMGLSDVVFPAF 56

Query: 109 LFIVGVAIALALK----KVPKINGAVKKIIFRTLKLLFWGIIL 147
           LFIVG++I  ALK    K       +  I+ RT  LL  G+ +
Sbjct: 57  LFIVGLSIPHALKARLEKGASKGSVMLHILSRTFALLVMGLFM 99


>gi|127512051|ref|YP_001093248.1| hypothetical protein Shew_1118 [Shewanella loihica PV-4]
 gi|126637346|gb|ABO22989.1| conserved hypothetical protein [Shewanella loihica PV-4]
          Length = 387

 Score = 44.7 bits (104), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 23/143 (16%)

Query: 62  RVATLDAFRG-----------LTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFF 108
           R+ +LDA RG           L   L++     G   A A++ HS W+G T  D + P F
Sbjct: 16  RLMSLDALRGFDMFWILGGEALFAALLVWTGWQGWRIADAQMHHSQWHGFTFYDLIFPLF 75

Query: 109 LFIVGVAIALALKKV-----PKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           +F+ GVA+ L+ K++     P+     +  I R + LL +G++   G+      +    D
Sbjct: 76  IFLSGVALGLSPKRLDSLPWPERLPLYRHAIKRLMLLLLFGVLYNHGWG---TGMPMAAD 132

Query: 164 MKHIRWCGILQRIALVYVVVALI 186
              +R+  +L RIA  +   AL+
Sbjct: 133 --EVRYASVLGRIAFAWFFAALL 153


>gi|167764058|ref|ZP_02436185.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
 gi|167698174|gb|EDS14753.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
           43183]
          Length = 394

 Score = 44.7 bits (104), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 21/112 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +RVA +D FR LT+ LM+ V+D  G         +A  D    +    +D + P FLF +
Sbjct: 7   QRVAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHAAADE---DMLGFSDTIFPAFLFCM 63

Query: 113 GVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSHA 154
           G++++ A+    KK       +  I +RT+ L+  G+       ++GG SH+
Sbjct: 64  GMSVSFAIQNRYKKGDTTTQVIAHIFWRTVALIAMGLFSLNSGGIEGGLSHS 115


>gi|154492357|ref|ZP_02031983.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
 gi|154087582|gb|EDN86627.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
           43184]
          Length = 376

 Score = 44.7 bits (104), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           ++   +R+ +LD  RG  +  ++ ++       R               H  W+G +  D
Sbjct: 1   MKPSYQRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWDGFSSWD 60

Query: 103 FVMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
            VMP F+F+ GV++  AL   K +P      ++I+ R   L  +G++ QG          
Sbjct: 61  LVMPLFMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNL-------- 112

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALI 186
            G+D   I  +   LQ IA+ Y++ A++
Sbjct: 113 LGLDPDRIYLYSNTLQAIAMGYLISAML 140


>gi|374373619|ref|ZP_09631279.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
 gi|373234592|gb|EHP54385.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
          Length = 397

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 8/107 (7%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDA--GGAYARIDHSP--WNGCTLADFVMPFFLFIVG 113
           Q + R+ ++D  RG+T+ LM+ V+D    G    + H+    +   LAD+V P FLF+VG
Sbjct: 4   QFAGRIRSIDIMRGITLCLMLFVNDLYEPGVPHWLVHTKAETDSMGLADWVFPGFLFMVG 63

Query: 114 VAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPD 156
           ++I  A+    +K  +    V  I+FR++ LL  G+++  G    P 
Sbjct: 64  LSIPFAIDSRRRKGDEWPQLVLHILFRSVSLLIIGLLMLNGGRVNPQ 110


>gi|255035026|ref|YP_003085647.1| hypothetical protein Dfer_1233 [Dyadobacter fermentans DSM 18053]
 gi|254947782|gb|ACT92482.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 401

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 9/101 (8%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSPW--NGCTLADFVMPFFLF 110
           +   S R+ ++D FR +T++LMI V+D     A    ++HS    +   L+D V P FLF
Sbjct: 4   VASSSLRLDSIDVFRAVTMLLMIFVNDFWTLEAVPKWLEHSKAEEDAMGLSDVVFPAFLF 63

Query: 111 IVGVAIALALKKVPKINGA----VKKIIFRTLKLLFWGIIL 147
           IVG++I  A+    K        ++ I  RT  LL  GI +
Sbjct: 64  IVGLSIPFAISNRRKKGDGNALIIRHIAERTFALLLMGIFI 104


>gi|456890764|gb|EMG01555.1| putative membrane protein [Leptospira borgpetersenii str.
           200701203]
          Length = 270

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 313 EPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPI 372
           +PEG LS I++I +   G+  G +L   +G     +     G G L   + L +  ++P+
Sbjct: 96  DPEGFLSGIASIATSLFGVICGFILFRREGRGKN-RVLSIFGLGFLFTFVGLLWDRSLPM 154

Query: 373 NKQLYSFSYVCFTAGAADGCLG 394
           NK L++ SY  +T G +  C+G
Sbjct: 155 NKSLWTGSYAVYTTGLSFLCIG 176


>gi|423722057|ref|ZP_17696233.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
 gi|409242759|gb|EKN35519.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
           CL09T00C40]
          Length = 376

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 65/148 (43%), Gaps = 25/148 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLAD 102
           ++   +R+ +LD  RG  +  ++ ++       R               H  W G +  D
Sbjct: 1   MKPSYQRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWEGFSSWD 60

Query: 103 FVMPFFLFIVGVAIALAL---KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALS 159
            VMP F+F+ GV++  AL   K +P      ++I+ R   L  +G++ QG          
Sbjct: 61  LVMPLFMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNL-------- 112

Query: 160 YGVDMKHIR-WCGILQRIALVYVVVALI 186
            G+D   I  +   LQ IA+ Y++ A++
Sbjct: 113 LGLDPDRIYLYSNTLQAIAMGYLISAML 140


>gi|302759308|ref|XP_002963077.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
 gi|300169938|gb|EFJ36540.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
          Length = 293

 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 14/85 (16%)

Query: 344 SARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAADGCLGITNSIFIPE 403
           +AR+ +++     LL +  +LH    I +NK LYSFSY+CFTAGAA G +     I +  
Sbjct: 7   NARIINFIVPAVILLALGYVLHLL-GIKMNKPLYSFSYMCFTAGAA-GAVFCLLYILVDV 64

Query: 404 MDWDECDACFRAWSTRYPSRICKWM 428
            D             RYP+ + +WM
Sbjct: 65  YD------------IRYPTLLLEWM 77


>gi|430747657|ref|YP_007206786.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
 gi|430019377|gb|AGA31091.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
           18658]
          Length = 418

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 63/145 (43%), Gaps = 28/145 (19%)

Query: 60  SKRVATLDAFRGLTVVLMIL----VDDAGGAYARID----------HSPWNGCTLADFVM 105
           S+R+A++DAFRG  + L++     +     ++ + +          H  W GC+L D + 
Sbjct: 31  SRRLASIDAFRGFVMFLLLAEWLKLPQVAKSFPKSELWALLSRHQQHVEWVGCSLHDLIQ 90

Query: 106 PFFLFIVGVA----IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG 161
           P F F+VGVA    IA  L +            +R L L+  GI L+          S G
Sbjct: 91  PSFSFLVGVALPFSIASRLARGQSTTRMAGHAFWRALVLVLLGIFLR----------SMG 140

Query: 162 VDMKHIRWCGILQRIALVYVVVALI 186
            D  +  +   L +I L Y  + L+
Sbjct: 141 KDRTNFTFEDTLTQIGLGYGFLFLL 165


>gi|423304305|ref|ZP_17282304.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
 gi|423310581|ref|ZP_17288565.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392681752|gb|EIY75109.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
           CL03T12C37]
 gi|392684891|gb|EIY78211.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
           CL03T00C23]
          Length = 394

 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 56/111 (50%), Gaps = 21/111 (18%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGG--------AYARIDHSPWNGCTLADFVMPFFLFIV 112
           +R+A +D FR LT+ LM+ V+D            +AR+D    +    +D + P FLF +
Sbjct: 7   QRIAAVDVFRALTMFLMLFVNDIPRLKNVPHWLMHARMDE---DMMGFSDTIFPAFLFCM 63

Query: 113 GVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           G++++ A+    KK       V  I +RT+ L+  G+       ++GG SH
Sbjct: 64  GMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114


>gi|224536805|ref|ZP_03677344.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521571|gb|EEF90676.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 394

 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 15/108 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSPWNGCTL--ADFVMPFFLFIVGVA 115
           +RVA +D FR LT+ LM+ V+D  G       + H+  N   L  +D + P FLF +G++
Sbjct: 7   QRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLFCMGMS 66

Query: 116 IALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           ++ A+    +K       +  I +RT+ L+  G+       ++GG SH
Sbjct: 67  VSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|149277363|ref|ZP_01883505.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
 gi|149232240|gb|EDM37617.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
          Length = 396

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 50/100 (50%), Gaps = 15/100 (15%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW--------NGCTLADFVMPFF 108
           +   +R+ ++DA R L ++LMI V+D    ++ ID   W        N   LAD V P F
Sbjct: 5   KNVQQRLVSIDALRALVMLLMIFVND---LWSLIDIPGWLEHAPGDANYMGLADVVFPAF 61

Query: 109 LFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWG 144
           L IVG+++  A+    +K          I++RT+ LL  G
Sbjct: 62  LVIVGLSVPYAIDSRRRKGDGNRAIFLHIVYRTIALLVMG 101


>gi|423223322|ref|ZP_17209791.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638858|gb|EIY32689.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 394

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 15/108 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSPWNGCTL--ADFVMPFFLFIVGVA 115
           +RVA +D FR LT+ LM+ V+D  G       + H+  N   L  +D + P FLF +G++
Sbjct: 7   QRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLFCMGMS 66

Query: 116 IALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           ++ A+    +K       +  I +RT+ L+  G+       ++GG SH
Sbjct: 67  VSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114


>gi|149437198|ref|XP_001516670.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
           partial [Ornithorhynchus anatinus]
          Length = 176

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 15/93 (16%)

Query: 308 CRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHW-VSMGFGLLIIAI 362
            +  ++PEG+L TI++I+   +G+  G +L+ +K        R   W V MG   LI  +
Sbjct: 5   TKVAYDPEGILGTINSIVMAFLGVQAGKILLFYKEQHRQIMLRFLTWSVVMG---LISGV 61

Query: 363 ILHFTNA---IPINKQLYSFSYV----CFTAGA 388
           +  F+     +PINK L+S SYV    CF   A
Sbjct: 62  LTKFSQNEGFVPINKNLWSISYVTTLSCFAFVA 94


>gi|427384705|ref|ZP_18881210.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727966|gb|EKU90825.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
           12058]
          Length = 398

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 57/108 (52%), Gaps = 15/108 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSPWNGCTL--ADFVMPFFLFIVGVA 115
           +RVA +D FR LT+ LM+ V+D  G       ++H+  N   +  +D + P FLF +G++
Sbjct: 11  QRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHAEMNEDMMGFSDTIFPAFLFCMGMS 70

Query: 116 IALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           ++ A+    +K       +  + +RT+ L+  G+       ++GG SH
Sbjct: 71  VSFAIQNRYRKGDTTLQVIAHVFWRTVALIAMGLFSLNSGGIEGGLSH 118


>gi|146302719|ref|YP_001197310.1| hypothetical protein Fjoh_4992 [Flavobacterium johnsoniae UW101]
 gi|146157137|gb|ABQ07991.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 395

 Score = 42.7 bits (99), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 15/90 (16%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-W--------NGCTLADFVMPFFLF 110
           ++R+ ++DA RG+T+ +MI V++     A I + P W        +  T  D V P FLF
Sbjct: 9   NQRIISIDALRGITIFVMIFVNEL----ASIQNVPQWMKHMPADADAMTFVDLVFPAFLF 64

Query: 111 IVGVAIALALKKVPKINGAVKKIIF-RTLK 139
           IVG+++  A      I G   K+I+  TLK
Sbjct: 65  IVGMSVPFAFNA-RLIKGDSPKVIWTHTLK 93


>gi|189463412|ref|ZP_03012197.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
 gi|189429841|gb|EDU98825.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 57/131 (43%), Gaps = 31/131 (23%)

Query: 259 GPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLL 318
           GP  N +  +D+ + G +HLY+D                               +PEG+L
Sbjct: 86  GPQ-NIIAVIDQYILGASHLYND----------------------------HGIDPEGIL 116

Query: 319 STISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYS 378
           STI +I    IG   G + I  +   ++L+    +G  LL    +  F+   PINK+++S
Sbjct: 117 STIPSISHTLIGYCIGKICIEKENIHSKLEKLFLIGTVLLFAGYL--FSYGCPINKKIWS 174

Query: 379 FSYVCFTAGAA 389
            +YV  T GA 
Sbjct: 175 PTYVFMTCGAG 185


>gi|373850799|ref|ZP_09593600.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
 gi|372476964|gb|EHP36973.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
           bacterium TAV5]
          Length = 401

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 42/68 (61%), Gaps = 5/68 (7%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDD---AGGAYARIDHSP--WNGCTLADFVMPFFLFIVG 113
            + RVA++D  R LT+VLMI+V+D        A + HS    +G  +AD V P FLF+VG
Sbjct: 9   NAGRVASIDILRALTMVLMIIVNDLFTLKNTPAWLGHSASGVDGIGVADVVFPAFLFLVG 68

Query: 114 VAIALALK 121
           +++  AL+
Sbjct: 69  LSLPHALE 76


>gi|336417194|ref|ZP_08597521.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297816|ref|ZP_17275876.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
 gi|335936514|gb|EGM98440.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664453|gb|EIY57991.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
           CL03T12C18]
          Length = 466

 Score = 42.7 bits (99), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 54/123 (43%), Gaps = 20/123 (16%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDA------GGAY-------ARIDHSPWNGCTLADFVMP 106
           +KR   LDA RG  ++ M+L          G  Y        R   +  +G T  D V P
Sbjct: 2   TKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGV 162
           FFLF +G A   ++KK  +      ++++  +K    L F+ I +Q  Y H    LS   
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGAQLTFFAIFIQHFYPH---VLSNPQ 118

Query: 163 DMK 165
           DM+
Sbjct: 119 DMR 121


>gi|406832166|ref|ZP_11091760.1| hypothetical protein SpalD1_11017 [Schlesneria paludicola DSM
           18645]
          Length = 413

 Score = 42.7 bits (99), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 23/134 (17%)

Query: 38  ERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMIL----VDDAGGAY--ARI- 90
            +SE++     EL           R+ ++DA+RG  ++LM+     + D   A   +R+ 
Sbjct: 7   NKSEIEGPATLELPAGG---AAPSRLVSVDAYRGWVMLLMMAEVLRLRDVAKALPESRLW 63

Query: 91  -------DHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGA-----VKKIIFRTL 138
                   H  W GC L D + P F F+VGVA+ L+L++   ++G           +R+L
Sbjct: 64  AFLAQQQSHVTWVGCVLHDMIQPSFSFLVGVALPLSLRR-RSLSGQPLWQRTAHAAWRSL 122

Query: 139 KLLFWGIILQGGYS 152
            L+  G+ L+  +S
Sbjct: 123 VLILLGVFLRSTHS 136


>gi|395803959|ref|ZP_10483200.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
 gi|395433603|gb|EJF99555.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
          Length = 396

 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP-W--------NGCTLADFVMPFFLF 110
           ++R+ ++D+ RG+T+ +MI V++     A I + P W        +  T  D V P FLF
Sbjct: 9   NQRIVSIDSLRGITIFVMIFVNEL----ASIQNVPQWMKHMPADADAMTFVDLVFPAFLF 64

Query: 111 IVGVAIALALK 121
           IVG++I  A  
Sbjct: 65  IVGMSIPFAFN 75


>gi|312131791|ref|YP_003999131.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
 gi|311908337|gb|ADQ18778.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
           17132]
          Length = 361

 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 19/109 (17%)

Query: 57  QQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW--------NGCTLADFVMPFF 108
            QK  R+ ++D FR LT+  MI V+D    +   +   W        +G   +D + P F
Sbjct: 5   DQKKNRLLSIDIFRALTMFFMIFVND---LFTVKNVPKWMLHTEMHEDGMGFSDVIFPIF 61

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDA 157
           L IVG++I       P      K I  RT  LL  G+ L   Y + P  
Sbjct: 62  LLIVGMSI-------PFAKADWKGIGMRTFALLVMGVFLV-NYEYFPSG 102


>gi|359686399|ref|ZP_09256400.1| hypothetical protein Lsan2_17893 [Leptospira santarosai str.
           2000030832]
          Length = 329

 Score = 42.4 bits (98), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 65/291 (22%), Positives = 117/291 (40%), Gaps = 83/291 (28%)

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGA--VKKIIFRTLKLLFWGIILQ--GGYSHAPDAL 158
           F +PFFLF VG +I ++L     IN +     I  R+  L+  G+ L   G +S A    
Sbjct: 9   FGVPFFLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFA---- 64

Query: 159 SYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFI 218
                   +R  G+LQRI  VY VVA +  +   ++                        
Sbjct: 65  -------ELRIPGVLQRIGFVYWVVASLCLVFPGKK------------------------ 93

Query: 219 AFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHL 278
              I + +   L +  W  ++ +  G  + +V       +G       ++DR ++G  HL
Sbjct: 94  ---ILVFSVPILLIHTWILTQIALPG--ESVVSLEQGKDIG------AWIDRTIFGEKHL 142

Query: 279 YSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLI 338
           +                        + +W     +PEG LS ++++++   G+  G +L 
Sbjct: 143 W----------------------RFSKTW-----DPEGFLSGVASVVTTLFGVLCGFILF 175

Query: 339 HFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
                  R +    +G G+L   + L +  ++P+NK L++ SY  +TAG +
Sbjct: 176 ------LRERKNKILGLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLS 220


>gi|332662942|ref|YP_004445730.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332331756|gb|AEE48857.1| Protein of unknown function DUF2261, transmembrane
           [Haliscomenobacter hydrossis DSM 1100]
          Length = 394

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 14/75 (18%)

Query: 62  RVATLDAFRGLTVVLMI------------LVDDAGGAYARI--DHSPWNGCTLADFVMPF 107
           R+ ++D +RGL + LM+              D    A+      H PW GC+L D + P 
Sbjct: 9   RLGSVDVYRGLVMFLMMAEVLEFGHVAKAFPDSGFWAFLHFHQSHVPWVGCSLHDLIQPS 68

Query: 108 FLFIVGVAIALALKK 122
           F F+VGVA+  +L  
Sbjct: 69  FSFLVGVALPYSLAS 83


>gi|403174292|ref|XP_003333277.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375170913|gb|EFP88858.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 386

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 4/90 (4%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYAR-IDH--SPWNGCTLADFVMPFFLFIVGVAI 116
           +KR  ++D  RGLT + M+LV+ AG      + H  S     T AD + P F+F  G+A 
Sbjct: 24  AKRDRSIDVLRGLTCLAMVLVNTAGPVRPSWLSHPTSIHQSITFADTLFPCFVFTSGLAS 83

Query: 117 ALALKKVPK-INGAVKKIIFRTLKLLFWGI 145
           A + K      N ++K+ + R +KL   GI
Sbjct: 84  AQSKKNEQNGRNPSLKRTLIRAIKLNLIGI 113


>gi|357510839|ref|XP_003625708.1| hypothetical protein MTR_7g102400 [Medicago truncatula]
 gi|355500723|gb|AES81926.1| hypothetical protein MTR_7g102400 [Medicago truncatula]
          Length = 76

 Score = 42.0 bits (97), Expect = 0.53,   Method: Composition-based stats.
 Identities = 21/33 (63%), Positives = 27/33 (81%), Gaps = 4/33 (12%)

Query: 43 QDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
          QD+Q+  L    L++QK+KRVATLDAFRGLT+V
Sbjct: 46 QDDQQPVL----LVKQKTKRVATLDAFRGLTIV 74


>gi|237717694|ref|ZP_04548175.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|299149194|ref|ZP_07042255.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|229453013|gb|EEO58804.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|298512861|gb|EFI36749.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 466

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 17/112 (15%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDA------GGAY-------ARIDHSPWNGCTLADFVMP 106
           +KR   LDA RG  ++ M+L          G  Y        R   +  +G T  D V P
Sbjct: 2   NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHA 154
           FFLF +G A   ++KK  +      ++++  +K    L F+ I +Q  Y H 
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFYPHV 113


>gi|255532593|ref|YP_003092965.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
 gi|255345577|gb|ACU04903.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
          Length = 390

 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 36/66 (54%), Gaps = 5/66 (7%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYAR---IDH--SPWNGCTLADFVMPFFLFIVGVAI 116
           R   +D  R LT+ LMI V+D G        +DH  +  +G   AD + P FLFIVG+++
Sbjct: 8   RFQAVDVLRALTMFLMIFVNDVGSVKYLPHWVDHVEADVDGMGFADTIFPAFLFIVGLSL 67

Query: 117 ALALKK 122
             AL+ 
Sbjct: 68  PFALQS 73


>gi|377572860|ref|ZP_09801940.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
 gi|377538518|dbj|GAB47105.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
           104925]
          Length = 439

 Score = 42.0 bits (97), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 68/136 (50%), Gaps = 11/136 (8%)

Query: 59  KSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIAL 118
           +  R+ +LD  RG+ +V+ ++V+    A    +H+ W G    D V P F+ + G  +A+
Sbjct: 8   RGGRLESLDVCRGVMLVVSVVVNAWFTAPEWFEHAAWTGVHPVDLVFPAFVTLSGAGMAI 67

Query: 119 AL-KKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYG-VDMKHIRWCGILQRI 176
           A  ++VP     V + + R L L   G+     ++ A   L  G VD+  +R+ G+LQ  
Sbjct: 68  AFARRVP-----VARQVRRVLVLTAAGL----AFAVAGQVLGTGAVDVATLRFTGVLQLY 118

Query: 177 ALVYVVVALIETLTTK 192
           A + + + L+  +  +
Sbjct: 119 AFLVLALGLVAVVVRR 134


>gi|116623829|ref|YP_825985.1| hypothetical protein Acid_4741 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116226991|gb|ABJ85700.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 376

 Score = 42.0 bits (97), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 55/118 (46%), Gaps = 20/118 (16%)

Query: 57  QQKSKRVATLDAFRGLTVVLMIL-------VDDA--GGAYARI-----DHSPWNGCTLAD 102
            +K  R   +D +RG  ++LM+        V  A  G A+         H  W GC+L D
Sbjct: 1   MEKPARNLAIDTYRGFVMLLMMAEVLQLSRVAKALPGSAFWSFLAYHQTHVEWAGCSLHD 60

Query: 103 FVMPFFLFIVGVAIALALKKVPKINGAVKKI----IFRTLKLLFWGIILQGGYSHAPD 156
            + P F F+VGVA+  ++       GA + +    ++R+  L+  GI L+   +HAP 
Sbjct: 61  TIQPGFSFLVGVALPYSIAARLAKGGAFRAMFLHALWRSFLLIALGIFLRS--THAPQ 116


>gi|423213223|ref|ZP_17199752.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693683|gb|EIY86913.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 469

 Score = 41.6 bits (96), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 54/122 (44%), Gaps = 19/122 (15%)

Query: 60  SKRVATLDAFRGLTVVLMIL------------VDDAGGAYARIDHSPWNGCTLADFVMPF 107
           + R   LDA RG  ++ M+L            +  A      + +    G T  D V PF
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPDHVFNPLLPGITWVDLVFPF 61

Query: 108 FLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGVD 163
           FLF +G A   ++KK  +   +  K+++    R ++L F+ I +Q  Y   P  LS   D
Sbjct: 62  FLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQHFY---PYMLSSPQD 118

Query: 164 MK 165
           M+
Sbjct: 119 MR 120


>gi|373459116|ref|ZP_09550883.1| hypothetical protein Calab_2944 [Caldithrix abyssi DSM 13497]
 gi|371720780|gb|EHO42551.1| hypothetical protein Calab_2944 [Caldithrix abyssi DSM 13497]
          Length = 471

 Score = 41.6 bits (96), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 54/120 (45%), Gaps = 22/120 (18%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD-------AGGAYA------RIDHSPWNGCTLAD 102
           ++  ++RV  LDA RG  ++ M+L          A   +A      RI +    G T  D
Sbjct: 1   MKSANERVLNLDALRGFAILTMVLAGTIPYTGLPAWMYHAQLPPPERIFNPNLPGFTWVD 60

Query: 103 FVMPFFLFIVGVAIALALKK------VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPD 156
            V P FLF +G AI LAL+K      +P++      I+ RT  L F+ I L     H  D
Sbjct: 61  LVFPLFLFSLGAAIPLALEKRLTRQSLPRVG---LHIVERTFLLAFFAIFLFHVRPHIID 117


>gi|189464971|ref|ZP_03013756.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
 gi|189437245|gb|EDV06230.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
           17393]
          Length = 394

 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 15/108 (13%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYAR---IDHSPWNGCTL--ADFVMPFFLFIVGVA 115
           +RVA +D FR LT+ LM+ V+D  G       ++H+  N   +  +D + P FLF +G++
Sbjct: 7   QRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHADINEDMMGFSDTIFPAFLFCMGMS 66

Query: 116 IALAL----KKVPKINGAVKKIIFRTLKLLFWGII------LQGGYSH 153
           ++ A+    +K       +  I +RT+ L+  G+       + GG SH
Sbjct: 67  VSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIAGGISH 114


>gi|374309893|ref|YP_005056323.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358751903|gb|AEU35293.1| hypothetical protein AciX8_0944 [Granulicella mallensis MP5ACTX8]
          Length = 399

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 21/119 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMI--LVDDAGGAYARID------------HSPWNGCTLA 101
           LQQ++  V   DA+RG  + LM+  +   A  A +  D            H  W G +L 
Sbjct: 11  LQQRNSAV---DAYRGFVMALMLAEVFRFAFVAKSFPDNFLLHILAYNQSHVEWTGMSLH 67

Query: 102 DFVMPFFLFIVGVAIALALKKVPKINGAVKKI----IFRTLKLLFWGIILQGGYSHAPD 156
           D + P F F+VGVA+  +L+   +   + K +    I+R+  L+  GI L+  +S A D
Sbjct: 68  DMIQPSFTFLVGVALPYSLRSRRRKGESFKYMLGHTIWRSFLLVALGIFLRSIHSTATD 126


>gi|423287389|ref|ZP_17266240.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
 gi|392672504|gb|EIY65971.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
           CL02T12C04]
          Length = 470

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGV 162
           FFLF +G A   ++KK  +   +  K+++    R ++L F+ I +Q  Y   P  LS   
Sbjct: 62  FFLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 163 DMK 165
           DM+
Sbjct: 119 DMR 121


>gi|406835226|ref|ZP_11094820.1| hypothetical protein SpalD1_26403 [Schlesneria paludicola DSM
           18645]
          Length = 508

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 55  LLQQKSKRVATLDAFRGLTVVLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFI 111
           L      RV ++D FRG  V  MI V+  GG    ++   H+  N  + AD +M  F+F+
Sbjct: 7   LTTDSPDRVISMDQFRGYAVAAMIFVNFVGGFGVVHSVFKHND-NYLSYADTIMANFMFM 65

Query: 112 VGVAIALA-LKKVPKIN--GAVKKIIFRTLKLLFWGIILQG 149
           VG +  L  L+++ +++        + R+L L+F   +L G
Sbjct: 66  VGFSFRLTMLRRLKRMSWLATCWSYVRRSLLLVFVSTLLYG 106


>gi|168705120|ref|ZP_02737397.1| hypothetical protein GobsU_36644 [Gemmata obscuriglobus UQM 2246]
          Length = 387

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 9/68 (13%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGG-AYARID------HSPWNGCTLADFVMPFFLFIVGV 114
           R+A+LD FRG TV+ M+LV+  G  A  + D      H  +  C+ AD +MP FLF VG 
Sbjct: 20  RLASLDQFRGYTVLGMLLVNFVGSFAVIKADVPVLAHHHTY--CSYADTIMPQFLFAVGF 77

Query: 115 AIALALKK 122
           A  L   +
Sbjct: 78  AFRLTFAR 85


>gi|146300862|ref|YP_001195453.1| hypothetical protein Fjoh_3117 [Flavobacterium johnsoniae UW101]
 gi|146155280|gb|ABQ06134.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
          Length = 380

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 2/77 (2%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWV-SMGFGLLIIAIILHFTNAI 370
           ++PEG+ STI AI +  +G+  G  L      S  +K  + ++   +LIIA ++   N  
Sbjct: 212 YDPEGIFSTIPAIATALLGVFIGTFLKAKCPFSINIKLLLMALAAVVLIIAGLIWDIN-F 270

Query: 371 PINKQLYSFSYVCFTAG 387
           PINK L++ S+VCF  G
Sbjct: 271 PINKHLWTSSFVCFVGG 287



 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 22/113 (19%)

Query: 88  ARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKING---------AVKKIIF--- 135
           +++ H+ WNG T  D + P FLF+ GV++  + +K  K+ G         A K+ I+   
Sbjct: 47  SQLHHAEWNGITFYDMIFPVFLFVAGVSMPFSFEKKMKLAGVKEPKDLPKAEKRKIYLSM 106

Query: 136 --RTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
             RT  LL  G ++ G           G D    R+  +L RI L +    +I
Sbjct: 107 LRRTCILLVLGFVVNGLLRFD------GFD--QTRFASVLGRIGLAWFFAGII 151


>gi|395804714|ref|ZP_10483949.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
 gi|395433102|gb|EJF99060.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
          Length = 380

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 37/152 (24%), Positives = 64/152 (42%), Gaps = 35/152 (23%)

Query: 62  RVATLDAFRGLTVVLMILVDDAGGAYAR-------------IDHSPWNGCTLADFVMPFF 108
           R+ +LDA RG  +  ++  +    A A+             + H+ WNG T  D + P F
Sbjct: 8   RLISLDALRGFVMFWIMSGEHIIHALAKAAPIPIFLWMSSQLHHAEWNGITFYDMIFPVF 67

Query: 109 LFIVGVAIALALKKVPKINGAV---------KKIIF-----RTLKLLFWGIILQGGYSHA 154
           LF+ GV++  + +K   + G           K+ I+     RT+ L+  G ++ G     
Sbjct: 68  LFVAGVSMPYSFEKKMNLAGVSTPQELPSKEKRKIYLSMLRRTIILVVLGFVVNGLLRFD 127

Query: 155 PDALSYGVDMKHIRWCGILQRIALVYVVVALI 186
                 G D  H R+  +L RI + +    +I
Sbjct: 128 ------GFD--HTRFASVLGRIGIAWFFAGMI 151



 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%)

Query: 312 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 371
           ++PEG+ ST+ AI +  +G+  G  L      S   K  +     +L+I   L +    P
Sbjct: 212 YDPEGIFSTLPAISTALLGVFTGTFLKAKNQFSINAKLILMALTAVLLIIAGLIWDIDFP 271

Query: 372 INKQLYSFSYVCFTAG 387
           INK L++ S+VCF  G
Sbjct: 272 INKHLWTSSFVCFVGG 287


>gi|296121958|ref|YP_003629736.1| hypothetical protein Plim_1707 [Planctomyces limnophilus DSM 3776]
 gi|296014298|gb|ADG67537.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776]
          Length = 378

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 16/98 (16%)

Query: 91  DHSPWNGCTLADFVMPFFLFIVGVAIALAL-----KKVPKINGAVKKIIFRTLKLLFWGI 145
           +H  W GC+L D V P F+F+VGV+I  +L     K V    G V+  ++R++ L+  GI
Sbjct: 32  EHVAWRGCSLWDMVQPSFMFLVGVSIPWSLAAQKSKNVSTGQGWVRA-VWRSVLLVVLGI 90

Query: 146 ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVV 183
            L      + D            +  +L +I L Y+VV
Sbjct: 91  FLISNNKPSTD----------FSFVNVLTQIGLGYLVV 118


>gi|347736758|ref|ZP_08869314.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
 gi|346919651|gb|EGY01093.1| hypothetical protein AZA_88975 [Azospirillum amazonense Y2]
          Length = 175

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 318 LSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILH--FTNAIPINKQ 375
           +ST+ A+ +  +G+  G  L       AR++  +  G    ++ ++L   +  A PINK 
Sbjct: 1   MSTLPAVATALLGVQAGRWLARPVPLPARIRGLIVAG----VVGVLLGEVWDLAFPINKN 56

Query: 376 LYSFSYVCFTAGAADGCLG 394
           L++ S+V  TAG A  CLG
Sbjct: 57  LWTSSFVALTAGMACLCLG 75


>gi|256423178|ref|YP_003123831.1| hypothetical protein Cpin_4173 [Chitinophaga pinensis DSM 2588]
 gi|256038086|gb|ACU61630.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
          Length = 349

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 14/102 (13%)

Query: 57  QQKSKRVATLDAFRGLTVVLMI---------LVDDAGGAYAR-IDHSPWNGCTLADFVMP 106
              S R+ +LD  RGL ++L+          L +    A+ R   H PW+G    D V P
Sbjct: 1   MNNSGRLLSLDVMRGLIMILLAGESCRVYESLHEWHDNAFIRQFFHHPWHGLRFWDLVQP 60

Query: 107 FFLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWG 144
            F+ + G A+ ++    L+K    +   K I+ R+LKL   G
Sbjct: 61  AFMLMAGTAMYISYQSKLRKGVSWSQNFKHILIRSLKLFLLG 102


>gi|350546587|ref|ZP_08915965.1| hypothetical protein GUU_02077 [Mycoplasma iowae 695]
 gi|349503825|gb|EGZ31390.1| hypothetical protein GUU_02077 [Mycoplasma iowae 695]
          Length = 427

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 33  KEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVD-----DAGGAY 87
           K   L +  ++D+ K  + L         R+ ++DAFRG+ V  M++        + G  
Sbjct: 2   KFNLLSQGNIKDKSK-MIFLSDDYFNGVTRLKSIDAFRGMCVFCMLIFQFLKNFPSLGIL 60

Query: 88  ARI-DHSPWNGC------TLADFVMPFFLFIVGVAIALALKKVPKINGAVKKII 134
           +RI +HS   G       TLADF+ P F+F +G+  +L+        G +K  I
Sbjct: 61  SRIANHSLEKGIVILPGMTLADFIAPAFIFAIGLTFSLSFINRKNRQGTLKAFI 114


>gi|325102778|ref|YP_004272432.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324971626|gb|ADY50610.1| hypothetical protein Pedsa_0021 [Pedobacter saltans DSM 12145]
          Length = 466

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 13/80 (16%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD------AGGAYARIDHSPWN-------GCTLAD 102
           ++   KR ++LD+ RG+ ++LM+L          G  Y      P +       G T  D
Sbjct: 1   MKLTVKRDSSLDSLRGIAIILMVLSGSIAFSILPGWMYHAQVPPPGHKFMPEIPGITWVD 60

Query: 103 FVMPFFLFIVGVAIALALKK 122
            V PFFLF +G AI LA+KK
Sbjct: 61  LVFPFFLFSMGAAIPLAMKK 80


>gi|224025513|ref|ZP_03643879.1| hypothetical protein BACCOPRO_02253, partial [Bacteroides
           coprophilus DSM 18228]
 gi|224018749|gb|EEF76747.1| hypothetical protein BACCOPRO_02253 [Bacteroides coprophilus DSM
           18228]
          Length = 377

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 53/117 (45%), Gaps = 20/117 (17%)

Query: 53  QQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYAR---------------IDHSPWNG 97
           + + + + +R+ +LD  RGL + +++         A+                 H  W G
Sbjct: 1   KNMRKIQKERLESLDILRGLDLFILVGFQSVFMYLAQATGENNWIKTIFDVLFTHVEWEG 60

Query: 98  CTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-KIIFRTLK--LLFW--GIILQG 149
             L D VMP FLF+ G +I  A+ +  +    +  K+ +R LK  +L W  G I+QG
Sbjct: 61  FHLWDQVMPLFLFMAGTSIPYAMARYKRKEEEISGKLFYRVLKRVVLLWIFGAIVQG 117


>gi|424665794|ref|ZP_18102830.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
 gi|404574047|gb|EKA78798.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
           616]
          Length = 385

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 50/100 (50%), Gaps = 15/100 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNG--------CTLADFVMPFFL 109
              +RVA +D FR LT+ LM+ V+D  G    I H  W G           +D + P FL
Sbjct: 4   SSPQRVAAVDVFRALTMFLMLFVNDIPGL-RNIPH--WLGHAAMTEDMLGFSDTIFPAFL 60

Query: 110 FIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGI 145
           F +G++I+ A+    +K   +   +  I +RT+ L+  G+
Sbjct: 61  FCMGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGL 100


>gi|313148038|ref|ZP_07810231.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|313136805|gb|EFR54165.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
          Length = 385

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 50/100 (50%), Gaps = 15/100 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNG--------CTLADFVMPFFL 109
              +RVA +D FR LT+ LM+ V+D  G    I H  W G           +D + P FL
Sbjct: 4   SSPQRVAAVDVFRALTMFLMLFVNDIPGL-RNIPH--WLGHAAMTEDMLGFSDTIFPAFL 60

Query: 110 FIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGI 145
           F +G++I+ A+    +K   +   +  I +RT+ L+  G+
Sbjct: 61  FCMGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGL 100


>gi|284035350|ref|YP_003385280.1| hypothetical protein Slin_0417 [Spirosoma linguale DSM 74]
 gi|283814643|gb|ADB36481.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
           linguale DSM 74]
          Length = 389

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 14/75 (18%)

Query: 62  RVATLDAFRGLTVVLM------------ILVDDAGGAYA--RIDHSPWNGCTLADFVMPF 107
           R+ ++DA+RG  +VLM               D A  A+      H  W GC+L D + P 
Sbjct: 4   RLMSMDAYRGFVMVLMAAEMLQFDHLHETFPDSAFWAFLAHHQSHVAWAGCSLHDLIQPS 63

Query: 108 FLFIVGVAIALALKK 122
           F F+VGVA+  ++  
Sbjct: 64  FSFLVGVALLFSMAS 78


>gi|336405631|ref|ZP_08586307.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
 gi|335937114|gb|EGM99020.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
          Length = 470

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGV 162
           FFLF +G A   +++K  +   +  K+++    R ++L F+ I +Q  Y   P  LS   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 163 DMK 165
           DM+
Sbjct: 119 DMR 121


>gi|373954327|ref|ZP_09614287.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
           18603]
 gi|373890927|gb|EHQ26824.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
           18603]
          Length = 473

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 37/81 (45%), Gaps = 14/81 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD-------AGGAYARIDHSPWN-------GCTLA 101
           +Q   +R  +LDA RG  ++LM+L           G  Y      P +       G T  
Sbjct: 6   IQTPPQRANSLDALRGTAILLMVLSGSIAFGGILPGWMYHAQVPPPAHQFKPDLPGITWV 65

Query: 102 DFVMPFFLFIVGVAIALALKK 122
           D V PFFLF +G AI LAL K
Sbjct: 66  DLVFPFFLFAMGAAIPLALVK 86


>gi|160883830|ref|ZP_02064833.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
 gi|156110915|gb|EDO12660.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
          Length = 470

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGV 162
           FFLF +G A   +++K  +   +  K+++    R ++L F+ I +Q  Y   P  LS   
Sbjct: 62  FFLFAMGTAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 163 DMK 165
           DM+
Sbjct: 119 DMR 121


>gi|295087641|emb|CBK69164.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 470

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 20/123 (16%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSHAPDALSYGV 162
           FFLF +G A   +++K  +   +  K+++    R ++L F+ I +Q  Y   P  LS   
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118

Query: 163 DMK 165
           DM+
Sbjct: 119 DMR 121


>gi|293369243|ref|ZP_06615833.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|292635668|gb|EFF54170.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
          Length = 466

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 17/112 (15%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDA------GGAY-------ARIDHSPWNGCTLADFVMP 106
           +KR   LDA RG  ++ M+L          G  Y        R   +  +G T  D V P
Sbjct: 2   NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIFRTLK----LLFWGIILQGGYSHA 154
           FFLF +G A   ++KK  +      ++++  +K    L F+ I +Q  Y + 
Sbjct: 62  FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFYPYV 113


>gi|308050627|ref|YP_003914193.1| hypothetical protein Fbal_2917 [Ferrimonas balearica DSM 9799]
 gi|307632817|gb|ADN77119.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799]
          Length = 366

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 33/139 (23%), Positives = 60/139 (43%), Gaps = 13/139 (9%)

Query: 61  KRVATLDAFRGLTVVLMILVDDAGGAYA-------------RIDHSPWNGCTLADFVMPF 107
           +R+  LDA RG  +  +I  +    A               ++ HSPW+G T  D + P 
Sbjct: 6   QRLQALDALRGFDMFWIIGGEKLFAALLLLTGWPLWQVAADQMLHSPWHGFTFYDLIFPL 65

Query: 108 FLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHI 167
           F+F+ GV I L  + +  I  + ++  +R        + L G   +        + +  I
Sbjct: 66  FIFLSGVTIGLQRQSLIGIAWSDRQPHYRKALKRLLLLALLGVLYNHGWGTGMPMALDEI 125

Query: 168 RWCGILQRIALVYVVVALI 186
           R+  +L RI + + + A+I
Sbjct: 126 RYASVLGRIGMAWFLAAMI 144


>gi|210622217|ref|ZP_03293007.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
 gi|210154351|gb|EEA85357.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
          Length = 483

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 81/388 (20%), Positives = 136/388 (35%), Gaps = 94/388 (24%)

Query: 18  EQEQDDGKDSENGINKEKGLE--RSEVQDEQ----KGELQLQQLLQQKS-----KRVATL 66
           E  +D     +    +EK +E  R E   EQ    +GE +  Q  +  +     +R  T+
Sbjct: 60  ESRKDTSPKQKKESKREKKVELSREESSTEQTVINRGEKEQPQAREVVTGDPLKRRYTTV 119

Query: 67  DAFRGLTVVLMIL---VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKV 123
           +   G+ V+ +I    +   G   A +  S WNG +  D  +P  L  V   I   ++  
Sbjct: 120 ELIMGVAVIAIICSSGIGVLGEMPAFLAFSKWNGISFGDLGLPLLLASVCFMIPTEVELD 179

Query: 124 PKINGAVKKIIFRTLK----LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
            K   + K+I  + +K    L   GI++          L    +    R  GILQ IA+V
Sbjct: 180 VKRKKSFKEICIKKVKVGIILFVIGILIN---------LIGAWNFNSFRIMGILQMIAVV 230

Query: 180 YVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE 239
           Y++ +L+  L   RR N          TA     + G+             Y  + +F +
Sbjct: 231 YMLGSLLYVLF--RRFNFKSSVIAVFLTAIGVVGLAGY-------------YFASANFGD 275

Query: 240 HSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGP 299
            +   +  ++    M GH+G                                        
Sbjct: 276 TAKTCLAYFVDSKVMPGHIG---------------------------------------- 295

Query: 300 LREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLI 359
                       FE  G++STISA+  G + +  G  L + +  + R K    +  G+  
Sbjct: 296 -----------DFERYGIMSTISALCGGCLAMAAGSFLCNRRVEN-RDKSNKILIMGMFF 343

Query: 360 IAIILHFTNAIPINKQLYSFSYVCFTAG 387
           + I L      P N  + S S+V    G
Sbjct: 344 VIIALLMERNCPYNASVMSPSFVMIVLG 371


>gi|423299515|ref|ZP_17277540.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473324|gb|EKJ91846.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
           CL09T03C10]
          Length = 467

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 18/109 (16%)

Query: 61  KRVATLDAFRGLTVVLMILVD--------------DAGGAYARIDHSPWNGCTLADFVMP 106
           KR  +LDAFRG  +V M+L                  G     I      G T  D V P
Sbjct: 2   KRAISLDAFRGYAIVTMVLSGTIASGVLPGWMYHAQMGPRSNYIFDPQLYGITWVDLVFP 61

Query: 107 FFLFIVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGY 151
           FFLF +G AI  +    ++K   +   + + + R ++L F+ I +Q  Y
Sbjct: 62  FFLFAMGAAIPFSVGGKIEKGENLWKIIGECVLRGIRLAFFAIFIQHLY 110


>gi|298384751|ref|ZP_06994311.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383122973|ref|ZP_09943661.1| hypothetical protein BSIG_0281 [Bacteroides sp. 1_1_6]
 gi|251841928|gb|EES70008.1| hypothetical protein BSIG_0281 [Bacteroides sp. 1_1_6]
 gi|298263030|gb|EFI05894.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 472

 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 52/115 (45%), Gaps = 20/115 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGA-------YARI--------DHSPWNGCTL 100
           +   S R ++LDA RG  ++ M+L              +A++        D S + G T 
Sbjct: 1   MNNNSTRASSLDALRGYAILTMVLSGSVAWGVLPGWMYHAQVGPRSNFVFDGSIY-GITW 59

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGY 151
            D V PFFLF +G A   ++    +   + ++II+    R  +L F+ I +Q  Y
Sbjct: 60  VDLVFPFFLFAMGAAFPFSIGNKYRKGSSRRRIIYDSLLRGFRLTFFAIFIQHIY 114


>gi|149276664|ref|ZP_01882807.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
 gi|149232333|gb|EDM37709.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
          Length = 359

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 18/109 (16%)

Query: 65  TLDAFRGLTVVLM------ILVD--------DAGGAYARIDHSPWNGCTLADFVMPFFLF 110
           +LD  RGL ++L+      + V          A G   +  H PW+G    D V P F+F
Sbjct: 2   SLDVMRGLIMILLCAESCLLYVSLQHLNPAWPASGLVEQFFHHPWHGLRFWDLVQPAFMF 61

Query: 111 IVGVAIALA----LKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAP 155
           + G A+ ++    L+K    +     I+ R+LKL   G+ L   Y+  P
Sbjct: 62  MAGAAMYISYSRKLEKGSSWSQNWNHILIRSLKLFLCGVGLHCVYAGKP 110


>gi|320107689|ref|YP_004183279.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
 gi|319926210|gb|ADV83285.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
          Length = 419

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 18/110 (16%)

Query: 61  KRVATLDAFRGLTVVLM---------ILVDDAGGAYARI-----DHSPWNGCTLADFVMP 106
           +R   +DA+RGL ++LM         +          RI      H  W G  L D + P
Sbjct: 33  QRNVAVDAYRGLVMLLMMGEVMQFEVVARSFPSSTIWRILSFNQSHVQWVGMGLHDMIQP 92

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKI----IFRTLKLLFWGIILQGGYS 152
            F F+VGVA+  +L+   K   + +KI    I+R+  L+  GI L+  +S
Sbjct: 93  SFTFLVGVALPYSLRSRQKKGQSFQKIVGHTIWRSFLLVALGIFLRSIHS 142


>gi|344238550|gb|EGV94653.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cricetulus
           griseus]
          Length = 423

 Score = 39.3 bits (90), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 24  GKDSENGINKEKGLERSEVQDEQKGELQLQ-QLLQQKSKRVATLDAFRGLTVVLMILVDD 82
            ++++  IN E G       D   G+ Q + +       R+  +D FRG+ ++LM+ V+ 
Sbjct: 163 SRETDRLINSELG--SPSRADPLSGDCQPETRHTSALPYRLRCVDTFRGIALILMVFVNY 220

Query: 83  AGGAYARIDHSPWN 96
            GG Y    HS WN
Sbjct: 221 GGGKYWYFKHSSWN 234


>gi|255036257|ref|YP_003086878.1| hypothetical protein Dfer_2495 [Dyadobacter fermentans DSM 18053]
 gi|254949013|gb|ACT93713.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 379

 Score = 39.3 bits (90), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 60/299 (20%), Positives = 108/299 (36%), Gaps = 44/299 (14%)

Query: 91  DHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGG 150
            H PW GC+L D + P F F+VGVA+  ++      + +V  +   T++           
Sbjct: 32  SHVPWVGCSLHDLIQPSFSFLVGVALPYSMASRASKDQSVATMWAHTIRR------SLIL 85

Query: 151 YSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ 210
                   S   +  +  +   L +I L Y ++  +   + K + + L            
Sbjct: 86  ILLGIFLRSMHSEQTNFTFEDTLTQIGLGYPILFALGLASEKTQRDAL-----------V 134

Query: 211 WQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDR 270
           W  +G  +     +   Y L  P + +S+       ++ +K G   H     NA    DR
Sbjct: 135 WGALGIILIGYAGVFAAYPLPGPGFDWSQTGTTADWEHNLK-GFAAHWNKNTNAAWAFDR 193

Query: 271 ELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIG 330
             W +N                  P   P   +   +    F P     T+  ++ G I 
Sbjct: 194 --WFLNLF----------------PREKPFEFNGGGYSTLSFIP-----TLGTMILGLIA 230

Query: 331 IHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAA 389
              G  L      +  LK +      L ++A+  HFT   PI K++++ ++  F+ G A
Sbjct: 231 ---GKWLKTAVSSTWLLKRYAITAGVLFLLALAFHFTGLNPIVKRIWTPAWTLFSGGWA 286


>gi|225013130|ref|ZP_03703543.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225002750|gb|EEG40733.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 365

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 18/73 (24%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAG---GAYARID-------------HSPWNGCTLADF 103
           S+R+ +LD FRG  VV+ +LV +     G + + +             H  W G    D 
Sbjct: 8   SQRLRSLDFFRG--VVMFLLVAEFSHLFGVFMKTENETITAAADFLFHHVQWEGLHFWDL 65

Query: 104 VMPFFLFIVGVAI 116
           + PFF+FIVGV+I
Sbjct: 66  IQPFFMFIVGVSI 78


>gi|320104555|ref|YP_004180146.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
 gi|319751837|gb|ADV63597.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
          Length = 399

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 42/97 (43%), Gaps = 5/97 (5%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVGVA 115
               R   LD FRG TV  MI+V+  GG  A   I       C+ AD +MP F   VG A
Sbjct: 12  SSPSRWDALDQFRGYTVAGMIVVNFVGGLAAVPAILKHHNTYCSYADTIMPQFFLAVGFA 71

Query: 116 IALALKKVPKING---AVKKIIFRTLKLLFWGIILQG 149
                    +  G   AV+  + R L LL  G ++ G
Sbjct: 72  YRWTFLNRLERGGWQAAVRHALGRNLGLLLVGFLMYG 108


>gi|423281270|ref|ZP_17260181.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
 gi|404583178|gb|EKA87860.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
           610]
          Length = 385

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 11/73 (15%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNG--------CTLADFVMPFFL 109
              +RVA +D FR LT+ LM+ V+D  G    I H  W G           +D + P FL
Sbjct: 4   SSPQRVAAVDVFRALTMFLMLFVNDIPGL-RNIPH--WLGHAAMTEDMLGFSDTIFPAFL 60

Query: 110 FIVGVAIALALKK 122
           F +G++I+ A++ 
Sbjct: 61  FCMGMSISFAVQN 73


>gi|383112620|ref|ZP_09933412.1| hypothetical protein BSGG_0509 [Bacteroides sp. D2]
 gi|313692974|gb|EFS29809.1| hypothetical protein BSGG_0509 [Bacteroides sp. D2]
          Length = 472

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 20/115 (17%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDD------AGGAY-ARI--------DHSPWNGCTL 100
           +   + R ++LDA RG  ++ M+L          G  Y A++        D S + G T 
Sbjct: 1   MNNNNTRASSLDALRGYAILTMVLSGSIAWGVLPGWMYHAQVGPRSNFVFDGSIY-GITW 59

Query: 101 ADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGY 151
            D V PFFLF +G A   ++    +   + +KII+    R  +L F+ I +Q  Y
Sbjct: 60  VDLVFPFFLFAMGAAFPFSIGNKYRKGSSRRKIIYDSFLRGFRLTFFAIFIQHIY 114


>gi|160883836|ref|ZP_02064839.1| hypothetical protein BACOVA_01809 [Bacteroides ovatus ATCC 8483]
 gi|237717686|ref|ZP_04548167.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|156110921|gb|EDO12666.1| hypothetical protein BACOVA_01809 [Bacteroides ovatus ATCC 8483]
 gi|229453005|gb|EEO58796.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 467

 Score = 38.9 bits (89), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 47/110 (42%), Gaps = 18/110 (16%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAG-------------GAYARIDHSP-WNGCTLADFVM 105
           +KR  +LDA RG  ++ M+L                  G  +     P + G T  D V 
Sbjct: 2   NKRAFSLDALRGYAIITMVLSGTIASGVLPGWMYHAQVGPRSNFAFDPSFYGITWVDLVF 61

Query: 106 PFFLFIVGVAIALAL-KKVPKINGAVK---KIIFRTLKLLFWGIILQGGY 151
           PFFLF +G A   ++  K+ K    +K     I R  +L F+ I +Q  Y
Sbjct: 62  PFFLFAMGAAFPFSIGNKLEKGESKLKIAWDCILRGFRLTFFAIFIQHMY 111


>gi|404404857|ref|ZP_10996441.1| hypothetical protein AJC13_05463 [Alistipes sp. JC136]
          Length = 392

 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 57/121 (47%), Gaps = 18/121 (14%)

Query: 56  LQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPW--------NGCTLADFVMPF 107
           + ++  R+A++D FRGLT+  M+ V+     ++  D   W        +    +D + P 
Sbjct: 1   MTEQRNRIASIDVFRGLTMFFMLWVNS---FWSLSDVPHWLQHAARGEDMLGFSDTIFPA 57

Query: 108 FLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGII---LQGGYSHAPDALSY 160
           FLFI+G ++ LA+        +  KI++    RT  L+  G++       +S A   LS 
Sbjct: 58  FLFIMGASVPLAVGSRRAKGDSTVKIVWHVFTRTFALVVMGLLTVNFGDAFSAAGTGLSR 117

Query: 161 G 161
           G
Sbjct: 118 G 118


>gi|333380436|ref|ZP_08472127.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826431|gb|EGJ99260.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 469

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 21/124 (16%)

Query: 62  RVATLDAFRGLTVVLMILVDDA------GGAY-------ARIDHSPWNGCTLADFVMPFF 108
           R   LDA RG  +V M+L          G  Y         + +    G T  D V PFF
Sbjct: 4   RSCALDALRGYAIVTMVLSGAVVYGVLPGWMYHAQVPPPTHVFNPAAPGITWVDLVFPFF 63

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGY----SHAPDALSY 160
           LF +G A   +++K  +   +  K+I+    R+++L F+ I ++  Y    S+  DA ++
Sbjct: 64  LFAMGSAFPFSIRKRLERGESKLKLIYDALKRSIQLTFFAIFIRHFYPYVLSNPEDARAW 123

Query: 161 GVDM 164
           G+ +
Sbjct: 124 GLSL 127


>gi|436833713|ref|YP_007318929.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
 gi|384065126|emb|CCG98336.1| Protein of unknown function DUF2261,transmembrane [Fibrella
           aestuarina BUZ 2]
          Length = 415

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 14/78 (17%)

Query: 59  KSKRVATLDAFRGLTVVLM---ILVDD-------AGGAYARI----DHSPWNGCTLADFV 104
            + R+ ++DA+RG  ++LM   +L  D         G +A +     H  W+GC+L D +
Sbjct: 27  PAGRLLSVDAYRGFVMLLMMGEVLHFDHLHEAFPGSGFWALLAYHQSHVDWSGCSLHDLI 86

Query: 105 MPFFLFIVGVAIALALKK 122
            P F F+VGVA+  ++  
Sbjct: 87  QPSFSFLVGVALPYSIAS 104


>gi|358344082|ref|XP_003636122.1| Magnesium transporter NIPA2 [Medicago truncatula]
 gi|355502057|gb|AES83260.1| Magnesium transporter NIPA2 [Medicago truncatula]
          Length = 328

 Score = 38.5 bits (88), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 23/36 (63%)

Query: 144 GIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALV 179
           G +  GGY H    L++GVD+K IR  GILQR  +V
Sbjct: 105 GGVFTGGYVHRVSDLTFGVDLKQIRLMGILQRENIV 140


>gi|226947615|ref|YP_002802706.1| acyltransferase family protein [Clostridium botulinum A2 str.
           Kyoto]
 gi|226843694|gb|ACO86360.1| acyltransferase family protein [Clostridium botulinum A2 str.
           Kyoto]
          Length = 343

 Score = 38.5 bits (88), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 52  LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSP--WNGCT----LADFVM 105
           ++++++ K KR+ ++D  R L+ + +IL+   G      ++    +N       LA F +
Sbjct: 1   MKEVIKIKDKRIVSMDLLRALSTIAVILIHVTGTILYNSNNKSLTYNSSLVLNQLARFSV 60

Query: 106 PFFLFIVGVAIALALKKVPK 125
           P F+F+ G  +AL+ KK  K
Sbjct: 61  PAFIFLSGFGLALSYKKESK 80


>gi|315126665|ref|YP_004068668.1| hypothetical protein PSM_A1588 [Pseudoalteromonas sp. SM9913]
 gi|315015179|gb|ADT68517.1| conserved hypothetical protein [Pseudoalteromonas sp. SM9913]
          Length = 307

 Score = 38.5 bits (88), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 59/287 (20%), Positives = 110/287 (38%), Gaps = 85/287 (29%)

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           +FI+G A+  + KK      A +      L+L+  G I+      A +   +   ++++R
Sbjct: 1   MFIIGSAMFFSFKKTNSTANASQ-----VLRLIKRGAIIFA-IGLALNMYPFTTSIENLR 54

Query: 169 WCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTY 228
             G+LQRI + Y++ ++   L  +R                      G I+  + I+  Y
Sbjct: 55  ILGVLQRIGIAYILASICVLLLNRR----------------------GVISLSVVILIAY 92

Query: 229 SLYV----PNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVW 284
            L +    PN +++  +                     N V  VD  + G +HL+     
Sbjct: 93  WLLLLSVGPNNAYTLEN---------------------NLVRTVDIAVLGESHLWQGK-- 129

Query: 285 SRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS 344
                                      F+PEGLLST+ +++S   G     +L       
Sbjct: 130 ------------------------GLAFDPEGLLSTLPSVVSVLFGFEVTRLLTSSSSQW 165

Query: 345 ARLKHWVSMGFGLLIIAIILHFTNAI--PINKQLYSFSYVCFTAGAA 389
             +K  + +G    ++ I L    ++  PINK L++ ++V +T+G A
Sbjct: 166 TSIKRLLVIG----VVGIALGQLGSLIMPINKSLWTSTFVIYTSGIA 208


>gi|302796998|ref|XP_002980260.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
 gi|300151876|gb|EFJ18520.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
          Length = 312

 Score = 38.5 bits (88), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 13/59 (22%)

Query: 370 IPINKQLYSFSYVCFTAGAADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWM 428
           I +NK LYSFSY+CFTAGAA     +   ++I              +  RYP+ + +WM
Sbjct: 40  IKMNKPLYSFSYMCFTAGAAGAVFCL---LYI----------LVDVYDIRYPTLLLEWM 85


>gi|395801313|ref|ZP_10480573.1| hypothetical protein FF52_05535 [Flavobacterium sp. F52]
 gi|395436726|gb|EJG02660.1| hypothetical protein FF52_05535 [Flavobacterium sp. F52]
          Length = 468

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 23/122 (18%)

Query: 62  RVATLDAFRGLTVVLMILVDD-------AGGAYARID-HSPWN----GCTLADFVMPFFL 109
           R  ++DA RG  ++ MIL          A  A+A++  +S +N    G T  D V PFFL
Sbjct: 4   RALSIDALRGFAIIAMILSGQMILTYLPAWMAHAQVPPNSSFNPNIYGITWVDLVFPFFL 63

Query: 110 FIVGVAIALALKKVPKINGAVKK------IIFRTLKLLFWGIILQGGYSHAPDALSYGVD 163
           F +G A   +L    K+   + K        +R ++L+F+ I         P  +S  VD
Sbjct: 64  FSMGAAFPFSLGS--KLENGINKWSLCLDSFYRGIQLVFFAIFFM---HMKPFVISSPVD 118

Query: 164 MK 165
           +K
Sbjct: 119 IK 120


>gi|299149192|ref|ZP_07042253.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
 gi|298512859|gb|EFI36747.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
          Length = 470

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 17/111 (15%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSH 153
           FFLF +G A   +++K  +   +  K+++    R ++L F+ I +Q  Y +
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFYPY 112


>gi|293369241|ref|ZP_06615831.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|336417197|ref|ZP_08597524.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297813|ref|ZP_17275873.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
 gi|292635666|gb|EFF54168.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
 gi|335936517|gb|EGM98443.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664450|gb|EIY57988.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
           CL03T12C18]
          Length = 470

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 17/111 (15%)

Query: 60  SKRVATLDAFRGLTVVLMIL-------VDDAGGAYARI---DH--SPW-NGCTLADFVMP 106
           + R   LDA RG  ++ M+L       V     ++A+    DH  +P   G T  D V P
Sbjct: 2   NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61

Query: 107 FFLFIVGVAIALALKKVPKINGAVKKIIF----RTLKLLFWGIILQGGYSH 153
           FFLF +G A   +++K  +   +  K+++    R ++L F+ I +Q  Y +
Sbjct: 62  FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFYPY 112


>gi|375148433|ref|YP_005010874.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361062479|gb|AEW01471.1| hypothetical protein Niako_5234 [Niastella koreensis GR20-10]
          Length = 483

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 56/131 (42%), Gaps = 18/131 (13%)

Query: 62  RVATLDAFRGLTVVLMILVDDAG----GAYARIDHSP---------WNGCTLADFVMPFF 108
           R  +LDA RG  ++ M+L          A+     +P           G T  D V PFF
Sbjct: 23  RNKSLDALRGFAILAMVLSSSIAFCILPAWMYHAQTPPPGHVYTPELPGITWVDLVFPFF 82

Query: 109 LFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIR 168
           LF +G AI LAL K  K       +++  ++  F G++    ++    ++S   +     
Sbjct: 83  LFSMGAAIPLALHKKVKEGSGTAMVLYTAIR-RFAGLVFFALFTMHARSMSKEPE----S 137

Query: 169 WCGILQRIALV 179
           W  +LQ ++ V
Sbjct: 138 WHYVLQLVSFV 148


>gi|196234160|ref|ZP_03132993.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
 gi|196221811|gb|EDY16348.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
          Length = 417

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 66/156 (42%), Gaps = 36/156 (23%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYARI--------------DHSPWNGCTLADFVM 105
           S R+ ++DA+RGL + L++       + A+               +H  W G  L D + 
Sbjct: 30  SVRLGSIDAYRGLVMFLLLAEQFRTASVAKALPDSSFWRFLATQQEHVTWTGAVLHDMIQ 89

Query: 106 PFFLFIVGVAIALAL----KKVPKINGAVKKIIFRTLKLLFWGIILQG-GYSHA----PD 156
           P F F+VGVA+  ++     +             R L L+  GI L+  G+S       D
Sbjct: 90  PSFSFLVGVALPFSIGNRRARGQSPEATTGHAFLRALILVLLGIFLRSTGHSQTNFTFED 149

Query: 157 ALS---------YGVDMKHIRWCGILQRIALVYVVV 183
            L+         Y + ++ +R    +Q IALV ++V
Sbjct: 150 TLTQIGLGYGFLYLIALRSVR----VQWIALVVILV 181


>gi|262381364|ref|ZP_06074502.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296541|gb|EEY84471.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 410

 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 60  SKRVATLDAFRGLTVVLMILVDDAGGAYA---RIDHSPW--NGCTLADFVMPFFLFIVGV 114
           ++R   +D  R +T+ +MI V+D    +     ++H+ +  +   LAD V P FLF VG+
Sbjct: 9   TQRNIAIDILRAVTMCVMIFVNDFWTVHDVPHYLEHAAYGEDFMGLADVVFPAFLFAVGM 68

Query: 115 AIALALKK--VPKING--AVKKIIFRTLKLLFWG-IILQGGYSHAPDAL 158
           +I  A+++     ++G   +  I+ RTL LL  G  I+      A DAL
Sbjct: 69  SIPFAIERRYAKGMSGESTILHILSRTLALLIMGAFIVNSEAGMADDAL 117


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.141    0.452 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,057,595,239
Number of Sequences: 23463169
Number of extensions: 301266256
Number of successful extensions: 1148120
Number of sequences better than 100.0: 912
Number of HSP's better than 100.0 without gapping: 649
Number of HSP's successfully gapped in prelim test: 263
Number of HSP's that attempted gapping in prelim test: 1144616
Number of HSP's gapped (non-prelim): 1686
length of query: 430
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 285
effective length of database: 8,957,035,862
effective search space: 2552755220670
effective search space used: 2552755220670
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 78 (34.7 bits)