BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007204
         (613 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255573091|ref|XP_002527475.1| conserved hypothetical protein [Ricinus communis]
 gi|223533115|gb|EEF34873.1| conserved hypothetical protein [Ricinus communis]
          Length = 840

 Score =  941 bits (2431), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 450/603 (74%), Positives = 513/603 (85%), Gaps = 15/603 (2%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S   PLK+TFNGPAKH+TD+IPIGNGR+GAM+ GG+ SE ++LNEDTLWTGVPG+YTNP+
Sbjct: 20  SYNKPLKVTFNGPAKHWTDSIPIGNGRIGAMISGGMQSEIIQLNEDTLWTGVPGNYTNPN 79

Query: 68  APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
           A +ALS+VR LVD G YAEATAASVK FG+PADVYQLLGD++LEFDDSHL YA+ETY RE
Sbjct: 80  ALEALSEVRKLVDDGLYAEATAASVKFFGNPADVYQLLGDVKLEFDDSHLTYADETYYRE 139

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL+TATARV+YSVG+V+FT+E+F+SNPDQV V KISGS+SGSLSF VSLDS LD+H YV
Sbjct: 140 LDLDTATARVQYSVGDVKFTKEYFASNPDQVAVIKISGSKSGSLSFTVSLDSKLDHHCYV 199

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
           N  NQIIMEG CP KRIPPK +AN++PKGI+FSA+L++ +SD  G I  L++KKLKVEGS
Sbjct: 200 NVENQIIMEGSCPEKRIPPKMSANENPKGIKFSAVLDLHVSDGVGVIHVLDNKKLKVEGS 259

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           DW VLLL ASSSF+ P   PSDSKKDPTSES+ AL++I NLSYSDLY RHL DYQKLFHR
Sbjct: 260 DWGVLLLAASSSFESPLTKPSDSKKDPTSESLRALKAITNLSYSDLYARHLHDYQKLFHR 319

Query: 308 VSIQLSRSPKDIVTDTCSEENI---------------DTVPSAERVKSFQTDEDPSLVEL 352
           VS QL +S   IV D     N                D VP+ ER+KSFQ+DEDPSLVEL
Sbjct: 320 VSFQLWKSSNRIVGDESQLTNNLIPSANALYVKGIKDDAVPTVERIKSFQSDEDPSLVEL 379

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           LFQFGRYLLIS SRPGTQVANLQG+WN+DL PTWDSAPH+NINLEMNYW SLPCNL+ECQ
Sbjct: 380 LFQFGRYLLISCSRPGTQVANLQGVWNKDLEPTWDSAPHLNINLEMNYWLSLPCNLNECQ 439

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           EPLFDF+  LS+NGSKTAQVNY ASGWVIHHK+DIWAKSSADRG  VWALWP+GGAWLCT
Sbjct: 440 EPLFDFIKSLSVNGSKTAQVNYGASGWVIHHKSDIWAKSSADRGDAVWALWPIGGAWLCT 499

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
           HLWEHYNYTMD++FLE  AY LLEGC SFLLDWL+EG +GYLETNPSTSPEH FI PDGK
Sbjct: 500 HLWEHYNYTMDKEFLENEAYFLLEGCVSFLLDWLVEGSEGYLETNPSTSPEHMFITPDGK 559

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            ACVSYSSTMDMAIIREVFS+ +SA+EVL +N+D LV+ V  +LPRLRPTKIAEDGSIME
Sbjct: 560 PACVSYSSTMDMAIIREVFSSFVSASEVLGRNKDVLVQNVHTALPRLRPTKIAEDGSIME 619

Query: 593 WVQ 595
           WV+
Sbjct: 620 WVR 622


>gi|224103693|ref|XP_002313157.1| predicted protein [Populus trichocarpa]
 gi|222849565|gb|EEE87112.1| predicted protein [Populus trichocarpa]
          Length = 836

 Score =  918 bits (2373), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/610 (72%), Positives = 512/610 (83%), Gaps = 16/610 (2%)

Query: 1   MMNAEST--STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
           M N  ST    + PLKIT  GPAK++TDAIPIGNGRLGAMVWGGV SE ++LNEDTLWTG
Sbjct: 17  MWNPTSTYLEDSKPLKITSTGPAKYWTDAIPIGNGRLGAMVWGGVSSELIQLNEDTLWTG 76

Query: 59  VPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
            P DYTNPDAP+AL++VR+LVDSG++AEA+ A+ KL G  A+VYQLLGDI+LEFD  +L 
Sbjct: 77  TPIDYTNPDAPEALAEVRNLVDSGEFAEASDAAAKLSGTNANVYQLLGDIKLEFD-GYLM 135

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            AEETY RELDL+TATARVKYSVG+VEFTREHF+S PDQVIVTKI+GS+ GS+SF VSLD
Sbjct: 136 CAEETYYRELDLDTATARVKYSVGDVEFTREHFASYPDQVIVTKIAGSKEGSVSFTVSLD 195

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S LD+H Y+   +QI+MEGRCPGKRIPPK  ANDDPKGI F+A+L ++ISD  G +S L+
Sbjct: 196 SKLDHHCYITDESQIVMEGRCPGKRIPPKVKANDDPKGILFAAVLGLQISDGAGLMSVLD 255

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D +LKVEG++W VL +VASSSF+GPF  PS+S+KDP S S+SAL+SI+N SYS+LY+RHL
Sbjct: 256 DGRLKVEGANWVVLHMVASSSFEGPFTKPSESEKDPASVSLSALKSIKNQSYSELYSRHL 315

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDT-------------CSEENIDTVPSAERVKSFQTDE 345
           DDYQ LFHRVS+QL +     + D              C E N D VP+ +R++SFQ+DE
Sbjct: 316 DDYQNLFHRVSLQLCKGSDRNIGDRSLEIKNLMPSGKRCVEGNKDVVPTVDRIRSFQSDE 375

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN+DL P WDSAPH+NINLEMNYW SLP
Sbjct: 376 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNKDLEPKWDSAPHLNINLEMNYWPSLP 435

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
           CNLSECQEPLF+F+  LSING KTAQVNY  SGWV+HHK+DIWAK SAD+G+VVWA+WPM
Sbjct: 436 CNLSECQEPLFEFIKSLSINGCKTAQVNYKTSGWVVHHKSDIWAKPSADKGEVVWAIWPM 495

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
           GGAWLCTHLWEHY+YTMD DFL  +AYPLLEGCASFLLDWLIEGH GYLETNPSTSPEH 
Sbjct: 496 GGAWLCTHLWEHYSYTMDEDFLRNKAYPLLEGCASFLLDWLIEGHGGYLETNPSTSPEHM 555

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
           FIAPDGK A VSYSSTMDMA+I+EVFSAIISA+EVL +NEDA V+KV K+ PRL PTKI 
Sbjct: 556 FIAPDGKSASVSYSSTMDMALIKEVFSAIISASEVLGRNEDAFVQKVHKAQPRLYPTKID 615

Query: 586 EDGSIMEWVQ 595
           E+GSIMEW Q
Sbjct: 616 EEGSIMEWAQ 625


>gi|224103687|ref|XP_002313154.1| predicted protein [Populus trichocarpa]
 gi|222849562|gb|EEE87109.1| predicted protein [Populus trichocarpa]
          Length = 803

 Score =  912 bits (2357), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/594 (73%), Positives = 510/594 (85%), Gaps = 7/594 (1%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M + ++   +  LKITFNGPAKH+TDAIPIGNGRLGAM+WGGV  ETL+LNEDTLWTG P
Sbjct: 1   MDDDDNGENSRSLKITFNGPAKHWTDAIPIGNGRLGAMIWGGVSLETLQLNEDTLWTGTP 60

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
           G+YTNP AP+ALS VR LVD+GQYA+AT A+ KL   P+DVYQLLGDI+LEFD+SHLKY 
Sbjct: 61  GNYTNPHAPEALSVVRKLVDNGQYADATTAAEKLSHDPSDVYQLLGDIKLEFDNSHLKYV 120

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
           E++Y RELDL+TATARVKYSVG+VE+TRE+F+SNP+QVI TKISGS+SGS+SF V LDS 
Sbjct: 121 EKSYHRELDLDTATARVKYSVGDVEYTREYFASNPNQVIATKISGSKSGSVSFTVYLDSK 180

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           + ++SYV G NQIIMEG CPGKRIPPK NA+D+PKGIQF+AIL ++IS+ RG +  L+ +
Sbjct: 181 MHHYSYVKGENQIIMEGSCPGKRIPPKLNADDNPKGIQFTAILNLQISNSRGVVHVLDGR 240

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KLKVEGSDWA+LLLV+SSSFDGPF  P DSKKDPTS+S+SAL+SI NLSY+DLY  HLDD
Sbjct: 241 KLKVEGSDWAILLLVSSSSFDGPFTKPIDSKKDPTSDSLSALKSINNLSYTDLYAHHLDD 300

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQ LFHRVS+QLS+S K       SE+N  TV +AERVKSF+TDEDPSLVELLFQ+GRYL
Sbjct: 301 YQSLFHRVSLQLSKSSK-----RRSEDN--TVSTAERVKSFKTDEDPSLVELLFQYGRYL 353

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LIS SRPGTQVANLQGIWN+D+ P WD A H+NINL+MNYW +LPCNL ECQ+PLF++++
Sbjct: 354 LISCSRPGTQVANLQGIWNKDIEPPWDGAQHLNINLQMNYWPALPCNLKECQDPLFEYIS 413

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            LSINGSKTA+VNY A GWV H  +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY Y
Sbjct: 414 SLSINGSKTAKVNYDAKGWVAHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTY 473

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           TMD+DFL+ +AYPLLEGC+ FLLDWLIEG  GYLETNPSTSPEH FI PDGK A VSYSS
Sbjct: 474 TMDKDFLKNKAYPLLEGCSLFLLDWLIEGRGGYLETNPSTSPEHMFIDPDGKPASVSYSS 533

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           TMDM+II+EVFSAIISAAE+L KNED +V+KV ++ PRL PT+IA DGSIMEW 
Sbjct: 534 TMDMSIIKEVFSAIISAAEILGKNEDEIVQKVREAQPRLLPTRIARDGSIMEWA 587


>gi|224056204|ref|XP_002298754.1| predicted protein [Populus trichocarpa]
 gi|222846012|gb|EEE83559.1| predicted protein [Populus trichocarpa]
          Length = 808

 Score =  906 bits (2341), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/594 (71%), Positives = 505/594 (85%), Gaps = 1/594 (0%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M   +  ++ PL++TF+GPAKH+TDAIPIGNGRLGAM+WGGV  ETL+LNEDTLWTG+PG
Sbjct: 1   MEDNNGESSKPLRVTFSGPAKHWTDAIPIGNGRLGAMIWGGVALETLQLNEDTLWTGIPG 60

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
           DYTNP+AP AL +VR LVD+GQYAEAT A+ KL G+ +DVYQLLGDI+LEFDDSHLKY E
Sbjct: 61  DYTNPNAPAALLEVRKLVDNGQYAEATTAAEKLSGNQSDVYQLLGDIKLEFDDSHLKYDE 120

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
           +TY+RELDL+TATARVKYSV ++E+TREHF+SNP+QVIVTKISGS+ GS+SF VSLDS +
Sbjct: 121 KTYKRELDLDTATARVKYSVADIEYTREHFASNPNQVIVTKISGSKPGSVSFTVSLDSKM 180

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
            +HSYV G NQII+EG CPG R   K N ND P+GIQF+AIL++++S+ RG +   ED K
Sbjct: 181 SHHSYVKGENQIIIEGSCPGNRYAQKLNENDSPQGIQFTAILDLQVSEARGLVRVSEDSK 240

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+VEGSDWAVLLLV+SSSFDGPF  P DSKK+PTS+S+S L+SI NLSY DLY  HLDDY
Sbjct: 241 LRVEGSDWAVLLLVSSSSFDGPFTKPIDSKKNPTSDSLSVLKSIGNLSYVDLYAHHLDDY 300

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q LFHRVS+QLS+S K+        E+ DTV +AERVK+FQTDEDPSLVELLFQ+GRYLL
Sbjct: 301 QSLFHRVSLQLSKSSKNSDISLNGSED-DTVSTAERVKAFQTDEDPSLVELLFQYGRYLL 359

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           IS SRPGTQVANLQGIWN+DL+P WD A H+NINL+MNYW SL CNL ECQEPLF++++ 
Sbjct: 360 ISCSRPGTQVANLQGIWNKDLTPPWDGAQHLNINLQMNYWPSLSCNLKECQEPLFEYISS 419

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           LSI+GS+TA+VNY A GWV H  +D+WAK+S D G+ +WALWPMGGAWLCTHLWEHY Y 
Sbjct: 420 LSISGSRTAKVNYEAKGWVAHQVSDLWAKTSPDAGQALWALWPMGGAWLCTHLWEHYTYA 479

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            D+DFL  +AYPLLEGC SFLLDWLIEG  GYLETNPSTSPEH FIAPDGK A VSYSST
Sbjct: 480 KDKDFLRDKAYPLLEGCTSFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASVSYSST 539

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MDM+II+EVFSAI+SAA++L +NED LV+KVL++LPRL PTKIA DGSIMEW Q
Sbjct: 540 MDMSIIKEVFSAIVSAAKILGRNEDELVQKVLEALPRLLPTKIARDGSIMEWAQ 593


>gi|359475494|ref|XP_002270199.2| PREDICTED: alpha-L-fucosidase 2-like [Vitis vinifera]
          Length = 817

 Score =  884 bits (2284), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/584 (74%), Positives = 498/584 (85%), Gaps = 9/584 (1%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34  PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS+VR LVD+G Y  AT A+VKL G+P+DVYQLLGDI LEF+DSHL YAEETY RELDL+
Sbjct: 94  LSEVRKLVDNGDYVAATEAAVKLSGNPSDVYQLLGDINLEFEDSHLAYAEETYSRELDLD 153

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  +KYSVG+VE+TREHF+S PDQVIVTKISGS+ GS+SF VSLDS   +HS  +G +
Sbjct: 154 TATVTIKYSVGDVEYTREHFASYPDQVIVTKISGSKPGSVSFTVSLDSKSHHHSNSSGKS 213

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           QIIMEG CPGKRIPPK   ND+P+GI FSA+L+++ISD RG I+ L+DKKLKVEGSDWAV
Sbjct: 214 QIIMEGSCPGKRIPPKVYENDNPQGILFSAVLDLQISDGRGVINVLDDKKLKVEGSDWAV 273

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           L LVASSSFDGPF  P DSK +PTSE++S L+SI N SYSDLY RHL+DYQ LFHRVS+Q
Sbjct: 274 LYLVASSSFDGPFTKPIDSKINPTSEALSTLKSIGNFSYSDLYARHLNDYQNLFHRVSLQ 333

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS+S K +         ++ V +A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q 
Sbjct: 334 LSKSSKSV---------MNRVSTAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQP 384

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN+D+ P WD APH+NINL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+
Sbjct: 385 ANLQGIWNKDIEPAWDGAPHLNINLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAK 444

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           VNY ASGWV H  +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +A
Sbjct: 445 VNYEASGWVTHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKA 504

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           YPLLEGCA FLLDWLIEG  GYLETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVF
Sbjct: 505 YPLLEGCARFLLDWLIEGRGGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVF 564

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           SA++SAAEVL KNED LV+KV ++ P+L PTKIA DGSIMEW Q
Sbjct: 565 SAVVSAAEVLGKNEDELVQKVRQAQPKLPPTKIARDGSIMEWAQ 608


>gi|356574288|ref|XP_003555281.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
          Length = 876

 Score =  867 bits (2241), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/598 (68%), Positives = 491/598 (82%), Gaps = 14/598 (2%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+TF  PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN  A +A
Sbjct: 65  PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAQQA 124

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L++VR LVD  +++EATAA+VKL G P+DVYQLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 125 LAEVRKLVDDRKFSEATAAAVKLSGDPSDVYQLLGDIKLEFHDSHLNYSKESYYRELDLD 184

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V  DS + + S V+G N
Sbjct: 185 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSKMHHDSRVSGQN 244

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           QII+EGRCPG RI P  N+ D+P+GIQFSA+L+++IS D+G I  L+DKKL+VEGSDWA+
Sbjct: 245 QIIIEGRCPGSRIRPIVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDWAI 304

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL ASSSFDGPF  P DSKKDP SES+S + S++ +SY DLY RHL DYQ LFHRVS+Q
Sbjct: 305 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLADYQNLFHRVSLQ 364

Query: 312 LSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDEDPSLVELLFQFG 357
           LS+S K +    V D      S+ NI      DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 365 LSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDEDPSFVELLFQYG 424

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 425 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 484

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           F++ LS+ G KTA+VNY A+GWV+H  +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 485 FISSLSVIGKKTAKVNYEANGWVVHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 544

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y YTMD+ FL+ +AYPLLEGC SFLLDWLIEG  G LETNPSTSPEH F APDGK A VS
Sbjct: 545 YTYTMDKVFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 604

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           YSSTMD++II+EVFS IISAAEVL ++ D ++++V +   +L PTK+A DGSIMEW +
Sbjct: 605 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTEYQSKLPPTKVARDGSIMEWAE 662


>gi|356575686|ref|XP_003555969.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
          Length = 874

 Score =  863 bits (2229), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/609 (66%), Positives = 495/609 (81%), Gaps = 16/609 (2%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + N ES     PLK+TF  PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+P
Sbjct: 54  LTNGESPP--RPLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIP 111

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
            DYTN  AP+AL++VR LVD  +++EATAA+VKL G P++VYQLLGDI+LEF DSHL Y+
Sbjct: 112 RDYTNSSAPQALAEVRKLVDDRKFSEATAAAVKLSGDPSEVYQLLGDIKLEFHDSHLNYS 171

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
           +E+Y RELDL+TATA +KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V  DS 
Sbjct: 172 KESYYRELDLDTATANIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSK 231

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           + + S V+G NQIIMEGRCPG RIPP+ N+ D+P+GIQFSA+L+++IS D+G I  L+DK
Sbjct: 232 MHHDSRVSGQNQIIMEGRCPGSRIPPRVNSIDNPQGIQFSAVLDMQISKDKGFIHVLDDK 291

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KL+VEGSDWA+LLL ASSSFDGPF  P DSKKDP SES+S + S++ +SY DLY RHL D
Sbjct: 292 KLRVEGSDWAILLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLAD 351

Query: 301 YQKLFHRVSIQLSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDED 346
           YQ LFHRVS+QLS+S K +    V D      S+ NI      DT+P++ RVKSFQTDED
Sbjct: 352 YQNLFHRVSLQLSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDED 411

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
           PS VELLFQ+GRYLLIS SRPGTQVANLQGIWN+D+ P W+ APH+NINL++NYW SL C
Sbjct: 412 PSFVELLFQYGRYLLISCSRPGTQVANLQGIWNKDVEPAWEGAPHLNINLQINYWPSLAC 471

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           NL ECQEPLFDF++ LS+ G KTA+V+Y A+GWV HH +DIW K+S  +G+ VWA+WPMG
Sbjct: 472 NLHECQEPLFDFISSLSVIGKKTAKVSYEANGWVAHHVSDIWGKTSPGQGQAVWAVWPMG 531

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
           GAWLCTHLWEHY YT+D+DFL+ +AYPLLEGC SFLLDWLIEG  G LETNPSTSPEH F
Sbjct: 532 GAWLCTHLWEHYTYTLDKDFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMF 591

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
            APDGK A VSYSSTMD++II+EVFS IISAAEVL ++ D ++++  +   +L PTK+A 
Sbjct: 592 TAPDGKTASVSYSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRATEYQSKLPPTKVAR 651

Query: 587 DGSIMEWVQ 595
           DGSIMEW +
Sbjct: 652 DGSIMEWAE 660


>gi|356536151|ref|XP_003536603.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
          Length = 877

 Score =  862 bits (2228), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/598 (67%), Positives = 486/598 (81%), Gaps = 14/598 (2%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+TF  PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN  AP+A
Sbjct: 66  PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAPQA 125

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L++VR LV+  ++AEATAA+VKL G P+DV+QLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 126 LAEVRKLVNDRKFAEATAAAVKLSGEPSDVFQLLGDIKLEFHDSHLNYSKESYYRELDLD 185

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V  DS + + S V+G N
Sbjct: 186 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSASKPGSLSFTVYFDSKMHHDSRVSGQN 245

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           QI +EGRCPG RI P+ N+ D+P+GIQFSA+L+++IS D+G I  L+DKKL+VEGSD A+
Sbjct: 246 QIKIEGRCPGSRIRPRVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDSAI 305

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL ASSSFDGPF  P DSKKDP SES+S + S++  SY DLY RHL DYQ LFHRVS+Q
Sbjct: 306 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKFSYDDLYARHLADYQNLFHRVSLQ 365

Query: 312 LSRSPK--------------DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           LS+S K                 T+   +   DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 366 LSKSSKTGSGKSVLEGRKLVSSQTNISQKRGDDTIPTSARVKSFQTDEDPSFVELLFQYG 425

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 426 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 485

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           F++ LS+ G KTA+VNY A+GWV H  +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 486 FISSLSVIGKKTAKVNYEANGWVAHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 545

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y YTMD+DFL+ +AYPLLEGC +FLLDWLIEG  G LETNPSTSPEH F APDGK A VS
Sbjct: 546 YIYTMDKDFLKNKAYPLLEGCTTFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 605

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           YSSTMD++II+EVFS IISAAEVL ++ D ++++V K   +L PTK+A DGSIMEW +
Sbjct: 606 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTKYQSKLPPTKVARDGSIMEWAE 663


>gi|224056206|ref|XP_002298755.1| predicted protein [Populus trichocarpa]
 gi|222846013|gb|EEE83560.1| predicted protein [Populus trichocarpa]
          Length = 843

 Score =  861 bits (2225), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/596 (68%), Positives = 496/596 (83%), Gaps = 14/596 (2%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           + PLK+TF+GPAK++TD IPIGNGRLGAMVWGGV SE ++LNEDTLWTG P D+T+P  P
Sbjct: 28  SRPLKVTFSGPAKYWTDGIPIGNGRLGAMVWGGVSSELIQLNEDTLWTGTPTDFTDPAIP 87

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           +ALS+VR+LVDSG+++EAT A+ ++FG   +VY+LLGDI+LEF+ S   YAE TY RELD
Sbjct: 88  QALSEVRNLVDSGKFSEATKAAARMFGKYTNVYKLLGDIKLEFNGS--TYAEGTYYRELD 145

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TAT RVKY+V +VEFTREHF+SNPDQVIVTKISGS++ S+SF VSLDS+L++  Y+  
Sbjct: 146 LDTATGRVKYTVDDVEFTREHFASNPDQVIVTKISGSKAQSVSFAVSLDSILEHQCYLTD 205

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            NQ++MEG CPGKR+  +  ANDDPKG++F+A+L+++IS+    +  L+D KLKV G+DW
Sbjct: 206 ENQLVMEGICPGKRMTTEVKANDDPKGMKFTAVLDLQISNGARLVRLLDDNKLKVVGADW 265

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           AVLLLVASSSF+GPF++PSDSKK+PTS+S+ A+ SI+ LSYS LY+RHLDD+Q LFHRVS
Sbjct: 266 AVLLLVASSSFEGPFVDPSDSKKNPTSDSLQAMNSIKKLSYSQLYSRHLDDFQNLFHRVS 325

Query: 310 IQLSRSP---------KDIVTDTCS--EENIDTV-PSAERVKSFQTDEDPSLVELLFQFG 357
           +QL +S          K+++       E N D V P+ ER+KSF++DEDPSLVELLFQFG
Sbjct: 326 LQLEKSSAIGDGVSEIKNLMPSVIEDFEGNKDVVVPTVERIKSFESDEDPSLVELLFQFG 385

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLIS SRPGTQVANLQGIWN+DL P WDSAP +NINLEMNYW SLPCNL ECQEPLFD
Sbjct: 386 RYLLISCSRPGTQVANLQGIWNKDLYPAWDSAPTLNINLEMNYWPSLPCNLRECQEPLFD 445

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           F+  LSINGSK AQVNY+ SGWV HH++DIW K+SAD G   WA+WPM GAW+CTHLWEH
Sbjct: 446 FIKSLSINGSKVAQVNYITSGWVAHHRSDIWEKASADMGNPKWAIWPMAGAWVCTHLWEH 505

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y YT+D+DFL   AYPLLEGCASFL+DWLIEG+DGYLETNPSTSPEH FIAPDG  A VS
Sbjct: 506 YTYTLDKDFLINTAYPLLEGCASFLMDWLIEGNDGYLETNPSTSPEHMFIAPDGNSASVS 565

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           YSSTMDMAII EVFSAI+SA+EVL ++EDALV+KVLK+ PRL P KIA DGSIMEW
Sbjct: 566 YSSTMDMAIINEVFSAIVSASEVLGRSEDALVQKVLKAQPRLYPPKIAPDGSIMEW 621


>gi|255573093|ref|XP_002527476.1| conserved hypothetical protein [Ricinus communis]
 gi|223533116|gb|EEF34874.1| conserved hypothetical protein [Ricinus communis]
          Length = 849

 Score =  856 bits (2212), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/599 (68%), Positives = 496/599 (82%), Gaps = 15/599 (2%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLKI F+GPAKH+TDAIPIGNGRLGAMV+GGV SETL++NEDTLWTG PG+YTNP+AP+A
Sbjct: 36  PLKIVFSGPAKHWTDAIPIGNGRLGAMVFGGVASETLRINEDTLWTGTPGNYTNPNAPEA 95

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L+ VR LV   +YAEAT  +VKL G P+++YQ+LGDI+LEFDDSHL Y E+TY+RELDL+
Sbjct: 96  LTQVRKLVGDRKYAEATTEAVKLSGLPSEIYQVLGDIKLEFDDSHLSYDEKTYQRELDLD 155

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TATARVKYS+G+VE+TREHF+SNP+QV+VTKI+ S+ GS+SF V LDS L +HSY  G N
Sbjct: 156 TATARVKYSLGDVEYTREHFASNPNQVVVTKIAASKPGSVSFTVLLDSELHHHSYTKGEN 215

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           QI +EG CPGKR PP+  A+D PKGI+F+AIL+++IS+ RG I  L+D+KLKVEGSDWAV
Sbjct: 216 QIFIEGSCPGKRAPPQIYASDGPKGIEFAAILKLQISEGRGKIHVLDDRKLKVEGSDWAV 275

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           L LVASSSFDGPF  PS SKKDPTS  + AL  ++NLSY+DLY RHLDDYQ LFHRVS++
Sbjct: 276 LSLVASSSFDGPFTMPSASKKDPTSACLHALDLVKNLSYTDLYARHLDDYQTLFHRVSLR 335

Query: 312 LSRSPKDIVTD---------------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           LS+S K I+ +               + +E   DT+ +AERVKSF+TDEDPSLVELLFQ+
Sbjct: 336 LSKSSKSILGNGPLNMKKFLSFKNYLSLNESKDDTISTAERVKSFRTDEDPSLVELLFQY 395

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLIS SRPGTQVANLQGIW++D +P WD A H+NINL+MNYW +L CNL EC EPLF
Sbjct: 396 GRYLLISCSRPGTQVANLQGIWSKDNAPPWDGAQHLNINLQMNYWPALSCNLHECHEPLF 455

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           ++++ LSINGS TA+VNY A+GWV H  +D+WAK+S DRG+ VWALWPMGGAWLC HLWE
Sbjct: 456 EYMSSLSINGSMTAKVNYEANGWVAHQVSDLWAKTSPDRGEAVWALWPMGGAWLCIHLWE 515

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY YTMD+DFL+ +AYPLLEGCA+FLLDWLIEG  GYLETNPSTSPEH FIAPDGK A V
Sbjct: 516 HYTYTMDKDFLKNKAYPLLEGCATFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASV 575

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S S+TMD+ II+EVFS I+SAAEVL + ED L++KV ++ PRLRP KIA DGSIMEW Q
Sbjct: 576 SNSTTMDVEIIQEVFSEIVSAAEVLGRKEDELIQKVREAQPRLRPIKIARDGSIMEWAQ 634


>gi|449446103|ref|XP_004140811.1| PREDICTED: alpha-L-fucosidase 2-like [Cucumis sativus]
          Length = 803

 Score =  852 bits (2202), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 406/588 (69%), Positives = 486/588 (82%), Gaps = 2/588 (0%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           +++PLK+TFN PAKH+TDAIPIGNGRLGAMVWGGV +E L+LNEDTLWTG P DYTNPDA
Sbjct: 4   SSDPLKLTFNAPAKHWTDAIPIGNGRLGAMVWGGVDTEILQLNEDTLWTGTPADYTNPDA 63

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
           P+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LEF+ SH  Y  ETY REL
Sbjct: 64  PEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLEFEVSHQSYTPETYHREL 123

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV- 187
           DLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL+F VS+DS L + S+V 
Sbjct: 124 DLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSLTFIVSIDSKLHHSSHVV 183

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
           +G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD    +  L++KKLKV GS
Sbjct: 184 DGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDGSVVVHDLDEKKLKVNGS 243

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           DWAVL LVASSSF GPF  PS S KDP+SES++ ++ I+ LSYS+LY RHL+DYQ LF R
Sbjct: 244 DWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSYSNLYARHLNDYQSLFQR 303

Query: 308 VSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           VS+ LS+S K+  +      + +    +AERVKSFQTDEDPSLVELLFQ+ RYLLIS SR
Sbjct: 304 VSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSLVELLFQYSRYLLISCSR 363

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL ECQEPLFDF ++LS+NG
Sbjct: 364 PGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLKECQEPLFDFTSFLSVNG 423

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            KTA+ NY ASGWV H  +DIWAKSS DRG+ VWALWPMGGAWLCTHLWEHY YTMD++F
Sbjct: 424 RKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAWLCTHLWEHYTYTMDKNF 483

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L+ +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIAPDGK A VSYS+TMDMAI
Sbjct: 484 LKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDMAI 543

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            +EVFS+IISAAE+L K +D  ++KV K+  RL P KIA+DGS+MEW 
Sbjct: 544 TKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDGSLMEWA 591


>gi|158302693|dbj|BAF85832.1| alpha-1,2-fucosidase [Lilium longiflorum]
          Length = 854

 Score =  839 bits (2167), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 407/617 (65%), Positives = 482/617 (78%), Gaps = 34/617 (5%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
            PLK+ F  PAKH+TDA PIGNGRLGAMVWGGVP+ETL+LN+DTLWTGVPG+YTNPDAP 
Sbjct: 31  QPLKLRFLEPAKHWTDAAPIGNGRLGAMVWGGVPTETLQLNDDTLWTGVPGNYTNPDAPT 90

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            LS VR LVD G+YAEA+ A+  L GHP+DVYQ LG + LEF DSH+ Y+   Y+RELDL
Sbjct: 91  VLSKVRKLVDDGKYAEASLAAFDLSGHPSDVYQPLGTMNLEFGDSHVAYS--NYQRELDL 148

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TATA+V YS+G+VEFTREHFSSNP QV+VTKIS ++SGSLSF VSLDS L + S  +G 
Sbjct: 149 TTATAKVTYSLGDVEFTREHFSSNPHQVLVTKISANKSGSLSFIVSLDSKLHHQSSADGV 208

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N+IIMEG CPG+RI PK N  ++ KGIQFSA+L++KI  +   +  LED KLKVEGSDWA
Sbjct: 209 NRIIMEGSCPGRRIAPKGNLFENNKGIQFSAVLDLKIGGNDSNVQVLEDMKLKVEGSDWA 268

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           VLLL ASSSF+GPFINPSDS+KDP S S+  L +I+ +S+S L+T H++DYQ LFH V++
Sbjct: 269 VLLLAASSSFEGPFINPSDSEKDPKSASLDTLNAIQKISFSQLFTHHVEDYQSLFHCVTL 328

Query: 311 QLSRSPKD---------------IVTDTCSEENIDTV----PS-------------AERV 338
           QLS+                   I+  TCS  N++ V    PS             AERV
Sbjct: 329 QLSKGSNSGGRTTVPLSQSYDSSILGTTCSLNNMEKVNTSNPSYSDQLTEEVLISTAERV 388

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
           KSF+ DEDPSLVELLF +GRYLLIS SRPGTQ+ANLQGIW++D+ P WD+APH+NINL+M
Sbjct: 389 KSFKVDEDPSLVELLFHYGRYLLISCSRPGTQIANLQGIWSKDIEPAWDAAPHLNINLQM 448

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
           NYW SL CNLSECQEPLFD++  L+ING+KTA+VNY ASGWV H  +DIWAK+S DRG  
Sbjct: 449 NYWPSLSCNLSECQEPLFDYIASLAINGAKTAKVNYEASGWVAHQVSDIWAKTSPDRGDP 508

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
           VWALWPMGGAWLCTHLWEHY ++MD+ FLE  AYPLLEGCASFLLDWLIEG  GYLETNP
Sbjct: 509 VWALWPMGGAWLCTHLWEHYTFSMDKVFLENTAYPLLEGCASFLLDWLIEGRGGYLETNP 568

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
           STSPEH FIAPD K A VSYSSTMDMAIIREVFS  IS+AE+L + E  LV+++ K++PR
Sbjct: 569 STSPEHSFIAPDSKTASVSYSSTMDMAIIREVFSEFISSAEILGRVESKLVKQIKKAIPR 628

Query: 579 LRPTKIAEDGSIMEWVQ 595
           L PTKIA DG+IMEW Q
Sbjct: 629 LPPTKIARDGTIMEWAQ 645


>gi|357479527|ref|XP_003610049.1| Macrophage migration inhibitory factor-like protein [Medicago
           truncatula]
 gi|355511104|gb|AES92246.1| Macrophage migration inhibitory factor-like protein [Medicago
           truncatula]
          Length = 855

 Score =  827 bits (2137), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/609 (64%), Positives = 486/609 (79%), Gaps = 29/609 (4%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + NA+    + PLK+TF+  AK++TDAIPIGNGRLGAM+WGG+ SE L+LNEDTLWTG+P
Sbjct: 22  LANADDDEPSMPLKVTFSRSAKYWTDAIPIGNGRLGAMIWGGIQSEVLQLNEDTLWTGIP 81

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
           G+YT+ +AP+AL++VR LVD  +Y+EAT A++KL G P +VYQLLGDIEL+FDDSHLKY+
Sbjct: 82  GNYTDKNAPEALAEVRKLVDDRKYSEATTAALKLLGPPGEVYQLLGDIELQFDDSHLKYS 141

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
           EE+Y RELDL+ AT               HF+SNPDQV+VTK S S SGSLSF VSLDS 
Sbjct: 142 EESYHRELDLDNAT---------------HFASNPDQVLVTKFSTSNSGSLSFTVSLDSK 186

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           L +++ ++  NQIIMEG CPGKRIPP+ N++D+PKGIQFSA+L+++IS+++G I  L+DK
Sbjct: 187 LHHNTRLSSKNQIIMEGSCPGKRIPPQVNSSDEPKGIQFSAVLDVQISNEKGVIHVLDDK 246

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KL+VEGSDWA+LLL ASSSFDGPF NP +SKKD TSES+S ++ + +L Y D+Y RHLDD
Sbjct: 247 KLRVEGSDWAILLLTASSSFDGPFTNPENSKKDLTSESLSKMKFVTSLKYDDIYARHLDD 306

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEE--------NI------DTVPSAERVKSFQTDED 346
           YQ LFHRVS+QLS+S K ++     +E        NI      D VP++ R+KSFQ DED
Sbjct: 307 YQNLFHRVSLQLSKSSKTVLGKPILDEGKMVSCQTNISQLRGGDIVPTSSRIKSFQNDED 366

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
           PS VELLFQ+GRYLLI+ SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL C
Sbjct: 367 PSFVELLFQYGRYLLIACSRPGTQVANLQGIWNKDVVPKWDGAPHLNINLQMNYWPSLSC 426

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           NL ECQEPLFD ++ LS+NGSKTA+VNY A+GWV HH +D+WAK+S  RG  VWALWPMG
Sbjct: 427 NLHECQEPLFDCISSLSVNGSKTAKVNYDANGWVAHHVSDLWAKTSTYRGPAVWALWPMG 486

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
           GAWLCTHLWEHY YT D++FL+ +AYPLLEGC SFLLDWLIEG  G LETNPSTSPEH F
Sbjct: 487 GAWLCTHLWEHYTYTTDKEFLKNKAYPLLEGCTSFLLDWLIEGPGGLLETNPSTSPEHMF 546

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
           IA D K A VSYSSTMD++II+EVFS +ISAAE+L + +DA++++V +S  +L P KIA 
Sbjct: 547 IASDQKRASVSYSSTMDISIIKEVFSIVISAAEILGRQDDAIIKRVFESQSKLPPIKIAR 606

Query: 587 DGSIMEWVQ 595
           DGSIMEW +
Sbjct: 607 DGSIMEWAE 615


>gi|356495827|ref|XP_003516773.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
          Length = 802

 Score =  809 bits (2089), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/593 (66%), Positives = 468/593 (78%), Gaps = 11/593 (1%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           AE   + N LKI F    KH+TDA+PIGNGRLGAMV G V SET+ LNEDTLWTG P DY
Sbjct: 2   AEGRGSRN-LKIRFREGGKHWTDAVPIGNGRLGAMVCGHVHSETIHLNEDTLWTGTPADY 60

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA-EE 122
           TN  AP ALS VR+LV    Y +ATAAS  L G+P++ Y LLGDI+L+FD SHL    ++
Sbjct: 61  TNSKAPPALSHVRNLVHRQHYPQATAASSALTGNPSEAYLLLGDIQLDFDYSHLTPGLQQ 120

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y RELDL+TAT +V+YSVG+V+FTREHF+S PDQ+IVT+IS S+   LSF VSL S + 
Sbjct: 121 PYERELDLDTATVKVRYSVGDVQFTREHFASYPDQLIVTQISSSKPAKLSFTVSLLSKII 180

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           N +YVN  NQIIM+G CPGKRI        +P GIQFSAIL++KI    G I  L++ KL
Sbjct: 181 NQTYVNAPNQIIMKGSCPGKRI------QHNPHGIQFSAILDLKIGGTDGVIHILDNNKL 234

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           KVE SDWAVLLLVASSSF GPF  PSDSKKDPTS+  + L SI N+SYS LY RHL+DYQ
Sbjct: 235 KVEASDWAVLLLVASSSFSGPFTAPSDSKKDPTSQCFTTLSSISNVSYSHLYARHLNDYQ 294

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
            LFHRVS+QL RS +  +++   +  +    +++RVKSFQTDEDPSLVELLFQ+GRYLLI
Sbjct: 295 GLFHRVSLQLMRSTRPNISE---DSTVTQASTSDRVKSFQTDEDPSLVELLFQYGRYLLI 351

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SSSRPGTQVANLQGIWN+DL P WD APH+NINLEMNYW +LPCNLSECQEPLFD+++ L
Sbjct: 352 SSSRPGTQVANLQGIWNKDLEPVWDGAPHLNINLEMNYWPALPCNLSECQEPLFDYISLL 411

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S+NGSKTA VNY A+GWV H K+DIWA++SA +G VVWALWPMGGAWLCTHLWEHY YTM
Sbjct: 412 SVNGSKTAHVNYQANGWVAHSKSDIWARTSAGQGDVVWALWPMGGAWLCTHLWEHYAYTM 471

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           D DFL+ +AYPL+EGC SFLL WLIE  +GYLETNPSTSPEH FIAP+G+ ACVS SSTM
Sbjct: 472 DEDFLKYKAYPLMEGCVSFLLSWLIEDSEGYLETNPSTSPEHYFIAPNGEPACVSQSSTM 531

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D+AII EVFS  +SAAEV+ + +D +V +V K+ PRLRP  IA+DGSIMEWV+
Sbjct: 532 DVAIINEVFSTFLSAAEVIGRTKDNIVGEVRKAQPRLRPINIAQDGSIMEWVK 584


>gi|297802554|ref|XP_002869161.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314997|gb|EFH45420.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 844

 Score =  802 bits (2072), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/595 (63%), Positives = 469/595 (78%), Gaps = 17/595 (2%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           + PLK+TF GP++++TDAIPIGNGRLGA +WGGV SETL +NEDT+WTGVP DYTNP+AP
Sbjct: 48  SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSETLNINEDTIWTGVPADYTNPNAP 107

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           +AL++VR LVD   YAEAT+ +VKL G P+DVYQL+GD+ LEF  SH KY + +YRRELD
Sbjct: 108 EALAEVRRLVDEKNYAEATSEAVKLSGQPSDVYQLVGDLNLEFGSSHRKYTQTSYRRELD 167

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A+V YSVG V+F+RE F+SNPDQVIV KI  S+ GSLSF VS DS L +HS  N 
Sbjct: 168 LETAVAKVSYSVGAVDFSREFFASNPDQVIVAKIYASKPGSLSFKVSFDSELHHHSETNP 227

Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
             NQI+M G C  KR+P       NA     DD KG+QF++ILE+++S+  G++S+L  K
Sbjct: 228 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 286

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KL VE +DWAVLLL ASS+FDGPF  P+DSK+DP  E    + S++  SYSDLY RHL D
Sbjct: 287 KLSVEKADWAVLLLAASSNFDGPFTMPADSKRDPAKECAKRISSVQKYSYSDLYARHLGD 346

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQKLF+RVS+QLS S  +      +        +AERV+SF+TDEDP+LVELLFQ+GRYL
Sbjct: 347 YQKLFNRVSLQLSGSSGNKTVQQAAS-------TAERVRSFKTDEDPALVELLFQYGRYL 399

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 400 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 459

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L+ING KTAQ+NY ASGWV H  +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 460 ALAINGRKTAQMNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 519

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP+GK A VSYSS
Sbjct: 520 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPNGKPASVSYSS 579

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD+AII+EVF+ I++A+E+L K  D L+ KV+ +  +L PT+I++DGSIMEW +
Sbjct: 580 TMDIAIIKEVFADIVTASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIMEWAE 634


>gi|30689979|ref|NP_195152.2| alpha-L-fucosidase 2 [Arabidopsis thaliana]
 gi|75245768|sp|Q8L7W8.1|FUCO2_ARATH RecName: Full=Alpha-L-fucosidase 2; AltName:
           Full=Alpha-1,2-fucosidase 2; AltName:
           Full=Alpha-L-fucoside fucohydrolase 2; Flags: Precursor
 gi|21928117|gb|AAM78086.1| AT4g34260/F10M10_30 [Arabidopsis thaliana]
 gi|27363438|gb|AAO11638.1| At4g34260/F10M10_30 [Arabidopsis thaliana]
 gi|332660949|gb|AEE86349.1| alpha-L-fucosidase 2 [Arabidopsis thaliana]
          Length = 843

 Score =  791 bits (2042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/595 (63%), Positives = 465/595 (78%), Gaps = 17/595 (2%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           + PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN  AP
Sbjct: 49  SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           +AL++VR LVD   YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A+V YSVG V+F+RE F+SNPDQVI+ KI  S+ GSLSF VS DS L +HS  N 
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228

Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
             NQI+M G C  KR+P       NA     DD KG+QF++ILE+++S+  G++S+L  K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KL VE +DWAVLLL ASS+FDGPF  P DSK DP  E ++ + S++  SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQKLF+RVS+ LS S       + +E       +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 401 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 460

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L+ING KTAQVNY ASGWV H  +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 461 ALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 520

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A VSYSS
Sbjct: 521 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPASVSYSS 580

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD+AII+EVF+ I+SA+E+L K  D L+ KV+ +  +L PT+I++DGSI EW +
Sbjct: 581 TMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAE 635


>gi|449531868|ref|XP_004172907.1| PREDICTED: alpha-L-fucosidase 2-like, partial [Cucumis sativus]
          Length = 764

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/546 (68%), Positives = 444/546 (81%), Gaps = 3/546 (0%)

Query: 52  EDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE 111
           EDTLWTG P DYTNPDAP+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LE
Sbjct: 7   EDTLWTGTPADYTNPDAPEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLE 66

Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
           F+ SH  Y  ETY RELDLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL
Sbjct: 67  FEVSHQSYTPETYHRELDLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSL 126

Query: 172 SFNVSLDSLLDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
           +F VS+DS L + S+V +G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD 
Sbjct: 127 TFIVSIDSKLHHSSHVVDGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDG 186

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
              +  L++KKLKV GSDWAVL LVASSSF GPF  PS S KDP+SES++ ++ I+ LSY
Sbjct: 187 SVVVHDLDEKKLKVNGSDWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSY 246

Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSL 349
           S+LY RHL+DYQ LF RVS+ LS+S K+  +      + +    +AERVKSFQTDEDPSL
Sbjct: 247 SNLYARHLNDYQSLFQRVSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSL 306

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
           VELLFQ+ RYLLIS SRPGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL 
Sbjct: 307 VELLFQYSRYLLISCSRPGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLK 366

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           ECQEPLFDF ++LS+NG KTA+ NY ASGWV H  +DIWAKSS DRG+ VWALWPMGGAW
Sbjct: 367 ECQEPLFDFTSFLSVNGRKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAW 426

Query: 470 LCTHLWEHYNYTMDR-DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
           LCTHLWEHY YTMD+  F + +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIA
Sbjct: 427 LCTHLWEHYTYTMDKVKFFKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIA 486

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
           PDGK A VSYS+TMDMAI +EVFS+IISAAE+L K +D  ++KV K+  RL P KIA+DG
Sbjct: 487 PDGKPASVSYSTTMDMAITKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDG 546

Query: 589 SIMEWV 594
           S+MEW 
Sbjct: 547 SLMEWA 552


>gi|4455171|emb|CAB36703.1| hypothetical protein [Arabidopsis thaliana]
 gi|7270376|emb|CAB80143.1| hypothetical protein [Arabidopsis thaliana]
          Length = 847

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/600 (62%), Positives = 460/600 (76%), Gaps = 23/600 (3%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           + PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN  AP
Sbjct: 49  SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           +AL++VR LVD   YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A+V YSVG V+F+RE F+SNPDQVI+ KI  S+ GSLSF VS DS L +HS  N 
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228

Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
             NQI+M G C  KR+P       NA     DD KG+QF++ILE+++S+  G++S+L  K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
           KL VE +DWAVLLL ASS+FDGPF  P DSK DP  E ++ + S++  SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQKLF+RVS+ LS S       + +E       +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTW-----DSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           LISSSRPGTQVANLQ  +   L+P         APH+NINL+MNYW SLP N+ ECQEPL
Sbjct: 401 LISSSRPGTQVANLQA-FVVSLTPLLLLRYCSGAPHLNINLQMNYWHSLPGNIRECQEPL 459

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
           FD+++ L+ING KTAQVNY ASGWV H  +DIWAK+S DRG+ VWALWPMGGAWLCTH W
Sbjct: 460 FDYMSALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAW 519

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           EHY YTMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A 
Sbjct: 520 EHYTYTMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPAS 579

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           VSYSSTMD+AII+EVF+ I+SA+E+L K  D L+ KV+ +  +L PT+I++DGSI EW +
Sbjct: 580 VSYSSTMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAE 639


>gi|78708252|gb|ABB47227.1| large secreted protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222612646|gb|EEE50778.1| hypothetical protein OsJ_31136 [Oryza sativa Japonica Group]
          Length = 851

 Score =  749 bits (1934), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/612 (59%), Positives = 457/612 (74%), Gaps = 32/612 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PL++ F  P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP  
Sbjct: 34  PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
           LS VR LV+ GQYA+ATA +  L G    VYQ LGDI+L FD+    + E+T Y+R LDL
Sbjct: 94  LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TAT  V Y++G V  +REHFSSNP QVIVTKIS  + G++SF VSL + L++   V   
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N+IIMEG CPG+R     NA+D P GI+FSAIL +++S   GT+  L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           VLLL A++SF+GPF+NPS+SK DPT+ +++ L   RN+SYS L   H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329

Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
           QLSR              P++ + +T           CS     N    P+ +R+ SF+ 
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
           DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
           LPCNLSECQEPLFDF+  LS+NG+KTA+VNY ASGWV H  TD+WAK+S D G  +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+  YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
           H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++  +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629

Query: 584 IAEDGSIMEWVQ 595
           +A DG+IMEW Q
Sbjct: 630 VARDGTIMEWAQ 641


>gi|218197301|gb|EEC79728.1| hypothetical protein OsI_21058 [Oryza sativa Indica Group]
          Length = 815

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/586 (59%), Positives = 456/586 (77%), Gaps = 5/586 (0%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21  PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L+ VR LVD  ++ +AT A+  LFG P +VYQ LGDI LEFD S L Y   +Y+RELDL 
Sbjct: 81  LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  + Y++G V+++REHF SNP QV  TKIS ++SG +SF +SL+S L+++  +   N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           ++IM+G CPG+R     N  +D  GI+F+  + ++I      ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LL+ A+SSFDGPF+NPS+SK +P   +++ L   RN ++S L   HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS++   +  D   E + D   +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           +NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNLSECQEPLFD +  L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLSECQEPLFDLIGSLAVNGTKTAK 437

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           VNY ASGWV HH TDIWAKSSA     ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
           YPLLEGCA FL+DWLI+G   YLETNPSTSPEH FIAP   G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           VF A+IS+AEVL K++  LVE++ K+LP L P KI++DG+IMEW Q
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQ 603


>gi|110288916|gb|ABG66022.1| large secreted protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222612642|gb|EEE50774.1| hypothetical protein OsJ_31132 [Oryza sativa Japonica Group]
          Length = 815

 Score =  746 bits (1927), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/586 (59%), Positives = 456/586 (77%), Gaps = 5/586 (0%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21  PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L+ VR LVD  ++ +AT A+  LFG P +VYQ LGDI LEFD S L Y   +Y+RELDL 
Sbjct: 81  LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  + Y++G V+++REHF SNP QV  TKIS ++SG +SF +SL+S L+++  +   N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           ++IM+G CPG+R     N  +D  GI+F+  + ++I      ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LL+ A+SSFDGPF+NPS+SK +P   +++ L   RN ++S L   HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS++   +  D   E + D   +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           +NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNL+ECQEPLFD +  L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLTECQEPLFDLIGSLAVNGTKTAK 437

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           VNY ASGWV HH TDIWAKSSA     ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
           YPLLEGCA FL+DWLI+G   YLETNPSTSPEH FIAP   G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           VF A+IS+AEVL K++  LVE++ K+LP L P KI++DG+IMEW Q
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQ 603


>gi|218184333|gb|EEC66760.1| hypothetical protein OsI_33136 [Oryza sativa Indica Group]
          Length = 851

 Score =  746 bits (1927), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/612 (59%), Positives = 456/612 (74%), Gaps = 32/612 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PL++ F  P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP  
Sbjct: 34  PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
           LS VR LV+ GQYA+ATA +  L G    VYQ LGDI+L FD+    + E+T Y+R LDL
Sbjct: 94  LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TAT  V Y++G V  +REHFSSNP QVIVTKIS  + G++SF VSL + L++   V   
Sbjct: 150 RTATVNVSYTIGGVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N+IIMEG CPG+R     NA+D P GI+FSAIL +++S   GT+  L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           VLLL AS+SF+GPF+NPS+SK DPT+ +++ L   RN+ YS L   H+DDYQ LF RVS+
Sbjct: 270 VLLLAASTSFEGPFVNPSESKLDPTASALTTLTVARNMPYSQLKAYHVDDYQNLFQRVSL 329

Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
           QLS+              P++ + +T           CS     N    P+ +R+ SF+ 
Sbjct: 330 QLSQDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
           DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
           LPCNLSECQEPLFDF+  LS+NG+KTA+VNY ASGWV H  TD+WAK+S D G  +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+  YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
           H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++  +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629

Query: 584 IAEDGSIMEWVQ 595
           +A DG+IMEW Q
Sbjct: 630 VARDGTIMEWAQ 641


>gi|15451592|gb|AAK98716.1|AC090483_6 Hypothetical protein [Oryza sativa Japonica Group]
          Length = 872

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/644 (57%), Positives = 461/644 (71%), Gaps = 59/644 (9%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PL++ F  P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP  
Sbjct: 34  PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
           LS VR LV+ GQYA+ATA +  L G    VYQ LGDI+L FD+    + E+T Y+R LDL
Sbjct: 94  LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TAT  V Y++G V  +REHFSSNP QVIVTKIS  + G++SF VSL + L++   V   
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N+IIMEG CPG+R     NA+D P GI+FSAIL +++S   GT+  L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           VLLL A++SF+GPF+NPS+SK DPT+ +++ L   RN+SYS L   H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329

Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
           QLSR              P++ + +T           CS     N    P+ +R+ SF+ 
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
           DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
           LPCNLSECQEPLFDF+  LS+NG+KTA+VNY ASGWV H  TD+WAK+S D G  +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509

Query: 464 PMGGAWLCTHLWEHYNYTMD--------------------RDFLEKRAYPLLEGCASFLL 503
           PMGG WL THLWEHY+YTMD                    + FLEK AYPLLEG ASFLL
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKKENVFRPNKVDMIVLKDAKKQFLEKTAYPLLEGSASFLL 569

Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
           DWLIEG+  YLETNPSTSPEH FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K
Sbjct: 570 DWLIEGNGDYLETNPSTSPEHYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGK 629

Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCKL 607
           ++  +V+++ K++PRL P K+A DG+IMEW+       FS C L
Sbjct: 630 SDSDMVQRIKKAIPRLPPIKVARDGTIMEWL-------FSECLL 666


>gi|296083105|emb|CBI22509.3| unnamed protein product [Vitis vinifera]
          Length = 781

 Score =  737 bits (1903), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/622 (61%), Positives = 439/622 (70%), Gaps = 121/622 (19%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34  PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPAD------------------------------- 100
           LS+VR LVD+G Y  AT A+VKL G+P+D                               
Sbjct: 94  LSEVRKLVDNGDYVAATEAAVKLSGNPSDDELPSLLLDSFFDCDHVGLEVCVKYAPLLMG 153

Query: 101 -------VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
                  VYQLLGDI LEF+DSHL YAEETY RELDL+TAT  +KYSVG+VE+TREHF+S
Sbjct: 154 YLKFNFGVYQLLGDINLEFEDSHLAYAEETYSRELDLDTATVTIKYSVGDVEYTREHFAS 213

Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
            PDQVIVTKISGS+ GS+SF VSLDS                       +IPPK      
Sbjct: 214 YPDQVIVTKISGSKPGSVSFTVSLDS-----------------------KIPPKV----- 245

Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
                             G I+ L+DKKLKVEGSDWAV                      
Sbjct: 246 ------------------GVINVLDDKKLKVEGSDWAVF--------------------- 266

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
                   L+SI N SYSDLY RHL+DYQ LFHRVS+QLS+S K +         ++ V 
Sbjct: 267 -------TLKSIGNFSYSDLYARHLNDYQNLFHRVSLQLSKSSKSV---------MNRVS 310

Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
           +A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q ANLQGIWN+D+ P WD APH+N
Sbjct: 311 TAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQPANLQGIWNKDIEPAWDGAPHLN 370

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           INL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+VNY ASGWV H  +DIWAK+S 
Sbjct: 371 INLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAKVNYEASGWVTHQVSDIWAKTSP 430

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
           DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +AYPLLEGCA FLLDWLIEG  GY
Sbjct: 431 DRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKAYPLLEGCARFLLDWLIEGRGGY 490

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
           LETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVFSA++SAAEVL KNED LV+KV 
Sbjct: 491 LETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVFSAVVSAAEVLGKNEDELVQKVR 550

Query: 574 KSLPRLRPTKIAEDGSIMEWVQ 595
           ++ P+L PTKIA DGSIMEW Q
Sbjct: 551 QAQPKLPPTKIARDGSIMEWAQ 572


>gi|357146134|ref|XP_003573887.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
          Length = 857

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/613 (59%), Positives = 445/613 (72%), Gaps = 30/613 (4%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           + PLK+ F  PAK+FTDA PIGNGRLGAMVWGGV SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 38  SRPLKVVFASPAKYFTDAAPIGNGRLGAMVWGGVASERLQLNHDTLWTGGPGNYTNPNAP 97

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             LS VRSLV  G YAEATA +  L G    +YQ LGDI+L F   H+KY    Y+R LD
Sbjct: 98  TVLSKVRSLVGKGLYAEATAVAYDLSGDQTQIYQPLGDIDLAFGQ-HIKYTN--YKRYLD 154

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L +AT  V Y+VG V ++REHFSSNP QVI TK+S ++ G++SF VSL + LD+  +V  
Sbjct: 155 LESATVNVTYTVGEVVYSREHFSSNPHQVIATKVSANKPGAVSFTVSLATPLDHRIHVTD 214

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            N+IIMEG C G+R     +A+DDP GI+F AIL ++IS   GT+  L D  LK++G+D 
Sbjct: 215 TNEIIMEGCCAGERPVGDDSASDDPTGIKFCAILYLQISGANGTLQVLNDNMLKLDGADS 274

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           AVLLL A++SF+GPF+ PS+S  +P + + + L   R +SYS L   H+DDYQ LF RVS
Sbjct: 275 AVLLLAAATSFEGPFVKPSESTLNPKTSAFTTLNMARTMSYSQLKAYHMDDYQSLFQRVS 334

Query: 310 IQLSR-----------------SPKDIVTDTCSEE----------NIDTVPSAERVKSFQ 342
           +QLSR                 S +DI    C E+          N    P+ +R+ SF 
Sbjct: 335 LQLSRGSDNVLRGNSLPNSPENSCQDIAVSHCVEQISDRSWLKELNNSDKPTVDRIISFV 394

Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D  P WD+APH NINL+MNYW 
Sbjct: 395 DDEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTRPPWDAAPHPNINLQMNYWP 454

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
           +LPCNLSECQEPLFDF+  LSING+KTA+VNY ASGWV H  TD+WAK+S D G  +WAL
Sbjct: 455 ALPCNLSECQEPLFDFIESLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWAL 514

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WPMGG+WL THLWEHY++T+D  FLEK AYPLLEG ASFLL WLIEG  G LETNPSTSP
Sbjct: 515 WPMGGSWLATHLWEHYSFTLDTQFLEKTAYPLLEGSASFLLSWLIEGQGGQLETNPSTSP 574

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           EH FIAPDGK ACVSYS+TMDM++IREVFSA++ +A++L K+   +V+++ K+LPRL P 
Sbjct: 575 EHYFIAPDGKKACVSYSTTMDMSVIREVFSAVLLSADILGKSGTDVVQRIKKALPRLPPI 634

Query: 583 KIAEDGSIMEWVQ 595
           KIA D +IMEW +
Sbjct: 635 KIARDITIMEWAR 647


>gi|326518094|dbj|BAK07299.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 832

 Score =  731 bits (1887), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/588 (59%), Positives = 451/588 (76%), Gaps = 7/588 (1%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F+ PA++FTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPG+YT+P AP  
Sbjct: 34  PLKVAFSSPAEYFTDAAPIGNGSLGAMVWGGVSSDKLQLNHDTLWTGVPGNYTDPKAPGV 93

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L++VR LVD G++A+ATA++  LFG  ++VYQ LG++ +EF  S   Y  ++Y+RELDL+
Sbjct: 94  LAEVRGLVDQGRFADATASAKGLFGGLSEVYQPLGELNIEFSTSEQVY--DSYKRELDLH 151

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TATA V Y++G V++TREHF SNP Q IVT+ S S  G +S  +SL S L++   V   N
Sbjct: 152 TATALVTYNIGGVQYTREHFCSNPHQAIVTRFSASTPGHVSCTLSLSSQLNHSVTVINEN 211

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           ++IMEG CPG+R   + N  D+  GI+F+A L +++       + L D+KL+++ +DW V
Sbjct: 212 EMIMEGICPGQRPGMRENGGDNVTGIRFTAALGLQMGGSAAKSTVLNDQKLRLDSADWVV 271

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
            ++ A+SSF GP +NP+DSK DPTS ++S L   RN ++  L   HLDDYQ LF+RV++Q
Sbjct: 272 FVVAAASSFYGPHVNPADSKLDPTSLALSMLNHSRNFTFDQLKAAHLDDYQSLFNRVTLQ 331

Query: 312 LSRSPKDI---VTDTCSEENI--DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           LS+   D    VT T  +E +  D   SA+RVKSF +DEDPSLVELLFQ+GRYLLIS SR
Sbjct: 332 LSQGSNDACTSVTRTDIQEQVAEDIRTSADRVKSFSSDEDPSLVELLFQYGRYLLISCSR 391

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQV+NLQGIW++D++P WD+APH+NINL+MNYW +LPCNLSECQEPLFDFL  L++NG
Sbjct: 392 PGTQVSNLQGIWSQDIAPEWDAAPHLNINLQMNYWPALPCNLSECQEPLFDFLGSLAVNG 451

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           +KTA+VNY A GWV HH +DIWAKSSA       A+WPMGGAWLCTHLWEHY +++D+DF
Sbjct: 452 TKTAKVNYQAGGWVTHHVSDIWAKSSAFLKNPKHAVWPMGGAWLCTHLWEHYQFSLDKDF 511

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           LE  AYPLLEGCA+FL+DWLIEG  GYLETNPSTSPEH F+APDGK A VSYS+TMD++I
Sbjct: 512 LENTAYPLLEGCANFLVDWLIEGPGGYLETNPSTSPEHAFVAPDGKPASVSYSTTMDVSI 571

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           IREVF A++S+AE+L K +  LVE++ K+LPRL P +IA D ++MEW 
Sbjct: 572 IREVFLAVLSSAELLGKADIDLVERIKKALPRLPPIQIARDRTVMEWA 619


>gi|326508462|dbj|BAJ95753.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 857

 Score =  728 bits (1878), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/611 (58%), Positives = 444/611 (72%), Gaps = 30/611 (4%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP  
Sbjct: 40  PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS+VRSLVD G Y EATA +  L G     YQ LGDI+L F + H+KY    Y R LDL 
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           +AT  V YSVG V ++REHFSSNP QVI TKIS ++ G++S  VSL + LD+   V   N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           +IIMEG CPG++     NA+D P G++F AIL + +S   G +  L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A++SF+GPF+ P++S  DP + + + L   R++SY+ L   H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336

Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
           LSRS             P++I  DT    C+ + +D            P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
           EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + +  W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
           PCNLSECQ+PLFDF+  LS+NG+KTA+VNY  SGWV H  TD+WAK+S D G   WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516

Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+   +V+++  +LPRL P KI
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPPIKI 636

Query: 585 AEDGSIMEWVQ 595
             DG+IMEW +
Sbjct: 637 GRDGTIMEWAR 647


>gi|357116946|ref|XP_003560237.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
          Length = 818

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/586 (58%), Positives = 436/586 (74%), Gaps = 5/586 (0%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PA+HFTDA PIGNG LGAMVWGGV SE L+LN DTLWTGVPG+YT+P  P A
Sbjct: 20  PLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASEKLQLNLDTLWTGVPGNYTDPSVPSA 79

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           ++ VR LV   Q+ +AT A+  L+G P +VYQ LGD+ +EF  S   Y+  +Y+RELDL+
Sbjct: 80  VAVVRKLVHDRQFVDATNAASGLYGGPTEVYQPLGDVNIEFGTSSQDYS--SYKRELDLH 137

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  V Y++G V++TREHF SNP QVIVTK+S ++SG +S  +SLDS L +   V   N
Sbjct: 138 TATVLVTYNIGEVQYTREHFCSNPHQVIVTKLSANKSGHISCTLSLDSKLTHSVRVTNAN 197

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           ++IM+G CPG+R   + N  +D  GI+F+A+L +++         L D  L+++ +DW +
Sbjct: 198 EMIMDGTCPGQRHVLQQNETNDATGIKFTAVLSLQMGGAMAKAEVLNDHNLRIDNADWVL 257

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LL+ A+SSF GPFINPS+SK DP S ++  L   RN+++  L   HL DYQ LFHRVS+ 
Sbjct: 258 LLVTAASSFSGPFINPSNSKIDPESVALRNLNMSRNVTFDQLKAAHLKDYQGLFHRVSLI 317

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS +P  I     +E       +AERV SF+++EDPSLVELLFQ+GRYLLIS SRPGTQV
Sbjct: 318 LSHAPA-IEKTNLNETGEAIKITAERVNSFRSNEDPSLVELLFQYGRYLLISCSRPGTQV 376

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           +NLQGIWN+DLSP W SAPH+NINL+MNYW +LPCNL ECQEPL DF+  L++NG+KTA+
Sbjct: 377 SNLQGIWNQDLSPAWQSAPHLNINLQMNYWPTLPCNLGECQEPLIDFIAALAVNGTKTAK 436

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           +NY  SGWV HH +DIWAKSSA      +A+WPMGGAWLCTHLWEHY Y++D++FL+  A
Sbjct: 437 INYQTSGWVTHHVSDIWAKSSAFNEDAKYAVWPMGGAWLCTHLWEHYQYSLDKEFLKNTA 496

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACVSYSSTMDMAIIRE 549
           YPLLEGCA FL DWL EG +GYLETNPS SPEH FIAPD  G+ A VSYS+TMD++IIRE
Sbjct: 497 YPLLEGCALFLADWLTEGRNGYLETNPSISPEHSFIAPDSGGQQASVSYSTTMDVSIIRE 556

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +F AIIS+AEVL K++  LV K+ K+L RL P  IA+D +IMEW Q
Sbjct: 557 IFMAIISSAEVLGKSDSTLVPKIKKALSRLTPIMIAKDHTIMEWAQ 602


>gi|414868290|tpg|DAA46847.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
          Length = 727

 Score =  717 bits (1850), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/611 (59%), Positives = 444/611 (72%), Gaps = 30/611 (4%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP  
Sbjct: 41  PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS VRSLV++G+Y EAT+A+  L G    V+Q LGDI+L F +  +KY    YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+   V   N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           +IIMEG CPG+R      A D P GI+FSAIL ++I+    T+  L D  LK++ +D  V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A++SF   FI PS+SK DPT  + + L   R  SYS L   H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337

Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
           LS       R  + + +   S +  +                      P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
           EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
           PCNLSECQEPLFDF+  LSING+KTA+VNY ASGWV H  TD+WAK+S D G  VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517

Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           MGG WL THLWEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH
Sbjct: 518 MGGPWLATHLWEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEH 577

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            FIAPDGK ACVSYS+TMD++IIREVFSA+I +A++L K++  +V+++ K+LP L P K+
Sbjct: 578 YFIAPDGKEACVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKV 637

Query: 585 AEDGSIMEWVQ 595
           A DG+IMEW Q
Sbjct: 638 ARDGTIMEWAQ 648


>gi|326493958|dbj|BAJ85441.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 636

 Score =  711 bits (1834), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/597 (58%), Positives = 434/597 (72%), Gaps = 30/597 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP  
Sbjct: 40  PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS+VRSLVD G Y EATA +  L G     YQ LGDI+L F + H+KY    Y R LDL 
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           +AT  V YSVG V ++REHFSSNP QVI TKIS ++ G++S  VSL + LD+   V   N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           +IIMEG CPG++     NA+D P G++F AIL + +S   G +  L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A++SF+GPF+ P++S  DP + + + L   R++SY+ L   H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336

Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
           LSRS             P++I  DT    C+ + +D            P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
           EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + +  W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
           PCNLSECQ+PLFDF+  LS+NG+KTA+VNY  SGWV H  TD+WAK+S D G   WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516

Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
            FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+   +V+++  +LPRL P
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPP 633


>gi|326513306|dbj|BAK06893.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 815

 Score =  710 bits (1832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/594 (57%), Positives = 440/594 (74%), Gaps = 7/594 (1%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A+      PLK+ F  PA+HFTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPGDY
Sbjct: 12  ADEAEEERPLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASDKLQLNLDTLWTGVPGDY 71

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
           T+P AP AL+ VR LVD G++ +AT+A+  LFG   +VYQ LGD+ LEFD S+ +Y+  +
Sbjct: 72  TDPKAPAALAAVRKLVDDGRFVDATSAASGLFGGQTEVYQPLGDMNLEFDISNQEYS--S 129

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y+RELDL+TAT  + Y++G V+ TREHF SNP QVIVTKIS ++S  +S  +SL+S L++
Sbjct: 130 YKRELDLHTATTVITYNIGEVQHTREHFCSNPHQVIVTKISANKSEHVSLTLSLNSKLNH 189

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
              V   N++IMEG CP  R+    N   D  GI F+A+L +++S     +  L D+KL+
Sbjct: 190 RVRVMNANEMIMEGSCPVHRL--HENEASDASGIGFAAVLSLQMSGAAAKVVVLNDQKLR 247

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           ++ +DW +L + A+SSF+GP +NPSDSK DP S ++ A+   RNL++  L   HL DYQ 
Sbjct: 248 IDNADWVLLRVTAASSFNGPSVNPSDSKLDPESAALRAMNMSRNLTFDQLKASHLKDYQG 307

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LFHRVS++LS+SP  I      E       +AERV  F++DED SLVELLFQ+GRYLLIS
Sbjct: 308 LFHRVSLRLSQSPA-IEKINMKEVGEAIKTTAERVNGFRSDEDSSLVELLFQYGRYLLIS 366

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            SRPGTQ++NLQGIWN+DL P W+ APH+NINL+MNYW +LPCNL ECQEPL DF+  L+
Sbjct: 367 CSRPGTQISNLQGIWNQDLLPQWECAPHLNINLQMNYWPTLPCNLIECQEPLLDFIASLA 426

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           +NG+KTA++NY ASGWV HH TDIWAKSSA      +++WPMGGAWLCTHLWEHY Y +D
Sbjct: 427 VNGTKTAKINYQASGWVTHHVTDIWAKSSAFNEDAKYSVWPMGGAWLCTHLWEHYQYLLD 486

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG--KLACVSYSST 541
           +DFL+  AYPLLEGCA FL DWLIEG  G LETNPSTSPEH FIAP      A VSYS+T
Sbjct: 487 KDFLKNTAYPLLEGCALFLTDWLIEGPRGLLETNPSTSPEHAFIAPGSGDHQASVSYSTT 546

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD+AIIRE+FSA+IS+AE+L K++  LV+K+ ++LPRL    IA+D +++EW Q
Sbjct: 547 MDIAIIREIFSAVISSAEILGKSDTPLVQKIKEALPRLPQNTIAKDQTLVEWAQ 600


>gi|242047972|ref|XP_002461732.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
 gi|241925109|gb|EER98253.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
          Length = 864

 Score =  686 bits (1769), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/618 (56%), Positives = 446/618 (72%), Gaps = 37/618 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PL + F  PA++FTDA PIGNG LG MVWGGV ++ L+LN DTLWTG PG YT+PDAP A
Sbjct: 47  PLTVVFASPAENFTDAAPIGNGSLGGMVWGGVATDKLQLNHDTLWTGAPGSYTDPDAPAA 106

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYRRELD 129
           L+ VR LVD G++A+ATAA+ +LFG  ++VYQ +GD+ LE     S  + A ++Y+RELD
Sbjct: 107 LAAVRELVDQGRFADATAAATRLFGGQSEVYQPMGDVNLELGGSGSDQQPAYDSYKRELD 166

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TAT  V YSVG V++TREHF SNP QVI+T+I+ SE G +S  +SL S L N   V  
Sbjct: 167 LHTATVLVTYSVGPVQYTREHFCSNPHQVIITRIAASEPGHVSCTLSLSSQLKNTVTVTN 226

Query: 190 NNQIIMEGRCPG-------------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
            NQ++MEG CP                     + +    GI+F+A+L +++  D+   + 
Sbjct: 227 ANQVVMEGVCPRQRPPAPPRLMLLRNSSSGDDDDDLTTGGIKFAAVLGVQMGGDKAKAAV 286

Query: 237 LEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSK-KDPTSESMSALQSIRNLSYSDLY 294
           L D+ KL +E +DW VL++ ASSSFDGPF++PSDS+  DPTS +++ L    +L+Y  L 
Sbjct: 287 LNDENKLSLESADWIVLIVAASSSFDGPFVSPSDSRLDDPTSAAVATLNRATSLTYEQLK 346

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTD-------------------TCSEENIDTVPSA 335
             HLDDYQ+LFHRV+++LS     ++ D                      +E I    SA
Sbjct: 347 AAHLDDYQRLFHRVTLRLSPPGGGLLEDARGGGLMMTGGKETMLKRGVGGDEGIIRT-SA 405

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
           +RVKSF TDEDPSLVELLFQ+GRYLLIS SRPGTQV+NLQGIWN++++P WD+APH+NIN
Sbjct: 406 DRVKSFATDEDPSLVELLFQYGRYLLISCSRPGTQVSNLQGIWNQEVAPAWDAAPHLNIN 465

Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
           L+MNYW +LPCNLSECQEPLFDFL  L++NG+KTA+VNY A GWV HH +DIWAKSSA  
Sbjct: 466 LQMNYWPTLPCNLSECQEPLFDFLQSLAVNGTKTAKVNYQARGWVTHHVSDIWAKSSAFI 525

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
                A+WPMGGAWLCTHLWEHY Y++D+DFLE  AYPLLEGCA+FL+DWLIEG  G+L+
Sbjct: 526 KNPKHAVWPMGGAWLCTHLWEHYQYSLDKDFLEYTAYPLLEGCATFLVDWLIEGPGGFLQ 585

Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
           TNPSTSPEH F APDGK A VSYS+TMD++IIREV SA++ +AE+LEK++  LVEK+ K+
Sbjct: 586 TNPSTSPEHAFTAPDGKPASVSYSTTMDISIIREVSSAVLLSAEILEKSDTDLVEKIKKA 645

Query: 576 LPRLRPTKIAEDGSIMEW 593
           LPRL P + A D +IMEW
Sbjct: 646 LPRLPPIQFARDNTIMEW 663


>gi|110288917|gb|ABG66023.1| large secreted protein, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 708

 Score =  627 bits (1617), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 292/497 (58%), Positives = 386/497 (77%), Gaps = 5/497 (1%)

Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
           VYQ LGDI LEFD S L Y   +Y+RELDL TAT  + Y++G V+++REHF SNP QV  
Sbjct: 3   VYQPLGDINLEFDSSSLGYT--SYKRELDLRTATVCISYNIGEVQYSREHFCSNPHQVFA 60

Query: 161 TKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
           TKIS ++SG +SF +SL+S L+++  +   N++IM+G CPG+R     N  +D  GI+F+
Sbjct: 61  TKISANKSGHVSFTLSLNSQLNHNVRITNANEMIMQGTCPGRRPALHHNGANDAIGIKFA 120

Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
             + ++I      ++ ++D+KL+++ +DW VLL+ A+SSFDGPF+NPS+SK +P   +++
Sbjct: 121 TAVGLQIGGTSAKVTIIDDQKLRIDAADWVVLLVAAASSFDGPFVNPSESKLNPEVAALN 180

Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
            L   RN ++S L   HL+DYQ LFHRV++QLS++   +  D   E + D   +AER+ S
Sbjct: 181 TLNISRNATFSQLKAAHLEDYQGLFHRVTLQLSQASM-LEKDILEEVDHDVKTTAERINS 239

Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
           F++DEDPSLVELLFQ+GRYLLISSSRPGTQV+NLQGIWN+D +P W+++PH+NINLEMNY
Sbjct: 240 FRSDEDPSLVELLFQYGRYLLISSSRPGTQVSNLQGIWNQDFAPAWEASPHLNINLEMNY 299

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W +LPCNL+ECQEPLFD +  L++NG+KTA+VNY ASGWV HH TDIWAKSSA     ++
Sbjct: 300 WPTLPCNLTECQEPLFDLIGSLAVNGTKTAKVNYQASGWVTHHVTDIWAKSSAYYVDAMY 359

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
           ALWPMGGAWLCTHLWE+Y Y++D++FLEKRAYPLLEGCA FL+DWLI+G   YLETNPST
Sbjct: 360 ALWPMGGAWLCTHLWENYQYSLDKEFLEKRAYPLLEGCAMFLIDWLIKGPGDYLETNPST 419

Query: 521 SPEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
           SPEH FIAP   G LA VSYS+TMD++IIREVF A+IS+AEVL K++  LVE++ K+LP 
Sbjct: 420 SPEHPFIAPGTGGHLASVSYSTTMDISIIREVFLAVISSAEVLGKSDTNLVERIKKALPM 479

Query: 579 LRPTKIAEDGSIMEWVQ 595
           L P KI++DG+IMEW Q
Sbjct: 480 LPPVKISKDGTIMEWAQ 496


>gi|302773137|ref|XP_002969986.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
 gi|300162497|gb|EFJ29110.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
          Length = 791

 Score =  619 bits (1596), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 296/588 (50%), Positives = 411/588 (69%), Gaps = 10/588 (1%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + F  PA+++ +A+P+GNGRLGAMV+GG  S+ ++LNEDTLW+G P D+ NP+A + L
Sbjct: 5   LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLNEDTLWSGGPRDWNNPNAVQVL 64

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR LV   +YAEA+  S ++ G   +VYQ LGDI+L+F  SH  Y  ++Y R+LDLNT
Sbjct: 65  PKVRQLVWDEKYAEASDLSKEMLGPYTEVYQPLGDIKLDFGASHATYDAQSYHRQLDLNT 124

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A   V Y+VG + +TRE F+S P QVIV +I+ S++G++SF+ +LDS L  ++YV  +N 
Sbjct: 125 ALVSVSYAVGGINYTREVFASYPHQVIVIRITSSKAGAVSFSATLDSPLQTNAYVKDSNF 184

Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
           I+++G+CP     P  ++    +D   G+ F+A++E++ S   G+ I+ L  ++++VE  
Sbjct: 185 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 244

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           DWA+L+L ASSSFDGPF +P+ + KDP + S++ L+ +  LSY  LY  HL DYQ LFHR
Sbjct: 245 DWAMLVLAASSSFDGPFKDPTSTGKDPVAASLATLKLVEALSYKKLYAAHLKDYQALFHR 304

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           VS+Q+++  ++    + +  +       ER+++F ++EDP++V LLFQFGRYLLISSSRP
Sbjct: 305 VSLQINKKSRENSVVSSTSMSTQ-----ERIQAFASNEDPAMVVLLFQFGRYLLISSSRP 359

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT VANLQGIWN+DL P W   PH+NINLEMNYW +  CNL+EC EPLFDF++ ++INGS
Sbjct: 360 GTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAINGS 419

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+VNY   GWV HH  DIW +++   G  V+AL+PMGGAWLC HLWEHY +++D +FL
Sbjct: 420 HTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEFL 479

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
             +AYPLL GCA FL DWL   + G L TNPSTSPEH FIAPDGK A VSY+S MDMAII
Sbjct: 480 RSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKEASVSYASAMDMAII 539

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           R VF A  SAA +L++        +  +   L P +I+  G +MEW +
Sbjct: 540 RAVFDATSSAATILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAK 587


>gi|302799394|ref|XP_002981456.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
 gi|300150996|gb|EFJ17644.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
          Length = 788

 Score =  616 bits (1588), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 300/589 (50%), Positives = 413/589 (70%), Gaps = 15/589 (2%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + F  PA+++ +A+P+GNGRLGAMV+GG  S+ ++LN DTLW+G P D+ NP+A + L
Sbjct: 5   LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLN-DTLWSGGPRDWNNPNAVQVL 63

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR LV   +YAEA+  S ++ G   +VYQ LGDI+L+F  SH  Y  ++Y R+LDLN 
Sbjct: 64  PKVRQLVWDEKYAEASDLSKQMLGPYTEVYQPLGDIKLDFGTSHATYDAQSYHRQLDLNA 123

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A   V+Y++G V +TRE F+S P QVIV +IS S++G++SF+ +LDS L  ++YV  +N 
Sbjct: 124 ALVSVRYAIGGVNYTREVFASYPHQVIVIRISSSKAGAVSFSATLDSPLQTNAYVKDSNF 183

Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
           I+++G+CP     P  ++    +D   G+ F+A++E++ S   G+ I+ L  ++++VE  
Sbjct: 184 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 243

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           DWA+L+L ASSSFDGPF NP+   KDP + S++ L+S+  LSY  LY  HL DYQ LFHR
Sbjct: 244 DWAMLVLAASSSFDGPFKNPTG--KDPVAASLATLKSVEALSYEKLYATHLKDYQALFHR 301

Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           VS+++++ S ++ V  T S      + + ER+++F ++EDP++V LLFQFGRYLLISSSR
Sbjct: 302 VSLRINKKSGENSVASTTS------MSTQERIQAFASNEDPAMVSLLFQFGRYLLISSSR 355

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGT VANLQGIWN+DL P W   PH+NINLEMNYW +  CNL+EC EPLFDF++ ++ING
Sbjct: 356 PGTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAING 415

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           S TA+VNY   GWV HH  DIW +++   G  V+AL+PMGGAWLC HLWEHY +++D +F
Sbjct: 416 SHTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEF 475

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L  +AYPLL GCA FL DWL   + G L TNPSTSPEH FIAPDGK A VSY+S MDMAI
Sbjct: 476 LRSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKQASVSYASAMDMAI 535

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           IR VF A  SAA +L++        +  +   L P +I+  G +MEW +
Sbjct: 536 IRSVFDATSSAAAILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAK 584


>gi|168043560|ref|XP_001774252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674379|gb|EDQ60888.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 818

 Score =  584 bits (1506), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 287/590 (48%), Positives = 390/590 (66%), Gaps = 32/590 (5%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH 97
           MV GGV SE ++LNEDTLW+G P D+ NP A + L  VR LV  G+YAEAT  + K+ G 
Sbjct: 1   MVHGGVKSELVQLNEDTLWSGGPTDWNNPKALETLPRVRELVKEGKYAEATTEAQKMLGP 60

Query: 98  PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
             +VYQ LGD++LEFDDSH  Y +E+YRR+LDL+TA   V Y +G+V + R+ F+S P Q
Sbjct: 61  DPEVYQPLGDLKLEFDDSHNTYDKESYRRQLDLDTAMTYVNYEIGDVSYLRQAFTSYPHQ 120

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
           V   +I+GS+SGS+SF+V+LDS L     V G+  I ++G+CP    ++   A+     K
Sbjct: 121 VFAMRIAGSKSGSVSFSVTLDSQLMLGKEVVGSKYIALKGQCPIDSNKVTEVASPTRSSK 180

Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
             G++F A+L++++S + G +  ++ + LKV  +DWAVL L ASSSFDGPF +PS S  +
Sbjct: 181 KQGMEFVAVLQVEVSGEAGRLQVVDKQTLKVHQADWAVLYLTASSSFDGPFKDPSISGIE 240

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD-----------IVTD 322
           PTS + +AL ++ +LS+ D+   HL DYQ LFHRVS+ +    KD           IV  
Sbjct: 241 PTSLAFAALANLVDLSFDDILAAHLADYQTLFHRVSLHVDNEEKDLGLWELIVPSEIVES 300

Query: 323 TCSEENI-----------------DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
              E                    + + + +R+ +F  DEDP LV LLFQFGRYLLI+SS
Sbjct: 301 KTVESGAQVSTGVDGEVYPQNAWKERISTRDRILNFDGDEDPDLVVLLFQFGRYLLIASS 360

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RP + V+NLQG+W+  L P W   P +NINLEMNYW +  C+L+EC  PLFDFL  +++ 
Sbjct: 361 RPNSFVSNLQGVWSNSLHPAWRCCPTLNINLEMNYWPAETCSLAECHLPLFDFLEQIAVT 420

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G+ TA+VNY   GWV HH  DIWA S+   G  VWALWPM GAW+C HLWEHY ++ D +
Sbjct: 421 GATTAKVNYGLGGWVSHHNADIWAHSAPVSGDPVWALWPMSGAWICLHLWEHYTFSQDEE 480

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL  RAYPL +GCA F ++WL+E   G+L TNPSTSPEH FIAPDG+ ACVSY STMDMA
Sbjct: 481 FLRNRAYPLFKGCAEFFVNWLVEDGKGHLVTNPSTSPEHHFIAPDGQSACVSYGSTMDMA 540

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I+   F+A++SAA+++ ++E  LV +V  ++ RL P KI  DG ++EWV+
Sbjct: 541 ILHNFFNAVVSAAKIVGQDEAELVSEVKSAVGRLLPAKIGSDGRLLEWVE 590


>gi|414868291|tpg|DAA46848.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
          Length = 567

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 285/500 (57%), Positives = 350/500 (70%), Gaps = 30/500 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP  
Sbjct: 41  PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS VRSLV++G+Y EAT+A+  L G    V+Q LGDI+L F +  +KY    YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+   V   N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           +IIMEG CPG+R      A D P GI+FSAIL ++I+    T+  L D  LK++ +D  V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A++SF   FI PS+SK DPT  + + L   R  SYS L   H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337

Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
           LS       R  + + +   S +  +                      P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
           EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
           PCNLSECQEPLFDF+  LSING+KTA+VNY ASGWV H  TD+WAK+S D G  VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517

Query: 465 MGGAWLCTHLWEHYNYTMDR 484
           MGG WL THLWEHY +T+D+
Sbjct: 518 MGGPWLATHLWEHYCFTLDK 537


>gi|337750325|ref|YP_004644487.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|336301514|gb|AEI44617.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
          Length = 831

 Score =  470 bits (1210), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 247/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+T++ PA+ +T+A+P GNGRLGAMV+GGV  E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 31  MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 90

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            +VR L   G+Y EA     ++ G     Y  LGD+ L F   H  +A + Y R LD+  
Sbjct: 91  PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 147

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           +  R  Y +G V +TRE F S+PDQV+V +++    G+LSF   LDS L + +  +  + 
Sbjct: 148 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 206

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
           ++++GR P K + P     D+P          G++F A L ++     G    ++   L 
Sbjct: 207 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 262

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VE +    LLL A++SF+G    P++  +D +  +   L++   L+Y +L  RH DDY+ 
Sbjct: 263 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 322

Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           LF RV++ L  SR+P+ + TD              R+  +    DP L ELLF +GRYLL
Sbjct: 323 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 368

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSR GTQ ANLQGIWN+++   W S   +NIN +MNYW +  CNLSEC EPL  F+  
Sbjct: 369 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 428

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           L++NG+KT  VNY   GW  HH +DIWA+S+       G  VWA WPM GAWL  HLWEH
Sbjct: 429 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 488

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y +  + D+L ++AYP+++  A F LDWL+E  DG+L ++PSTSPEH F+  +G+LA V+
Sbjct: 489 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 548

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            ++TMD+A++ ++F+  I AA  L  + +     +  +L RL+P +I + G + EW
Sbjct: 549 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 603


>gi|386726157|ref|YP_006192483.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
 gi|384093282|gb|AFH64718.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
          Length = 801

 Score =  470 bits (1210), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 246/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+T++ PA+ +T+A+P GNGRLGAMV+GG+  E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1   MKLTYDKPARVWTEALPAGNGRLGAMVFGGMEHELLQLNEDTLWSGAPGDHNNPRAREVL 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            +VR L   G+Y EA     ++ G     Y  LGD+ L F   H  +A + Y R LD+  
Sbjct: 61  PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           +  R  Y +G V +TRE F S+PDQV+V +++    G+LSF   LDS L + +  +  + 
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTTDRPGALSFTAKLDSALKHRTAADAGD- 176

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
           ++++GR P K + P     D+P          G++F A L ++     G    ++   L 
Sbjct: 177 LVLKGRAPVK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 232

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VE +    LLL A++SF+G    P++  +D +  + + L++   L+Y +L  RH DDY+ 
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAANDLRAASGLTYEELLQRHQDDYRA 292

Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           LF RV++ L  SR+P+ + TD              R+  +    DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRITEYGAS-DPGLAELLFHYGRYLL 338

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSR GTQ ANLQGIWN+++   W S   +NIN +MNYW +  CNLSEC EPL  F+  
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           L++NG+KT  VNY   GW  HH +DIWA+S+       G  VWA WPM GAWL  HLWEH
Sbjct: 399 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y +  + D+L ++AYP+++  A F LDWL+E  DG+L + PSTSPEH F+  +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSAPSTSPEHRFVMAEGELAAVT 518

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            ++TMD+A++ ++F+  I AA  L  + +     +  +L RL+P +I + G + EW
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 573


>gi|379723425|ref|YP_005315556.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|378572097|gb|AFC32407.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
          Length = 801

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 247/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+T++ PA+ +T+A+P GNGRLGAMV+GGV  E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1   MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            +VR L   G+Y EA     ++ G     Y  LGD+ L F   H  +A + Y R LD+  
Sbjct: 61  PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           +  R  Y +G V +TRE F S+PDQV+V +++    G+LSF   LDS L + +  +  + 
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 176

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
           ++++GR P K + P     D+P          G++F A L ++     G    ++   L 
Sbjct: 177 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDSGALH 232

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VE +    LLL A++SF+G    P++  +D +  +   L++   L+Y +L  RH DDY+ 
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 292

Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           LF RV++ L  SR+P+ + TD              R+  +    DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 338

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSR GTQ ANLQGIWN+++   W S   +NIN +MNYW +  CNLSEC EPL  F+  
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           L++NG+KT  VNY   GW  HH +DIWA+S+       G  VWA WPM GAWL  HLWEH
Sbjct: 399 LAVNGTKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y +  + D+L ++AYP+++  A F LDWL+E  DG+L ++PSTSPEH F+  +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 518

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            ++TMD+A++ ++F+  I AA  L  + +     +  +L RL+P +I + G + EW
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 573


>gi|379721553|ref|YP_005313684.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
 gi|378570225|gb|AFC30535.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
          Length = 806

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 246/595 (41%), Positives = 352/595 (59%), Gaps = 37/595 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + I F  PA ++T+A+P+GNGRLGAM++GGV  E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14  MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR L+   +Y EA      + G     Y   GD+ +  +  H +     Y R+LDL+T
Sbjct: 74  PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHILME--HGQVCGRGYERKLDLST 131

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF   LDS L + S  + ++ 
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190

Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             + G  P    P   N  +         PK ++F   L    +   G    +E   L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L   A++SFD P I  S + + P   +  A+Q+I    YSD+   H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRVPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306

Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           FHRV + L  S +P+D+ TD             +R+  + +  DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------QRIAEYGS-RDPGLVELLFHYGRYLMI 352

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           +SSRPGTQ ANLQGIWNED    W S   +NIN EMNYW +  CN++E  EPL DF+  L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
           ++NG KTA+VNY A GWV HH +D+WA+++       G  VWA WP+GG WL  HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
            ++ +  FL   AYP+++  A F LDWL    DGY  T+PSTSPEH+F+  D + A V  
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++TMD+A+I E+FS  I++AE L+ +E+     +L++  +L P +I + G + EW
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEW 585


>gi|337748528|ref|YP_004642690.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
           KNP414]
 gi|336299717|gb|AEI42820.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
           KNP414]
          Length = 806

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 246/595 (41%), Positives = 351/595 (58%), Gaps = 37/595 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + I F  PA ++T+A+P+GNGRLGAM++GGV  E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14  MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR L+   +Y EA      + G     Y   GD+ +  +  H +     Y R+LDL+T
Sbjct: 74  PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHIVME--HGQVCGRGYERKLDLST 131

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF   LDS L + S  + ++ 
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190

Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             + G  P    P   N  +         PK ++F   L    +   G    +E   L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L   A++SFD P I  S + + P   +  A+Q+I    YSD+   H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRMPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306

Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           FHRV + L  S +P+D+ TD              R+  + +  DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------RRIAEYGS-RDPGLVELLFHYGRYLMI 352

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           +SSRPGTQ ANLQGIWNED    W S   +NIN EMNYW +  CN++E  EPL DF+  L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
           ++NG KTA+VNY A GWV HH +D+WA+++       G  VWA WP+GG WL  HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
            ++ +  FL   AYP+++  A F LDWL    DGY  T+PSTSPEH+F+  D + A V  
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++TMD+A+I E+FS  I++AE L+ +E+     +L++  +L P +I + G + EW
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEW 585


>gi|261406875|ref|YP_003243116.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261283338|gb|ACX65309.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 802

 Score =  453 bits (1166), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 239/588 (40%), Positives = 346/588 (58%), Gaps = 29/588 (4%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  + DA+ +GNGRLG MV+GG+  E + LNEDTLW+G P D  N +A   L  V+
Sbjct: 16  YRNPAAEWVDALAVGNGRLGGMVYGGIFRERISLNEDTLWSGHPYDPNNREAAAYLETVQ 75

Query: 77  SLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            LV  G+Y EA       + G  ++ YQ LGD+ LE +++      E YRRELDLN A  
Sbjct: 76  KLVFEGKYPEAQRTIEEHMLGPWSESYQPLGDLYLELEETG---KAEHYRRELDLNDAVC 132

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           R ++++  V + RE F S  DQV+V + +  + G ++ + SLDS L + +     +++ M
Sbjct: 133 RTRFTLNGVRYVRETFVSAVDQVMVVRFTADQPGRIAVSASLDSQLRHQALRVSADKLAM 192

Query: 196 EGRCPGKRIPPKANAND-----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           +GR P    P  A +ND     + +GI+F A  ++    + G  +   + ++++EG+D  
Sbjct: 193 KGRSPSHVEPLHARSNDPVIYEEGRGIRFEA--QLLALPEGGATTEDGEGRIRIEGADAV 250

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
             LL AS+SF+G   NP    ++P     S L +   LSY +L  RH+ DY+ L+ RV +
Sbjct: 251 TFLLAASTSFNGFDKNPVLEGRNPAELCRSCLDAAAKLSYGELLDRHVQDYRALYGRVEL 310

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGT 369
           +L  +P            +  +P+ ER+++ + D+ D  L  L FQFGRYLL+SSSRPGT
Sbjct: 311 ELD-AP-----------GLQHLPTDERIRALREDKTDEQLAVLFFQFGRYLLLSSSRPGT 358

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN+ + P W     VNIN +MNYW +  CNL+EC EPLF  L  L I G +T
Sbjct: 359 QAANLQGIWNQSMRPPWSCNYTVNINTQMNYWPAEVCNLAECHEPLFRLLEDLRIAGRET 418

Query: 430 AQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           A  +Y A GWV HH  D+W  ++       G   WA WPMGGAWL  H+WEHY +  DR 
Sbjct: 419 ASAHYKARGWVSHHAVDLWRITTPSGGPSGGPASWAYWPMGGAWLSQHVWEHYRFGGDRT 478

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL +  YP+++  A F LD+L+E  DGYL +NPSTSPE+ F  PDG+ A VS  +TMD+A
Sbjct: 479 FLSQVGYPIMKEAALFFLDYLVEDADGYLVSNPSTSPENTFALPDGRKAAVSMDATMDIA 538

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++RE+F   + A++ L  + +  +E +  +  RLRP +I   G + EW
Sbjct: 539 LLRELFGNCMEASDHLGIDRELRLE-LAAARARLRPFQIGRRGQLQEW 585


>gi|15613405|ref|NP_241708.1| hypothetical protein BH0842 [Bacillus halodurans C-125]
 gi|10173457|dbj|BAB04561.1| BH0842 [Bacillus halodurans C-125]
          Length = 795

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 248/596 (41%), Positives = 343/596 (57%), Gaps = 39/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +KI F+ PA  +T+A+PIGNG LGAMV+G V  E + LNEDTLW+G P D+ NP A + L
Sbjct: 1   MKIQFDFPASFWTEALPIGNGNLGAMVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR L+   +Y EA   S  + G     Y   GD+ +  D  H +     Y RELDL+T
Sbjct: 61  PKVRELIAQEKYEEADQLSRDMMGPYTQSYLPFGDLNIFMD--HGQVVAPHYHRELDLST 118

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               V Y++G V++TRE F + PD+ IV +++ S+ G LSF   LDSLL + S V G   
Sbjct: 119 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 177

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
             + G  P + + P     ++P         +G+ F   L    + + G    ++   L 
Sbjct: 178 YTISGTAP-EHVSPSYYDEENPVRYGHPDMSQGMTFHGRL---AAVNEGGSLKVDADGLH 233

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+  A L   AS+SFD P    S  ++DP+  ++  +++I    Y ++  RHL+DY K
Sbjct: 234 VMGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTK 292

Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           LF+RVS+ L  S  P D+ TD             +R+K + +  D  LVELLFQ+GRYL+
Sbjct: 293 LFNRVSLHLGESIAPADMSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLM 338

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I+SSRPGTQ ANLQGIWNE+    W S   +NIN EMNYW +  CNL+E  +PL  F+  
Sbjct: 339 IASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEMNYWPAETCNLAELHKPLIHFIER 398

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           L+ NG KTA++NY A GWV HH  D+W +++       G  VWA WPMGG WL  HLWEH
Sbjct: 399 LAANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPMGGVWLTQHLWEH 458

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y +  D  +L   AYP+++  A F LDWLIE   GYL T+PSTSPE  F   + K   VS
Sbjct: 459 YTFGEDEAYLRDTAYPIMKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGE-KGYAVS 517

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            ++TMD+++I E F   I AA+ L  +ED  V+ +  +  RL P +I + G + EW
Sbjct: 518 SATTMDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEW 572


>gi|251797558|ref|YP_003012289.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
 gi|247545184|gb|ACT02203.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
          Length = 790

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 240/596 (40%), Positives = 347/596 (58%), Gaps = 39/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +N  +  +TDA+P GNGRLGAM++GG   E ++LNEDTLW+G P    N +A K L
Sbjct: 1   MKLQYNRASVRWTDALPTGNGRLGAMMFGGSEMERIQLNEDTLWSGGPRYGDNDNAVKVL 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            +VR L++ GQYA A     ++ G     Y  + D+ ++F   +     + YRR L L  
Sbjct: 61  PEVRKLIEEGQYAAADRLCKQMMGTYTQSYLPMADLYIKFLHGNTM---KNYRRALHLGD 117

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           AT+ V+Y +GNV +TR  F S PDQV+V ++  S+ G L+F   L+S L   +  +  + 
Sbjct: 118 ATSTVEYQIGNVTYTRRLFVSYPDQVVVLRLEASQPGKLNFLARLESPLRYETAFD-QDA 176

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
           +I+ G  P +++ P     D P           ++F   +  ++  D G  S   D  L+
Sbjct: 177 LILRGDAP-EQVDPSYYDTDMPVKYGEPGSANAMRFEGRMAARL--DEGQASYGHDG-LR 232

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+    L+  A++SF+G   +P    KD ++ + + L+  + LSY  L  RH++D++K
Sbjct: 233 VTGATAVTLIFSAATSFNGYDRSPGSEGKDESAAASAYLEQAKKLSYESLLQRHVEDHRK 292

Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           LF+RV + L  S  P D  TD              R++ +    DP LVELL+ +GRYL+
Sbjct: 293 LFNRVELSLGESVAPPDYPTDA-------------RIRDYGAS-DPGLVELLYHYGRYLM 338

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSR GTQ ANLQGIWNE+    W     +NIN EMNYW +  CNL++C  PL DF+  
Sbjct: 339 IGSSRKGTQPANLQGIWNEETRAPWSGNYTLNINAEMNYWPAETCNLADCHTPLLDFIGN 398

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           LS NG KTA  NY A+GW  HH +DIW +S+       G   WA WPMGG WLC HLWEH
Sbjct: 399 LSKNGRKTASTNYGAAGWTAHHNSDIWCQSAPAGDYGHGDPGWAFWPMGGVWLCQHLWEH 458

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y + +D  FL  +AYP+++  A F LDWL E  DG L T+PSTSPEH+F   +G LA VS
Sbjct: 459 YAFGLDEAFLRDKAYPVMKEAALFCLDWLHEDKDGRLITSPSTSPEHKFRTAEG-LAAVS 517

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +STMD+++I ++F+ +I A+ +L  +E    E++  +  RL P +I E+G + EW
Sbjct: 518 AASTMDLSLIWDLFTNLIEASTILGVDE-PFRERLADTRSRLHPLQIGENGRLQEW 572


>gi|298246864|ref|ZP_06970669.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
 gi|297549523|gb|EFH83389.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
          Length = 809

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 240/596 (40%), Positives = 346/596 (58%), Gaps = 33/596 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ +  PA  + +A+P+GNG LGAMV GG+  E L+LNEDTLW+G P D  NPDA   
Sbjct: 15  PLKLWYRQPATQWLEALPVGNGHLGAMVHGGISEEVLQLNEDTLWSGEPYDTDNPDAVTH 74

Query: 72  LSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           L ++R L+ +   Y  A   + ++ G   + YQ LG + L+F+    +   + Y+R LDL
Sbjct: 75  LPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ---RGEVQAYQRALDL 131

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           NTA A V+Y  G++ F+RE FSS  D ++V +++     +LS    L+SL        G+
Sbjct: 132 NTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAHLESLHPFTCAPAGS 191

Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
           N+I M GRCP + + P   +  DP          G++F   L+  +  + G ISA  D  
Sbjct: 192 NKIRMTGRCP-RHVDPDYLSTSDPVIYDHGEDGHGMRFETQLQAMV--EGGRISADVDGA 248

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+VE +      L A++S+ G    P  S      +  + L +  +  Y  L   H++DY
Sbjct: 249 LRVENAHAVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAAGMSKGYEVLRAAHINDY 308

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
           Q+LF RV++ L  S            +   +P+ ER+ + Q    D +L+ L FQ+GRYL
Sbjct: 309 QQLFQRVTLDLGTS------------DGQELPTDERLAAVQKGASDDALLALYFQYGRYL 356

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LI+SSRPGTQ ANLQGIWN+ + P W S   +NIN +MNYW +  CNL+EC  PLFD L 
Sbjct: 357 LIASSRPGTQSANLQGIWNDHVRPAWSSNYTININTQMNYWLAETCNLAECHSPLFDLLE 416

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEH 477
             S++G +TAQV Y   GWV HH  D+W  ++      G   WA W MGGAWLC HLWEH
Sbjct: 417 EASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGGPQWANWNMGGAWLCQHLWEH 476

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y ++ DR FL +RAYP+++  A FLLD+L+E   G+L T PST+PE+ FI   G+L+ VS
Sbjct: 477 YAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDKQGHLTTCPSTAPENLFITESGELSGVS 536

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             STMD+AI  E+F+  I+A++VL+ ++     ++ ++L RL    I   G + EW
Sbjct: 537 AGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPGIGSYGQLQEW 591


>gi|182419971|ref|ZP_02951207.1| twin-arginine translocation pathway signal [Clostridium butyricum
           5521]
 gi|237666001|ref|ZP_04525989.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
           BoNT E BL5262]
 gi|182376222|gb|EDT73807.1| twin-arginine translocation pathway signal [Clostridium butyricum
           5521]
 gi|237658948|gb|EEP56500.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
           BoNT E BL5262]
          Length = 799

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 237/594 (39%), Positives = 348/594 (58%), Gaps = 27/594 (4%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDA 68
            + L++ +  PA+ + +A+P+GNGR+GAMV+GGV  E L+LNEDTLW+GVP  + T+ + 
Sbjct: 2   NDKLRLWYTKPAEKWVEALPLGNGRIGAMVFGGVYRERLQLNEDTLWSGVPITEETDENF 61

Query: 69  PKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
              L   R L+  G+Y ++    + KL G   + Y  LG++  +FD+    Y +  Y R+
Sbjct: 62  IDDLEKARKLIFEGKYCKSENIINNKLLGPWNESYLPLGNLYFDFDNEG-DYVD--YERD 118

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L+L  A++ VKY++ N+ + R  F S  D  IV K   S+ G +SF  S DSLL      
Sbjct: 119 LNLEDASSCVKYTMNNIRYKRTTFISKSDNAIVIKFESSKEGKISFKASFDSLLRYTVVT 178

Query: 188 NGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKL 242
              N I + G+ P   +P   +       DD +G+ F A+LE+  +   G I + E+  L
Sbjct: 179 ENKNSISLLGKAPIHVLPSYEDGEKPVIYDDKRGMNFKAVLEV--NGINGDIKS-ENGIL 235

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           KV+ +D  ++ +V  +SF+G         KD      +++Q IR+ +Y +LY  H  +Y+
Sbjct: 236 KVKDADEVIIKIVVHTSFNGYKNEAGTQGKDVNDLCENSIQKIRDKTYVNLYNAHKIEYK 295

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
            LF R+   L+    D           ++ P+ +R+++F+ ++ D  L+ L FQ+GRYLL
Sbjct: 296 SLFDRLQFTLNSDFTD-----------NSTPTDKRIENFKENKNDLGLISLYFQYGRYLL 344

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSR GTQ ANLQGIWNEDL P W S    NINLEMNYW +  CNL EC EPLF F+  
Sbjct: 345 ISSSRKGTQPANLQGIWNEDLRPAWSSNYTTNINLEMNYWLAEVCNLQECHEPLFKFIRE 404

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           +S  G +TA++ Y   GW  +H  D+W ++S   G   WA WPM GAWLC+H+WEHY +T
Sbjct: 405 VSEVGKETAKIRYNCRGWTANHNIDLWRQTSPAGGSTEWAYWPMAGAWLCSHIWEHYEFT 464

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            D  FL K  YP+++ CA FL+DWL+E  +GYL T PS SPE+ FI  +G+ +CVS +ST
Sbjct: 465 NDVKFL-KEMYPIMKSCAEFLVDWLMEDENGYLVTCPSISPENNFITEEGEKSCVSIAST 523

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MDM+I + +F   I AA +LE ++    E +      L P KI + G + EW +
Sbjct: 524 MDMSITKNLFKNCIDAANILEIDKKFRSE-LKNYYNNLYPYKIGKFGQLQEWFK 576


>gi|298246866|ref|ZP_06970671.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
 gi|297549525|gb|EFH83391.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
          Length = 809

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 241/610 (39%), Positives = 349/610 (57%), Gaps = 36/610 (5%)

Query: 1   MMNAESTSTTN---PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           M  A++   +    PLK+ +  PA  + +A+P+GNG LGAM+ GG+  E L+LNEDTLW+
Sbjct: 1   MYQAQAAGVSQDKPPLKLWYRQPATQWLEALPVGNGHLGAMIHGGIGEEVLQLNEDTLWS 60

Query: 58  GVPGDYTNPDAPKALSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH 116
           G P D  NPDA   L ++R L+ +   Y  A   + ++ G   + YQ LG + L+F+   
Sbjct: 61  GEPYDTDNPDAVTLLPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ-- 118

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
            +   + Y+R LDLNTA A V+Y  G++ F+RE FSS  D ++V +++     +LS    
Sbjct: 119 -RGEVQAYQRALDLNTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAH 177

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKI 227
           L+SL        G+N+I M GRCP + + P      DP          G++F   L+  +
Sbjct: 178 LESLHPFTCAPAGSNKIRMTGRCP-RHVDPDYLPTSDPVIYDHGEDGHGMRFETQLQAMV 236

Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
             + G ISA  D  L+VE +      L A++S+ G    P  S      +  + L    +
Sbjct: 237 --EGGRISADVDGALRVENAHTVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAVGMS 294

Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-ED 346
             Y  L   H+ DYQ+LF RV++ L RS            + + +P+ ER+ + Q    D
Sbjct: 295 KGYEVLRAAHISDYQRLFQRVTLDLGRS------------DGENLPTDERLVAVQKGASD 342

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
            +L+ L FQ+GRYLLISSSRPGTQ A+LQGIWN+ + P W S   +N+N +MNYW +  C
Sbjct: 343 DALLALFFQYGRYLLISSSRPGTQPAHLQGIWNDHVRPAWSSNWTINMNTQMNYWPAETC 402

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALW 463
           NL+EC  PLFD L   S++G +TAQV Y   GWV HH  D+W  ++      G   WA W
Sbjct: 403 NLAECHSPLFDLLEEASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGDPQWANW 462

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
            MGGAWLC HLWEHY ++ DR FL +RAYP+++  A FLLD+L+E   G+L T PS SPE
Sbjct: 463 NMGGAWLCQHLWEHYAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDRQGHLTTCPSMSPE 522

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
           + FI   G+L+ VS  STMD+AI  E+F+  I+A++VL+ ++     ++ ++L RL    
Sbjct: 523 NLFITESGELSGVSAGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPG 581

Query: 584 IAEDGSIMEW 593
           I   G + EW
Sbjct: 582 IGSYGQLQEW 591


>gi|253574360|ref|ZP_04851701.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251846065|gb|EES74072.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 817

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 246/598 (41%), Positives = 353/598 (59%), Gaps = 39/598 (6%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA  +T+A+P+GNGRLGAM++GGV  ET+ LNEDTLW+G P D+ NP A + L 
Sbjct: 6   KLQYDRPATVWTEALPVGNGRLGAMIYGGVERETISLNEDTLWSGYPRDWNNPSARQVLP 65

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +VR LV  G+Y EA     ++ G   + Y   GD++L F+      A  +YRR LDL  A
Sbjct: 66  EVRKLVREGRYEEADQLGRQMLGPYTESYLPFGDLQLTFEHGA---ACRSYRRTLDLADA 122

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
               +Y+VG V + RE F S+PD++I  +++ S+ G+L+F+  LDS L + + V  +   
Sbjct: 123 IHVTEYTVGKVSYKREIFVSHPDRIIAMRLTCSQPGALAFHARLDSPLRHIAAVE-DGIF 181

Query: 194 IMEGRCPGKRIPPKANAN-----DDPK---GIQFSAILEIKISDDRGTISALEDKKLKVE 245
           +M G  P +  P   NA+      DP     + F   L +  +D R ++   +   ++V 
Sbjct: 182 VMRGTAPERVEPNYVNADRPIRYGDPAVSPAMAFEGRLAVTETDGRVSV---DGDGIRVL 238

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS------YSDLYTRHLD 299
            +  AVL   A++SFD     P   + +     ++A ++  +L+      Y ++  RH++
Sbjct: 239 DATEAVLYFSAATSFDRFDQIPGAGRPESVPADVAAARARADLTGALANRYLEIRARHIE 298

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF RVS++L         +T + E +DT    ER        DP LVELLF +GRY
Sbjct: 299 DYQALFSRVSLRLG--------ETAAPEGLDT----ERRIVEYGAADPGLVELLFHYGRY 346

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLI+SSRPGTQ ANLQGIWN    P W S   +NIN EMNYW +  CNL+EC  PL + +
Sbjct: 347 LLIASSRPGTQAANLQGIWNAMTRPPWSSNWTLNINAEMNYWPAEVCNLAECHWPLLEMI 406

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
             L+ NG+KTA VNY   GWV HH +DIW +++       G  VWALWP+GG WL  HLW
Sbjct: 407 GNLAENGAKTAAVNYGTRGWVAHHNSDIWGQTAPVGDFGGGDPVWALWPLGGVWLTQHLW 466

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           EHY +  D  +L   AYP+L+  A F LDWLIE   G+L T+PSTSPEH+F   +G +A 
Sbjct: 467 EHYVFGGDVAYLHDFAYPILKDAALFALDWLIEDESGHLVTSPSTSPEHKFRTANG-VAA 525

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +S  STMD+++I E+F+  I AA VL  +E A  E++ ++  RL P ++ + G + EW
Sbjct: 526 ISEGSTMDLSLIWELFTNCIEAAGVLGIDE-AFREELRQARERLLPLQVGKYGQLQEW 582


>gi|261406536|ref|YP_003242777.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261282999|gb|ACX64970.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 806

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 241/600 (40%), Positives = 354/600 (59%), Gaps = 41/600 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +  PA  +T+A+PIGNGRLG MV+G V  ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 1   MKLQYVKPATVWTEALPIGNGRLGGMVYGCVERETISLNEDTLWSGYPRDWNNPSALEAL 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            ++R L   G+Y EA     K+ G   + Y  LGD+ L FD   + +   +YRR LD+  
Sbjct: 61  PEIRELASQGRYMEADQLGRKMMGPYTESYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 117

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A  R +Y +G V +TRE F+S+PDQ+I  +++ S + +L+F+  L+S L  ++     + 
Sbjct: 118 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACALNFHAYLESPL-RYTVKTEEDM 176

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
             M G  P +R+ P   ++D P           + F+  L +  +D R T+   +   + 
Sbjct: 177 YAMSGFAP-ERVEPSYVSSDHPIRYGDPDHTAAMAFNGRLAVAETDGRVTV---DSAGIH 232

Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSE----SMSALQSIRNLSYSDLYTRH 297
           V  +  AV+   A++SF+G    P   D    P +     +   +++  + S+++L  RH
Sbjct: 233 VLDASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAALTAGTMKAACSQSWTELRDRH 292

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++DY+ LF RVS++L         +T + E++DT    ER++ F    DP LVELLF +G
Sbjct: 293 INDYRSLFDRVSLRLG--------ETLAAEDMDT---GERIERFGA-RDPGLVELLFHYG 340

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSRPGTQ ANLQGIWN    P W S   +NIN +MNYW +  CNL+EC +PL +
Sbjct: 341 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 400

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
            +  LS+NG++TA V+Y   GW +HH TDIWA ++       G   WALW MGG WL  H
Sbjct: 401 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 460

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY Y+ D  +L   AYPL++  + F LDWLIE   G+L T+PSTSPEH+F   +G +
Sbjct: 461 LWEHYAYSGDEAYLRSFAYPLMKEASLFALDWLIENDAGHLVTSPSTSPEHKFRTSEG-M 519

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A +S  +TMD+++I E+F+  + AA +L  +E+   E+      RL P K+   G + EW
Sbjct: 520 AAISEGATMDISLIWELFTNCMEAAGILGVDEE-FREEWSSKRERLLPLKVGRYGQLQEW 578


>gi|329926959|ref|ZP_08281359.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
 gi|328938789|gb|EGG35165.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
          Length = 812

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 243/600 (40%), Positives = 352/600 (58%), Gaps = 41/600 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +  PA  +T+A+PIGNGRLG MV+GGV  ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 5   MKLQYVKPATVWTEALPIGNGRLGGMVYGGVERETISLNEDTLWSGYPRDWNNPSAREAL 64

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            ++R L   G+Y EA     K+ G     Y  LGD+ L FD   + +   +YRR LD+  
Sbjct: 65  PEIRELASQGRYMEADQLGRKMMGPYTQSYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 121

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A  R +Y +G V +TRE F+S+PDQ+I  +++ S + SL+F+  L+S L  ++     + 
Sbjct: 122 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACSLNFHAYLESPL-RYTVKTEEDM 180

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
             M G  P +R+ P   ++D P           + F   L +  +D R T+ A     + 
Sbjct: 181 YAMSGFAP-ERVEPSYVSSDRPIRYGDPEHTAAMAFDGRLAVAETDGRVTMDA---AGIH 236

Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSESM----SALQSIRNLSYSDLYTRH 297
           V  +  AV+   A++SF+G    P   D    P + +       +++  + S+++L  RH
Sbjct: 237 VLEASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAAIAAGTMKAACSQSWTELRDRH 296

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++DY+ LF RVS++L         +T +  ++DT    ER++ F    DP LVELLF +G
Sbjct: 297 VNDYRSLFDRVSLRLG--------ETLAVGDMDT---EERIERFGA-RDPGLVELLFHYG 344

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSRPGTQ ANLQGIWN    P W S   +NIN +MNYW +  CNL+EC +PL +
Sbjct: 345 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 404

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
            +  LS+NG++TA V+Y   GW +HH TDIWA ++       G   WALW MGG WL  H
Sbjct: 405 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY Y+ D  +L   AYPL++  + F +DWLIE   G+L T+PSTSPEH+F   +G L
Sbjct: 465 LWEHYAYSGDEAYLRSFAYPLMKEASLFAMDWLIENDAGHLLTSPSTSPEHKFRTSEG-L 523

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A VS  +TMD+++I E+F+  + AA +L  +E+   E+      RL P ++   G + EW
Sbjct: 524 AAVSEGATMDISLIWELFTNCMEAAVILGVDEE-FREEWSSKRERLLPLQVGRYGQLQEW 582


>gi|410456476|ref|ZP_11310337.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
 gi|409928145|gb|EKN65268.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
          Length = 789

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 231/593 (38%), Positives = 336/593 (56%), Gaps = 33/593 (5%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           +++   A H+T+A+P+GNGR+GAM +GGV +E  +LNEDTLW+G P      +   +L  
Sbjct: 4   LSYKKAASHWTEALPLGNGRIGAMHFGGVETERFQLNEDTLWSGPPQHKREYNDQASLKK 63

Query: 75  VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
           VR L+D  +Y +A + +  +FG   + Y  LG++ + +       A + Y+R LD+NTA 
Sbjct: 64  VRKLLDEEKYEDAISETKNMFGPYTESYMPLGNLFIHYLHGD---AAQKYQRTLDINTAI 120

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
           + VKY+VG + +TRE F S+P QV+  +++ S +  L+ N+SLDSLL  +   N    + 
Sbjct: 121 STVKYTVGKINYTREAFISHPHQVLAIQLTSSAANQLNVNISLDSLL-KYQTANSKEALS 179

Query: 195 MEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           ++G CP K  P   N ++ P         K I F   L + + D     S   + +L ++
Sbjct: 180 LQGVCPEKCAPVYFNESETPIVYGEFGETKAIHFEGRLGLVLEDGTALTS---NGRLSIQ 236

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +   VL    ++SF G    P    ++   ++ + L    ++ Y  L   H+ DYQ L+
Sbjct: 237 DATRVVLYFSVATSFKGYDQLPGTDFEELIQKNEAILAKAMSIPYEQLRETHIQDYQTLY 296

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           +RV   L         +  SEE +DT    ERV  +  D D  +VELLF +GRYLLI+SS
Sbjct: 297 NRVGFSLG--------NKQSEEMLDT---DERVTKYSAD-DLEMVELLFHYGRYLLIASS 344

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R GTQ ANLQGIWN+     W S   +NIN EMNYW +   NL+EC  PL   +  LS+ 
Sbjct: 345 REGTQPANLQGIWNDITRAPWSSNYTLNINTEMNYWPAEVTNLAECHRPLLQAIKELSVT 404

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           G       Y   GW  HH TD+W  +        G   WA WPM G WLC HLWEHY Y+
Sbjct: 405 GENMVNQRYGLHGWTAHHNTDLWRHAHPVGDERHGDPNWAFWPMSGPWLCRHLWEHYQYS 464

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            DRDFLEK A+P+++G A F L+WL+E  +GYL T+PSTSPEH F   DG+L  V+  ST
Sbjct: 465 QDRDFLEKEAFPVMKGAAQFCLEWLVEDENGYLITSPSTSPEHHFYTEDGQLGSVTKGST 524

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           MD+ II ++FS  I AAE+   +E+  +++V ++  RL P +I + G + EW+
Sbjct: 525 MDLQIIWDLFSNCIEAAEICGVDEE-WIQQVREAKDRLHPNQIGKYGQLQEWL 576


>gi|253574718|ref|ZP_04852058.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
           taxon 786 str. D14]
 gi|251845764|gb|EES73772.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
           taxon 786 str. D14]
          Length = 799

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 233/594 (39%), Positives = 338/594 (56%), Gaps = 24/594 (4%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA  + +A+P+GNGR+G MV+GG+  E + LNEDTLW+G P D  N DA + L 
Sbjct: 13  KLWYDRPASRWEEALPVGNGRIGGMVFGGIHRERIALNEDTLWSGFPRDPQNYDALRHLG 72

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE---ETYRRELD 129
             R L+ +G+Y EA      K+ G   + YQ LGD+ LE  DS  +      + +RRELD
Sbjct: 73  PARELIFAGKYKEAEKLIDAKMLGRRTESYQPLGDLWLEQGDSATEADGNELQGFRRELD 132

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
           L T  A   Y +G  E+ RE F S  DQV+V +I+   S  ++   SLDSLL + ++   
Sbjct: 133 LATGIATTTYRIGGAEYRREVFISAVDQVMVLRITALGSEPVNMAASLDSLLRHQAFGGP 192

Query: 189 -GNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
               +I M G+ P       +   P++   +D  G+ F A L + + +  GT+ A    +
Sbjct: 193 AETARICMRGQAPSHIADNYRGDHPQSVLYEDGLGLTFEAQL-LALPEGGGTVQADASGR 251

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L V G+    LLL A++ + G    P     DP     +AL +   L Y  L  RH  D+
Sbjct: 252 LTVSGAKAVTLLLAAATDYAGYDQAPGSGGIDPAERCQAALDAAAALGYEQLRQRHEADH 311

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
           ++LF RV ++L                    P+ ER+++++  E D  L  L F +GRYL
Sbjct: 312 RRLFGRVELRLG--------RAEEAAERAARPTDERLEAYRRGESDLGLESLYFHYGRYL 363

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           L++SSR GT+ A+LQGIWN  + P W+     NIN +MNYW +    L++C EPLF+ + 
Sbjct: 364 LMASSRTGTEAAHLQGIWNPHVQPPWNCGYTTNINTQMNYWHAEVAGLADCHEPLFELIR 423

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            LS+ G++TA+++Y A GWV HH  D+W +S+   G+  WA WPMGG WLC HLWEHY +
Sbjct: 424 DLSVTGARTARIHYGARGWVAHHNVDVWRQSTPSDGEASWAFWPMGGVWLCRHLWEHYEF 483

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-VSYS 539
            +D  FL + AYPL++G A F  DWL+ G DG L T PSTSPE++F+ PDG   C VS  
Sbjct: 484 GLDEQFLRETAYPLMKGAAEFCQDWLVPGPDGQLVTAPSTSPENKFLTPDGGEPCSVSAG 543

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           STMD+ +IRE+    I A+E+L  +E A  +++   L R+   +I  DG + EW
Sbjct: 544 STMDLFLIRELLEHTIQASEILGVDE-AWRQELSHMLARMAEPQIGPDGRLQEW 596


>gi|261409383|ref|YP_003245624.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261285846|gb|ACX67817.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 799

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 234/603 (38%), Positives = 357/603 (59%), Gaps = 28/603 (4%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + +AE  +TT    + +  PA  + +A+P+GNGRLGAMV+GGV  E ++ NEDTLW+G P
Sbjct: 3   LYSAEHRNTT----LWYRKPAAKWEEALPLGNGRLGAMVFGGVQEECMQWNEDTLWSGFP 58

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
            D  N +A + L+  R L+ SG+YAEA      ++ G   + +  LGD+ +    S +  
Sbjct: 59  RDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVGRNTESFLPLGDLLIR--QSGIGD 116

Query: 120 AEETYRRELDLNTATARVKYSVG--NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
           +   YRREL+L+   A  ++  G  N  F+R+ F S  DQV V +   S SGS+   + L
Sbjct: 117 SCSEYRRELNLDMGIASTRFQGGGSNHIFSRDMFISAVDQVGVIRYECSGSGSIQLEIGL 176

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDR 231
            S L + +    +  +++ G  P       +   P +   +D  GI++   + +    D 
Sbjct: 177 RSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGSVLYEDGLGIRYE--MRLLALTDS 234

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           G ++ ++D  +++  +    LL+ A+++F+G   +P     DP+      LQ      + 
Sbjct: 235 GQVT-VDDSGMRICAAGSVTLLIAAATNFEGFDRSPGSGGTDPSGICRERLQDAMRHGFE 293

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
            L +RH+ D+Q LF RV +QL R P++       E +I  + + ER+++++   ED +L 
Sbjct: 294 QLRSRHVQDHQALFRRVELQLGR-PEN-------ERSIAALATDERMEAYREGREDSALE 345

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
            L+FQFGRYLLI+SSRPGTQ A+LQGIWN  + P W+S    NIN EMNYW +    L+E
Sbjct: 346 ALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTRLNE 405

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
           C EPL   +  LS++G++TA+++Y A GWV HH  D+W  +S   G+ +WA WPMGGAWL
Sbjct: 406 CHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWL 465

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
           C HLWE Y +  D ++L + AYPL+ G A F LD LIE  +G+L T+PSTSPE++F+  +
Sbjct: 466 CRHLWERYQFQPDLEYLRETAYPLMRGAALFCLDLLIEDGEGHLVTSPSTSPENQFLTAE 525

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
           G    VS  STMDMAIIR++F   I A+++LE++ D L E+   ++ RL P  I ++G +
Sbjct: 526 GLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DELREEWKAAVARLLPYAIDDEGRL 584

Query: 591 MEW 593
           MEW
Sbjct: 585 MEW 587


>gi|374374701|ref|ZP_09632359.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
 gi|373231541|gb|EHP51336.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
          Length = 855

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 234/609 (38%), Positives = 354/609 (58%), Gaps = 35/609 (5%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A+ + ++  LK+ +  PA  + +A+P+GNG+ GAMV+GGV +E  +LN++TLW+G P   
Sbjct: 20  AQRSQSSQELKLWYTKPASIWEEALPLGNGKTGAMVFGGVGTERFQLNDNTLWSGAPNPG 79

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
             P  P  L+ VR LV +GQY  A     ++ G  +  Y  + D+ L+   +        
Sbjct: 80  NTPGGPAILAAVRKLVFAGQYDSAAVVWKQMHGPYSARYLPMADLWLKLKGADT--IASA 137

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y R+LDL+TATA V Y++  V +TR+ F S PD+ +V +I+  +  ++SF  +L S L  
Sbjct: 138 YYRDLDLHTATATVNYTLHGVRYTRQTFISYPDKAMVIRITADKKNAVSFTAALSSKLKY 197

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALE 238
              +NG N ++++G+ P K +  +A        DD  G   +  +++K+    GT++   
Sbjct: 198 KVALNGKNGLLLKGKAP-KFVANRAYEKEQVVYDDWNGEGTNFEVQVKVIAQEGTVNG-A 255

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D++L V  ++   + L  ++SF+G   +P    KDP  E+ + +Q ++ + +  L   H 
Sbjct: 256 DEQLTVSNANAVTIYLTNATSFNGFDKSPGKEGKDPHVEATATMQRVQVMPFERLLQNHT 315

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
            DY++LF+RVS  +     +             +P+ ER+K F +  +D  L  L +QFG
Sbjct: 316 TDYRRLFNRVSFAIENRSANA-----------KLPTNERLKVFTKAPDDFGLQTLYYQFG 364

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYL+I++SRPG+Q  NLQGIWN+ + P W S   VNIN EMNYW +   NLSEC +PLFD
Sbjct: 365 RYLMIAASRPGSQPTNLQGIWNDQVQPPWGSNYTVNINTEMNYWPAENTNLSECHQPLFD 424

Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGA 468
           F+  L++NG+ TA+VNY +  GW +HH +DIWAK+S   G        K  W+ WPM G 
Sbjct: 425 FMKELAVNGAVTAKVNYGIKEGWTVHHNSDIWAKTSPPGGQGWVDPSAKTRWSCWPMAGG 484

Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFI 527
           W  THLWEHY YT D  FL   AYPL++G A FL  WL++    GY  TNPSTSPE+  +
Sbjct: 485 WFSTHLWEHYLYTGDEAFLRNTAYPLMKGAAQFLQHWLVKDPVTGYWVTNPSTSPENT-M 543

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAE 586
             +GK   V+ +STMDM+IIRE+F+ +I AA VL+   DA     L ++  +L P  I +
Sbjct: 544 KVNGKEYEVAMASTMDMSIIRELFTDVIKAAAVLK--TDAAFAATLSTIKEKLYPFHIGQ 601

Query: 587 DGSIMEWVQ 595
            G + EW +
Sbjct: 602 YGQLQEWFK 610


>gi|338213674|ref|YP_004657729.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
 gi|336307495|gb|AEI50597.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
          Length = 880

 Score =  427 bits (1098), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 241/605 (39%), Positives = 356/605 (58%), Gaps = 41/605 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ F  PA+ + +A+P+GNG+ GAMV+G V  E  +LN++TLW+G P +  NP+ P  L
Sbjct: 43  LKLWFTQPARIWEEALPLGNGKTGAMVFGRVNRERYQLNDNTLWSGYPIEGNNPNGPTVL 102

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
            +VR  +  G+Y +A +   K+ G     Y  +GD+ L+F   DS        Y RELDL
Sbjct: 103 PEVRKAIFEGKYDKADSLWKKMQGPYCARYLPMGDLHLDFGFRDS----TATDYYRELDL 158

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           NTA A VKY+VG V +TRE F S+P  V+V +I+ ++  S++ + +L S L         
Sbjct: 159 NTAVAIVKYTVGGVTYTRETFISHPASVMVVRITANKKNSINMSAALSSRLRFSVLPGET 218

Query: 191 NQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
           N+I+++G+ P K +      P +   +DDPKG   +  L +K   + G I+  ++ KL +
Sbjct: 219 NEIVLKGKAP-KHVAHRAAEPQQIVYDDDPKGEGTNFELRVKAQTEGGKITN-QNGKLLI 276

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G++     +  ++SF+G   +P    KDP+ E+ + L+   + SY+ L + H+ DYQ+L
Sbjct: 277 SGANAVTYYVAGATSFNGFDKSPGREGKDPSVETNAILKKAGSQSYAQLKSAHISDYQRL 336

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER-VKSFQTDEDPSLVELLFQFGRYLLIS 363
           F RVS+ L   P+ +            +P+ ER ++      D  L  L +QFGRYLLI+
Sbjct: 337 FQRVSLDLGTDPEAL-----------KLPTDERLIRQQNGPADTHLQTLYYQFGRYLLIA 385

Query: 364 SSRPGTQ-----VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           SSR G        ANLQGIWN+ + P W S    NIN EMNYW +   NLSEC  P+  F
Sbjct: 386 SSRNGASGAAGTPANLQGIWNDHIQPPWGSNFTTNINFEMNYWLAENANLSECHLPMLQF 445

Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWL 470
           + +L++NG+KTA+VNY +  GW+ HH TDIWAK+SA        R +  W+ W M GAWL
Sbjct: 446 IGHLAVNGAKTAKVNYGINEGWITHHGTDIWAKTSAGGGYEWDPRSRGSWSSWLMAGAWL 505

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
            THLWEHY +T D+ FL  + YPL++  A F+L WL+E   G+L TNPS+SPE+  +   
Sbjct: 506 STHLWEHYQFTGDQTFLRDQGYPLMKSAAQFMLHWLLEDGQGHLITNPSSSPENT-VKIS 564

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
           GK   ++ +STMDMAIIRE+FS  I AA+ L K + A   ++ ++  RL P +I + G +
Sbjct: 565 GKEYQITMASTMDMAIIRELFSDCIQAAKQL-KTDAAFQTQLEQAKARLYPYQIGQYGQL 623

Query: 591 MEWVQ 595
            EW +
Sbjct: 624 QEWYR 628


>gi|158430814|pdb|2RDY|A Chain A, Crystal Structure Of A Putative Glycoside Hydrolase Family
           Protein From Bacillus Halodurans
 gi|158430815|pdb|2RDY|B Chain B, Crystal Structure Of A Putative Glycoside Hydrolase Family
           Protein From Bacillus Halodurans
          Length = 803

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 245/595 (41%), Positives = 332/595 (55%), Gaps = 37/595 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LKI F+ PA  +T+A+PIGNG LGA V+G V  E + LNEDTLW+G P D+ NP A + L
Sbjct: 3   LKIQFDFPASFWTEALPIGNGNLGAXVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 62

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR L+   +Y EA   S    G     Y   GD+ +  D  H +     Y RELDL+T
Sbjct: 63  PKVRELIAQEKYEEADQLSRDXXGPYTQSYLPFGDLNIFXD--HGQVVAPHYHRELDLST 120

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               V Y++G V++TRE F + PD+ IV +++ S+ G LSF   LDSLL + S V G   
Sbjct: 121 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 179

Query: 193 IIMEGRCP--------GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             + G  P         +  P +    D  +G  F   L    + + G    ++   L V
Sbjct: 180 YTISGTAPEHVSPSYYDEENPVRYGHPDXSQGXTFHGRL---AAVNEGGSLKVDADGLHV 236

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L   AS+SFD P    S  ++DP+  ++  +++I    Y ++  RHL+DY KL
Sbjct: 237 XGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTKL 295

Query: 305 FHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           F+RVS+ L  S  P D  TD             +R+K + +  D  LVELLFQ+GRYL I
Sbjct: 296 FNRVSLHLGESIAPADXSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLXI 341

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           +SSRPGTQ ANLQGIWNE+    W S   +NIN E NYW +  CNL+E  +PL  F+  L
Sbjct: 342 ASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEXNYWPAETCNLAELHKPLIHFIERL 401

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
           + NG KTA++NY A GWV HH  D+W +++       G  VWA WP GG WL  HLWEHY
Sbjct: 402 AANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPXGGVWLTQHLWEHY 461

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
            +  D  +L   AYP+ +  A F LDWLIE   GYL T+PSTSPE  F   + K   VS 
Sbjct: 462 TFGEDEAYLRDTAYPIXKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGE-KGYAVSS 520

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++T D+++I E F   I AA+ L  +ED  V+ +  +  RL P +I + G + EW
Sbjct: 521 ATTXDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEW 574


>gi|375148572|ref|YP_005011013.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
 gi|361062618|gb|AEW01610.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
          Length = 850

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 235/597 (39%), Positives = 347/597 (58%), Gaps = 31/597 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA  + +A+P+GNG+ GAMV+GGV +E L+LN++TLW+G P    NP+ P  L
Sbjct: 25  LKLWYNKPADAWEEALPLGNGKTGAMVFGGVATERLQLNDNTLWSGYPEAGNNPNGPTVL 84

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             VR  V  G Y +A A   K+ G  +  Y  LGD+           A  TY RELDLN 
Sbjct: 85  PQVRQAVFEGDYEKAAALWKKMQGPYSARYLPLGDLWWRVQSKDTLPA--TYYRELDLNK 142

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A + V+Y +G V + RE F S P +++V +I+  + G +   + L S L         + 
Sbjct: 143 AVSTVRYKIGEVTYQRETFISYPSKLLVMRITADKKGVIDGVLDLTSKLHFKVTTTDADY 202

Query: 193 IIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           +++ G+ P     +   P+    D   G   +  + +KI  + G +    +  LKV G++
Sbjct: 203 LVLRGKAPKFVANRDYEPQQVGYDSANGEGMNFEVHVKIKTEGGKVEQ-SNNALKVSGAN 261

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              + L  ++SF+G   +P    KDP++E+ + LQ    L+Y  L   H+ DYQ LF RV
Sbjct: 262 TVTIYLSEATSFNGFNKSPGLEGKDPSTEAKANLQKALRLTYEQLKAAHMRDYQNLFKRV 321

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
            + L                   +P+ ER+K + ++  D  L  L +QFGRYLLI+SSRP
Sbjct: 322 ELNLGPG-----------NGAAKLPTDERLKQYASNPTDQQLQVLYYQFGRYLLIASSRP 370

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G++ ANLQGIWN+ + P W S    NIN EMNYW +   NLSEC +PLFDF+  L++NG+
Sbjct: 371 GSRPANLQGIWNDHIQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVNGA 430

Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSA-------DRGKVVWALWPMGGAWLCTHLWEHYN 479
           +TA+VNY ++ GWV+HH +D+WAK+S         +G   W+ WPM GAWL THLWEHY 
Sbjct: 431 QTAKVNYNISEGWVVHHNSDLWAKTSPPGGWDWDPKGMPRWSAWPMAGAWLSTHLWEHYL 490

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           YT D+ FL K A+PL++G A F++ WLI +  +G L TNPSTSPE+  +   GK   V  
Sbjct: 491 YTGDKTFL-KNAWPLMKGAAQFMIHWLITDPANGLLVTNPSTSPENT-MKIKGKEYQVGM 548

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++TMDM+IIRE+F+A+I  + VL + +    ++V+K+  +L P  I + G + EW +
Sbjct: 549 ATTMDMSIIRELFTAVIKTS-VLLQTDAVFRDQVIKAKEKLYPFHIGQYGQLQEWFK 604


>gi|326800263|ref|YP_004318082.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
 gi|326551027|gb|ADZ79412.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
          Length = 855

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 239/599 (39%), Positives = 362/599 (60%), Gaps = 34/599 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN + GAMV+GGV  E  +LN++TLW+G P    NP+ PK L
Sbjct: 30  LKLWYTKPASVWEEALPLGNAKTGAMVFGGVQVERYQLNDNTLWSGFPNPGNNPNGPKIL 89

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
             VR  +  G Y +A +   ++ G  +  Y  LGD+ L+F   DS       +Y+R+LDL
Sbjct: 90  PRVRRAIFDGDYEKAASLWKQMQGPYSARYLPLGDLLLDFHRPDS----LTTSYQRDLDL 145

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           + A + +KY+   V +TRE F S PD+ +  +I+ ++ G+++F+V+L S L + +    +
Sbjct: 146 DKALSTIKYTYRGVMYTRETFISRPDKTMAIRITANKPGAVAFDVALTSKLKHQTKAARH 205

Query: 191 NQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + +I++G+ P     +   P+    DD  G   +  + +K+    G +   +D +L V G
Sbjct: 206 DYLILQGKAPKFVANREYEPQQIVYDDRDGEGMNFEIHVKVQAIGGEVKT-DDNRLCVSG 264

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D  +L L  ++SF+G   +P  + KDP  E+ + ++     SY ++ +RH+ D+  LF 
Sbjct: 265 ADSVILWLTEATSFNGFDKSPGLNGKDPAVEAAACMERASKSSYQEVKSRHIADHAALFR 324

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
           RVSI L + P+ +            +P  ER+ +  +   D +L  L +Q+GRYLLI+SS
Sbjct: 325 RVSIDLGKDPEAV-----------RLPIDERMLRLAEGKSDNALQALYYQYGRYLLIASS 373

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG + ANLQGIWN+ + P W S    NIN EMNYW +   NLSEC +PLFDF+  L++N
Sbjct: 374 RPGGRPANLQGIWNDMVQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVN 433

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWEH 477
           G+ TA+VNY +  GWV HH +D+WAK+S         +G   W+ WPM GAW CTHLWEH
Sbjct: 434 GAVTAKVNYNIDDGWVTHHNSDLWAKTSPPGGYDWDPKGMPRWSAWPMAGAWFCTHLWEH 493

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D+ FL++ AYPL++G ASF+L WLIE     YL TNPSTSPE+  +   GK   +
Sbjct: 494 YLYTGDKKFLKEEAYPLMKGAASFMLHWLIEDPGSHYLITNPSTSPENT-VKIAGKEYQL 552

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S +STMDMAIIRE+F+A I +A++L  ++D   EK++ +  +L P  I + G + EW Q
Sbjct: 553 SMASTMDMAIIRELFNACIRSADILGSDKD-FKEKLIMAKAKLYPYHIGQYGQLQEWYQ 610


>gi|315649545|ref|ZP_07902630.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
 gi|315275018|gb|EFU38393.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
          Length = 796

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 232/590 (39%), Positives = 345/590 (58%), Gaps = 24/590 (4%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA  + +A+P+GNGRLGAMV+GGV  E ++ NEDTLW+G P D  N +A + L+
Sbjct: 10  KLWYREPAAKWEEALPLGNGRLGAMVFGGVEEERIQWNEDTLWSGFPRDTNNYEARRHLA 69

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             R L+ SG+Y EA      K+ G   + +  LGD+ +     H    E  YRRELDL+T
Sbjct: 70  AARKLITSGKYKEAEELIEDKMVGRGTESFLPLGDLLIRQSGIHGHRTE--YRRELDLDT 127

Query: 133 ATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGN 190
             A V++ S G+  + R+ F S  DQV V + +G     +  ++ LDS L + +     +
Sbjct: 128 GIASVRFQSGGSATYARDMFISAVDQVAVIRCAGPNYEDIRLDIRLDSPLRHGTRRCAED 187

Query: 191 NQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             +++ G  P       K   P +   ++  GI++   + +    D G ++ ++D+ + +
Sbjct: 188 GSLVLYGHAPTHIADNYKGDHPGSVLYEEGLGIRYE--MRLLALPDSGQVT-VDDRGMHI 244

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            GS    LL+ A+++F G   +P     DP+      LQ      Y +L  RH+ D+Q L
Sbjct: 245 NGSGPVTLLIAAATNFAGFDRSPGSGGIDPSVICRKRLQDAVQHGYEELRARHVKDHQAL 304

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
           F RV ++L        +  C E + ++  + ER+K++ +  EDP+L  L+FQFGRYLL++
Sbjct: 305 FRRVDLRLE-------SLDC-ERSTESAATDERMKAYREGQEDPALEALMFQFGRYLLMA 356

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPGTQ A+LQGIWN  + P W+S    NIN EMNYW +   +LSEC EPL   +  LS
Sbjct: 357 SSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTHLSECHEPLIQMIRELS 416

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           ++G +TA+++Y A GWV HH  D+W  +S   G+ +WA WPMGGAWLC HLWE Y +  D
Sbjct: 417 VSGRRTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWLCRHLWERYQFQPD 476

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
            ++L   AYPL+   A F LDWLIE   G+L T+PSTSPE++F+  +G    VS  STMD
Sbjct: 477 LEYLRGTAYPLMREAALFCLDWLIEDGKGHLVTSPSTSPENQFLTAEGVPCSVSAGSTMD 536

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           MAIIR++F   I A+++L ++ D L E+   +  RL P  +  +G +MEW
Sbjct: 537 MAIIRDLFHNCIEASQLLGQDAD-LREEWESAAARLLPYGMDGEGKLMEW 585


>gi|325103197|ref|YP_004272851.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324972045|gb|ADY51029.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 868

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 235/606 (38%), Positives = 356/606 (58%), Gaps = 36/606 (5%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A+S S  N L + +  P+K + +A+PIGNG  GAMV+GGV  E  +LN  TLW+G P   
Sbjct: 20  AQSKSDPN-LVLWYKEPSKIWEEALPIGNGFQGAMVFGGVGKERFQLNNGTLWSGFPNPG 78

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEE 122
            NP  P AL  VR  +D G YA+A     K    P    Y  + D+ L+F+  H     +
Sbjct: 79  NNPKGPAALPQVRKAIDDGDYAKAAEIWKKNNQGPYSARYLTMADLYLDFN--HKDSDVQ 136

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+R LDLN+A   V Y VG V + RE   SNPD+V+  +++  +  +LSF   L S L 
Sbjct: 137 AYKRSLDLNSAVHTVTYKVGGVTYKRETLMSNPDKVMAIRLTADKKNALSFTTDLISKLK 196

Query: 183 NHSYVNGNNQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISA 236
             +   G N +I++G+ P K +      P +   +++ +G+ F   + +K+ ++ GT+  
Sbjct: 197 YKTNAVGQNALILKGKAP-KHVAHRPTEPEQIIYDENGEGMTFE--VHLKVLNEGGTVKT 253

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           + +K + V+ ++   + L + +SF+G   +P+ + K+P+ E+ + L +     Y  +   
Sbjct: 254 VGNK-ITVQNANAVTIYLSSGTSFNGFDKSPTIAGKNPSIEASANLAAAVGKKYDVMKQA 312

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQ 355
           H+ DY KLF+RV ++L   P           ++  +P+  R+ +  Q   D  L  L FQ
Sbjct: 313 HIADYSKLFNRVVLKLGNRP-----------DLANLPTNIRLSRQGQKGNDQELQVLYFQ 361

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYL+ISSSRPG+Q  NLQG+WN+ + P W S   VNIN EMNYW +   NLSE   PL
Sbjct: 362 FGRYLMISSSRPGSQATNLQGLWNDHVQPPWGSNYTVNINTEMNYWLAENTNLSELHYPL 421

Query: 416 FDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGG 467
           FDFL  L++NG +TA++NY +  GWV+HH TDIWAK+S         +G   W+ WPMGG
Sbjct: 422 FDFLERLAVNGKETAKINYNINKGWVLHHNTDIWAKTSPTGGYDWDPKGSPRWSAWPMGG 481

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
           AWL THL++HY +T D+ FL+++AYPL++G A FLL WL+    GYL TNPSTSPE+ F 
Sbjct: 482 AWLSTHLYDHYLFTGDKRFLKEKAYPLMKGAAEFLLAWLVPDQSGYLITNPSTSPENTFT 541

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
             + K   +S  +TMD+ I+ E+F+A I +A+ L+ + +  V+++  +  +L P +I + 
Sbjct: 542 I-NKKQYEISKGTTMDLGIMLELFNACIQSAKALDTDAN-FVKQLEAAKAKLYPYQIGKY 599

Query: 588 GSIMEW 593
           G + EW
Sbjct: 600 GQLQEW 605


>gi|403743768|ref|ZP_10953247.1| aliphatic sulfonates family ABC transporter, periplasmic
           ligand-binding protein [Alicyclobacillus hesperidum
           URH17-3-68]
 gi|403122358|gb|EJY56572.1| aliphatic sulfonates family ABC transporter, periplasmic
           ligand-binding protein [Alicyclobacillus hesperidum
           URH17-3-68]
          Length = 804

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 239/596 (40%), Positives = 329/596 (55%), Gaps = 30/596 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-DAPKA 71
           + + +  PA  +TDA+PIGNGRLG MV+GG+  E + LNEDTLW+G P     P  A + 
Sbjct: 6   VALWYEKPAVAWTDALPIGNGRLGGMVFGGIEHERIHLNEDTLWSGYPRTLAVPRKAEET 65

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L  VR LV +G+Y EA  AS  L G  ++ Y  LG +EL F+   L +    YRR LDL 
Sbjct: 66  LRQVRELVLAGRYQEAHEASRGLSGPYSESYLPLGWLELVFEHGDLAH---DYRRSLDLR 122

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA A V Y +G  +FTRE F S+PD+ +V  ++      L+F + + S L  H+      
Sbjct: 123 TAVATVSYRIGRTQFTREMFVSHPDEAMVIHLTADGPLPLAFTLCMGSKL-RHAIAEMAG 181

Query: 192 QIIMEGRCPGKRIPP--------KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            + + G+ P    P         +  A DDP+ I+F+A + +   D  GT++   D  L+
Sbjct: 182 DLALTGQAPIHVAPSYEVDDHPIQYAAPDDPRPIRFAARITVARCD--GTVAWCGDG-LR 238

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +EG+    LLL A ++F    + P D   D ++     L  +R   +++L +RH+ D+Q+
Sbjct: 239 IEGATRVTLLLGAGTNFRSFALRP-DEALDVSANLGRQLADLRTTPFAELKSRHVADHQR 297

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF RV   L+    D        E    +P+ E +  +       LVELLF +GRYLLI+
Sbjct: 298 LFDRVEFVLADPRPD------ENEGYRDLPTDELIARYGVHAK-RLVELLFHYGRYLLIA 350

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPGTQ ANLQGIWN+   P W S   +NIN EMN+W    CN+ EC EPL   +  L+
Sbjct: 351 SSRPGTQPANLQGIWNDATRPPWSSNLTLNINAEMNFWPVEVCNIGECHEPLLRMIGELA 410

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
             G + A+  Y   GWV HH TDIW  + A     RG   W++WPM G WLC HLWEHY 
Sbjct: 411 QTGREVAK-RYGCRGWVAHHNTDIWRMAHAAGGDGRGDPSWSMWPMAGPWLCAHLWEHYL 469

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           ++ D  FL+  AYPL+   A F +DWL     G     PSTSPEH F+  DG+ A VS S
Sbjct: 470 FSRDHAFLQNVAYPLMRDAALFCIDWLASDPSGRGLAIPSTSPEHHFVTQDGQKAAVSAS 529

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           STMD+ ++RE+FS  I AA  L  + +   E       RLRP +I  DG + EW++
Sbjct: 530 STMDVMLMRELFSHCIEAASTLGVDAELSAEWAAWQ-ERLRPLRIGRDGRLQEWME 584


>gi|329930748|ref|ZP_08284172.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
 gi|328934680|gb|EGG31180.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
          Length = 673

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 232/585 (39%), Positives = 331/585 (56%), Gaps = 51/585 (8%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  + +A+PIGNGRLGAM++GG+  E L+LNED++W G P D  N DA   L  +R LV
Sbjct: 21  PATDWNEALPIGNGRLGAMIFGGIAEEKLQLNEDSVWYGGPRDRNNEDALPHLPVIRELV 80

Query: 80  DSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATAR 136
            +G+  EA A A + + G P     Y  LGD+ + FD   +    + Y RELDL    +R
Sbjct: 81  MNGRLHEAEALAGMAMAGLPESQRHYLPLGDLLISFDRHEMA---KDYERELDLEHGVSR 137

Query: 137 VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ---- 192
             Y +G + +TRE F+S PDQ I+ +IS  + G++S     +    N  Y+   ++    
Sbjct: 138 SSYRIGEIRYTRELFASYPDQAIIMRISADKPGAVSLKARFNR--RNWRYMEKTDKWDQQ 195

Query: 193 -IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            ++M+G C GK             G  F AI++   +   G +     + L VE +D   
Sbjct: 196 GLVMQGECGGK------------GGSSFCAIVK---ALSEGGVCKTIGEYLLVENADAVT 240

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A ++F  P         DP       L+ +  +SY++L  RH+ DY +LF RV++ 
Sbjct: 241 LLLTAGTTFRHP---------DPELYGKRRLEELSQVSYTELLVRHIKDYTELFGRVTLS 291

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
           LS SP             +T+P+ +R+K + + +ED  L+E  FQFGRYLLISSSRPG+ 
Sbjct: 292 LSESPGK-----------NTLPTDDRLKRYREGEEDNGLIETYFQFGRYLLISSSRPGSL 340

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+  +P WDS   +NIN +MNYW +  CNL+EC EPLF+ +  +   G  TA
Sbjct: 341 PANLQGIWNDSYTPPWDSKFTININTQMNYWPAENCNLAECHEPLFELIERMREPGRVTA 400

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
            V Y   G+  HH TDIWA ++     +  + WPMG AWLC HLWEHY +  DR FL  R
Sbjct: 401 GVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDRYFL-AR 459

Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
           AY  ++  A FLLD+LIE  +G L T PS SPE+ +  P+G+   +   +TMD  II  +
Sbjct: 460 AYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCAGATMDFQIIEAL 519

Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           F A I + E++EK+E A  E++  +L RL   +I + G I EW++
Sbjct: 520 FEACIRSGEIIEKDE-AFREELAAALKRLPKPQIGKYGQIQEWME 563


>gi|374603684|ref|ZP_09676661.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
 gi|374390787|gb|EHQ62132.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
          Length = 818

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 237/624 (37%), Positives = 332/624 (53%), Gaps = 44/624 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M    T+    LK+ +  PA  +T+A+P+GNGR GAMV+GGV  E ++LNEDTLW G P 
Sbjct: 1   MATSKTARDEDLKLWYTRPADKWTEALPLGNGRFGAMVFGGVRRERIQLNEDTLWAGHPV 60

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSHL 117
              NP A + L + R L+ +G+YAEA        V   GH    YQ LG++ LEFD    
Sbjct: 61  SEYNPAAGELLPEARQLLHAGKYAEAMELIGTRMVGTEGHGIQPYQPLGNVYLEFDGPEA 120

Query: 118 KYAEET-------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
                        Y+REL L  A A      G+    R  F S  DQV+V ++       
Sbjct: 121 TGGAAGGKPAAPAYKRELQLKQALAVTSCQAGDSLEKRTVFVSAADQVMVVRLESDSPYG 180

Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK-------RIPP------KANANDDPKGI 217
           +   VSLDS L++    +    ++M GRCP +        +PP       A + +  + +
Sbjct: 181 VRVTVSLDSRLEHSVAADDEGGLVMTGRCPQRVRNHNNSAVPPIAYDGDGAESEESGRAL 240

Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
           +F+  + +   D    +  + D +LK+ G     LL  A++SF G    P ++   P   
Sbjct: 241 RFAVKMAVLEEDGETRVRCI-DNRLKIGGGRAVTLLFAAATSFRGYDRMPDEAAVPPAER 299

Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
             + L+     SY  L   H+ DY++LF RVS++L     D   D   +     +P+ ER
Sbjct: 300 CHAVLKEALRRSYGQLLDAHIQDYRRLFERVSLEL-----DDADDAGRK-----LPTDER 349

Query: 338 VKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
           ++       D  +  LLFQ+GRYLLISSSRPGTQ ANLQGIWN+++ P W+   H+NINL
Sbjct: 350 LRRIGAGGSDNGIYALLFQYGRYLLISSSRPGTQAANLQGIWNDEVQPPWNCDYHLNINL 409

Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSAD-R 455
           +MNYW +  C+L EC +PLF  +  L++ G+  ++V+Y   GW+ H  TD W   +    
Sbjct: 410 QMNYWLAEVCHLQECHDPLFRLMEELAVTGAAASRVHYGCGGWMAHAMTDQWRNHNVGPS 469

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
           G   WA WPMGGAWLC HLWEHY YT DR FL +RA+PLL G A+FLLDW++ E  DG L
Sbjct: 470 GDPSWAYWPMGGAWLCRHLWEHYEYTRDRAFLAERAWPLLRGAAAFLLDWVVQEDEDGRL 529

Query: 515 ETNPSTSPEHEFIAPDG----KLAC-VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
            T+PS SPE+ F+ P      K  C VS SS MDM I  +++  +  A +VL  + D   
Sbjct: 530 MTSPSVSPENAFLIPGAEEGEKQTCTVSQSSAMDMQIAYDLWMIVKQANDVLGLD-DTFA 588

Query: 570 EKVLKSLPRLRPTKIAEDGSIMEW 593
                +  RL   +I   G +MEW
Sbjct: 589 RACEAAALRLPQPRIGARGQLMEW 612


>gi|379721956|ref|YP_005314087.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|378570628|gb|AFC30938.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
          Length = 768

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 234/592 (39%), Positives = 334/592 (56%), Gaps = 33/592 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  + +A+P GNGRLGAMV+GG   E + LNEDTLW+G P D    DA   L   R
Sbjct: 12  YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71

Query: 77  SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
            L+  G++AEA     +    P  + Y  LGD+EL+ D    K  E T YRREL L+ A 
Sbjct: 72  KLIFEGRHAEAEEIIQQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDEAV 127

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
            R +Y       TRE F S  DQV+  +I   +   L+  +SL S L       G++ + 
Sbjct: 128 VRTQYRTDGALQTRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185

Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           + GRCP  R+ P    +D+P      +GI F A L +  + ++G I +    +++V    
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241

Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
              LLL A++S+DG   +P+ +     P +     L+    L YS L  RHL ++ + + 
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
           RV ++L        +   S  + D +P+  R+++  Q  +DP L  L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGTQ ANLQGIWN+ L P W S+   NIN++MNYW +   NL+EC EPL  F+  L  +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G + A V+Y   GW  HH  D+W  ++   G   WA WPM GAWLC HLWEHY ++ D  
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEK 475

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           +L  R YP+L+  A F LDWL+EG DG+L T PSTSPE+ F+  DG   CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534

Query: 546 IIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++R +F   + A+  L+K+     L+E+ L+ +P   P +I   G + EW +
Sbjct: 535 LLRNLFGRCMEASRQLQKDTAFRVLLEQTLRRMP---PYRIGRHGQLQEWAE 583


>gi|375310399|ref|ZP_09775670.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
 gi|375077548|gb|EHS55785.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
          Length = 643

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 227/613 (37%), Positives = 341/613 (55%), Gaps = 48/613 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ F  PA+ + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D    DA + L
Sbjct: 10  LRLWFRQPAEVWEEALPVGNGRLGAMVFGGIRKERLQLNEDTLWSGFPRDGVQYDALRYL 69

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
             VR L+ +G+Y +A    +  + G   + YQ LGD+ +    +   + E T Y RELDL
Sbjct: 70  KPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----TQKGFGEITHYERELDL 125

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSLLD 182
            T TA V +    + +TRE  +S+PD +I+  ++   +G ++ +V +        +S  D
Sbjct: 126 PTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTADRAGQINASVRITTPHPCEDESGED 185

Query: 183 NHSYV---------------NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL---- 223
            H  V                  N I + GR P        +  D P+ + +   L    
Sbjct: 186 EHFAVLSQWDSDVAEGLSDEATRNCITLNGRAPSH--VESNDHGDHPQSVVYEHDLGMAF 243

Query: 224 --EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
             ++++  + G ++A +D  + V G+D   + L A++ F G  + P     +        
Sbjct: 244 AVQVRMVSEGGIVTAKDDGTVIVSGADTLTVYLAAATGFRGFDVMPDSDPAESAEACQIT 303

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           L    +L    +  RH  D++ LF RV+++L        +DT +EE I  +P+  R++ +
Sbjct: 304 LDKAISLGSEQVRQRHEQDHRTLFERVALELG-------SDTRTEELI--LPTDLRLERY 354

Query: 342 -QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
            Q + DP L  LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S    NIN +MNY
Sbjct: 355 KQGEADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNY 414

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W +  CNL+EC EPL   +  +S  G + A VNY A GW  HH  D+W  +    G   W
Sbjct: 415 WPAEICNLAECHEPLLHMVGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHASW 474

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
           A WP+GG WL  HLWE Y +T D  +L ++AYPL++G A+F +DWLIEG DG+L T+PST
Sbjct: 475 AFWPLGGVWLTAHLWERYLFTQDTAYLAEQAYPLMKGAAAFCMDWLIEGPDGWLVTSPST 534

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
           SPE++FI   G+   +S  STMDM +IRE+    I AA++LE +E+    +  ++  RL 
Sbjct: 535 SPENKFITSSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQRLL 593

Query: 581 PTKIAEDGSIMEW 593
           P ++   G + EW
Sbjct: 594 PYQMGRHGQLQEW 606


>gi|337748987|ref|YP_004643149.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|336300176|gb|AEI43279.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
          Length = 827

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 233/590 (39%), Positives = 334/590 (56%), Gaps = 29/590 (4%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  + +A+P GNGRLGAMV+GG   E + LNEDTLW+G P D    DA   L   R
Sbjct: 12  YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71

Query: 77  SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
            L+  G++AEA     +    P  + Y  LGD+EL+ D    K  E T YRREL L+ A 
Sbjct: 72  KLIFEGRHAEAEEIIEQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDDAV 127

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
            R +Y        RE F S  DQV+  +I   +   L+  +SL S L       G++ + 
Sbjct: 128 IRTQYRTDGALQIRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185

Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           + GRCP  R+ P    +D+P      +GI F A L +  + ++G I +    +++V    
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241

Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
              LLL A++S+DG   +P+ +     P +     L+    L YS L  RHL ++ + + 
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
           RV ++L        +   S  + D +P+  R+++  Q  +DP L  L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGTQ ANLQGIWN+ L P W S+   NIN++MNYW +   NL+EC EPL  F+  L  +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G + A V+Y   GW  HH  D+W  ++   G   WA WPM GAWLC HLWEHY ++ D +
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEE 475

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           +L  R YP+L+  A F LDWL+EG DG+L T PSTSPE+ F+  DG   CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++R +F   + A+  L+K+  A  E + ++L R+ P +I   G + EW +
Sbjct: 535 LLRNLFGRCMEASRQLQKD-TAFRELLEQTLRRMPPYRIGRHGQLQEWAE 583


>gi|255532589|ref|YP_003092961.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
 gi|255345573|gb|ACU04899.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
          Length = 868

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 232/598 (38%), Positives = 355/598 (59%), Gaps = 37/598 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK + +A+P+GNG+ GAMV+G V  E  +LN++TLW+G P    NP  P  L
Sbjct: 29  LKLWYTQPAKVWEEALPLGNGKTGAMVFGRVNKERFQLNDNTLWSGSPEAGNNPKGPANL 88

Query: 73  SDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
             VR  V  G YA A A   K L G  +  Y  + D+ L+F+   LK +  T Y RELD+
Sbjct: 89  PLVRQAVFEGDYARAAALWKKNLQGPYSARYLTMADLFLDFN---LKDSIPTAYHRELDI 145

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           + A + V Y+VG + + RE   S PD+ +V +I+  +  +L+F+ S+ S L   +   G 
Sbjct: 146 DNAISTVTYTVGGITYKRESLISYPDKAVVIRITTDQKNALNFSTSISSKLKYTARAVGA 205

Query: 191 NQIIMEGRCPGKRIPPKAN-----ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           + ++++G+ P K +  +A        DD +G+ F   ++++I  + GT +A +  ++ V 
Sbjct: 206 DLLVLKGKAP-KHVAHRATEAAQVVYDDKEGMTFE--VDVRIKAEGGTTTA-KGTEILVS 261

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            ++   + L  ++SF+G   +P    K+P +E+   L+ +    YS + T H+ DY+ LF
Sbjct: 262 KANAVTIYLSGATSFNGYNKSPGLEGKNPATEAAGILKKVYPKPYSTIKTAHVADYKALF 321

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISS 364
            RVS  L            S   ++ +P+  R+ +      D  L  L +QFGRYL+I+S
Sbjct: 322 DRVSFSLG-----------SNAELEGLPTNVRLSRQGAMGNDQGLQVLYYQFGRYLMIAS 370

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+Q  NLQGIWN+ + P W S   VN N +MNYW +   NLSE  +PLFDF+  +++
Sbjct: 371 SRPGSQATNLQGIWNDHVQPPWGSNYTVNANTQMNYWLAEQTNLSELHQPLFDFIGRMAV 430

Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWE 476
           NG+KTA++NY +  GWV+HH TDIWAKSS         +G   W+ WPMGGAWL THL++
Sbjct: 431 NGAKTAKINYDIRQGWVVHHNTDIWAKSSPTGGYDWDPKGAPRWSAWPMGGAWLTTHLYD 490

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
           HY +T D+ FL+++ YPL++G A F+L WL++     YL TNPSTSPE+ F   +GK   
Sbjct: 491 HYLFTGDKQFLKEKGYPLMKGAAEFMLKWLVKDDKTEYLVTNPSTSPENIFKI-EGKEYE 549

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           VS ++TMDM II+E+F+  I+A+++L+ + D  VE + K+  +L P  I   G + EW
Sbjct: 550 VSKATTMDMGIIKELFTDCIAASKILDMDADFRVE-LEKAKAKLYPFNIGRYGQLQEW 606


>gi|294054095|ref|YP_003547753.1| alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
 gi|293613428|gb|ADE53583.1| Alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
          Length = 783

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 229/591 (38%), Positives = 340/591 (57%), Gaps = 41/591 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + ++ PA  +TDA+P+GNG +GAMV+GG+  E ++ N+DTLW G P  Y + DA   L
Sbjct: 26  LTLRYDRPADAWTDALPVGNGSMGAMVFGGIEKERIQFNQDTLWAGEPRSYAHEDAVDVL 85

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R+L+  G+ AEAT  A  +    P     YQ  GD+ ++F  ++ +  E  Y R LD
Sbjct: 86  PEIRTLLFDGKQAEATKLAGERFMSEPLRQAAYQPFGDLWIQFP-AYGQAGE--YERSLD 142

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+ A A   Y++G+VEFTR  F+S PD VI  +I  S+ G ++F   L +   ++S V  
Sbjct: 143 LDGALATTSYTIGDVEFTRTVFASYPDGVIAIRIEASKPGMVNFTAGLTTPHQSNSVVEP 202

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            N+  +  R        K         ++F A  ++++  D G   A     ++V G+  
Sbjct: 203 LNRNTLRLRGQVDAFTDKKETFTFEGAMRFEA--QLRVYTDGGMCQA-SGGVVEVGGATS 259

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L LVA++ F     N      +P S   + L+++ + SY+D+  RH  D++ LF R S
Sbjct: 260 ATLYLVAATDF----TNYKRLAGNPNSRCTTTLRALNSASYADVLQRHQADHRALFRRAS 315

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           I+L  +            + +T+P+ ER+  +Q   DPSLV LLFQ+GRYLLI+SSRPG+
Sbjct: 316 IELGGT------------DANTMPTNERLNQYQAKPDPSLVALLFQYGRYLLIASSRPGS 363

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           + ANLQG+WNE   P W+S   +NIN EMNYW +   NLSEC EPLFD +  LS+ G++ 
Sbjct: 364 EAANLQGLWNESQQPAWESKYTLNINAEMNYWPAELTNLSECHEPLFDLIEDLSVTGAEV 423

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+++Y A GWV HH TD+W + +A        +WP GGAWLCTHLWEH+ YT DR FL+ 
Sbjct: 424 AELHYDARGWVAHHNTDLW-RGAAPINAANHGIWPTGGAWLCTHLWEHFLYTGDRQFLKS 482

Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           RAYPL++G A F +D L+E     +G+L + PS SPE            +    TMD  I
Sbjct: 483 RAYPLMKGAAQFFVDTLVEDPVFDEGWLISGPSNSPER---------GGLVMGPTMDHQI 533

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQR 596
           IR +F A   AA+VL +  DA     L+ L  ++ P+++ ++G + EW+ +
Sbjct: 534 IRSLFHATADAADVLGR--DAAFAAELRELAAKITPSQVGQEGQVKEWLYK 582


>gi|386724573|ref|YP_006190899.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
 gi|384091698|gb|AFH63134.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
          Length = 714

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 235/586 (40%), Positives = 329/586 (56%), Gaps = 42/586 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           F  PAK + +A+P+GNGRLGAMV+G    E ++LNEDT+W G P D  NPDA + L ++R
Sbjct: 8   FKQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67

Query: 77  SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             + SG+ AEA   A++ L G P     Y  LGD+ +  D  H     E YRRELDL+  
Sbjct: 68  EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGVAEEYRRELDLSKG 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
            A + Y +G+  F RE F S+PDQ +V +I     G++ F   LD   S   +     G 
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRIRADRPGAVGFTARLDRGKSRYLDEIEAAGP 185

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N ++M G C GK             G  F A L    +D  G    +  + L VEG+D  
Sbjct: 186 NMLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L L A+++F          ++DP +  ++ L S     Y+ L  RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
            L     ++ TD  +   +  +P+ ER++  +   EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWNE + P WDS   +NIN +MNYW +  C+LSEC EPLFD +  +S  GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+V Y   GW  HH TD+W  ++     +    WP+GGAWLC HLWEHY +      L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
             YP+++G A FLLD++IE  DG+L T PS SPE+ +I P+G+   +     MD  I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +F A   AA  L  +ED   E  L +L R+   ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLE 558


>gi|374320465|ref|YP_005073594.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
           [Paenibacillus terrae HPL-003]
 gi|357199474|gb|AET57371.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
           [Paenibacillus terrae HPL-003]
          Length = 829

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 228/611 (37%), Positives = 335/611 (54%), Gaps = 41/611 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PAK + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D    DA + L
Sbjct: 10  LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIGEERLQLNEDTLWSGFPRDGVQYDALRYL 69

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
             VR L+  G+Y +A    +  + G   + YQ LGD+ +   +     AE  Y RELDL 
Sbjct: 70  KPVRELIADGKYKDAEHLINANMLGRDTEAYQPLGDLWIT-QEGLGSIAE--YERELDLV 126

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL------------DS 179
           T TA V +  G + +TRE  +S PD +I+ +++    G ++  V +            D+
Sbjct: 127 TGTAAVTFQGGGIRYTREVIASAPDGIIMVRLTADTPGKINATVRITTPHSCEAEAGEDA 186

Query: 180 LLDNHSYVNGNNQ-----------IIMEGRCPGK------RIPPKANANDDPKGIQFSAI 222
              + S  + + +           I + GR P           P++   +D  G+ F+  
Sbjct: 187 HFGDSSEWDNDKEDDSSGEPERDLITLTGRAPSHVESDYHGYHPQSVVYEDELGMAFA-- 244

Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
           ++ +I  + GT++   D  ++V G+D   + L A++ F G    P     + T      L
Sbjct: 245 IQARIIAEGGTLTRGADGVIRVAGADKLTVYLAAATGFRGFDTQPDIDATESTGVCEVTL 304

Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
               +L Y  +  RH  D+ +LF RV ++L    +   TD  ++  I T    E+ +  Q
Sbjct: 305 ARAVSLGYEQVRHRHEQDHWELFGRVELELGDEGR---TDPSTKRQIPTDLRLEQYREGQ 361

Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            D D  L   LFQ+GRYLLI+SSR G+Q ANLQGIWN+ + P W+S    NIN +MNYW 
Sbjct: 362 ADLD--LEVTLFQYGRYLLIASSRSGSQPANLQGIWNDHVQPPWNSDYTTNINTQMNYWP 419

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
           +  CNL+EC EPL   +  +S  G + A + Y A GW  HH  D+W  +    G   WA 
Sbjct: 420 AEICNLAECHEPLLHMVGEVSRTGRRVASIYYGAQGWTAHHNVDVWRYAGPSGGHASWAF 479

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WP+GG WL  HLWE Y  T D  +L ++AYPL++G A+F +DWL+EG DG+L T+PSTSP
Sbjct: 480 WPLGGVWLTAHLWERYLLTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPDGWLVTSPSTSP 539

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           E++FI PDG+   +S  STMDM +IRE+ S  I A E+LE + D    +  ++L RL P 
Sbjct: 540 ENKFITPDGEHCSISMGSTMDMTLIRELLSNCIQATELLELD-DEFRNRCEETLQRLLPY 598

Query: 583 KIAEDGSIMEW 593
           +I   G + EW
Sbjct: 599 QIGRHGQLQEW 609


>gi|390452435|ref|ZP_10237963.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
           [Paenibacillus peoriae KCTC 3763]
          Length = 826

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 223/615 (36%), Positives = 342/615 (55%), Gaps = 48/615 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
            PL++ +  PA+ + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D    DA +
Sbjct: 8   QPLRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREERLQLNEDTLWSGFPRDGVQYDALR 67

Query: 71  ALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRREL 128
            L  VR L+ +G+Y +A    +  + G   + YQ LGD+ +    +     E T Y REL
Sbjct: 68  YLKPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----AQEGLGEITHYEREL 123

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSL 180
           DL T TA V +    + +TRE  +S+PD +I+  ++ + +G ++ +V +        ++ 
Sbjct: 124 DLPTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTANRAGQINASVRITTPHPCEDEAG 183

Query: 181 LDNHSYV---------------NGNNQIIMEGRCPGKRIP------PKANANDDPKGIQF 219
            D H  V                  N I + GR P           P++   +   G+ F
Sbjct: 184 EDEHFAVLSQWDSDVAEGPSDEAARNCITLTGRAPSHVESNYHGDHPQSVVYEHDLGMAF 243

Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
           +  ++ ++  + G ++   D  + V G+D   + L A++ F G    P     +      
Sbjct: 244 A--VQARMVSEGGIVTTKADGTVIVSGADTLTIYLAAATGFRGFHTMPDSDPAESAEVCQ 301

Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
             L  + +L    +  RH  D++ LF RV+++L         DT +EE+I  +P+  R++
Sbjct: 302 VTLDKVISLGSEQVRQRHEQDHRALFDRVALELG-------GDTRTEESI--LPTDLRLE 352

Query: 340 SF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
            + Q + DP L  LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S    NIN +M
Sbjct: 353 RYKQGEADPRLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQM 412

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
           NYW +  CNL+EC EPL   +  +S  G + A VNY A GW  HH  D+W  +    G  
Sbjct: 413 NYWPAEVCNLAECHEPLLHMIGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHA 472

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
            WA WP+GG WL  HLW+ Y +T D  +L ++AYPL++G A+F +DWL+EG +G+L T+P
Sbjct: 473 SWAFWPLGGVWLTAHLWDRYLFTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPNGWLVTSP 532

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
           STSPE++FI P G+   +S  STMDM +IRE+    I AA++LE +E+    +  ++  R
Sbjct: 533 STSPENKFITPSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQR 591

Query: 579 LRPTKIAEDGSIMEW 593
           L P ++   G + EW
Sbjct: 592 LLPYQMGRHGQLQEW 606


>gi|251795949|ref|YP_003010680.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
 gi|247543575|gb|ACT00594.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
          Length = 787

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 234/595 (39%), Positives = 332/595 (55%), Gaps = 37/595 (6%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA  + +A+PIGNGR+G MV+ G   + + LNEDTLW G P D  N +A + L+
Sbjct: 8   KLWYEQPASVWEEALPIGNGRIGGMVFAGTEIDQILLNEDTLWAGFPRDPINYEAQRYLA 67

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             R L+ SG+YAEA       + G   + Y  LG + +   +   + A   Y+REL LN 
Sbjct: 68  KARQLIFSGKYAEAERLIESTMQGRDVEPYLPLGGLSIVRREDR-ESAVSQYKRELHLNE 126

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
             A   Y  G+V    ++F S PDQ +V +   +  G+L+ ++ +DSLL       G  Q
Sbjct: 127 GIAAACYQDGDVTVQSQYFVSVPDQALVVRYEAA-GGTLNRDIVMDSLLQYRLEEAGERQ 185

Query: 193 IIMEGRCPGK------RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + + G+ P        +  P     ++  G+ F   + +K+  D GT+   E K L+V  
Sbjct: 186 LHLIGQAPSHVAGNYHKDHPMDVLYEEGLGLPFE--IRVKVETD-GTVKNGE-KGLEVRN 241

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-----NLSYSDLYTRHLDDY 301
           + +  + L A + F G         + P  E+ SA  SIR      L +  L +RH +D+
Sbjct: 242 AAYLHIYLTAETGFAG-------YDQSPDQEACSARCSIRLEKAAALGFEGLLSRHTEDH 294

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYL 360
           ++LF RVS  L+            E +    P+  R+  +QT  +D  L  L F FGRYL
Sbjct: 295 RQLFDRVSFSLA-----------DETDGSDKPTDRRLADYQTTKQDSHLEALYFHFGRYL 343

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           L+ SSRPGTQ ANLQGIWN  +SP W S   +NIN +MNYW +  CNLSEC EPLF  L 
Sbjct: 344 LMGSSRPGTQPANLQGIWNHHVSPPWHSDYTININTQMNYWPAEVCNLSECHEPLFTMLR 403

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            +S  GS+TA+++Y + GW  HH  DIW  ++   G   WA WP+GGAWL   +WE Y Y
Sbjct: 404 EMSEAGSRTARIHYGSRGWTAHHNVDIWRMTTPTGGSASWAFWPLGGAWLVRQVWESYLY 463

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            MD+DFL ++AYPLL+G A F LDWL+EG +G L TNPSTSPE++F+  +G+   VSY S
Sbjct: 464 NMDKDFLGEKAYPLLKGAALFCLDWLVEGPNGDLVTNPSTSPENKFLTSEGEPCSVSYGS 523

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD+AIIR++F   + A + L   E    +++L SL RL   KI   G + EW +
Sbjct: 524 TMDIAIIRDLFQNCLEAIDALGVEEAEFRDELLASLDRLPAYKIGRHGQLQEWYE 578


>gi|337748853|ref|YP_004643015.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|336300042|gb|AEI43145.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
          Length = 762

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 234/586 (39%), Positives = 330/586 (56%), Gaps = 42/586 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           F  PAK + +A+P+GNGRLGAMV+G    E ++LNEDT+W G P D  NPDA + L ++R
Sbjct: 8   FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67

Query: 77  SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             + SG+ AEA   A++ L G P     Y  LGD+ +  D  H     E YRRELDL+ +
Sbjct: 68  EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
            A + Y +G+  F RE F S+PDQ +V ++     G++     LD   S   +     G 
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N ++M G C GK             G  F A L    +D  G    +  + L VEG+D  
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L L A+++F          ++DP +  ++ L S     Y+ L  RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
            L     ++ TD  +   +  +P+ ER++  +   EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWNE + P WDS   +NIN +MNYW +  C+LSEC EPLFD +  +S  GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+V Y   GW  HH TD+W  ++     +    WP+GGAWLC HLWEHY +  D   L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGDTQRLAE 454

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
             YP+++G A FLLD++IE  DG+L T PS SPE+ +I P+G+   +     MD  I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +F A   AA  L  +ED   E  L +L R+   ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQLAEGGYLQEWLE 558


>gi|149199357|ref|ZP_01876394.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
 gi|149137599|gb|EDM26015.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
          Length = 840

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/600 (38%), Positives = 324/600 (54%), Gaps = 33/600 (5%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           ++ E+ +  N L + +  PA H+ +A+P+GNGRLGAMV+GG+  E L+LNEDT+W+G P 
Sbjct: 60  LSGEAVAPANDLSLWYRKPASHWVEALPVGNGRLGAMVYGGINKEWLQLNEDTMWSGEPV 119

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATAASVKL-----FGHPADVYQLLGDIELEFDDSH 116
           +   P+    +++ R L+   +Y EA     +       G     YQ++ D+EL F    
Sbjct: 120 ERDKPNVQAGIAEARKLLFDEKYVEAQKVVEEKVMGTSLGRGTHNYQMMADLELIFPK-- 177

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
            +     YRR+L+L  A + V+Y      + RE FSS  DQ I  ++S  E   +SF+ S
Sbjct: 178 -RDEVSNYRRDLNLENAISSVQYEFAGTTYKRELFSSAVDQAIYLRLSSDEKAKISFSAS 236

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
           L     +   +  N  ++++G+    +           KG+ F     +K+ ++ G I  
Sbjct: 237 LTRPQSSQLKMMENGALVLKGQARTSKKKVIEQFPSAAKGVAFET--HLKVLNEGGKIFY 294

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
            ED  ++VE +D   L+LVASS + G         K  T+     L      SY    T 
Sbjct: 295 EEDS-IRVENADAVTLVLVASSDYYG--------DKKLTASCQKQLNHATQKSYHQARTD 345

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+ DYQKLF RV + L  SP         +  ID +         +   D  L E  FQ+
Sbjct: 346 HIQDYQKLFKRVDLDLGASPS--AHKPTDQRLIDLI---------KGQYDAQLFEQYFQY 394

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISSSRPGT  ANLQG+W + L P W+S  H+NIN +MNYW +   NLSEC  P F
Sbjct: 395 GRYLLISSSRPGTMPANLQGLWTDGLMPAWNSDFHININFQMNYWHAETTNLSECHMPAF 454

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
             L  L   G + AQ N+   GW   H TD W  +S   GK  + +WP+GGAW   HLWE
Sbjct: 455 YLLERLQERGREVAQKNFGCRGWTAGHTTDAWFFASLI-GKPQYGMWPVGGAWCSRHLWE 513

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
           HY +  D+DFL  RAYP+++G A F +DWL+E    G L + PSTSPE+ F  PDGK A 
Sbjct: 514 HYEFNGDKDFLRNRAYPIMKGAALFCMDWLVENPATGLLVSGPSTSPENRFKTPDGKEAN 573

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++   TMD  I+R++F+  I +AE+L  +++   E  L  L +L PTKIA+DG IMEW +
Sbjct: 574 LTMGPTMDHQIMRDLFTNTIKSAEILNIDQEFRKELNL-ILQKLSPTKIAKDGRIMEWAE 632


>gi|320107748|ref|YP_004183338.1| alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
 gi|319926269|gb|ADV83344.1| Alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
          Length = 814

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 232/591 (39%), Positives = 333/591 (56%), Gaps = 31/591 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L +    PA  + DA+P+GNGRLGAMV+G    E + LNEDTLW G P D TNPDA   L
Sbjct: 35  LTLWMETPAAQWADALPLGNGRLGAMVFGEPLKERIALNEDTLWAGQPRDTTNPDAKNHL 94

Query: 73  SDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY-RRELDL 130
             VR LV +   Y  A     K+ G     ++ LGD+ +E    HL   E T+ +R LDL
Sbjct: 95  PIVRKLVLEDKNYVAADKECQKMQGPENFAFEPLGDLHIE----HLGLTEATHLKRSLDL 150

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           +TA A+  +    V F+RE F S PDQV+  +I+ S+  SL+  +SL   +   +  + +
Sbjct: 151 DTAVAKTSFQSSGVTFSREVFVSFPDQVVALRITASKPSSLNLRLSLTCEMPAKTSAHAD 210

Query: 191 NQIIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
             +++ G+ P +  P  +++      D +G++F+A+L  K   + GT+   E   L +  
Sbjct: 211 GTLLLAGKVPTENNPQISDSIRYSEVDGEGMRFAAVLSAKA--EGGTVQP-EGDTLAISK 267

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    LLL A++ F G F  P D+      E      + ++ +Y+ L T+H+ D++ LF 
Sbjct: 268 ATSVTLLLTAATGFRG-FAFPPDTPAAALEEKCRKGLAGKS-AYAVLKTKHVADHRALFR 325

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV   L+ +  D             +P+  R+K+F T +DP+L+ L FQ+GRYLLI+SSR
Sbjct: 326 RVGANLNSTVPDGAN----------LPTDARLKNFPTTQDPALLALYFQYGRYLLIASSR 375

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN+ + P W S    NIN++MNYW     NL+E   PL D    +++ G
Sbjct: 376 PGTQPANLQGIWNDLVRPPWSSNWTANINIQMNYWPVFTANLAELNGPLVDLTQDMTVTG 435

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           +KTA VNY A GW  HH  D+W ++S      G   WA + M G WLC HL+EH+ +T D
Sbjct: 436 AKTASVNYGARGWCSHHNIDLWRQASPVGMGSGDPTWANFAMSGPWLCQHLYEHFQFTGD 495

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
            D+L KR YP+L   A F LDWL+   DG L T PS S E+ F  P  + A VS   T+D
Sbjct: 496 VDYLRKRVYPILRSSALFCLDWLVPAGDGTLTTCPSFSTENNFFTPQHQKAVVSAGCTLD 555

Query: 544 MAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +A+I E+F   ISA++VL  NED A  +K+  +L +L P K+   G + EW
Sbjct: 556 LALIHELFGNCISASQVL--NEDQAFADKLKAALAKLPPYKVGSAGELQEW 604


>gi|379721830|ref|YP_005313961.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|378570502|gb|AFC30812.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
          Length = 781

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 233/586 (39%), Positives = 329/586 (56%), Gaps = 42/586 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           F  PAK + +A+P+GNGRLGAMV+G    E ++LNEDT+W G P D  NPDA + L ++R
Sbjct: 8   FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67

Query: 77  SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             + SG+ AEA   A++ L G P     Y  LGD+ +  D  H     E YRRELDL+ +
Sbjct: 68  EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
            A + Y +G+  F RE F S+PDQ +V ++     G++     LD   S   +     G 
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N ++M G C GK             G  F A L    +D  G    +  + L VEG+D  
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L L A+++F          ++DP +  ++ L S     Y+ L  RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
            L     ++ TD  +   +  +P+ ER++  +   EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWNE + P WDS   +NIN +MNYW +  C+LSEC EPLFD +  +S  GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIKRMSERGSRT 394

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+V Y   GW  HH TD+W  ++     +    WP+GGAWLC HLWEHY +      L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
             YP+++G A FLLD++IE  DG+L T PS SPE+ +I P+G+   +     MD  I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +F A   AA  L  +ED   E  L +L R+   ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLE 558


>gi|261407087|ref|YP_003243328.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261283550|gb|ACX65521.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 755

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 226/591 (38%), Positives = 332/591 (56%), Gaps = 50/591 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+PIGNGRLGAM++GG   E L+LNED++W G P D  N DA   L 
Sbjct: 12  RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++R L+  G+  EA   A++ + G P     Y  LGD+ L F   H + AE+ Y RELDL
Sbjct: 72  EIRKLIMEGRLQEAEELAAMTMAGLPEAQRHYVPLGDLLLSFG-QHGQLAED-YMRELDL 129

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
               +RV Y +G + +TRE F+S PDQ +V +I+  +  +++F    +    N  YV   
Sbjct: 130 ERGVSRVSYRIGGIRYTRELFASYPDQAVVIRITADKQEAVTFKARFNR--RNWRYVEKT 187

Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           ++     ++M G C G+             G  FSA+L+   +   G +     + L V+
Sbjct: 188 DKWEASGLVMRGDCGGE------------GGSSFSAVLK---AVPEGGVCRTLGEYLLVD 232

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+    LLL A ++F  P         DP  +    L+ +  + Y++L  RH+ DY++L+
Sbjct: 233 GASSVTLLLAAGTTFRHP---------DPELDGKRRLEELSRVPYAELLARHVADYRELY 283

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISS 364
            RV ++L  +P               +P+ ER+K FQ  +ED  L+   FQFGRYLLI+S
Sbjct: 284 GRVELKLPENPDKAA-----------LPTDERLKRFQHGEEDHGLIATYFQFGRYLLIAS 332

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+  ANLQGIWN+  +P WDS   +NIN +MNYW +  CNL+EC EPLF+ +  +  
Sbjct: 333 SRPGSLPANLQGIWNDSFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA V Y   G+  HH TDIWA ++     +  + WPMG AWLC HLWEHY +  DR
Sbjct: 393 PGRVTAGVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            FL  RAY  ++  A FLLD+LIE  +G L T PS SPE+ +  P+G+   +   +TMD 
Sbjct: 453 YFL-ARAYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCTGATMDF 511

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            II  +F A + +AE+  ++E A  E++  +L RL   +I + G I EW++
Sbjct: 512 QIIEALFDACMQSAEIFGRDE-AFREELAAALKRLPKPQIGKYGQIQEWME 561


>gi|329926814|ref|ZP_08281220.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
 gi|328938931|gb|EGG35301.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
          Length = 764

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 219/566 (38%), Positives = 332/566 (58%), Gaps = 24/566 (4%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           MV+GGV  E ++ NEDTLW+G P D  N +A + L+  R L+ SG+YAEA      ++ G
Sbjct: 1   MVFGGVQEECIQWNEDTLWSGFPRDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVG 60

Query: 97  HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE--FTREHFSSN 154
              + +  LGD+ +    S +  +   YRREL+L+T  A  ++ V   +  F+R+ F S 
Sbjct: 61  RNTESFLPLGDLLIR--QSGIGDSCSEYRRELNLDTGIASTRFQVSGSDPIFSRDMFISA 118

Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKA 208
            DQV V +   + S S+   + L S L + +    +  +++ G  P       +   P +
Sbjct: 119 VDQVGVIRYESTGSSSVQLEIGLRSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGS 178

Query: 209 NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
              +D  GI++   + +    D G ++ ++D  +++  +    LL+ A+++F+G    P 
Sbjct: 179 VLYEDGLGIRYE--MRLLALTDSGQVT-VDDSGMRISAAGSVTLLIAAATNFEGFDRFPG 235

Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
               DP+      LQ      +  L +RH+ D+Q LF RV +QL R P++       E +
Sbjct: 236 SGGTDPSGICRERLQDAMRHGFEQLRSRHVQDHQALFRRVELQLGR-PEN-------ERS 287

Query: 329 IDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
           I  + + ER+++++   ED +L  L+FQFGRYLLI+SSRPGTQ A+LQGIWN  + P W+
Sbjct: 288 IAALATDERMEAYREGREDAALEALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWN 347

Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
           S    NIN EMNYW +    LSEC EPL   +  LS++G++TA+++Y A GWV HH  D+
Sbjct: 348 SDYTTNINTEMNYWPAETTRLSECHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDL 407

Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
           W  +S   G+ +WA WPMGGAWLC HLWE Y +  D ++L + AYPL+ G A F LDWLI
Sbjct: 408 WRMASPSDGRAMWAYWPMGGAWLCRHLWERYQFQPDIEYLRETAYPLMRGAALFCLDWLI 467

Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
           E  +G+L T+PSTSPE++F+  +G    VS  STMDMAIIR++F   I A+++LE++ D 
Sbjct: 468 EDGEGHLVTSPSTSPENQFLTEEGLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DE 526

Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEW 593
           L E+   ++ RL P  I  +G +MEW
Sbjct: 527 LREEWKMAVERLLPYAIDNEGRLMEW 552


>gi|218263534|ref|ZP_03477615.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
           DSM 18315]
 gi|218222657|gb|EEC95307.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
           DSM 18315]
          Length = 811

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/598 (38%), Positives = 343/598 (57%), Gaps = 31/598 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPK 70
           P  + F  PA  + +A+PIGNG++GAM++GGV  E ++LNE TLW+G P     NP+A K
Sbjct: 22  PKTLWFEQPANQWVEALPIGNGQIGAMIFGGVEEELIQLNEGTLWSGSPLKKNVNPEAYK 81

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+ VR  +    Y +AT    K+ G   + +  LGD++++ D  H K     Y+R L L
Sbjct: 82  FLAPVREALAKEDYQQATKLCKKMQGFFTENFLPLGDLKIKQDFGH-KARVVDYKRILQL 140

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           + A A +++ V  V +TR+ F+S PD V+V + +  +   L+ ++ L SLL +H   NG 
Sbjct: 141 DKAIASIEFVVDEVHYTRKMFTSAPDSVMVIQFTADKLRKLTLDIHLTSLLKHHVTANGK 200

Query: 191 NQIIMEGRCPG----------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           +  ++ G+ P            R P      D  +G++F  +L  K   D GTI + ++K
Sbjct: 201 DLFVLSGQAPACVDPIYYERPGREPIVQVDKDGLQGMRFQTVL--KAIPDGGTIVS-DEK 257

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + V+ ++   LLL A++SF+G   +P    KD    S   +  I  + ++ L  RH+ D
Sbjct: 258 GIHVKDANSLTLLLSAATSFNGFNKHPDSEGKDEKVISCHRIDRIDKVDFAVLKKRHITD 317

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRY 359
           ++  F RVS+ L        TDT +      +P+  R+K +   + DP L EL FQ+GRY
Sbjct: 318 FKSYFDRVSLHL--------TDTLNSTINKKLPTDFRLKLYSYGNYDPQLEELYFQYGRY 369

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLIS+SRPG    NLQG+W+ ++ P W S   +NIN EMNYW +   NLSE  + L +F+
Sbjct: 370 LLISASRPGGSAINLQGLWSNEVRPPWASNYTININTEMNYWLAESTNLSEMHQSLLNFI 429

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
             LSI G  TA+  Y A GW+ HH +DIWA S++      G   WA W MGG WL  HLW
Sbjct: 430 KNLSITGEDTAKEYYHARGWMAHHNSDIWALSNSVGNCGDGNPSWASWYMGGNWLSLHLW 489

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           EHY YT D++FL+  AYP+++G A F  DWL+E  +GYL T+PSTSPE+ F   D  +  
Sbjct: 490 EHYCYTGDKEFLKNEAYPIMKGAALFCFDWLLE-KNGYLITSPSTSPENNFFV-DNNVYA 547

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           VS ++TMDMAII ++F+ +I A+E+L  ++    E V+K   RL P +I   G + EW
Sbjct: 548 VSEAATMDMAIIHDLFTNVIEASEILGIDKKFRSE-VIKKKERLFPYQIGSFGQLQEW 604


>gi|376260116|ref|YP_005146836.1| hypothetical protein [Clostridium sp. BNL1100]
 gi|373944110|gb|AEY65031.1| hypothetical protein Clo1100_0760 [Clostridium sp. BNL1100]
          Length = 775

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 225/589 (38%), Positives = 333/589 (56%), Gaps = 35/589 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA+ + +A+PIGNGRLG MV GG+  E + LN DTLW+G+PG + N +    L  V+
Sbjct: 7   YKSPARIWEEALPIGNGRLGGMVHGGISQECIDLNNDTLWSGLPGQHINKNILPVLPKVQ 66

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            LV+ G+  EA       +    +  Y  LG + L ++   L    + Y R L LNTA  
Sbjct: 67  RLVNQGKNYEAQKLIEENILTGYSQSYLPLGRLLLTYE---LSGDAKGYNRSLSLNTAVC 123

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
             +Y+ G V + RE   S PD V+   I+  +SG+L+FN++LDS L  +     NN +IM
Sbjct: 124 ETRYTSGGVNYCREVICSYPDDVMAVHITADKSGALTFNITLDSQL-RYQIAKMNNTLIM 182

Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            G CP   IP    A+        +  + I+FS  +   +   +G    ++  ++ V  +
Sbjct: 183 TGDCPSCMIPDYVEADKHLIYDHEEYSRSIRFSVGMRANV---KGGSLIVDADRISVTAA 239

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  +L+L ++++F+G    P  S  DP ++ M  L +    S+++L +RH  D+  LF R
Sbjct: 240 DEVLLILSSTTNFEGFDKMPGSSGNDPLTKCMRILDNTVGYSWNELLSRHKADHAALFER 299

Query: 308 VSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           V + L ++SP               +P+ +R+ ++     DPSL  LLF +GRYLLI+ S
Sbjct: 300 VCLDLGTQSP---------------MPTDKRLAAYAAGHHDPSLDSLLFAYGRYLLIACS 344

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGTQ ANLQGIWN++L+  W S    NIN EMNYW +   NL EC  PLFD L  +S  
Sbjct: 345 RPGTQAANLQGIWNKELTAPWSSNYTTNINTEMNYWPAETANLPECHIPLFDLLKDVSKA 404

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           GS+ + V+Y   G+V+HH TD+W  +S+  G+  W  WPMGGAWL  H+ EHY ++ D D
Sbjct: 405 GSEISLVHYGCRGFVLHHNTDLWRMASSVSGQARWGFWPMGGAWLSIHIMEHYRFSCDTD 464

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL+   Y + E    FLLD+L    +GY  TNPSTSPE+ FI  DG++  ++  STMD+A
Sbjct: 465 FLKDYYYIMREAVL-FLLDYLKPDDNGYFLTNPSTSPENAFIDADGRICSITKGSTMDLA 523

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           IIRE+F + I A  +L K +  L   + + L +L P +I   G ++EW+
Sbjct: 524 IIRELFESCIEAQSIL-KIDSYLSGLLAQRLCKLPPFQIGSKGQLLEWL 571


>gi|310644025|ref|YP_003948783.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
           [Paenibacillus polymyxa SC2]
 gi|309248975|gb|ADO58542.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
           [Paenibacillus polymyxa SC2]
          Length = 824

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 219/612 (35%), Positives = 343/612 (56%), Gaps = 46/612 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA+ + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D    DA + L
Sbjct: 10  LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 69

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
              R L+  G+Y EA    +  + G   + YQ LGD+ +  ++   + +    Y RELD+
Sbjct: 70  EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 125

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
            T TA V +    V +TR+  +S PD VI+  ++ ++ G +  +V + +           
Sbjct: 126 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 185

Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
               D+  + + N+         I + GR P           P++   ++  G+ F+  +
Sbjct: 186 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 243

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
           + ++  + GT++  +D  L +  +D   + L A++ F G    P+    +        L 
Sbjct: 244 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 303

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
              +L    +  RH  D++KLF RV+++L        +DT ++E++  +P+  R++ +Q 
Sbjct: 304 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 354

Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            + D  L  LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S    NIN +MNYW 
Sbjct: 355 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 414

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
           +  CNL+EC EPL   +  +S  G + A ++Y A GW  HH  D+W  +    G   WA 
Sbjct: 415 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 474

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WP+GG WL  HLWE Y +T+D  +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 475 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 534

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           E++FI P G+   +S  STMDM +IRE+ S  I AA++LE + D   ++  ++  RL P 
Sbjct: 535 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 593

Query: 583 KIAEDGSIMEWV 594
           +I   G + EW+
Sbjct: 594 QIGRHGQLQEWL 605


>gi|392304738|emb|CCI71101.1| Alpha-L-fucosidase 2 [Paenibacillus polymyxa M1]
          Length = 867

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 219/612 (35%), Positives = 343/612 (56%), Gaps = 46/612 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA+ + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D    DA + L
Sbjct: 53  LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 112

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
              R L+  G+Y EA    +  + G   + YQ LGD+ +  ++   + +    Y RELD+
Sbjct: 113 EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 168

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
            T TA V +    V +TR+  +S PD VI+  ++ ++ G +  +V + +           
Sbjct: 169 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 228

Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
               D+  + + N+         I + GR P           P++   ++  G+ F+  +
Sbjct: 229 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 286

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
           + ++  + GT++  +D  L +  +D   + L A++ F G    P+    +        L 
Sbjct: 287 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 346

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
              +L    +  RH  D++KLF RV+++L        +DT ++E++  +P+  R++ +Q 
Sbjct: 347 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 397

Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            + D  L  LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S    NIN +MNYW 
Sbjct: 398 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 457

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
           +  CNL+EC EPL   +  +S  G + A ++Y A GW  HH  D+W  +    G   WA 
Sbjct: 458 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 517

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WP+GG WL  HLWE Y +T+D  +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 518 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 577

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           E++FI P G+   +S  STMDM +IRE+ S  I AA++LE + D   ++  ++  RL P 
Sbjct: 578 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 636

Query: 583 KIAEDGSIMEWV 594
           +I   G + EW+
Sbjct: 637 QIGRHGQLQEWL 648


>gi|256424518|ref|YP_003125171.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
 gi|256039426|gb|ACU62970.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
          Length = 841

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 236/603 (39%), Positives = 342/603 (56%), Gaps = 38/603 (6%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAP 69
           N LK+ +  PA  ++ A+P+GNGR+GAMV+GG   E ++LNE TLW+G P     NP A 
Sbjct: 38  NNLKLWYKEPAIEWSQALPLGNGRVGAMVFGGTSEELIQLNEATLWSGGPVSKQVNPAAA 97

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEETYRRE 127
             L  VR+ + S +Y EA +   K+ G  +  +  LGDI +  +  D+ +      Y R+
Sbjct: 98  SYLPAVRAALFSEKYHEADSLLRKMQGAFSQSFLPLGDIRIHQQLKDTLV----SQYSRD 153

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LD+  A +  ++  G + +TRE F S PDQVIV ++  S+ G+L F     S L   + V
Sbjct: 154 LDIANAKSITRFVSGGITYTRELFISAPDQVIVIRLRSSKKGALQFKADPSSQLHYQNSV 213

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALE 238
            G  +I M G+ P +  P   N N +P         KG+++   L ++     GT++  +
Sbjct: 214 TGAKEIAMRGKAPSQVDPSYINYNAEPIQYEAAGSCKGMRYE--LRMRAISPDGTVTT-D 270

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              + V+ +  A+LLL A++SF+G    P     D  + +   ++    LSY++L  RH 
Sbjct: 271 ATGITVKNATEAILLLTAATSFNGFDKCPDSEGLDEKAIAGGQMKKAAALSYANLLQRHE 330

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
            DY K F+RVS+ LS             ++    P+ ER++ +    +D +L  L FQFG
Sbjct: 331 QDYHKYFNRVSLNLS------------GDDQSAQPTDERLRRYTAGGKDQALESLYFQFG 378

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLIS SR  +  ANLQGIWN++L   W S   +NIN +MNYW +  CNL E Q+PL+ 
Sbjct: 379 RYLLISCSRTPSAPANLQGIWNKELRAPWSSNYTININTQMNYWPAEVCNLMEMQQPLYQ 438

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGKV--VWALWPMGGAWLCTH 473
            L  LS+ G+ TA   Y   GWV HH TDIWA ++   D+GK    WA W MGG WLC  
Sbjct: 439 LLKELSVTGAATAGEFYNTRGWVAHHNTDIWAIANPVGDKGKGDPQWANWMMGGNWLCQF 498

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
           LW+HY YT D  FL   AYP+++  A F LD+L++    GYL T P+TSPE++F+  +G 
Sbjct: 499 LWQHYCYTGDEKFLRDTAYPIMKSAALFSLDFLVKDPASGYLVTAPATSPENKFLLANGT 558

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
              VS +STMDM IIRE+F+ +I A EVL K ++ L + +  +  RL P KI +DGS+ E
Sbjct: 559 QESVSIASTMDMTIIRELFNNVIKAGEVL-KVDNGLRDSLQVAADRLYPFKIGKDGSLQE 617

Query: 593 WVQ 595
           W +
Sbjct: 618 WYK 620


>gi|354582995|ref|ZP_09001895.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
 gi|353198412|gb|EHB63882.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
          Length = 758

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 228/591 (38%), Positives = 327/591 (55%), Gaps = 50/591 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+PIGNGRLGAM++GG   E L+LNED++W G P D  N DA   L 
Sbjct: 12  RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++R L+  G+  EA   A++ + G P     Y  LGD+ L F  SH       Y RELDL
Sbjct: 72  EIRKLIMEGRLREAEELAAMTMAGLPEAQRHYMPLGDLLLSF--SHHDLPAVDYVRELDL 129

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
               +RV Y +G + +TRE F+S PDQ IV +IS  + G++S     +    N  Y+   
Sbjct: 130 ENGISRVSYRIGEIRYTRELFASYPDQAIVIRISADKQGTVSLKARFNR--RNWRYLEKT 187

Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           ++     + M G C G+             G  FSA+L  K   D G    L  + L V+
Sbjct: 188 DKWKESGLAMRGDCGGE------------GGSSFSAVL--KAVPDGGVCRTL-GEYLLVD 232

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+    LL+ A ++F  P         DP  +    L+ +  + Y++L  RH+ DY++L+
Sbjct: 233 GASSVTLLITAGTTFRHP---------DPELDGKRRLEMLSRVPYAELLARHVADYRELY 283

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
            RV ++L  SP   V           +P+ ER+  FQ   ED  L+   FQFGRYLLI+S
Sbjct: 284 GRVDLKLPESPDKTV-----------LPTDERLMQFQQGGEDHGLIATYFQFGRYLLIAS 332

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+  ANLQGIWN++ +P WDS   +NIN +MNYW +  CNL+EC EPLF+ +  +  
Sbjct: 333 SRPGSLPANLQGIWNDNFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA V Y   G+  HH TDIWA ++     +  + WPMG AWLC HLWEHY +  DR
Sbjct: 393 PGRVTAHVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            FL  R Y  ++  A FLLD+LIE  +G L T PS SPE+ +  P+G+   +   + MD 
Sbjct: 453 YFL-ARVYETMKEAALFLLDYLIEDAEGRLVTCPSVSPENRYKLPNGETGVLCVGAAMDF 511

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            II  +F A I A+E++ ++E A  +++  +L RL   +I + G I EW++
Sbjct: 512 QIIEALFDACIRASEIIGRDE-AFRDELTGTLKRLPQPQIGKYGQIQEWME 561


>gi|308070789|ref|YP_003872394.1| hypothetical protein PPE_04076 [Paenibacillus polymyxa E681]
 gi|305860068|gb|ADM71856.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
          Length = 822

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 220/612 (35%), Positives = 336/612 (54%), Gaps = 47/612 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PAK + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW+G P D  + DA + L
Sbjct: 10  LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVHYDALRYL 69

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
             VR  +  G+Y EA    +  + G   + YQ LGD+ +    +     E   Y RELDL
Sbjct: 70  QPVRKRIADGKYKEAEQLINTNMLGRDTEAYQPLGDLWV----TQEGLGEIVHYERELDL 125

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            T TA V +    V +TRE  +S PD +++  ++ ++ G +  +V + S       V  +
Sbjct: 126 LTGTAAVTFQSDGVRYTREVIASAPDGIMMVSLTANKLGRIHASVRITSPHPCEDEVGED 185

Query: 191 NQ----------------------IIMEGRCPGKRIP------PKANANDDPKGIQFSAI 222
                                   I + GR P           P++   ++  G+ F+  
Sbjct: 186 AHFGDSSKWDSDNDDSSDESSGDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA-- 243

Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
           ++ ++  + GT++   D  L + G+D   + L A++ F G    P+    +        L
Sbjct: 244 VQARVIPEGGTLTKGADGALIISGADKITVYLAAATGFQGFHAMPNSDATESVDACQVIL 303

Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
               +L    +  RH  D++KLF RV+++L         DT + E++  +P+ +R++ +Q
Sbjct: 304 DGAISLGSEQVRQRHEQDHRKLFDRVALELG-------GDTLTNESV--LPTDQRLELYQ 354

Query: 343 TDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
             + DP L  LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S    NIN +MNYW
Sbjct: 355 KGQADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYW 414

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
            +  CNL+EC EPL   +  ++  G + A ++Y A GW  HH  D+W  +    G   WA
Sbjct: 415 PAEVCNLAECHEPLLHMIGEVARTGRRVASIHYGAQGWAAHHNVDVWRYAGPSGGHASWA 474

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
            WP+GG WL  HLWE Y +T+D  +L ++AYPL++G A+F +DWL+EG  G L T+PSTS
Sbjct: 475 FWPLGGVWLTAHLWERYLFTLDTAYLAEQAYPLMKGAAAFCMDWLVEGPKGRLVTSPSTS 534

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PE++F  PDG+   +S  STMDM +IRE+ S  I AA++LE ++D    +   +  RL P
Sbjct: 535 PENKFKTPDGEECSISMGSTMDMTLIRELLSNCIQAADLLELDDD-FRNRCEGTRARLMP 593

Query: 582 TKIAEDGSIMEW 593
            +I   G + EW
Sbjct: 594 YQIGRHGQLQEW 605


>gi|430742223|ref|YP_007201352.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
           18658]
 gi|430013943|gb|AGA25657.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
           18658]
          Length = 806

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 228/588 (38%), Positives = 342/588 (58%), Gaps = 35/588 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + I +  PA+ +T+A+PIGNG+LGAMV+GG  SE + LNEDT+W G   D TNPDA K+L
Sbjct: 38  MVIHYRRPAEAWTEALPIGNGQLGAMVFGGTGSERIALNEDTVWAGERRDRTNPDALKSL 97

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R L+  G+  EA A A   +   P  +  YQ LGD+ + F         + YRRELD
Sbjct: 98  PEIRRLLRVGKPDEAEALAERTMIAVPKRLPPYQPLGDLRILFPGHD---QADDYRRELD 154

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L++A  RV Y VG+  F RE F+S  DQV+V +++    G L+F+ +LD   D  +    
Sbjct: 155 LDSAMVRVSYRVGDATFRREVFASAKDQVLVVRLTCDRPGRLAFSATLDRERDARAEAVA 214

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            +++++ G      I       D+ K G++FSA L +     R      E  +++V  +D
Sbjct: 215 PDRVLLRGEA----IARDERHEDERKVGVKFSAFLRVVTEGGR---VFTEGDRVEVRDAD 267

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A L LVA++ F           KDP +    AL +  +  Y  L + H DD++  F RV
Sbjct: 268 AATLRLVAATDF---------RSKDPDAACERALAAA-DRPYEPLRSEHEDDHRSFFRRV 317

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
           S++ + +P D       +++   +P+  R+   +  E DP+L+   FQFGRYLLI+SSRP
Sbjct: 318 SLEFA-APGD-------KDDRAALPTDVRLARVRKGESDPALIAQYFQFGRYLLIASSRP 369

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT  ANLQGIWNE L+P W+S   +NIN +MNYW +   NL+E  +PLFD +  +  +G 
Sbjct: 370 GTMPANLQGIWNESLTPPWESKYTININTQMNYWPAEVANLAELHQPLFDLIEAMRPSGR 429

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +TA+  Y A G++ HH TD+WA  +    KV   LWPMG AWL  HLW+HY++  DRDFL
Sbjct: 430 QTAKALYGARGFMAHHNTDLWAH-TVPVDKVGSGLWPMGAAWLSLHLWDHYDFGRDRDFL 488

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            +RAYP+++  A FLLD+L++   G L   PS SPE+ +   DGK+A +    TMD+ I 
Sbjct: 489 AQRAYPVMKEAAEFLLDYLVDDGQGQLIPGPSISPENRYRTADGKVAKLCMGPTMDVEIA 548

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             +F  ++ A+E+L+ + D   ++V ++  RL   +I + G + EW++
Sbjct: 549 HALFGRVVEASELLDLDPD-FRKRVAEARRRLPSLRIGKHGQLQEWLE 595


>gi|326801540|ref|YP_004319359.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
 gi|326552304|gb|ADZ80689.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
          Length = 801

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 230/598 (38%), Positives = 330/598 (55%), Gaps = 33/598 (5%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDA 68
           N LK+ ++ PA  F +A+P+GNGRLGAMV+GGV  E L LNE TLW+G P D    NP A
Sbjct: 26  NNLKLWYSKPAGKFEEALPLGNGRLGAMVYGGVQEERLSLNEATLWSGKPVDENKVNPQA 85

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
              L  V+  + +  Y  A +    + G  +  Y+ LG++ + F     +     +RREL
Sbjct: 86  KDHLPAVQEALFNEDYQTADSLIRFMQGAYSQSYEPLGNLLIHFKH---QGTPTHFRREL 142

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           D++ A ARV Y +    + RE F+S+PDQ+IV +++      L F    +SLL + S   
Sbjct: 143 DISQAIARVSYQLNGTSYRREIFASHPDQLIVIRLTAEGKDRLDFTCRFNSLLRSKS-KK 201

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKL 242
            +  + M G  P    P   N   +P        ++F+++L++  +D +   ++ +D  L
Sbjct: 202 QSTSLWMHGWAPIHTEPNYRNKEKNPVVYDTLNSMRFASMLKVLKNDGQ---TSWQDSSL 258

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            +  +   VLLL  ++S+ G   NP  + K+    ++S L+     S++ L  +H+ DY+
Sbjct: 259 AISNAKEVVLLLSMATSYSGFDKNPGRAGKNELDLALSYLKEAEKQSFASLQAKHIQDYR 318

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLL 361
             F RVSI L    K              +P+ ER++ F + D D +LV L +Q+ RYLL
Sbjct: 319 HYFDRVSINLGHGEKA------------NLPTDERLERFAKGDGDNNLVALFYQYSRYLL 366

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSRPG Q  NLQ +WNE + P W S    NIN EMNYW +   NL E  +PLFDF+  
Sbjct: 367 ISSSRPGGQPTNLQALWNEIVRPPWSSNYTTNINTEMNYWGTEVANLPEMHQPLFDFIGR 426

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           L+  G+ TA+  Y A GWV HH TDIWA +        G   WA W M G WL THLWEH
Sbjct: 427 LAQTGAITAKNYYNADGWVCHHNTDIWAMTHPVGHFGEGHPSWANWQMAGVWLSTHLWEH 486

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           + +T D DFL K+AYPL++G   F L +L    DGYL T PSTSPE+ +I   G    V 
Sbjct: 487 FAFTADADFLRKQAYPLMKGAVDFCLSFLTTNKDGYLVTAPSTSPENIYITDKGYKGAVL 546

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           Y ST D+A+IRE+F+  + AA +L+K++    E V  +L +L P KI   G++ EW  
Sbjct: 547 YGSTADIAMIRELFADYLKAAVILKKDKKT-QEAVTNALAKLPPYKIGRKGNLREWYH 603


>gi|333380580|ref|ZP_08472271.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826575|gb|EGJ99404.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 823

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 237/602 (39%), Positives = 348/602 (57%), Gaps = 35/602 (5%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAP 69
           N L++ +  PA  +T+A+P+GNG +G M++GGV +E ++LNE +LW+G P     NP+A 
Sbjct: 22  NKLQLWYEKPAGKWTEALPVGNGFIGGMIFGGVDNELIQLNEGSLWSGGPQKKNVNPEAY 81

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE---FDDSHLKYAEETYRR 126
           K L  +R  +    Y  AT    K+ G+  + +  LGD+ ++    D+  LK     YRR
Sbjct: 82  KYLQPIREALAKEDYKLATELCKKMQGYYGESFLPLGDLHIKQTYADNRRLK----NYRR 137

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL  A A  ++ +  V++ RE F+S PD V+V  I+ S  G ++  VSL+S L     
Sbjct: 138 TLDLENAIATTEFEINGVKYIREIFTSAPDSVLVMHITASMPGMINLEVSLNSQLSGTLS 197

Query: 187 VNGNNQIIMEGRCPGK----------RIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
            +G N+I++ G+ P +          R P +    +   G++F  +++ + S D   IS 
Sbjct: 198 ADGKNRIVLRGKAPARVDPNYYNKPGRNPIEQTDAEGCNGMRFQTVVQAR-SKDGAIIS- 255

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
            ++  + ++ +    LLL A++SF+G    P    KD    S S +  +++  Y DL T 
Sbjct: 256 -DNNGIYIKNATSVTLLLSAATSFNGFDKCPDSEGKDEKRISESYIAHVQDKGYYDLKTT 314

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQ 355
           H++DYQK F+RVS  L   P   +T   + +    +PS  R+K +   + DP L  L F 
Sbjct: 315 HINDYQKYFNRVSFSL---PNTTITRDVNRK----LPSDMRLKLYSYGNYDPELESLFFH 367

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLIS+SRPG   ANLQG+WN++  P W S   +NIN +MNYW +   NLSE  +PL
Sbjct: 368 YGRYLLISASRPGGSAANLQGLWNKEFRPPWSSNYTININTQMNYWPAEIANLSEMHQPL 427

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA--DR--GKVVWALWPMGGAWLC 471
             F+  LS  G+ TAQ  Y A GWV HH TDIW  S+A  DR  G   WA W MGG WLC
Sbjct: 428 LQFIQNLSKTGTITAQEYYRAKGWVAHHNTDIWGLSNAVGDRGDGDPNWANWYMGGNWLC 487

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
            HLWEHY +T D+ FL+  AYP+++  A F  DWLIE  DGYL T+PSTSPE  F+  DG
Sbjct: 488 QHLWEHYQFTGDKGFLKDIAYPVMKEAALFCFDWLIE-KDGYLITSPSTSPEAAFVTADG 546

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K   V+ ++TMD+AIIR++F+ +I A++ L  ++    E+++K   +L P KI   G + 
Sbjct: 547 KRYSVTEAATMDIAIIRDLFTNLIEASQELNFDK-KFREQLIKKRDKLLPYKIGSQGQLQ 605

Query: 592 EW 593
           EW
Sbjct: 606 EW 607


>gi|410098957|ref|ZP_11293931.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409220088|gb|EKN13045.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 848

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 228/622 (36%), Positives = 336/622 (54%), Gaps = 52/622 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
           T     L + +N P++++ +A+PIGNGR GAMV+GGV  E L+LNE+TL++G P      
Sbjct: 21  TQKKESLVLWYNEPSENWNEALPIGNGRAGAMVFGGVDKEQLQLNENTLYSGEPSTVFKD 80

Query: 66  -PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET 123
               P+    V  L+ + +Y EA+    K   G     YQ  GD+ +E +    K  E +
Sbjct: 81  IKITPEMFDKVVGLMKAQKYDEASDLVCKHWLGRLHQYYQPFGDLFIENN----KPGEVS 136

Query: 124 -YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+REL+++ A  R  +    V++ RE F+S+PD VI+  +  S    L  +++  S   
Sbjct: 137 GYKRELNISDAVTRTVFEQNGVQYEREIFASHPDDVIIVHLKSSTPDGLDLSLNFTSPHP 196

Query: 183 NHSYVNGNNQIIMEGRCPG----------------------------KRIPPKANAND-- 212
                 G +++++ G+ PG                            ++   +    D  
Sbjct: 197 TAKQSKGTDRLVLHGQAPGYVERRTFEQMEAWGDQYKHPELYDEKGNRKFDKRVLYGDEI 256

Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
           D KG+ F A  ++K    +G    + D  + V  ++    +L  ++SF+G   +PS    
Sbjct: 257 DNKGMFFEA--QLKPVLPKGGDYEITDAGVHVYNTNEVYFVLSMATSFNGFDKSPSREGV 314

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP++++   L       Y  L  RH+ DYQKLF RV +QL  SP+              +
Sbjct: 315 DPSAKAAGILDKALAYDYKQLKQRHMADYQKLFDRVDLQLPSSPEQ-----------KAM 363

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           P+ +R+  F+T  DP L  LLFQFGRYL+IS SRPG Q  NLQGIWN+D+ P W+S   +
Sbjct: 364 PTDQRIAQFETMGDPDLAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDVVPAWNSGYTI 423

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NIN EMNYW +   NLSEC EPLF  +  L+++G++TA+  Y   GWV HH T IW +S 
Sbjct: 424 NINTEMNYWPAEVTNLSECHEPLFRLIDELAVSGAETARNMYNRRGWVGHHNTSIWRESV 483

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
            +      + WPM   WLC+HLWEHY YT D+DFL+ RAYPL++G A F  DWLI+  +G
Sbjct: 484 PNDNVPTASFWPMVQGWLCSHLWEHYLYTQDQDFLKNRAYPLMKGAAEFFADWLIDDGNG 543

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
            L T    SPE+ FI  +GK   ++   TMDMAI+RE F+  + AAE+L  +E +L  ++
Sbjct: 544 RLVTPVGVSPENRFIMDNGKQGAMTMGPTMDMAIVRETFTRTLQAAEMLGLDE-SLQAEL 602

Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
              LPRL P +I   G + EW+
Sbjct: 603 KDKLPRLLPYQIGARGQLQEWM 624


>gi|340617674|ref|YP_004736127.1| alpha-L-fucosidase [Zobellia galactanivorans]
 gi|339732471|emb|CAZ95739.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
          Length = 807

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 226/593 (38%), Positives = 337/593 (56%), Gaps = 36/593 (6%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
           F+ PA+HF + + +GNG+ GA ++GGV ++++ LN+ TLW+G P D Y NP+A K L  +
Sbjct: 37  FDRPAEHFEETLVLGNGKAGASIFGGVATDSIYLNDATLWSGEPVDPYMNPEAYKNLPAI 96

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           R  + +  Y  A +   KL G  +  Y  LG + L F+    K   ++Y R+L+L  A +
Sbjct: 97  REALKNENYKLADSLQSKLQGSFSQSYMPLGTVYLNFEH---KNQPQSYHRQLELEKALS 153

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
            V Y V  V FTRE+F S+ DQ +V ++  S+ G+L+FN+  +SLL      NG   + +
Sbjct: 154 TVTYKVDGVTFTREYFISHADQAMVIRLKSSKKGALNFNIGFNSLLKYELATNGPT-LEV 212

Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDR--GTISALEDKKLKVEGS 247
            G  P    P      P     D  +G +F+++  IK +D +  GT     D  + ++ +
Sbjct: 213 NGYAPYHVEPSYRGKMPNPVQFDPNRGTRFTSLFRIKHTDGKLIGT-----DNTVALKDA 267

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
             AV+ +  ++SF+G   NP+    D  + + S L    +  +  L+  HL D+QK F+R
Sbjct: 268 TEAVVYVSIATSFNGFDKNPATEGLDHKAMASSQLSKASSKPFDALFEAHLKDHQKYFNR 327

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSR 366
           V + L +S              + +P+ ER+K + + +ED +L  L FQ+GRYLLISSSR
Sbjct: 328 VHLDLGKS------------TAEDLPTDERLKRYAKGEEDKNLEVLYFQYGRYLLISSSR 375

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
                ANLQGIWN  + P W S   +NIN E NYW +   NLSE  +P+  F+  ++  G
Sbjct: 376 TPNVPANLQGIWNPYIRPPWSSNYTLNINAEENYWLAENANLSEMHQPMLGFIENIAQTG 435

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
             TA+  Y A GW   H +DIWA S+      +G + WA W MGG WL +HLWEHY ++ 
Sbjct: 436 KITAKTFYGAGGWAACHNSDIWAMSNPVGDFGQGGINWANWNMGGTWLSSHLWEHYTFSQ 495

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           D DFL+ RAYPLL+G A F L+WL+E  DG L T+P TSPE++FI PDG      Y ST 
Sbjct: 496 DLDFLKNRAYPLLKGAAEFCLEWLVEDKDGNLVTSPGTSPENKFITPDGYQGATLYGSTS 555

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D+A+IRE F   I+A+E L K + A   ++ K+L +L P ++ + G++ EW  
Sbjct: 556 DLAMIRECFQQTIAASETL-KTDAAFRTQLEKALAKLYPYQVGKKGNLQEWYH 607


>gi|255035537|ref|YP_003086158.1| hypothetical protein Dfer_1752 [Dyadobacter fermentans DSM 18053]
 gi|254948293|gb|ACT92993.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 833

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 236/620 (38%), Positives = 340/620 (54%), Gaps = 37/620 (5%)

Query: 1   MMNAESTSTT----NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLW 56
           ++NA ST         LK+ ++ PA  + +A+P+GNG +GAMV+GGV  E ++LNE TLW
Sbjct: 12  LLNALSTDVIAQKGQDLKLWYSKPASRWVEALPVGNGHIGAMVFGGVEEELMQLNESTLW 71

Query: 57  TGVP-GDYTNPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD 114
           +G P     NP +   L  VR +L++   Y +A     K+ G   + Y  + D+++  D 
Sbjct: 72  SGGPVKTNVNPASASYLPQVRKALLEEQDYQKANELLKKMQGLYTESYMPMADLKIVHD- 130

Query: 115 SHLK-YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
             LK      Y R+LD+  + A  ++S G V++ RE F+S PD ++V K+S S+  +L+F
Sbjct: 131 --LKGQPASAYYRDLDIAHSKATTRFSAGGVDYKREVFTSAPDNIMVIKVSASKPNALNF 188

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-------NDDPKGIQFSAILEIK 226
            VSL S L      +GN ++++ G+ P    P   N         DDP G   +      
Sbjct: 189 TVSLSSQLRYRLEASGNKELLVNGKAPSHVAPNYYNPPGQEPIIYDDPNGCNGTRFQIRT 248

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
            +  RG  + ++   + V+ +   V+ L A++SF+G    P    KD  + + + L    
Sbjct: 249 KAVSRGGTTVVDTAGIHVKNATEVVIFLSAATSFNGFDKCPDKDGKDEKALAKNYLDKAL 308

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
              Y+ L T H  DY   F+RVS          VTDT +      +PS ER+ ++ + D 
Sbjct: 309 AKGYATLATSHQHDYHSYFNRVSFS--------VTDTLTRNPNTALPSDERLMAYAKGDY 360

Query: 346 DPSLVELLFQFGRYLLISSSR------PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
           DP L  L +QFGRYLLISSSR      P    ANLQGIWN+++ P W S   +NIN +MN
Sbjct: 361 DPGLETLYYQFGRYLLISSSRAALPGVPAGPPANLQGIWNKEMRPPWSSNYTININTQMN 420

Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS----ADR 455
           YW +   NLSE   PL  ++  LS  G+ TA+  Y A GWV HH  DIW  S+       
Sbjct: 421 YWPAEVANLSEMHRPLLSWIKDLSQTGAVTAKEFYDAKGWVAHHNADIWGMSNPVGNVGD 480

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
           G  VWA W MG  WLC HLWEHY ++ D+ FL  + YPL++  A F LDWL+E  DGYL 
Sbjct: 481 GDPVWANWYMGANWLCQHLWEHYRFSGDKAFLRDKGYPLMKEAALFTLDWLVEDKDGYLV 540

Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
           T PSTSPE++F  P G  A VS ++TMD++II ++FS +I AAEVL  +ED   + +++ 
Sbjct: 541 TAPSTSPENKFKDPKGGEAAVSVATTMDISIIHDLFSNLIDAAEVLGTDED-FRKLLIEK 599

Query: 576 LPRLRPTKIAEDGSIMEWVQ 595
             +L P KI   G + EW +
Sbjct: 600 RAKLYPLKIDGRGRLQEWYK 619


>gi|423342630|ref|ZP_17320344.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217547|gb|EKN10523.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 844

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 223/626 (35%), Positives = 335/626 (53%), Gaps = 50/626 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M  E T    PL + ++ PA+++ +A+PIGNGR GAM++G   +E L+LNE+TL++G P 
Sbjct: 14  MACEETPQKEPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73

Query: 62  DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
                    P+    V  L+ +G+Y EA+    K   G     YQ  GD+ ++   ++ +
Sbjct: 74  VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
                Y+R L+++ A A   Y  G   + RE F+S+PD VIV ++  +    +  +++  
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
           S          ++++I+ G+ PG                 + P   +AN           
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250

Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
               D KG+ F A L+     D      + D  + V  +D    +L  ++SF+G   +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308

Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
               DP++++   L    + +Y  L  RH +DY+ LF+RV  +L+ SP+           
Sbjct: 309 REGIDPSAKAAGILDKALSYNYQTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358

Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
              +P+ +R++ F    DP L  LLFQFGRYL+IS SRPG Q  NLQG+WN+D  P W+ 
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
              +NIN EMNYW +   NLSECQ+PLF  +  L+++G++TA+  Y   GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477

Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
            +S  +      + WPM   WLC+HLWEHY +T D  FL+  AYPL++G A F  DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537

Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
             +GYL T    SPE+ FI  DG+ A +S   TMDMAIIRE F+  I A+E+   +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596

Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++   L RL+P +I E G + EW+
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWI 622


>gi|218258383|ref|ZP_03474775.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225510|gb|EEC98160.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
           DSM 18315]
          Length = 844

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 223/626 (35%), Positives = 335/626 (53%), Gaps = 50/626 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M  E T    PL + ++ PA+++ +A+PIGNGR GAM++G   +E L+LNE+TL++G P 
Sbjct: 14  MACEETPQKKPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73

Query: 62  DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
                    P+    V  L+ +G+Y EA+    K   G     YQ  GD+ ++   ++ +
Sbjct: 74  VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
                Y+R L+++ A A   Y  G   + RE F+S+PD VIV ++  +    +  +++  
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
           S          ++++I+ G+ PG                 + P   +AN           
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250

Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
               D KG+ F A L+     D      + D  + V  +D    +L  ++SF+G   +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308

Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
               DP++++   L    + +Y  L  RH +DY+ LF+RV  +L+ SP+           
Sbjct: 309 REGIDPSAKAAGILDKALSYNYRTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358

Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
              +P+ +R++ F    DP L  LLFQFGRYL+IS SRPG Q  NLQG+WN+D  P W+ 
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
              +NIN EMNYW +   NLSECQ+PLF  +  L+++G++TA+  Y   GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477

Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
            +S  +      + WPM   WLC+HLWEHY +T D  FL+  AYPL++G A F  DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537

Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
             +GYL T    SPE+ FI  DG+ A +S   TMDMAIIRE F+  I A+E+   +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596

Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++   L RL+P +I E G + EW+
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWI 622


>gi|332668180|ref|YP_004450968.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332336994|gb|AEE54095.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 861

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 231/620 (37%), Positives = 342/620 (55%), Gaps = 52/620 (8%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNP 66
           +  N L++ ++ PA  +T+A+PIGNG +GAMV+G    E L+LNE TL++G P G +T+ 
Sbjct: 17  AQNNHLQLWYDQPASVWTEALPIGNGYMGAMVFGDPLQEHLQLNEGTLYSGDPKGTFTSI 76

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
           +  KA   V +L+++ +Y EA     K   G    +YQ +GD+ L  D  H K + + Y+
Sbjct: 77  NVRKAYPQVTALLEAKKYQEAQPLITKEWLGRNHQMYQPMGDLWL--DVEHDKSSIKAYK 134

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL TATA  +Y  G+  + R +F+S PD V+V K++ +  G +  N +L     + +
Sbjct: 135 RGLDLQTATAFTEYQSGSTTYRRTYFTSYPDHVLVMKMTATGPGKI--NCTLRQSTPHTA 192

Query: 186 ---YVNGNNQIIMEGRCPG---------------------------KRIPPKANANDDPK 215
              Y+   N + M+ R PG                           +R P  AN   D +
Sbjct: 193 PAKYLGQGNVLRMQSRAPGFALRRNFDLVEKLGDQHKYPELYEKTGERKPGAANFLYDQQ 252

Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
             G+  +    +K+    GTIS + D K++V+ +   V++L A++S++G   +P+   KD
Sbjct: 253 IEGLGMAFESRLKVIHTGGTISNV-DGKIRVQNATELVIILSAATSYNGFDKSPAYEGKD 311

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
           P     +  ++I N  +S LY RHL DYQ LF RV I L+           +E     +P
Sbjct: 312 PAKLLDTYFRAIDNKPFSTLYQRHLLDYQNLFKRVEINLA-----------AETEQSKLP 360

Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
           +  RV+ F   +DP+   L FQFGRYL+I+ SRPG Q  NLQGIWN+ L+P W+ A  +N
Sbjct: 361 TDRRVELFSNGQDPAFAALYFQFGRYLMIAGSRPGGQPLNLQGIWNDQLTPPWNGAYTIN 420

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           IN +MNYW +   NL+ECQEP F  +  L+ING +TA+  Y  +GWV HH  DIW + + 
Sbjct: 421 INAQMNYWPAEITNLAECQEPFFKAIKELAINGRETARNMYGNAGWVAHHNMDIW-RHAE 479

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
                  + WPMGG WL +HLWEHY ++ D+ FL+   +PLL+G   F   WL++   GY
Sbjct: 480 PIDNCACSFWPMGGGWLVSHLWEHYLFSGDQQFLKNEVFPLLKGVVDFYQGWLVKNEAGY 539

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
           L T    SPE  F+    K A  S   TMDMAI+RE F+  + AA+VL    D  V+ V 
Sbjct: 540 LVTPVGHSPEQNFVYEGNKQATYSPGPTMDMAIVREAFARYLEAAQVLGV-ADKSVDSVR 598

Query: 574 KSLPRLRPTKIAEDGSIMEW 593
           ++L +L P +I + G + EW
Sbjct: 599 QNLAKLLPYQIGKYGQLQEW 618


>gi|224539148|ref|ZP_03679687.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519233|gb|EEF88338.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 822

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/599 (39%), Positives = 338/599 (56%), Gaps = 34/599 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
           L++ +  PA  + +A+P+GNG +GAMV+G V +E ++LNE TLWTGVP     NPDA   
Sbjct: 24  LRLWYEKPANTWVEALPLGNGYIGAMVYGKVENELIQLNEGTLWTGVPCVKSVNPDAYSY 83

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE--FDDSHLKYAEETYRRELD 129
           LS++R  +    +A A   S K+ G+ +  +  LGD+E++  F D    Y    Y+RELD
Sbjct: 84  LSEMREALSRDDFAAAGTLSKKMQGYFSQSFLPLGDLEIKQSFGDRKAWYL--GYKRELD 141

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           LN A     +  G V++ RE F+S PD+V+V + + S+ G L+ + +  S L +     G
Sbjct: 142 LNEAILTTSFWEGGVQYVREMFTSAPDRVMVLRFTASQKGKLALDFTTKSRLSDAVEALG 201

Query: 190 NNQIIMEGRCPGKRIPPKANAN----------DDPKGIQFSAILEIKISDDRGTISALED 239
           +N + M+G  P +  P   N            +   G++F ++L  K     GT++  + 
Sbjct: 202 DNCLAMDGAAPARLDPAYYNRKGREPMMRVDENGCSGMRFRSLL--KAIPVGGTVTT-DK 258

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           K + + G+D  +++  A++SF+G    P+   KD    +   L      S+ +L   H+ 
Sbjct: 259 KGIHINGADEILVIWTAATSFNGFDKCPACEGKDEKMLAGQYLAKASIKSFDELKDSHIR 318

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGR 358
           D+   F RVS+QL        TDT   +    +PS  R+K +   + DP L ELLFQ+GR
Sbjct: 319 DFASYFERVSLQL--------TDTVGSKVNAQLPSDFRLKLYSYGNYDPQLEELLFQYGR 370

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSR G   ANLQGIWN+D  P W S   +NIN EMNYW +   NLSE   PL  +
Sbjct: 371 YLLISSSRLGGTAANLQGIWNKDFRPPWSSNYTININTEMNYWLAETTNLSEMHTPLLSW 430

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHL 474
           +  LS  G  TA+  Y A GWV HH +DIW  S    +   G   WA W MGG WLC HL
Sbjct: 431 IKDLSKAGRATAKEFYHAKGWVAHHNSDIWGLSNPVGNKGDGSPEWANWTMGGNWLCQHL 490

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           WEHY +T D+ FL   AYP+++  A F LDWL+E  D YL T+PS SPE+ F+  DGK  
Sbjct: 491 WEHYCFTGDKQFLADEAYPVMKEAALFCLDWLVERGD-YLITSPSVSPENLFVV-DGKKY 548

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            VS +STMDMAIIR++FS +I A+EVL  +     ++++ +  +L P +I   G + EW
Sbjct: 549 AVSEASTMDMAIIRDLFSNLIEASEVLNIDRK-FRKQLVTAKNKLFPYQIGAKGQLQEW 606


>gi|255530725|ref|YP_003091097.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
 gi|255343709|gb|ACU03035.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
          Length = 786

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 221/589 (37%), Positives = 343/589 (58%), Gaps = 32/589 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
           +N PA+ F + + +GNG+LGA V+GG+ S+ + LN+ TLW+G P + Y NP+A K +  +
Sbjct: 32  YNKPAQFFEETMVLGNGKLGAAVFGGIKSDKIFLNDATLWSGEPVNPYMNPEAYKQIPSI 91

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           R  + +  Y  A   + K+ G  +  Y  LG + ++F+ +    +   YRRELD++ + +
Sbjct: 92  REALKNENYKLANELNRKVQGAFSQSYAPLGTMHIKFNHTD---SASMYRRELDISKSLS 148

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           ++ Y+V  V FTRE+F S P +V++ K++ S+ G+LSFNV  +SLL      N  N + +
Sbjct: 149 KITYNVSGVTFTREYFISKPARVMMIKLTSSKKGALSFNVDFESLLK-FEITNQGNTLRV 207

Query: 196 EGRCPGKRIPP-KAN-AN----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           +G  P    P  + N AN    D+ +G +FS++  IK +D +  I   +   + ++    
Sbjct: 208 KGYAPYHAEPVYRGNIANSVKFDENRGTRFSSLFRIKNTDGQVII---QHGSIGLKNGTE 264

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A+L +   +SF+G   NP+   K     + S L+ +  ++Y  +   H++DYQ  F+RVS
Sbjct: 265 AILYIAIETSFNGFDKNPATEGKSDALLADSCLKKVVPVNYESVKHAHINDYQNYFNRVS 324

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
             L ++            N   +P+ ER+K + +  ED +L  L FQFGRYLLISSSR  
Sbjct: 325 FNLGKT------------NAPELPTDERLKRYAEGKEDKNLEILYFQFGRYLLISSSRTA 372

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQGIWN  + P W S    NINL+ NYW +   NLSE  EPL  F+ +++  G  
Sbjct: 373 GVPANLQGIWNPYIRPPWSSNYTTNINLQENYWLAENTNLSELHEPLMKFIGHVAHTGKV 432

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           TA+  Y   GW + H +DIWA S+      +G  VWA W MGG WL THLWEHY +T+D+
Sbjct: 433 TAKTFYGVEGWALCHNSDIWAMSNPVGGFGQGDPVWANWNMGGTWLSTHLWEHYIFTLDK 492

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           +FL+++AYPL++G A F L+WL++   G L T+PSTSPE  FI  DG      Y  T D+
Sbjct: 493 NFLKQKAYPLMKGAARFCLNWLVKDKKGNLITSPSTSPEASFITADGSKGSTLYGGTADL 552

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A+IRE F   I A+++L   +    ++V  +L +L+P ++ ++G++ EW
Sbjct: 553 AMIRECFLQTIRASQIL-GTDITFRKEVESALRQLQPYQVGKNGNLQEW 600


>gi|336429038|ref|ZP_08609009.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336003732|gb|EGN33810.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 779

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 227/583 (38%), Positives = 323/583 (55%), Gaps = 35/583 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +++ +  PA ++ +A+P+GNGRLGAMVW G   E + LNED+LW+G P  +    A +  
Sbjct: 1   MELWYKEPASYWEEALPLGNGRLGAMVWSGTDQEKISLNEDSLWSGYPQSHDISGAAEYY 60

Query: 73  SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
              R L    +Y EA A     + G     Y  LG  EL  D +H +     Y+R L+L 
Sbjct: 61  LQARRLSMEKKYEEAQALLEQNVLGEYTQSYLPLG--ELTLDMAHPEGEIRNYKRALELE 118

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A +R++YS G+  +TRE F S PDQV+V  IS    G +S        L     +   N
Sbjct: 119 KALSRLEYSAGDTNYTREMFISAPDQVMVMHISADRPGMVSLKAGFSCQLRAEVSIE-EN 177

Query: 192 QIIMEGRCPGKRIPPKANAND--------DPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           ++I++G  P +  P   ++ D        + KG+QF A+LEI +  + G +  L +  L+
Sbjct: 178 RMILDGIAPSQVDPSYIDSPDPVIYEDAPEKKGMQFCAVLEIDV--EGGEMKRLPEG-LE 234

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V  +D   L L A +SF+GPF +P    K       + LQ+ R + Y  L  RH+++YQ+
Sbjct: 235 VIHADSVTLFLAARTSFNGPFRHPFLEGKPYKEPCFAELQAAREMGYDRLLERHIEEYQQ 294

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RVS+ L    +++             P  ER+  +  D DP+   LLFQ+GRYLLIS
Sbjct: 295 YFNRVSMDLGPGREEL-------------PVPERLADWDKDVDPARFTLLFQYGRYLLIS 341

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPGTQ ANLQGIWN+ L   W S   VNIN EMNYW +   NL E  EPLFD +  L 
Sbjct: 342 SSRPGTQPANLQGIWNQHLRAPWSSNYTVNINTEMNYWGAETVNLPEMHEPLFDLIRNLR 401

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
           I+G  TA+++Y A G+V HH +DIW  S+   +RGK   V+A WP+   WL  H+++HY 
Sbjct: 402 ISGGNTARIHYNAGGFVSHHNSDIWCLSTPVGNRGKGTAVYAFWPLSAGWLSAHVYDHYL 461

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           ++ D DFL +  YP++   A F LD L E  DG L   PSTSPE++FI   GK+  VS +
Sbjct: 462 FSGDLDFLRQTGYPVIHDAARFFLDVLTENEDGELIFAPSTSPENQFIY-HGKVCAVSQT 520

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLR 580
           +TM MAI+REV     +   +L  +++ L   E+ L  LP  R
Sbjct: 521 TTMTMAIVREVLENAAACCRLLGIDQEFLAEAEEALGRLPSYR 563


>gi|354584579|ref|ZP_09003473.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
 gi|353194100|gb|EHB59603.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
          Length = 761

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 210/565 (37%), Positives = 321/565 (56%), Gaps = 23/565 (4%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           MV+GG+  E ++ NEDTLW+G P D  N +A + L   R L+ S +YAEA      ++ G
Sbjct: 1   MVFGGIQEERIQWNEDTLWSGFPRDTNNYEALRYLQAARELIASEKYAEAEKLIEERMVG 60

Query: 97  HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE-FTREHFSSNP 155
              + +  LGD+ +E   + +   +  YRRELDL    A V +  G  E F RE F S  
Sbjct: 61  RNTEAFLPLGDLLIE--QTGIDDWQSNYRRELDLGNGVASVVFRTGRGEHFQREMFISAA 118

Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKAN 209
           DQ+ V + +GS  GS+   + L S L   + +     + + G  P       +   P++ 
Sbjct: 119 DQIAVIRYTGSAEGSIHLKLKLQSPLRYETEITPGGVMRLFGHAPTHIADNYRGDHPQSV 178

Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD 269
             ++  G+++   +++ +  D G I  +    L V G+    L + A++ F+G  + P  
Sbjct: 179 LYEEGSGLRYE--MQVAVRADGGRI-GINGDVLTVTGASAVTLHVAAATDFEGFDVMPGA 235

Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
              DP     + L++        L  RH +++  LF RV+++L         D      +
Sbjct: 236 KGSDPARLCSARLEAAAGYDDEALRLRHTEEHWALFGRVAVELG--------DAEHRARM 287

Query: 330 DTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
           + +P+ +R+ ++    EDPSL  L+FQ+GRYLL++SSRPGTQ A+LQG+WN  + P W+S
Sbjct: 288 EAIPTDQRLAAYAGGQEDPSLEALMFQYGRYLLMASSRPGTQPAHLQGLWNPHVQPPWNS 347

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
               NIN EMNYW +   NLSEC EPL   +  L+++G++TA+++Y A GW  HH  D+W
Sbjct: 348 NYTTNINTEMNYWAAETGNLSECHEPLIQMVRELAVSGARTAKIHYNARGWAAHHNVDLW 407

Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
             ++   G+ +WA WPM G WLC HLWEHY +  D ++L   AYPL+   A F LDWLIE
Sbjct: 408 RMANPSNGRAMWAFWPMAGPWLCRHLWEHYVFNPDPEYLRNTAYPLMREAALFCLDWLIE 467

Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
             +G+L T+PSTSPE++F+  +G    VS  STMDMA+IRE+F   + A+E+LE + + L
Sbjct: 468 NGEGHLVTSPSTSPENQFLTKEGVPCSVSAGSTMDMALIRELFRHCLEASELLEIDRE-L 526

Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEW 593
            E++  +L RL P +I +DG +MEW
Sbjct: 527 QEELRSALERLLPYQIDDDGRLMEW 551


>gi|414868292|tpg|DAA46849.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
          Length = 457

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 214/404 (52%), Positives = 269/404 (66%), Gaps = 30/404 (7%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP  
Sbjct: 41  PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           LS VRSLV++G+Y EAT+A+  L G    V+Q LGDI+L F +  +KY    YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TAT  V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+   V   N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           +IIMEG CPG+R      A D P GI+FSAIL ++I+    T+  L D  LK++ +D  V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           LLL A++SF   FI PS+SK DPT  + + L   R  SYS L   H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337

Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
           LS       R  + + +   S +  +                      P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
           EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDT 441


>gi|21218886|ref|NP_624665.1| large hypothetical protein [Streptomyces coelicolor A3(2)]
 gi|5912520|emb|CAB56146.1| putative large secreted protein [Streptomyces coelicolor A3(2)]
          Length = 809

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 222/592 (37%), Positives = 326/592 (55%), Gaps = 47/592 (7%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           P+++ +  PA+ + +A+P+GNGRLGAMV+GG  +E L+LNED+LW G PGDY  PDA + 
Sbjct: 50  PMRLWYRAPAQEWLEALPVGNGRLGAMVFGGTDTERLQLNEDSLWAGGPGDYARPDAVRH 109

Query: 72  LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
           L+++R LV   ++  A      +  G P++   YQ+LGD+EL       +     Y REL
Sbjct: 110 LAEIRRLVVEEKWNRAQRLIDAEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYEREL 166

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL TA AR  Y+ G V   RE F+S PDQV+V ++S    G++ F     S   +     
Sbjct: 167 DLETAVARTTYTRGGVRHVREVFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAV 226

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLK 243
             + I ++G           +    P  ++F  +        ++S D GT        L 
Sbjct: 227 DAHTIALDG--------VGGDWYGRPGSVRFRGLARAESEGGRVSTDGGT--------LT 270

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VEG+D A L++  ++S+     N  D   DP S + + L       Y+ L TRH+ D+++
Sbjct: 271 VEGADAATLVISLATSYR----NYLDVGADPASRARNHLAPAARKPYAHLRTRHVADHRR 326

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF RV++ L  S +              +P+ ER+  F   +DP L  L FQ+GRYLL S
Sbjct: 327 LFGRVALDLGPSERA------------ELPTDERIPLFADGKDPQLAALYFQYGRYLLAS 374

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            SR   Q ANLQG+WN+ L+P W+S   VNIN EMNYW + P NL+EC +P    +  L+
Sbjct: 375 CSRSPGQPANLQGLWNDSLNPAWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELA 434

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +G++TA+  Y A GWV+HH TD W + +A      + +WP GGAWLC  LW+HY +T D
Sbjct: 435 ESGTRTAKALYDAPGWVLHHNTDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGD 493

Query: 484 RDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
              L  R YP+++G   F LD L ++   G+L TNPS SPE      +G+   +    TM
Sbjct: 494 TGAL-SRNYPVMKGAVEFFLDTLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTM 552

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           DM ++R++F A   AAEVL+++   LV +V +   RL PT++   G I EW+
Sbjct: 553 DMQLLRDLFDAYRQAAEVLDRDSR-LVGRVTEVRDRLAPTRVGHLGQIQEWL 603


>gi|389792551|ref|ZP_10195739.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
           fulvus Jip2]
 gi|388436250|gb|EIL93122.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
           fulvus Jip2]
          Length = 791

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 227/590 (38%), Positives = 331/590 (56%), Gaps = 34/590 (5%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + ++ PA  + +A+P+GNG +GAMV+GGVP E ++LN  TLW G P DY    A   L  
Sbjct: 25  LVYDKPASQWNEALPLGNGLMGAMVFGGVPDERVQLNLGTLWGGAPNDYIAQGAASRLKP 84

Query: 75  VRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           ++ L+ SG+ A+A A S    G P  +  +Q  GD+ L  ++   K     Y+REL L+ 
Sbjct: 85  IQKLIFSGKVAQAEALSAGFMGDPKLLMPFQPFGDLHLHVEN---KGKVSDYQRELRLDD 141

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
           A + V Y+V  V F RE F S PD+V+V  +S  +  + +F V+L S        + G +
Sbjct: 142 AISTVSYAVDGVHFRRETFMSYPDRVLVMHLSADQPAAQNFTVTLTSPQPGAKVALVGKD 201

Query: 192 QIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            I + G+   +  P  +      K G+ ++  L IK     G+I    D  L+V G+D  
Sbjct: 202 TIALTGQIEPRTNPASSWTGSWSKPGMTYAGRLVIKTKG--GSIRQAGDH-LEVRGADAV 258

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L+   ++SF     +  D   +  + + + L      SY  L   HL DY+ LF RV +
Sbjct: 259 TLVFSGATSFK----SYRDISGNAEAAARAPLDKAVQRSYEALKNAHLADYRALFDRVHL 314

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
           +L         D  S EN+ T    +R++ F+T +DPSLV L +Q+GRYLLISSSR G Q
Sbjct: 315 RLG--------DDASRENVAT---DKRIRDFKTHDDPSLVALYYQYGRYLLISSSRAGGQ 363

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+DL P W S    NINLEMNYW +    L E Q PL+D +  L + G+KTA
Sbjct: 364 PANLQGIWNQDLLPAWGSKWTTNINLEMNYWPAETGALWETQTPLWDLIDDLQVAGAKTA 423

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           Q  Y A GWV+HH +D+W  ++   G   W LWPMGG WL   +W+HY ++ D  FL  R
Sbjct: 424 QRYYGAHGWVLHHNSDLWRATTPVDGP--WGLWPMGGVWLSNQMWDHYTFSGDETFLRNR 481

Query: 491 AYPLLEGCASFLLDWLIEGHD-----GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           AYP ++G A F+LD+L+E        G L TNPSTSPE+ ++   GK   ++Y+ TMD+ 
Sbjct: 482 AYPAMKGAAEFVLDFLVEAPKGSPVAGKLVTNPSTSPENRYLL-GGKPVGLTYAPTMDIE 540

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +I ++F+ + +AA  L  +  ALV ++  + PRL P +I   G + EW++
Sbjct: 541 LINDLFNHVRAAARHLGVDA-ALVSRIDAAQPRLPPLQIGHKGQLQEWIE 589


>gi|409198450|ref|ZP_11227113.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 767

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 220/591 (37%), Positives = 330/591 (55%), Gaps = 50/591 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           +PL + ++ PA  + +A+PIGNG +GAM++GG+  E ++LNE+T+WT        PD  K
Sbjct: 25  SPLTLWYDQPASQWEEALPIGNGHMGAMIFGGIDKERIQLNEETIWTKRDEFTDKPDGHK 84

Query: 71  ALSDVRSLVDSGQYAEATAASVK-----LFGHPADVYQLLGDIELEFDDSHLKYAE-ETY 124
            ++ +R+L+   QY EA     +        +  + YQ LGD+ L+F+    K+ +   Y
Sbjct: 85  YINKIRTLLFEEQYEEAEKLVRRHLLEDRMPNNTNTYQTLGDLHLDFE----KFEQISQY 140

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
           RR+L+L  ATA V +    V ++RE FSSNP      K+S  + G +SF  SL+   +  
Sbjct: 141 RRQLNLENATASVSFISDGVHYSRESFSSNPANATFMKLSADKPGRISFTASLNRPGEGE 200

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
           +     + IIM  +             D+  G+ +   ++I+     GT+ A +DK +K+
Sbjct: 201 NISVDGHTIIMNQKV------------DNKDGVTYETRIQIRAKG--GTLEA-KDKSIKI 245

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+   VL+ VA++ + G         ++PT      L+ I   SY DL   H+ DYQ L
Sbjct: 246 SGAAEVVLIQVAATDYRG---------ENPTQSCKKYLKDIAEKSYDDLRKEHISDYQSL 296

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLIS 363
           F+RVS+ L  S  D +            P  ER+ + +   EDP+L  L +QFGRYLLIS
Sbjct: 297 FNRVSLDLGTS--DAIY----------FPVDERLTALRKGAEDPALFSLYYQFGRYLLIS 344

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPG+  ANLQG+W   L+P W++  H+NIN++MNYW ++  NL EC  P  +F+  L 
Sbjct: 345 SSRPGSLPANLQGLWESTLTPPWNADYHININIQMNYWPAVVTNLPECHLPFLNFIGQLR 404

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            NG KTA   Y A G+  HH TD W  ++A +G+  WA+WPMG AW  TH+WEH+ +T D
Sbjct: 405 ENGRKTANTLYGARGFTAHHTTDAWHFTTA-QGQPQWAMWPMGAAWASTHIWEHFLFTRD 463

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
             FL    + +++  A FL D+L++  + G L + PS SPE+ F  P G  A V    +M
Sbjct: 464 TTFLRNYGFDVMKEAALFLSDFLVKDPETGRLVSGPSMSPENTFFTPRGNRASVVMGPSM 523

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           D  II  +FS++I AA+VL   ED    K+ + L +L P++I EDG I+EW
Sbjct: 524 DHQIIHHLFSSVIEAAKVLNA-EDHFTRKITRQLKQLTPSEIGEDGRILEW 573


>gi|295689298|ref|YP_003592991.1| alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
 gi|295431201|gb|ADG10373.1| Alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
          Length = 781

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 223/595 (37%), Positives = 333/595 (55%), Gaps = 48/595 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           T   +PL++ +  PAK + +A+P+G GRLGAMV+GGV  E L+LNEDTLW G P +  NP
Sbjct: 27  TPKASPLRLWYRQPAKTWVEALPVGTGRLGAMVFGGVDVERLQLNEDTLWAGGPYEPINP 86

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-E 122
           +A  AL ++R L+D+G YA+A   A  K  G P     YQ +GD++L+F       AE  
Sbjct: 87  EAGAALPEIRRLIDTGDYAKAAQLAETKFVGVPKQQMSYQTIGDLKLDFPG----LAEPA 142

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
           +Y REL+L+ A A  ++  G V+  RE  +S PD VI  +++ S  G++S ++   S L 
Sbjct: 143 SYVRELNLDGAIATTRFKAGGVDHVREVIASAPDGVIAVRLTASRRGAISVDLGFASPLK 202

Query: 183 NH--SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALED 239
           +   + V G + ++             A AND  +GI      E ++    +G   + + 
Sbjct: 203 SAPAARVEGRSLVL-------------AGANDSQQGIPAKLRFECRVDVRAKGGRVSGQG 249

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           + L +  +D  +LL+ A++S+       +D   DPT+ + + L  + N  ++ +   H  
Sbjct: 250 ETLSIRDADEVILLIAAATSYR----RYNDVSGDPTALNKATLARLSNKPWAKILAGHQA 305

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           D+  LF RV +   R+  ++             P+ ER+K+    +DPSL  L +Q+GRY
Sbjct: 306 DHHALFRRVEVDFGRTRAELS------------PTDERIKASPMTDDPSLAALYYQYGRY 353

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLI+ SRPGTQ ANLQG+WN+  S  W     +NIN EMNYW + P +L E  EPL   +
Sbjct: 354 LLIACSRPGTQPANLQGVWNDKPSAPWGGKYTININTEMNYWPAEPTSLPELVEPLIALV 413

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             LS  G++TA+  Y A GWV HH TD+W +++A      W +WP GGAWLC HLW+HY+
Sbjct: 414 RDLSETGARTAKAMYGARGWVAHHNTDLW-RATAPVDGAPWGVWPTGGAWLCKHLWDHYD 472

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           Y  DR +L  R YPL++G A F LD L ++   G L TNPS SPE++     G  A +  
Sbjct: 473 YGRDRAYL-ARVYPLMKGSARFFLDTLVVDPKFGVLVTNPSLSPENDH----GHGASIVA 527

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             TMD AIIR++F   + A  VL  ++   V ++  +  +L P K+ +DG + EW
Sbjct: 528 GPTMDQAIIRDLFDNCLKAEAVLGADQ-TFVAELKTARDKLAPYKVGKDGQLQEW 581


>gi|436835055|ref|YP_007320271.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
 gi|384066468|emb|CCG99678.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
          Length = 874

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 230/616 (37%), Positives = 328/616 (53%), Gaps = 54/616 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
           L + ++ PA  +T+A+PIGNG +GAM++GGV  E L+LNE TL++G P G +T  D  K 
Sbjct: 32  LTLWYDKPAAAWTEALPIGNGYMGAMLFGGVEQEHLQLNEGTLYSGDPSGTFTAIDVRKK 91

Query: 72  LSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
              V SLV  G Y EA    +    G     YQ LGD+ + F  +        YRR LDL
Sbjct: 92  FKAVDSLVKQGNYKEAQNLVAADWLGRNHQDYQPLGDLWMAFTHTG---PVTKYRRSLDL 148

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKI--SGSES--GSLSFNVSLDSLLDNHSY 186
           +T  ++++Y+V N  + RE F+S PD+VIV ++   G E+  G + F+     L     Y
Sbjct: 149 STGISQIQYTVANTTYRREIFASYPDRVIVIRLLAEGKETINGEIRFSTPHKPLA---RY 205

Query: 187 VNGNNQIIMEGRCPG---------------KRIPPKANAND--------------DPKGI 217
               +Q+IM G+ PG               +   P+  A D              D  G 
Sbjct: 206 SASADQLIMAGKAPGFVLRRTVKLVQKLGDQHKYPEVFAKDGSVLPNASDVLYGADATGW 265

Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
                  ++ +   GT+ A  D+ +K+ G+   +L+L  ++SF+G   +P     +P + 
Sbjct: 266 GMGFEARLRATQQGGTLQA-TDQTIKISGAREVLLVLTCATSFNGFDKSPVTQGLNPAAS 324

Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
           +   L S+   SY DL   HL DYQ LF R  +Q+          T S+++  T  + +R
Sbjct: 325 TQKYLASVAGRSYDDLAKTHLSDYQHLFSRSQLQIG---------TVSDQSART--TDQR 373

Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
           +  F   +D SLV LL+QFGRYL+I+ SRPG Q  NLQGIWN+ + P W+ A  VNIN +
Sbjct: 374 IALFANGKDQSLVGLLYQFGRYLMIAGSRPGGQPLNLQGIWNDKVIPPWNGAYTVNINAQ 433

Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
           MNYW +   NLSEC EP    +  L+ING+ TA+  Y  +GWV+HH TDIW + +     
Sbjct: 434 MNYWPAELTNLSECHEPFLTAVRELAINGAVTARAMYGNNGWVVHHNTDIW-RHTEPVDY 492

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
              A WPM G WL +H WE Y +  D  FL    YPLL+G   F  DWLI   DGYL T 
Sbjct: 493 CNCAFWPMAGGWLTSHFWERYLFRGDTTFLRTDVYPLLKGVVLFYKDWLIPNKDGYLVTP 552

Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
              SPEH F+  +G+ + +S   TMDMAIIRE F+  I A++ L  +E  L +++   L 
Sbjct: 553 IGHSPEHAFVYGNGQTSTLSPGPTMDMAIIRESFTRFIEASDKLGTSEQPLYDEIKAKLA 612

Query: 578 RLRPTKIAEDGSIMEW 593
           +L P +I + G + EW
Sbjct: 613 KLLPYQIGKYGQLQEW 628


>gi|442803588|ref|YP_007371737.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
           stercorarium DSM 8532]
 gi|442739438|gb|AGC67127.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
           stercorarium DSM 8532]
          Length = 761

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 220/560 (39%), Positives = 321/560 (57%), Gaps = 45/560 (8%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+P+GNGR+GAM++GGV +E ++LNED++W G P D  NP+A + L  +R L+  G+  E
Sbjct: 30  ALPLGNGRIGAMIYGGVENELIQLNEDSIWYGGPRDRNNPEAVRYLPTIRKLISEGRIRE 89

Query: 87  A-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A   A++ L G P     YQ LG++ L F++         YRRELD++ A ARV+Y + +
Sbjct: 90  AENLAAIALSGIPESQRHYQPLGELYLNFENHK---NPSYYRRELDIDNAVARVEYKIVD 146

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPG 201
             +TRE F S P QV+  KI    S S+SF   L      +    +N +N + M G C G
Sbjct: 147 TLYTREMFVSAPQQVLAIKIKAEGSKSISFRTKLRRSRYFEKVDALN-HNTLKMAGSCGG 205

Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFD 261
           +              I + A+L  +I  + G++ A+  + L V+ S   V+ L  +++F 
Sbjct: 206 E------------GAINYCALL--RIIPENGSVEAI-GEHLVVKNSKSVVIFLSVATTF- 249

Query: 262 GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
                     ++P  ES+  L+    L Y +L   H++DY+ LF RV +         +T
Sbjct: 250 --------RHEEPEKESLRILEEAEKLRYDELLQNHIEDYRSLFDRVDL--------YIT 293

Query: 322 DTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNE 380
           +  +++N+D++P+ ER++  +  ++DP LV L FQFGRYLLISSSRPGT  ANLQGIWN+
Sbjct: 294 NHSADKNVDSLPTDERLERVKAGNDDPGLVSLYFQFGRYLLISSSRPGTLPANLQGIWNK 353

Query: 381 DLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWV 440
           D  P WDS   +NIN +MNYW +  CNLSEC  PLFD +  +   G KTA+V Y   G+ 
Sbjct: 354 DYLPPWDSKYTININTQMNYWPAEVCNLSECHLPLFDLIERMREPGRKTARVMYGCRGFC 413

Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
            HH TDIWA ++          WPMG AWLC HLWEHY +T D++FL + AY  ++    
Sbjct: 414 AHHNTDIWADTAPQDIYFGATYWPMGAAWLCLHLWEHYEFTRDKEFLAQ-AYLTMKEAVE 472

Query: 501 FLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
           FLLD+L E   G L T+PS SPE+ +I P+G+   +    +MD  II E+F   I A  +
Sbjct: 473 FLLDFLTEDDKGRLVTSPSVSPENTYILPNGESGRLCQGPSMDSQIIHELFGVCIKATSI 532

Query: 561 LEKNEDALVE--KVLKSLPR 578
           L  + +   E  KVL+ +P+
Sbjct: 533 LNIDGEFAAELGKVLERVPK 552


>gi|373958368|ref|ZP_09618328.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
 gi|373894968|gb|EHQ30865.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
          Length = 827

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 228/608 (37%), Positives = 345/608 (56%), Gaps = 37/608 (6%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-T 64
           S+   N  K+ ++ PAK +T+A+P+GNGRLGAM++G V  E ++LNE TLW+G P  +  
Sbjct: 18  SSFAQNSSKLWYSHPAKVWTEALPLGNGRLGAMIFGRVDQELIQLNEGTLWSGGPVKHNV 77

Query: 65  NPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAE 121
           NPDA   L   R +L+    Y +A A + K+ G  ++ ++ LGD+ +  +F ++    + 
Sbjct: 78  NPDAYSYLLQTREALLKEENYVKAAALARKMQGVYSESFEPLGDVMISQKFKEA----SP 133

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y R+LD++ A +  ++++   +FTR+ F S PDQVIV ++  S+ G L+F VS  S L
Sbjct: 134 SAYYRDLDISDAVSTTRFTIDGTQFTRQMFISAPDQVIVIRLKASKPGQLNFKVSTKSQL 193

Query: 182 D-NHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDR 231
              +S +NG+ QI M G  P    P   N N  P         +G++++ +L+   +   
Sbjct: 194 KFGNSVINGS-QIAMLGHAPLHADPSYVNYNKTPVIYQDSTGKQGMRYALLLK---AVGN 249

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           GTI+  +   L V+     +L L A++SF+G   +P    +D    +   L +     + 
Sbjct: 250 GTITT-DTSGLSVKNGSDIILFLSAATSFNGFDKSPDKDGQDEVRIATQYLNTALKKDWQ 308

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLV 350
            L+  HL DY + ++RV+  L+ +PKD             +P+ ER+  + +  +DP+L 
Sbjct: 309 SLFDAHLADYHRYYNRVTFNLA-APKDNTNAL--------LPTDERLIGYTRGTKDPALE 359

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
            L + +GRYLLIS SRPG   ANLQGIWN  + P W S    NIN +MNYW S   NLSE
Sbjct: 360 TLYYNYGRYLLISCSRPGGAAANLQGIWNNIVRPPWSSNFTTNINTQMNYWPSEMTNLSE 419

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGG 467
             EPLF+ + +L++ G  TA+  Y A GW +HH +DIWA S+     RG   WA W MG 
Sbjct: 420 LNEPLFEQIKHLAVTGKATAKEFYHAEGWAVHHNSDIWALSNPVGDKRGDPKWANWSMGS 479

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
            WL  HLW HY +T D+ FL+  AYPL++G A F L WL+E  DG L T PS SPE++FI
Sbjct: 480 PWLSQHLWTHYQFTGDKLFLKDTAYPLMKGAAQFCLSWLVENKDGLLVTAPSVSPENDFI 539

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
              G    VS ++TMDM+II ++F+ +I A  VL  + D   + ++    +L P  I + 
Sbjct: 540 DDRGHEGSVSIATTMDMSIIWDLFTNVIEACNVLNTDRD-FRDLIIAKRAKLFPLHIGKK 598

Query: 588 GSIMEWVQ 595
           G++ EW +
Sbjct: 599 GNLQEWYK 606


>gi|408674119|ref|YP_006873867.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
 gi|387855743|gb|AFK03840.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
          Length = 785

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 230/603 (38%), Positives = 349/603 (57%), Gaps = 40/603 (6%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A+ST+T     + +  PA++F + + +GNG+LGA V+GGV S+ + LN+ TLW+G P + 
Sbjct: 8   AQSTNT-----LWYKQPAQYFEETLVLGNGKLGATVFGGVESDKIYLNDATLWSGEPVNA 62

Query: 64  T-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
             NP+A K L  +R  + +  Y  A   + KL G  ++ Y  LG + L  +D    Y   
Sbjct: 63  NMNPEAYKHLPAIREALRNENYKLADQLNKKLQGKFSESYAPLGTMYLT-NDKATNYT-- 119

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y RELD++ A ++V Y V  V++TRE+F S PDQ++V K++ S+ G+LSF+V  +SLL 
Sbjct: 120 NYYRELDISKAISKVTYEVDGVKYTREYFVSYPDQIMVIKLTSSKKGALSFDVKFNSLLK 179

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISA 236
             + VN +  + + G  P     P    +D+P      KGI+F+ + +IK +D  G I +
Sbjct: 180 YKTIVN-DKTLKINGYAP-IHAEPNYRRSDNPVIFDENKGIRFTTLAKIKNTD--GAIVS 235

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
             D  L ++ +  A++ +  ++SF+G   NP+    +  + + ++L      +Y  +   
Sbjct: 236 -TDTTLGIKNASEAIVYVSIATSFNGFDKNPATQGLNNQAIAATSLAKAYAKTYEQIRQS 294

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
           HL DYQK F+RVS+ L ++                +P+ +R++ + + +ED +L  L FQ
Sbjct: 295 HLLDYQKFFNRVSLDLGKT------------TAPNLPTDDRLRRYAKGEEDKNLEVLYFQ 342

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSR     ANLQGIWN  + P W S    NIN E NYW +   NLSE   PL
Sbjct: 343 YGRYLLISSSRTMGVPANLQGIWNPYIRPPWSSNYTTNINAEENYWLAENTNLSEMHAPL 402

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLC 471
             F+  ++  G+ TA+  Y A+GWV+ H +DIWA S+       G   WA W MGG WL 
Sbjct: 403 LGFIKNVAKTGAITAKTFYGANGWVVAHNSDIWAMSNPVGAFGEGDPGWANWNMGGTWLS 462

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
           THLWEHY +T D++FL+  AYPL+ G A F L+W++E  +G L T+PSTSPE+ +IAPDG
Sbjct: 463 THLWEHYIFTKDQNFLKNEAYPLMRGAAQFCLEWMVEDKNGKLITSPSTSPENIYIAPDG 522

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSI 590
                 Y  + D+A+IRE F   I A+++L  N DA    K+  +L +L P +I + G++
Sbjct: 523 YKGATMYGGSADLAMIRECFIQTIKASKIL--NTDANFRTKLETALAKLYPYQIGKKGNL 580

Query: 591 MEW 593
            EW
Sbjct: 581 QEW 583


>gi|423346384|ref|ZP_17324072.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
           CL03T12C32]
 gi|409220202|gb|EKN13158.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
           CL03T12C32]
          Length = 844

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 224/622 (36%), Positives = 330/622 (53%), Gaps = 50/622 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
            T +  PL + ++ PA+++ +A+PIGNGR GAMV+GGV  E L+LNE+TL++G P     
Sbjct: 18  GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 77

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
                P+    V  L+ +G+Y  A+    K   G     YQ  GD+ ++ +         
Sbjct: 78  DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 134

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+R L+++ A A   Y    V++ RE F+S+PD VIV  +       +  ++   S   
Sbjct: 135 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 194

Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND-------------- 212
                  ++++I+ G+ PG                 + P   +AN               
Sbjct: 195 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 254

Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
           D KG+ F A L+     D      + D  + +  +D    +L  ++SF+G   +PS    
Sbjct: 255 DGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 312

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP++++ S L+   +  Y  L  RH +DY  LF RV +QL  S         SE+    +
Sbjct: 313 DPSAKAASILEKALSYDYQTLKQRHTEDYHSLFDRVDLQLVSS---------SEQK--AM 361

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           P+ +R++ F    DP+L  LLFQFGRYL+IS SRPG Q  NLQGIWN+D  P W+    +
Sbjct: 362 PTDKRLEQFTQTADPALAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDTIPAWNCGYTI 421

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NIN EMNYW +   NLSECQEPLF  +  LS++G++TA+  Y   GWV HH T IW +S 
Sbjct: 422 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 481

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
            +      + WPM   WLC+HLWEHY +T D  FL+  AYPL++G A F  DWLI+  +G
Sbjct: 482 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDEAFLKNEAYPLMKGAAEFFADWLIDDGNG 541

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           +L T    SPE+ FI  DG+ A +S   TMDMAIIRE F+  I+A+E+   +E +   ++
Sbjct: 542 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 600

Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
              L RL P +I + G + EW+
Sbjct: 601 KDKLARLLPYQIGKRGQLQEWI 622


>gi|374605049|ref|ZP_09677992.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
 gi|374389319|gb|EHQ60698.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
          Length = 779

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 216/588 (36%), Positives = 331/588 (56%), Gaps = 44/588 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+PIGNGRLGAM +GGV S+ L+LNED++W G P    NPDA   L 
Sbjct: 12  RLWYRQPAGQWVEALPIGNGRLGAMQFGGVDSDRLQLNEDSVWYGGPAARENPDAAAYLP 71

Query: 74  DVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELD 129
            +R  +  G+  EA   AS+ L   P     YQ LG++++ F   H +  E + Y REL 
Sbjct: 72  VIRQYLLEGKPEEAERIASLALASVPKHFGPYQTLGELKMFF---HGEEGEVSGYSRELS 128

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
           L    ARV+Y+   + ++RE  SS PDQVI  +++ S +  LS ++ L+    ++ + V 
Sbjct: 129 LPDGLARVEYTRNGIAYSRELLSSVPDQVIALRLTASAAKRLSLSLYLNRRSFEDGTTVI 188

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            ++ I M+G+C                G+++   L  K   D G ++A+ D  L ++ +D
Sbjct: 189 ASDTIAMQGQC-------------GAGGVRYCVAL--KALADNGEVTAIGDC-LSIDAAD 232

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              L + A+++F          + +P    +  +++     Y  + + H+ D++ L+ RV
Sbjct: 233 AVTLYVAAATTF---------RESNPLQTCLRQVEAAAAKGYQQVRSDHVRDHRALYERV 283

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRP 367
           +++L            SE+++  +P+ ER+K   Q   DP L  L FQ+GRYLL+ SSRP
Sbjct: 284 ALRLG---------ATSEDSLCRLPTDERLKRVRQGQADPGLFALFFQYGRYLLMGSSRP 334

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT  ANLQGIWN  ++P W+S  H+NINL+MNYW +   NL+EC EP+FD L  L  NG 
Sbjct: 335 GTLPANLQGIWNPHMTPPWESDFHLNINLQMNYWPAEAANLAECHEPVFDLLDRLRTNGR 394

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA V Y A G+V HH T++WA ++     V    WPMGGAWL  H WEHY Y  D  FL
Sbjct: 395 HTAAVMYGADGFVAHHATNLWADTAPVSDVVSATFWPMGGAWLALHAWEHYQYGGDETFL 454

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            +RAYP+++  A FLL++L+E   G   T+PS SPE+ +  P+G+   +    +MD  I+
Sbjct: 455 RERAYPVMKDAALFLLNYLVENAQGEWVTSPSISPENRYRLPNGQQGTLCMGPSMDTQIM 514

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           R +F A + A+      EDA  E++  ++ RL P +I  DG ++EW +
Sbjct: 515 RALFQACLDAS-AGRTEEDAFRERLQAAMTRLPPHRIGRDGQLLEWAE 561


>gi|284036792|ref|YP_003386722.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
 gi|283816085|gb|ADB37923.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
          Length = 825

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 222/598 (37%), Positives = 328/598 (54%), Gaps = 32/598 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
           LK+ +  PA  +T+A+P+GNGR+GAM++G V  E ++LNE TLW+G P     NP++P  
Sbjct: 23  LKLWYTKPAAVWTEALPVGNGRIGAMIFGKVEDELIQLNESTLWSGGPVSGNVNPESPSY 82

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
           L  VR  ++   Y +A     K+ G     Y  LGD+ L+    +L  A  T Y R+LD+
Sbjct: 83  LPQVREALNREDYKQAVTLVKKMQGLYTQSYMPLGDLSLK---QNLNGATPTGYYRDLDI 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +++   V + RE F+S PD V+V +++ S+ G LSF+ S  S L   +    N
Sbjct: 140 QKALATTRFTANGVTYKREMFTSAPDGVMVIRLTASKPGQLSFDASTSSQLRAENMRGSN 199

Query: 191 NQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTISALEDK 240
             ++M+G+ P +  P   N  D            KG++F   L +K  +  GT+   + +
Sbjct: 200 GDLVMKGKAPTQVDPNYYNPKDREHVIYEDATGCKGMRFQ--LRLKALNKGGTVQT-DKE 256

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + V  +   +L + A++SF+G    P    KD    +   ++     SY  L  RH  D
Sbjct: 257 GIHVRNASEVLLFVAATTSFNGYDKCPDKDGKDENKLAEELIRKATATSYQALLNRHTAD 316

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
           YQ  F+R S Q        +TDT S      +PS ER++ +     DP +  L  Q+GRY
Sbjct: 317 YQSYFNRFSFQ--------ITDTTSVNKNAALPSDERLEMYSKGVYDPGIETLYCQYGRY 368

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSSR     ANLQGIWN++L   W S   +NIN +MNYW     NLSE   PL  F+
Sbjct: 369 LLISSSRVTNVPANLQGIWNKELRAPWSSNYTININTQMNYWPVEVTNLSELHRPLLSFI 428

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLW 475
             L+  G+ TA+  Y  +GWV+HH TDIWA S+   D+G+    WA W  G  WL  HLW
Sbjct: 429 GELAKTGAVTAKEFYNMNGWVVHHNTDIWAISNPVGDKGQGDPKWANWNQGAGWLSQHLW 488

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           EHY +T D+ FL + AYP+++G A F LDWL+   DGYL  +PS SPE++FI   G+ A 
Sbjct: 489 EHYRFTGDKKFLRESAYPIMKGAAEFYLDWLVADKDGYLVVSPSVSPENDFIDAKGQPAS 548

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +S ++TMDM+I+ ++F+ +I A+ VL    D   + +++   +  P  I   G++ EW
Sbjct: 549 ISVATTMDMSIMWDLFTNLIDASTVLNIEPD-FRKMLIEKRSKFYPLHIGHKGNLQEW 605


>gi|399029093|ref|ZP_10730146.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
 gi|398073115|gb|EJL64299.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
          Length = 802

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 218/589 (37%), Positives = 348/589 (59%), Gaps = 33/589 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
           ++ PA+ F +++ +GNG+LGA V+GGV S+ + LN+ TLW+G P +   NP+A K +  V
Sbjct: 32  YDKPAEFFEESLVLGNGKLGATVFGGVNSDKIYLNDATLWSGEPVNANMNPEAYKNIPAV 91

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           R  + +  Y  A   + K+ G  ++ +  LG +E+   ++  K     Y RELD++ A +
Sbjct: 92  REALKNENYKLAEELNKKIQGKNSESFAPLGTLEI---NNSEKGKAVNYHRELDISNAVS 148

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           +V Y +  +++TRE+F S PDQ+++ K++  + G+L+F+++L SLL ++  V  NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAPDQIMIIKLTSDQKGALNFDINLKSLLKSNVEVR-NNILVM 207

Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            G  P     G  + PK   +   +G +F+ +++IK +D + T S    + L ++ +  A
Sbjct: 208 TGSAPIHENAGYAVLPKY-LDIKERGTRFTTLIQIKKTDGKITNSR---ESLTLKDATEA 263

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           ++ +  ++SF+G   NP+    D  + ++  +      S+  L   H+ DYQK ++RVS+
Sbjct: 264 IIYVSVATSFNGFDKNPATEGLDDVAIALQNMNKAFAKSFDKLKQSHITDYQKFYNRVSL 323

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
            L ++       T S      +P+ ER+  +   +ED +L  L FQ+GRYLLISSSR   
Sbjct: 324 DLGKT-------TAS-----NLPTDERLLRYADGNEDKNLEILYFQYGRYLLISSSRTLG 371

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWN  L+P W S   +NINLE NYW +   NLSE   PL  F+  LSI G  T
Sbjct: 372 VPANLQGIWNPYLNPPWSSNYTMNINLEENYWLAENTNLSEMHLPLLSFIKNLSITGKIT 431

Query: 430 AQVNY-LASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           A+  Y +  GW   H +DIWA ++      + + +WA WPM GAWL TH+WEHY +T D+
Sbjct: 432 AKTFYGVDKGWAAGHNSDIWAMTNPVGQFGKEEPMWACWPMAGAWLSTHIWEHYVFTQDK 491

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           ++L+K  YPL++G A F L W++   +G L T+PSTSPE+++IAPDG +    Y  T D+
Sbjct: 492 EYLKKEGYPLMKGAAEFCLGWMVTDKNGNLITSPSTSPENQYIAPDGFVGATMYGGTADL 551

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A+IRE F   I A++VL  + D    K+  +L +L P +I + G++ EW
Sbjct: 552 AMIRECFDKTIKASKVLNIDAD-FRAKLETALSKLHPYQIGKKGNLQEW 599


>gi|436838082|ref|YP_007323298.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
 gi|384069495|emb|CCH02705.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
          Length = 801

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/590 (37%), Positives = 328/590 (55%), Gaps = 32/590 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
           +  PA +F + + +GNG  GA V+GGV S+ + LN+ TLW+G P D   NP+A K +  +
Sbjct: 29  YKQPAHYFEETLVLGNGTQGASVFGGVRSDKIYLNDATLWSGGPVDPNMNPEAYKNIPAI 88

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           R  + +  Y  A     KL G  ++ Y  LG +   F D+      + Y R+L+L  AT+
Sbjct: 89  REALQNENYQLADQFQKKLQGKFSESYAPLGTL---FIDTDAPADPQNYYRQLNLADATS 145

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           +V+Y+V  V FTR++F S PDQ++V ++  S  G+L F V  +S L N     GN  +  
Sbjct: 146 QVRYTVNGVTFTRDYFISKPDQLMVIRLKSSRKGALGFTVRFNSQLRNQVSATGN-VLKA 204

Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            G  P K  P      P A   D  KG +F+ ++ IK  D  G   A  D  L ++G   
Sbjct: 205 TGYAPQKAEPNYRGNIPNAVVFDPAKGTRFTTLMGIKTQD--GGTVATTDTSLTLKGGTE 262

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A+L +  ++SF+G   +P+ +     + +   L    + SY+ L   H+ DYQ+LF+RVS
Sbjct: 263 ALLFVSIATSFNGFDKDPATNGLPHETIAAERLSRAMSKSYAQLLAAHVSDYQRLFNRVS 322

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
           ++L+           S E I  +P+ ER++ + +   D  L +L F FGRYLLISSSR  
Sbjct: 323 LRLT-----------SAETIPNLPTDERLQRYAEGKPDTDLEQLYFNFGRYLLISSSRTP 371

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQGIWN  + P W S    NINL+ NYW +   NL E  EP+  F+  L+  G+ 
Sbjct: 372 GVPANLQGIWNPYMRPPWSSNYTTNINLQENYWPAETANLPEMHEPMLSFIGNLAKTGTI 431

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           TA+  Y A+GW + H +DIWA ++      +G  VWA W MGGAW+ THLWEH+ +  D+
Sbjct: 432 TARTFYGANGWTVAHNSDIWAMTNPVGDFGQGDPVWANWNMGGAWISTHLWEHFTFGQDK 491

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            +L + AYPLL+G A F LDWL+    G L T+P TSPE++++ P G      +  T D+
Sbjct: 492 TYLRETAYPLLKGAAQFCLDWLVRDKAGKLVTSPGTSPENQYLTPSGYKGATLFGGTADL 551

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
           A++RE  S  + AA+VL  N DA  +  LK +L  L P +I + G++ EW
Sbjct: 552 AMVRECLSQTLQAAQVL--NTDADFQATLKQTLADLHPYQIGKAGNLQEW 599


>gi|182416090|ref|YP_001821156.1| alpha/beta hydrolase domain-containing protein [Opitutus terrae
           PB90-1]
 gi|177843304|gb|ACB77556.1| Alpha/beta hydrolase fold-3 domain protein [Opitutus terrae PB90-1]
          Length = 1094

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 234/606 (38%), Positives = 341/606 (56%), Gaps = 53/606 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           +A   + T  LK+ +  PA  + +A+P+GNGRLGAMV+GG+  E L+LNEDTLW G P D
Sbjct: 337 SAPEEAATAALKLWYRQPAAQWVEALPVGNGRLGAMVFGGIQQERLQLNEDTLWAGGPYD 396

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKY 119
             +P+A  AL ++R L+ +G YA A   +  K  G P     YQ +GD+ +    S    
Sbjct: 397 PASPEARAALPEIRRLISAGNYAAAQQLTQGKFMGRPIVQMPYQTVGDLMITQAGSE--- 453

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------GSLS 172
               YRRELDL+TA AR +Y +G V F RE F+S  DQVIV +++ S +       G LS
Sbjct: 454 QVANYRRELDLDTAIARTEYVLGGVTFVREVFASPVDQVIVIRLTASRNPPRPEWGGPLS 513

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDD 230
           F ++  S     +  +G  ++++ G            +N D  GI+     E +  +  +
Sbjct: 514 FTLAFQSPQRATAAADGA-ELVLSG------------SNSDAAGIKGRLKFEARARLIVE 560

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
            G + A +   L+V+G+  A +LL A++S+        D   DP + + + L ++    Y
Sbjct: 561 GGAVVA-DGTDLQVQGAHAATILLAAATSYR----RYDDVSGDPAALNRATLAAVATKPY 615

Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
             +   H+ ++Q+LF RVS+       D+ T   ++     +P+ ERV+   T  DP+L 
Sbjct: 616 EAIRAAHVAEHQRLFRRVSL-------DLGTSYAAQ-----LPTDERVRLSTTSVDPALA 663

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
            L FQ+ RYLLISSSRPG+Q ANLQG+WN+ ++P W S   +NIN EMNYW +   NL+E
Sbjct: 664 ALYFQYARYLLISSSRPGSQPANLQGLWNDHVTPPWGSKYTININTEMNYWPAEVANLAE 723

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
           C EP+F  +  L+  G+K AQ  Y A GWV+HH TD+W +++A      W +WP GGAWL
Sbjct: 724 CTEPVFSMIRDLTETGTKMAQAQYGARGWVVHHNTDLW-RAAAPIDGAFWGMWPTGGAWL 782

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP 529
           C   WEHY Y+ DR+FL  R YP L+G A F LD L+ E    +L T+PS SPE+     
Sbjct: 783 CRTAWEHYLYSGDREFL-ARIYPWLKGAAEFFLDTLVEEPRHRWLVTSPSISPENAH--- 838

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                 +S   TMD  IIR++FS +I+A+E L  + D   +KV  +  RL P +I   G 
Sbjct: 839 -HPGVTISAGPTMDEQIIRDLFSEVITASEQLGVDAD-FRQKVAAARARLAPNQIGAQGQ 896

Query: 590 IMEWVQ 595
           + EWV+
Sbjct: 897 LQEWVE 902


>gi|399073647|ref|ZP_10750601.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
 gi|398041300|gb|EJL34368.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
          Length = 783

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 220/586 (37%), Positives = 334/586 (56%), Gaps = 42/586 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PAK + +A+P+G GR+GAMV+GGV  E L+LN+DTLW G P D  NP A  AL 
Sbjct: 35  RLWYRQPAKEWVEALPVGTGRIGAMVFGGVAEERLQLNDDTLWAGGPYDPVNPQARAALP 94

Query: 74  DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++R L+ +G  AEAT  A  +    P     YQ +GD+ L F    L    + Y R+LDL
Sbjct: 95  EIRRLIAAGDIAEATKVADARFLATPRYQMSYQTIGDLRLAF--PGLPETADDYVRDLDL 152

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH--SYVN 188
           + A A  ++S G   FTRE  +S PD+VI  +++  ++ +LS ++S  S L++   +   
Sbjct: 153 DGAIATTRFSAGATRFTREVIASAPDRVIAVRLTADKAKALSLDLSFASPLNSRPTARAE 212

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G + +++ G    +        N     ++F     +++ +  GT+ A +   L V G+D
Sbjct: 213 GADTLVLAGTGEAQ--------NGVEAALKFEC--RVRVLNKGGTVVA-DGAGLAVRGAD 261

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
             VLLL+AS++    F    D   DP + + +A+++     + DL  RH  D++KLF RV
Sbjct: 262 -EVLLLIASATSYRRF---DDVGGDPAAINRTAVEAASARPWRDLLARHQADHRKLFRRV 317

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           ++ L  +   +             P+ ER+K+  T +DP+L  L +Q+GRYLLI+ SRPG
Sbjct: 318 AVDLGTTSAALK------------PTDERIKASPTTDDPALAALYYQYGRYLLIACSRPG 365

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQG+WN+  +P W S   +NIN EMNYW + P  L+EC  PL + +  LS+ G++
Sbjct: 366 GQPANLQGLWNDQAAPPWGSKYTININTEMNYWPAEPTGLAECVAPLVEMVRDLSVTGAR 425

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TAQ  Y A GWV HH TD+W +++A      + +WP GGAWLC HLW+HY+Y  D+ +L 
Sbjct: 426 TAQAMYGARGWVAHHNTDLW-RATAPIDGAKYGVWPTGGAWLCKHLWDHYDYGRDQAYLA 484

Query: 489 KRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
              YPL+ G A F +D L+ +   G + T+PS SPE++     G    +    TMD AII
Sbjct: 485 D-VYPLMRGAALFFVDTLVRDPRTGQVVTSPSISPENDH----GHGGSLVAGPTMDQAII 539

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           R++FS+ I+AA +L   +  L   +  +  RL P KI +DG + EW
Sbjct: 540 RDLFSSCIAAAAIL-GTDAPLAAILAAARDRLAPYKIGKDGQLQEW 584


>gi|430749774|ref|YP_007212682.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
 gi|430733739|gb|AGA57684.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
          Length = 845

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 228/639 (35%), Positives = 348/639 (54%), Gaps = 67/639 (10%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+PIGNGRLGAM++GGV  + + LNEDTLW G P +  + +A + L+
Sbjct: 7   RLWYRRPAGVWEEALPIGNGRLGAMLFGGVRLDRILLNEDTLWAGYPRETVDCEARRHLA 66

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             R L+ +G+  EA      ++ G     Y  LG++ +E+ D      +  Y R L +  
Sbjct: 67  RARELIFAGRLTEAQRLIESRMTGRNVQPYLPLGELAIEWLDGEDDAPD--YVRSLRIFD 124

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
             A V+++ G +   R +++S PDQVIV +   +E G ++   +L S + +    ++   
Sbjct: 125 GVADVRFASGGLRMRRAYWASAPDQVIVVRYE-AEGGMMNLAAALSSPVRSSVSVMDDGR 183

Query: 192 QIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
            +++ GR P       +   P+    ++ +G++F A   +++  D G + A E ++L V 
Sbjct: 184 TLVLAGRAPSHVADNWRGDHPEPVLYEEGRGMRFEA--RVRLETD-GVVEA-EGERLIVR 239

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+      + A+++F   +  P D     ++   + L+      Y  L  RHL D++   
Sbjct: 240 GASRLTAYIAAATAFVD-WRTPPDESGAHSARCEAWLREAERSGYEALLERHLADHRAFM 298

Query: 306 HRVSIQLSR----------SP------KDIV-TDTCSEENIDT----------------- 331
            RVS++L+           SP      KD   +DT   + + +                 
Sbjct: 299 GRVSLRLAGGEAAGLPDADSPGSHAAGKDATGSDTAGSDAVGSAAATAESGQAGMDRSEA 358

Query: 332 ---------------VPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
                          +P+ ER+K++Q+ + DP+L  L FQ+GRYLL++SSRPGTQ ANLQ
Sbjct: 359 GWTASFGLNRVSMNDLPTDERLKAYQSGNPDPALEALYFQYGRYLLLASSRPGTQPANLQ 418

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
           GIWN  + P W S   +NIN EMNYW +  CNLSEC EPLF  L  L+ +G++TA+++Y 
Sbjct: 419 GIWNPHVQPPWFSDYTININTEMNYWPAEVCNLSECHEPLFAMLGELAESGTRTARIHYG 478

Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
             GW  HH  D+W  S+   G   WA WPMGGAWL THLWE Y +  D DFL   AYPL+
Sbjct: 479 CRGWTAHHNVDLWRMSTPSDGSASWAFWPMGGAWLATHLWERYLFEPDLDFLRGTAYPLM 538

Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
            G A F LDWL+ G DG L TNPSTSPE+ F+ P+G+   V++ STMDMAIIRE+F+A I
Sbjct: 539 RGAAQFCLDWLVPGPDGTLVTNPSTSPENVFLTPEGEPCSVTWGSTMDMAIIRELFAACI 598

Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            A+ +L  +E  L  ++  +L +L P +I   G + EW 
Sbjct: 599 EASRLLGTDE-PLRGELEAALAKLPPYRIGRHGQLQEWA 636


>gi|414868293|tpg|DAA46850.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
          Length = 579

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 175/241 (72%), Positives = 205/241 (85%)

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +LPCNLSECQEP
Sbjct: 129 QFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPALPCNLSECQEP 188

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LFDF+  LSING+KTA+VNY ASGWV H  TD+WAK+S D G  VWALWPMGG WL THL
Sbjct: 189 LFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWPMGGPWLATHL 248

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           WEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH FIAPDGK A
Sbjct: 249 WEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEHYFIAPDGKEA 308

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           CVSYS+TMD++IIREVFSA+I +A++L K++  +V+++ K+LP L P K+A DG+IMEW 
Sbjct: 309 CVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKVARDGTIMEWA 368

Query: 595 Q 595
           Q
Sbjct: 369 Q 369



 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 59/85 (69%), Positives = 69/85 (81%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PLK+ F  PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP  
Sbjct: 41  PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100

Query: 72  LSDVRSLVDSGQYAEATAASVKLFG 96
           LS VRSLV++G+Y EAT+A+  L G
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSG 125


>gi|357008575|ref|ZP_09073574.1| alpha-L-fucosidase [Paenibacillus elgii B69]
          Length = 765

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 215/587 (36%), Positives = 328/587 (55%), Gaps = 40/587 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + ++ PA+ + +A+PIG GRLG MV+G V  + ++LNED++W G P    NPDA   +
Sbjct: 8   LALWYSAPARRWEEALPIGGGRLGGMVFGTVGQDKIQLNEDSVWYGGPKKANNPDARANV 67

Query: 73  SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R L+  G+  EA   A + L   P  +  YQ LGD+ L +   H K   + Y RELD
Sbjct: 68  PEIRRLLMEGKQQEAEHLARMALMSAPKYLHPYQPLGDLLL-YMLGHDK-PPQAYERELD 125

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSYVN 188
           L  A  RV+Y +  V +TRE+FSS   QV+  +++ +  GSL+F+  +     D  S   
Sbjct: 126 LERALVRVRYDMDGVRYTREYFSSAVHQVLAVRLTAARPGSLTFSTHMMRRPFDMGSQKY 185

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G + +IM G C               +G++FS +L+     D  ++  + D  + VEG+D
Sbjct: 186 GEDTMIMYGEC-------------GTEGVRFSVVLKAVAEGD--SVKPIGDF-ISVEGAD 229

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              LLL A ++F            DP +  +  +    +L Y +L   H +D+ + F RV
Sbjct: 230 AVTLLLAAGTTF---------RHDDPKAVCLEQIARAASLPYEELKRAHTEDHDRYFRRV 280

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            ++L++   D      ++E +      ERVK  +  +DP LVE  FQFGRYLL+S SRPG
Sbjct: 281 GLELAKPEPDAAASLPTDERL------ERVK--EGHDDPGLVETFFQFGRYLLLSCSRPG 332

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +  A LQGIWN++ +P W+S   +NIN +MNYW +  C+L EC EPLFD +  +  NG  
Sbjct: 333 SLAATLQGIWNDNYTPPWESKYTININTQMNYWPAEVCHLQECLEPLFDLIERMRENGRV 392

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y   G++ HH T++W  +  +   V  ++WPMG AWL  HLWEHY + +DR FL 
Sbjct: 393 TAREVYGCGGFMAHHNTNLWGDTHVEGIPVSASIWPMGAAWLSLHLWEHYRFGLDRSFLA 452

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
            RAYP+++  A FLLD+L+E   G L T PS SPE++F+  +G    +  + +MD  I  
Sbjct: 453 DRAYPVMKEAAQFLLDYLLEDEQGRLLTGPSISPENKFVLSNGVTGNLCMAPSMDSQIAF 512

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +F A   AA VL  +E A  +++ +++ +L   +I   G IMEW++
Sbjct: 513 TLFDACREAAAVLGLDE-AFRQRLAEAMAKLPQPQIGRHGQIMEWLE 558


>gi|312792729|ref|YP_004025652.1| alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
 gi|312179869|gb|ADQ40039.1| Alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
          Length = 752

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/590 (38%), Positives = 334/590 (56%), Gaps = 44/590 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ FN PA+ + +A+PIGNG LGAM++GGV  ET++LNE+++W+  P    NPDA K L
Sbjct: 6   LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65

Query: 73  SDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R  +  G    A   SV  L G  H    Y+ LG +++ F++      +  Y R LD
Sbjct: 66  PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
           ++ A  +V++ V N+ + + +FSS PD+VIV KI  S++G++S    F       +D   
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
            V+ N++I  E  C             + +G+ FSA+L+  +S D G +  + D  L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +   +LL+ +++S+          +KD  +  +  ++      + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV   +        T+  + E I+ +    +        D  L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG    NLQGIWN+++ P W S   +NINL+MNYW +  CNLSEC  PLFD L  +  N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G  TAQ  Y   G+  HH TDIW  ++     +    WPMG AWLC H+WEHY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYLPATYWPMGAAWLCLHIWEHYEYTGDIN 451

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL KR Y L++  A FLLD+LIE  +GYL T PS SPE+ +   +G++  ++Y  TMD+ 
Sbjct: 452 FL-KRYYYLMKEAALFLLDYLIEDKNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           II  +F  +  A  VL+ N D +VEK+  +L +L P KI + G I EW++
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIE 558


>gi|375145718|ref|YP_005008159.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
 gi|361059764|gb|AEV98755.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
          Length = 825

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 225/607 (37%), Positives = 343/607 (56%), Gaps = 35/607 (5%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NP 66
           S  + L + +N PA+ + +A+P+GNG +G M++G V  E ++LNE TL++G P   + NP
Sbjct: 23  SAQSGLSLWYNKPAEAWVEALPVGNGHIGGMIFGRVEEELIQLNESTLYSGGPVKQSINP 82

Query: 67  DAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
           DA + L+ +R +L+    Y++A   + K+ G+  + Y  LGD+ L+   S        Y+
Sbjct: 83  DAFQYLAPIREALLKEQDYSKANELAKKMQGYFTESYLPLGDLLLK--QSFNGRTPSAYQ 140

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL TA A  +++V  VE+TRE F S P  V+V +I     G++  +V+L+S L    
Sbjct: 141 RRLDLQTAIATTRFTVDGVEYTREVFCSAPANVMVIRIRAGVPGAIDLSVALNSPLHYTI 200

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTIS 235
               NN++IM G+ P    P   N  D             G++F     +K     GT++
Sbjct: 201 SAKANNEVIMSGKAPAHVDPSYYNPKDRQPVIYEDTAGCNGMRFQC--RVKAITKTGTVT 258

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A +   L V+ +   VL++ A++SF+G    P    K+  + +   + +    SY+ L  
Sbjct: 259 A-DTLGLHVQHATELVLIVSAATSFNGFDKCPDKEGKNEQAIAAGLIDAAAKRSYTGLQQ 317

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELL 353
            H++D+Q+ F+RVS         I+ DT +  N + T+P  +R++++     DP+L  L 
Sbjct: 318 DHVNDHQRYFNRVSF--------ILKDTGAASNTNSTLPVDKRLQAYSAGAYDPALETLY 369

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           +Q+GRYLLI++SRPG   ANLQGIWN++L   W S   +NIN +MNYW +   NLSE   
Sbjct: 370 YQYGRYLLIAASRPGGPPANLQGIWNKELRAPWSSNYTININTQMNYWPAESTNLSEMHL 429

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW--AKSSADRGK--VVWALWPMGGAW 469
           PL  +L  LS+ G++ A+  Y   GWV HH +DIW  A    DRG    VWA W MGG W
Sbjct: 430 PLLQWLKILSVTGARVAREFYHCDGWVAHHNSDIWGCANPVGDRGAGDPVWANWYMGGNW 489

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
           LC HLWEHY +T D+ FL   AYP+++  A F L+WL++   GY  T PSTSPE++F   
Sbjct: 490 LCQHLWEHYAFTQDKKFLAT-AYPIMKQAAVFTLNWLVKDSSGYWVTAPSTSPENKFRDE 548

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
            G+   VS ++TMDM+IIR++F+ +I A+E L  N D L    L  + + L P +    G
Sbjct: 549 KGRAQAVSVATTMDMSIIRDLFTNVIEASEAL--NTDQLFRNRLTEVRKHLYPLRKGSKG 606

Query: 589 SIMEWVQ 595
            ++EW +
Sbjct: 607 ELLEWYK 613


>gi|332662390|ref|YP_004445178.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332331204|gb|AEE48305.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 801

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 225/586 (38%), Positives = 333/586 (56%), Gaps = 31/586 (5%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKALSDVRSL 78
           PAKHF +++ +GNGR+GA+V GGV S+ + LN+ TLW G P D   NP A   L  +R  
Sbjct: 34  PAKHFEESLVLGNGRIGAVVHGGVKSDKIFLNDATLWAGSPVDPDMNPAAHTHLPAIREA 93

Query: 79  VDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
           +    Y +A + + + L G  ++ Y  LG + +  D +H + A   YRR+LDL+TA +  
Sbjct: 94  LRQEDYRKADSLNRRHLQGKFSESYAPLGTMYI--DMAHTETASN-YRRQLDLSTAISTT 150

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
            Y    V +TRE+F S+P QV++ +++ S+ G LSFN+  +SLL  H      N +   G
Sbjct: 151 SYQQAGVTYTREYFISHPQQVLLIRMTASQLGKLSFNLRFNSLL-RHQVNTSTNVLNASG 209

Query: 198 RCPGKRIP-----PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           R P    P     P     DD K ++F ++++I  +D +       D  + V+G   A++
Sbjct: 210 RAPAHAEPSYRRVPDPIQYDDQKSMRFLSLVKIIKTDGK---IVRTDSTIGVQGGKEAII 266

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           ++  ++SF+G   NP+   KD  + +   L+  + +SY+ +   H+ D+Q+ F+RV  QL
Sbjct: 267 MVSIATSFNGFDQNPALHGKDEVTLANEWLKKAQIISYATIKAAHIKDHQQFFNRVQFQL 326

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           +    +            ++P+ ER+K F +  +DP L  L F FGRYLLI+SSR     
Sbjct: 327 AGRSSNA-----------SLPTDERLKRFAEGAKDPDLELLYFNFGRYLLIASSRTPQVP 375

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN  L P W S   +NIN EMNYW +   NLSE  +PL  FL  L+  G+ TA+
Sbjct: 376 ANLQGIWNHHLQPPWSSNYTININTEMNYWPAESGNLSELHQPLLGFLGNLAKTGAVTAK 435

Query: 432 VNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
             Y A GW   H TDIWA S+      +G   WA W MGGAWL THLWEH++YT D  +L
Sbjct: 436 TFYNAGGWCAAHNTDIWAMSNPVGHFGQGSPSWANWNMGGAWLATHLWEHFDYTRDTIWL 495

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
           +   Y L++G A F LD L++   G L T+PSTSPE+ FI P G      Y +T D+ +I
Sbjct: 496 KTYGYGLMKGAAQFCLDILVDDGKGNLVTSPSTSPENIFITPSGYKGATLYGATADLGMI 555

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           RE+F   I+AA+ L ++ D   +++  SL +L P +I++ G + EW
Sbjct: 556 RELFLQTIAAAKTLVQDAD-FQQQLEASLSKLYPYQISKKGHLQEW 600


>gi|154489941|ref|ZP_02030202.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
           43184]
 gi|154089383|gb|EDN88427.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
           43184]
          Length = 846

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 220/622 (35%), Positives = 327/622 (52%), Gaps = 50/622 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
            T +  PL + ++ PA+++ +A+PIGNGR GAMV+GGV  E L+LNE+TL++G P     
Sbjct: 20  GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 79

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
                P+    V  L+ +G+Y  A+    K   G     YQ  GD+ ++ +         
Sbjct: 80  DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 136

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+R L+++ A A   Y    V++ RE F+S+PD VIV  +       +  ++   S   
Sbjct: 137 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 196

Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANANDD------------- 213
                  ++++I+ G+ PG                 + P   +AN               
Sbjct: 197 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 256

Query: 214 -PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
             KG+ F A L+     D      + D  + +  +D    +L  ++SF+G   +PS    
Sbjct: 257 GGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 314

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP++++ S L+   +  Y  L  RH +DY+ LF RV  +L  SP+              +
Sbjct: 315 DPSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAM 363

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           P+ +R++ F  + DP L  LLFQFGRYL+IS SRP  Q  NLQGIWN+D  P W+    +
Sbjct: 364 PTDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTI 423

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NIN EMNYW +   NLSECQEPLF  +  LS++G++TA+  Y   GWV HH T IW +S 
Sbjct: 424 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 483

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
            +      + WPM   WLC+HLWEHY +T D  FL+  AYPL++G A F  DWLI+  +G
Sbjct: 484 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNG 543

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           +L T    SPE+ FI  DG+ A +S   TMDMAIIRE F+  I+A+E+   +E +   ++
Sbjct: 544 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 602

Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
              L RL P +I + G + EW+
Sbjct: 603 KDKLARLLPYQIGKRGQLQEWI 624


>gi|315500396|ref|YP_004089199.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
 gi|315418408|gb|ADU15048.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
          Length = 783

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/592 (37%), Positives = 327/592 (55%), Gaps = 45/592 (7%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           + +N L + +  PA  +T+A+P+GNGRLGAMV+GG+  E L+LNEDTL+ G P    NPD
Sbjct: 32  TASNDLTLWYREPANEWTEALPLGNGRLGAMVFGGIARERLQLNEDTLYAGAPYQPANPD 91

Query: 68  APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
            P AL ++R L+  G+Y EA A    K  G+P     YQ +G++ L F  S    A   Y
Sbjct: 92  GPAALPEIRKLIFEGKYLEAQALIQAKFMGNPMRQVSYQTIGEMTLTFGPSSNASA---Y 148

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
           RRELDL  A + V Y    V +TRE F S  DQV+V ++S  + G +SF +  ++     
Sbjct: 149 RRELDLTKALSTVTYRQDGVTYTRETFISPVDQVLVMRLSADKPGKVSFQLGFETPQLGA 208

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             +    +I++ GR  G         N     ++F +   +++    G  S   D+ L V
Sbjct: 209 VTIESPQEIVLSGRNGGH--------NGKDGALRFES--RVRVVASGGQQSTGTDE-LVV 257

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+D A++ + A++++     +  D   D T+ +   +    + S+  LY+ HLD ++ +
Sbjct: 258 SGADSALVFMAAATNYK----SFRDVSGDATAITKDQITRAASRSFGALYSAHLDAHKAV 313

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F RVS+   R+             +  +P+ ER+    T  DP+L  L FQ+GRYLLI+ 
Sbjct: 314 FDRVSVDFGRT------------EVADLPTNERIAKSLTLNDPALAALYFQYGRYLLIAC 361

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPGTQ ANLQG+WNE L+  W     +NIN EMNYW + P  L E  EPL   +  +SI
Sbjct: 362 SRPGTQPANLQGLWNEKLNAPWGGKYTININTEMNYWPAEPTALPELTEPLIRMVREISI 421

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G++TA++ Y A GWV HH TD+W +++A      +  WP GGAWLC HLW+ Y+Y  D 
Sbjct: 422 TGAETAKIMYGARGWVAHHNTDLW-RATAPIDAAFYGTWPTGGAWLCLHLWDRYDYGRDP 480

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE--HEFIAPDGKLACVSYSST 541
            +L +  YP+L+G + F LD L++    GY+ T PS SPE  H+F    G   C     T
Sbjct: 481 AYL-REIYPILKGASQFFLDTLVKDPASGYMVTAPSISPENQHKF----GTSICA--GPT 533

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           MDM IIR++F+    AAE+L K + +   +VL    +L P +I + G + EW
Sbjct: 534 MDMQIIRDLFANTARAAEIL-KTDKSFRAEVLAMRNKLVPNQIGKAGQLQEW 584


>gi|220928453|ref|YP_002505362.1| hypothetical protein Ccel_1020 [Clostridium cellulolyticum H10]
 gi|219998781|gb|ACL75382.1| conserved hypothetical protein [Clostridium cellulolyticum H10]
          Length = 759

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 220/585 (37%), Positives = 336/585 (57%), Gaps = 47/585 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PAK + +A+PIGNGRLGAMV+G V +E ++LNED++W G P D  NPDA   L+
Sbjct: 4   KLWYKSPAKEWNEALPIGNGRLGAMVYGCVKNENIQLNEDSIWYGDPIDRNNPDALANLA 63

Query: 74  DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEF--DDSHLKYAEETYRREL 128
           ++R+ +  G+  EA   +V  L G P     YQ LG+++L F  D+S ++     Y REL
Sbjct: 64  EIRNFLSDGRIKEAEKLAVLSLSGVPESQRPYQTLGNLKLNFEIDESDIR----DYSREL 119

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF--NVSLDSLLDNHSY 186
           D+  A A VK+    V +TRE+F+S  DQVIV ++     G +SF  N+     LDN   
Sbjct: 120 DIENACASVKFVSKGVMYTREYFASAVDQVIVVRLFADAPGKISFTANMRRGRFLDNSGA 179

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           ++G            K I   A+   D KG++F ++  ++   + G ++ +  + L VE 
Sbjct: 180 IDG------------KTIGMFASCGSD-KGVRFCSM--VRAVSEGGKVNTI-GENLIVEE 223

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D   LL+  ++SF           K+  ++ +  L  +   +Y++L + H++DY +L+ 
Sbjct: 224 ADAVTLLISTATSF---------YHKEYETQCLKYLDGVEEKTYTELMSNHIEDYSQLYG 274

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           RV +++  + +         + I ++ +AER++  ++ + D  L  L F FGRYLLIS S
Sbjct: 275 RVELEIGNAEE--------HDKIQSLDTAERLERLESGKPDHQLECLYFSFGRYLLISCS 326

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG+  ANLQGIWN+D+ P WDS   +NIN EMNYW +  CNLSEC  PLFD +  +   
Sbjct: 327 RPGSLPANLQGIWNQDILPAWDSKYTININTEMNYWPAETCNLSECHFPLFDHIERMRAP 386

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+V Y  SG+V HH TDIW  ++     +    WPMG AWL  HLWEHY + +D++
Sbjct: 387 GRRTARVMYGCSGFVAHHNTDIWGDTAPQDIYIPATYWPMGAAWLSLHLWEHYEFGLDKE 446

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL K AYP+++  A F LD+LIE   G L T+PS SPE+ +I  +G+  C+    +MD  
Sbjct: 447 FL-KDAYPVMKEAAQFFLDFLIEDSKGRLVTSPSVSPENTYILENGEKGCLCIGPSMDSQ 505

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
           I+  +FS  I A+ +L+  + +  EK++K    L   +I   G I
Sbjct: 506 ILYALFSGCIEASNILD-TDISFAEKLIKVRDSLPKPQIGRYGQI 549


>gi|423722949|ref|ZP_17697102.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
           CL09T00C40]
 gi|409241779|gb|EKN34546.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
           CL09T00C40]
          Length = 864

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 220/622 (35%), Positives = 327/622 (52%), Gaps = 50/622 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
            T +  PL + ++ PA+++ +A+PIGNGR GAMV+GGV  E L+LNE+TL++G P     
Sbjct: 38  GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 97

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
                P+    V  L+ +G+Y  A+    K   G     YQ  GD+ ++ +         
Sbjct: 98  DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 154

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+R L+++ A A   Y    V++ RE F+S+PD VIV  +       +  ++   S   
Sbjct: 155 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 214

Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANANDD------------- 213
                  ++++I+ G+ PG                 + P   +AN               
Sbjct: 215 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 274

Query: 214 -PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
             KG+ F A L+     D      + D  + +  +D    +L  ++SF+G   +PS    
Sbjct: 275 GGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 332

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP++++ S L+   +  Y  L  RH +DY+ LF RV  +L  SP+              +
Sbjct: 333 DPSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAM 381

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           P+ +R++ F  + DP L  LLFQFGRYL+IS SRP  Q  NLQGIWN+D  P W+    +
Sbjct: 382 PTDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTI 441

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NIN EMNYW +   NLSECQEPLF  +  LS++G++TA+  Y   GWV HH T IW +S 
Sbjct: 442 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 501

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
            +      + WPM   WLC+HLWEHY +T D  FL+  AYPL++G A F  DWLI+  +G
Sbjct: 502 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNG 561

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           +L T    SPE+ FI  DG+ A +S   TMDMAIIRE F+  I+A+E+   +E +   ++
Sbjct: 562 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 620

Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
              L RL P +I + G + EW+
Sbjct: 621 KDKLARLLPYQIGKRGQLQEWI 642


>gi|430751376|ref|YP_007214284.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
 gi|430735341|gb|AGA59286.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
          Length = 765

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 224/595 (37%), Positives = 331/595 (55%), Gaps = 39/595 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +  PA  + +A+PIGNGRLGAMV GG+  E L++NE+T W+G P DY  P A + L
Sbjct: 1   MKLWYAKPASDWLEALPIGNGRLGAMVHGGMERERLQINEETFWSGGPHDYRRPGASRYL 60

Query: 73  SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELD 129
             VR L+   +  EA      ++ G P  ++  L   D+ L F   H       Y RELD
Sbjct: 61  RQVRELIFQDKVEEAQQLFDERMKGDPELLHAFLPCCDMMLHFP-GHAD--GRDYYRELD 117

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
           L+ A A  +Y V  V +TRE F S PDQ I+ +IS    G +     L +   +      
Sbjct: 118 LDRAVATTRYRVNGVTYTREVFCSYPDQAIIMRISSDCPGKIDMAGELAAANGEQRVRFA 177

Query: 189 GNNQIIMEGRCPGKR--IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           G++ +++ G+  GKR   P + NA  D  G++F A   ++   + G +   E + L+V G
Sbjct: 178 GDDTLVLTGQA-GKREARPRRLNAGWDGPGVRFEA--RLRAFSEGGRVLRGE-QALEVRG 233

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D   L+  A++SF    +N      DP +++   ++ ++  +Y +L  RHL+DY  L+ 
Sbjct: 234 ADAVTLIFSAATSF----VNYRSIDGDPGAKAAGVIERLQGKTYGELLGRHLEDYTALYR 289

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV ++L     D              P+ ERV+ +   EDP L  L +Q+GRYLLI+SSR
Sbjct: 290 RVELELGDGAGD------------GTPTDERVRMYAETEDPGLAALFYQYGRYLLIASSR 337

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+D  P W S    NIN++MNYW +   NL EC  PLFD +  L I G
Sbjct: 338 PGGQPANLQGIWNDDPWPLWGSKWTTNINVQMNYWPAESGNLRECHLPLFDLIDDLRITG 397

Query: 427 SKTAQVNYLASGWVIHHKTDIW-AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           ++TA+ +Y   G+V+HH TD+W A +  D      A+WPMGG WL  HLW+HY Y  D+ 
Sbjct: 398 AETAETHYGCRGFVVHHNTDLWRAATPVDYDA---AVWPMGGVWLVQHLWDHYEYCPDQA 454

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-----GYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           FL  R YP L   A F+LD+L E  +     G L TNPS SPE+ +I   G+   ++ ++
Sbjct: 455 FLRNRVYPALREAALFVLDYLTEAPEGTRLAGKLVTNPSYSPENHYIDDKGRRRYLTCAA 514

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD+ +IR++F   + AAE+L  +ED   E + +++ RL   +I + G + EW +
Sbjct: 515 TMDIQLIRDLFQRCMKAAEMLGVDEDFRGE-LEEAMARLPGMQIGKYGQLQEWAE 568


>gi|188991901|ref|YP_001903911.1| hypothetical protein xccb100_2506 [Xanthomonas campestris pv.
           campestris str. B100]
 gi|167733661|emb|CAP51866.1| conserved exported protein [Xanthomonas campestris pv. campestris]
          Length = 790

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 230/596 (38%), Positives = 327/596 (54%), Gaps = 46/596 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +  + L + +  PA  +  A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D TN
Sbjct: 38  AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           P A  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +       
Sbjct: 98  PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDK 240
               V     ++  GR            N    GI  +    L +      G+++A+ D+
Sbjct: 215 GEVTVE-QGSLLFSGR------------NGSFAGIDGKLRFALRVLPQVKGGSVTAVRDR 261

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            L+++G+D  VLLL A++S+          + DP + + ++LQ    LSY+ L   HL D
Sbjct: 262 -LRIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLAD 316

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           +Q+LF RV+I L  S               T+P+ ERV+ F    DP+L  L  Q+GRYL
Sbjct: 317 HQRLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYL 364

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LI SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L 
Sbjct: 365 LICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLF 424

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y
Sbjct: 425 DLARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDY 483

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
             DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C    
Sbjct: 484 GRDRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--G 538

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            TMD  ++R++F+  I+ +++L+ +  AL +++     +L P +I + G + EW Q
Sbjct: 539 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 593


>gi|374324082|ref|YP_005077211.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
 gi|357203091|gb|AET60988.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
          Length = 772

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 232/603 (38%), Positives = 338/603 (56%), Gaps = 49/603 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N   I FN PA+ + +AIPIGNG LG M++G    E ++LNED+LW G P D  NP + +
Sbjct: 2   NEKMIWFNQPAEKWEEAIPIGNGTLGGMIFGKTSIERIQLNEDSLWYGGPMDRNNPHSFE 61

Query: 71  ALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            L ++RSL+ SGQ  +A   ASV L G P     Y+ LGD+ L   D   +  +  YRR+
Sbjct: 62  YLDEIRSLLFSGQIKQAEELASVALVGVPDGQRHYESLGDLYLNIGDGEEEIKD--YRRQ 119

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------------ 175
           LDL+     V Y V  V + RE+FSS PDQV+V +++ SE G+LSF+             
Sbjct: 120 LDLDHGIVSVNYRVNQVNYCREYFSSFPDQVLVVRLNSSEYGALSFSALFGRGIVLEPTP 179

Query: 176 ---SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
               L   +  H+Y++      +E R P   I    +  ++  GI+F  +  I+I  + G
Sbjct: 180 WSDVLKHPVGLHAYLDR-----IETRSPADLIIRGRSGGEE--GIRFCCV--IRIVTEEG 230

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
            IS   + +L ++  + A +L+ A + F  P       K+   +E +  L      SY  
Sbjct: 231 QIS-YSNGQLSLKDVNAATILVSACTDFRIP-------KEQMEAECICRLDRAAGKSYDQ 282

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
           L T H++DYQ LF RV + L  +    V  T +   + T    ER+K+    ED  L+ L
Sbjct: 283 LRTGHIEDYQALFGRVELSLQGN----VDSTSTSSFLTTDQRLERIKN--GAEDNELISL 336

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            FQFGRYLLISSSRPG+  ANLQGIWN+D+ P WDS   +NIN +MNYW +  CNL+EC 
Sbjct: 337 YFQFGRYLLISSSRPGSLPANLQGIWNKDMLPIWDSKYTININTQMNYWPAEICNLAECH 396

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
            PL DF+  +   G +TA++ Y   G+V HH +DIWA ++     +    W MG AWL  
Sbjct: 397 IPLIDFIDRMQERGKETARIMYRCRGFVAHHNSDIWADTAPQDVCITSTFWTMGAAWLSL 456

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
           HLW+HY +  D  FL K AY  ++  A FLLD+LIE   G L  +PS+SPE+ ++ P+G+
Sbjct: 457 HLWDHYEFGQDASFL-KEAYDTMKEAAFFLLDYLIEDPYGNLVISPSSSPENRYVLPNGE 515

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSI 590
              + Y ++MD  IIRE+F   I +  +L+++++  A++ K LK +P+L    + + G I
Sbjct: 516 SGALCYGASMDSQIIRELFERCIKSTIILQEDQEFGAMLRKALKRIPKL---AVGKHGQI 572

Query: 591 MEW 593
            EW
Sbjct: 573 QEW 575


>gi|294146663|ref|YP_003559329.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
 gi|292677080|dbj|BAI98597.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
          Length = 777

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 226/590 (38%), Positives = 327/590 (55%), Gaps = 43/590 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           +PL + +  PA  +T+A+PIGNGRLGAM++GGV  E L+LNE TLW G P D  NP+A  
Sbjct: 33  HPLTLWYRQPAAAWTEALPIGNGRLGAMLFGGVARERLQLNEGTLWAGQPYDPVNPEAKA 92

Query: 71  ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
            L  VR L+ +G+ AEA A + K L   P     YQ LGD+ L+F       A   Y RE
Sbjct: 93  NLPQVRELIFAGRIAEAEALADKTLMAKPLAQMPYQTLGDLILDFPGVGQATA---YHRE 149

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSY 186
           LDL++ATA  +++ G V   R+  +S  D VI   +S   +G L  ++SL  S +     
Sbjct: 150 LDLDSATATTRFTAGGVAHVRQAIASPADNVIAVHLS--STGRLDVDISLRSSQIGVQVA 207

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
            +G N +++ GR    R     + N     ++F+A L  ++     T SA  D  L + G
Sbjct: 208 ADGPNGLLLTGRNGASR---GIDGN-----LRFAARLAARVEGGHATHSA--DGSLSIRG 257

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    LLL  ++ F        D   DP + + + L   R+ S++ + T   D +++LF 
Sbjct: 258 AKSVTLLLAMATGFR----RFDDVGGDPVAGTAATLARARDRSFATIATDAADAHRRLFR 313

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  +P               +P+  R+   QT +DP+L  L F + RYLLI SSR
Sbjct: 314 RVTLDLGSTPAA------------QLPTDRRIADSQTSDDPALAALYFHYARYLLICSSR 361

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG+WN+ L P W S   +NIN +MNYW + P  L EC  PL + +  L++ G
Sbjct: 362 PGGQPANLQGLWNDSLDPPWGSKYTININTQMNYWPAEPAALGECVAPLVEMVRDLAVTG 421

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           ++TA+  Y A GWV HH TD+W +++A      + LWP GGAWLC HLW+HY+Y  DR +
Sbjct: 422 ARTARSMYGARGWVAHHNTDLW-RATAPIDGAQFGLWPTGGAWLCMHLWDHYDYHRDRAY 480

Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L    YPL+ G A F LD L  +   G+L TNPS SPE+    P G    +    TMDMA
Sbjct: 481 LAS-VYPLMAGAARFFLDTLQRDPASGFLVTNPSMSPEN----PHGHGGTICAGPTMDMA 535

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I+R++F+  + AA +L+++  +LV ++  +  RL P +I   G + EW Q
Sbjct: 536 ILRDLFTRTMEAAAILDRDA-SLVAEMRAARDRLAPYRIGRQGQLQEWQQ 584


>gi|384427644|ref|YP_005637003.1| hypothetical protein XCR_1996 [Xanthomonas campestris pv. raphani
           756C]
 gi|341936746|gb|AEL06885.1| expressed protein [Xanthomonas campestris pv. raphani 756C]
          Length = 764

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 231/594 (38%), Positives = 331/594 (55%), Gaps = 42/594 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +  + L + +  PA  +  A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D TN
Sbjct: 12  AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 71

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           P A  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +       
Sbjct: 72  PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 128

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   
Sbjct: 129 EYRRQLDLDTAVATTTFRSGGAVQRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 188

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
               V     ++  GR         + A  D K ++F+  L +      G+++A+ D+ L
Sbjct: 189 GEVTVE-QGSLLFSGRN-------GSFAGIDGK-LRFA--LRVLPQVKGGSVTAVRDR-L 236

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           +++G+D  VLLL A++S+          + DP + + ++LQ    LSY+ L   HL D+Q
Sbjct: 237 RIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLADHQ 292

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +LF RV+I L  S               T+P+ ERV+ F    DP+L  L  Q+GRYLLI
Sbjct: 293 RLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYLLI 340

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  L
Sbjct: 341 CSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDL 400

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  
Sbjct: 401 ARTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGR 459

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C     T
Sbjct: 460 DRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--GPT 514

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD  ++R++F+  I+ +++L+ +  AL +++     +L P +I + G + EW Q
Sbjct: 515 MDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 567


>gi|374991896|ref|YP_004967391.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
 gi|297162548|gb|ADI12260.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
          Length = 822

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 233/596 (39%), Positives = 333/596 (55%), Gaps = 45/596 (7%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A  T+    L + +  PA  + +A+PIGNGRLGAMV+GG  +E L+LNEDT+W G P D 
Sbjct: 49  AGGTTLPGELTLWYPRPASEWLEALPIGNGRLGAMVFGGTDTERLQLNEDTVWAGGPYDP 108

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPAD--VYQLLGDIELEFDDSHLKYA 120
            NP     L ++R  V +G++ +A A     F G+P     YQ +GD+ L F     +  
Sbjct: 109 ANPQGLSNLPEIRRRVFAGEWGDAQALIDSTFMGNPLSELPYQTVGDLRLTFSS---QGE 165

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRRELD+++AT  V+Y+   V + RE  +S+PDQVI  +++    GS+SF  + DS 
Sbjct: 166 VSDYRRELDIDSATTSVRYTQSGVTYRREIIASHPDQVIALRLTADTPGSISFTAAFDSP 225

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALED 239
                       I ++G                  G ++F A+   +   + GT+ + ED
Sbjct: 226 QSVTGSSPDRITIAIDG---------TGQTRSGITGQVRFRAL--ARACAEGGTVGS-ED 273

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            KL V G+D A LL+   +S+   F NP+    D T+ + + L +  ++ ++ L  RH D
Sbjct: 274 GKLTVAGADSATLLVSIGTSYTD-FGNPT---GDHTARAAAPLNAASDVPFTTLRKRHTD 329

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DY++LF RV++ L  +            +   +P+ ERVK+F +  DP LV L +QFGRY
Sbjct: 330 DYRRLFRRVTLDLGST------------DAAKLPTDERVKNFASASDPQLVSLHYQFGRY 377

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLIS SRPGTQ ANLQGIWN+ LSP W     +NIN EMNYW +   NL EC EP+FD L
Sbjct: 378 LLISCSRPGTQPANLQGIWNDLLSPPWSCRYTININTEMNYWPAPVTNLLECWEPVFDML 437

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             LS++G++TA+  Y A GWV HH  D W + +A   +  +  WP GGAWL T +W+HY 
Sbjct: 438 ADLSVSGARTARTQYGARGWVAHHNVDGW-RGTAPCDQAFYGTWPTGGAWLATSIWDHYL 496

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           +T D++ L KR YP+L G   F LD L+ +   G+L T PS SPEH    PD   A V  
Sbjct: 497 FTGDKEALRKR-YPVLRGAVLFFLDTLVTDPSSGHLVTCPSMSPEHAH-HPD---ASVCA 551

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEW 593
             TMD  I+R+VF   + A+E+L ++ D   E + ++   +L P KI   G + EW
Sbjct: 552 GPTMDNQILRDVFDGFVIASELLGEDADMRAEARTVRG--KLPPMKIGAQGQLQEW 605


>gi|337748975|ref|YP_004643137.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|379721944|ref|YP_005314075.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|386724687|ref|YP_006191013.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
 gi|336300164|gb|AEI43267.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|378570616|gb|AFC30926.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|384091812|gb|AFH63248.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
          Length = 786

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 214/586 (36%), Positives = 319/586 (54%), Gaps = 40/586 (6%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA+ +TDA+P+GNGRLGAMV+G V  E L++NED++W G P +  NPD  K L 
Sbjct: 11  KLWYEKPARAWTDALPVGNGRLGAMVFGKVNQERLQINEDSVWYGGPLNGDNPDGRKYLP 70

Query: 74  DVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +VR L+  G+  EA  AA + L   P  +  YQ LGD+ +  D    K     Y R+LD+
Sbjct: 71  EVRRLLLKGKQLEAEEAAQMGLMSIPKSMRPYQPLGDLHIYHDGE--KKMISNYYRDLDI 128

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
               A V Y +  V   RE FSS  D V+  +I+      L+  +++     D  +    
Sbjct: 129 EEGIAHVSYCLNEVPHVREVFSSAVDGVLAVRITCGPDAKLNLRMNVSRRPFDEGTQQLA 188

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           ++ I M G              +   G+ +   + +K   + G ++A  D  L V  ++ 
Sbjct: 189 HDTIAMCG-------------ENGKNGVTY--CMAVKAVPEGGWVNAFGDF-LAVRDANA 232

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             + +   ++F            DP +E +  L+      Y  +   H+ D++ L+ RV+
Sbjct: 233 VTIYIAGGTTF---------RSDDPLAECVRQLEQAERKGYEAVRRDHVADHRSLYRRVN 283

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
           ++L   P        S  +  T+P+  R++ F +  EDP L  L FQ+GRYL+++SSRPG
Sbjct: 284 LELDPEP-------VSGPDPSTLPTDARLQRFREGGEDPGLFRLYFQYGRYLMMASSRPG 336

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +  ANLQGIWNE  +P W+S   +NIN EMNYW +  CNL EC EPLFD +  +  NG K
Sbjct: 337 SNPANLQGIWNESFTPPWESKYTININTEMNYWPAESCNLPECHEPLFDLIDRMRPNGRK 396

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y   G+V HH TD+W  +  +   +  ++WPMG AWL  HLWEHY Y ++  FL 
Sbjct: 397 TAEQLYGCRGFVAHHNTDMWGSTQVEGNYMPGSIWPMGAAWLSLHLWEHYRYGLEETFLR 456

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           +RAYP+++  A F LD+L E  +G L T PSTSPE++FI PDG +  ++   +MD+ I+ 
Sbjct: 457 ERAYPVMKEAAEFFLDYLFEDKEGRLVTGPSTSPENKFIMPDGSVGTLTIGPSMDIQIVY 516

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            + SA   AAE+L + +D L EK  + L RL P +I   G + EW 
Sbjct: 517 SLLSACTDAAEIL-RTDDLLREKWEEVLRRLPPPQIGRHGQLQEWT 561


>gi|344997079|ref|YP_004799422.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
 gi|343965298|gb|AEM74445.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
          Length = 752

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 224/590 (37%), Positives = 332/590 (56%), Gaps = 44/590 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ FN PA+ + +A+PIGNG LGAM++GGV  ET++LNE+++W+  P    NPDA K L
Sbjct: 6   LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65

Query: 73  SDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R  +  G    A   SV  L G  H    Y+ LG +++ F++      +  Y R LD
Sbjct: 66  PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
           ++ A  +V++ V N+ + + +FSS PD+VIV KI  S++G++S    F       +D   
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
            V+ N++I  E  C             + +G+ FSA+L+  +S D G +  + D  L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +   +LL+ +++S+          +KD  +  +  ++      + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV   +        T+  + E I+ +    +        D  L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG    NLQGIWN+++ P W S   +NINL+MNYW +  CNLSEC  PLFD L  +  N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G  TAQ  Y   G+  HH TDIW  ++     +    WPMG AWLC H+W+HY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEYTGDLE 451

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL K  Y L+   A FLLD+LIE  +GYL T PS SPE+ +   +G++  ++Y  TMD+ 
Sbjct: 452 FL-KEYYYLMREAALFLLDYLIEDRNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           II  +F  +  A  VL+ N D +VEK+  +L +L P KI + G I EW++
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIE 558


>gi|21231206|ref|NP_637123.1| hypothetical protein XCC1756 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66768787|ref|YP_243549.1| hypothetical protein XC_2479 [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21112850|gb|AAM41047.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66574119|gb|AAY49529.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 790

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 230/594 (38%), Positives = 331/594 (55%), Gaps = 42/594 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +  + L + +  PA  +  A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D TN
Sbjct: 38  AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           P A  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +       
Sbjct: 98  PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
               V     ++  GR         + A  D K ++F+  L +      G+++A+ D+ L
Sbjct: 215 GEVTVE-QGSLLFSGRN-------GSFAGIDGK-LRFA--LRVLPQVKGGSVTAVRDR-L 262

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           +++G+D  VLLL A++S+          + DP + ++++LQ    LSY+ L   HL D+Q
Sbjct: 263 RIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTVASLQKAGKLSYAALLRAHLADHQ 318

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +LF RV+I L  S                +P+ ERV+ F    DP+L  L  Q+GRYLLI
Sbjct: 319 RLFRRVAIDLGSS------------EAARLPTDERVQRFAEGNDPALAALYHQYGRYLLI 366

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  L
Sbjct: 367 CSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDL 426

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  
Sbjct: 427 ARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGR 485

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C     T
Sbjct: 486 DRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--GPT 540

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD  ++R++F+  I+ +++L+ +  AL +++     +L P +I + G + EW Q
Sbjct: 541 MDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 593


>gi|240144516|ref|ZP_04743117.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
 gi|257203465|gb|EEV01750.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
          Length = 741

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/586 (37%), Positives = 320/586 (54%), Gaps = 54/586 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PAK + +A+P+GNGR+GAM++GGV  E +++NE+++W G P D  NPDA   L ++R
Sbjct: 6   YKEPAKVWEEALPLGNGRIGAMIFGGVEQERIQVNEESIWYGGPVDRNNPDAKAHLEEIR 65

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIEL---EFDDSHLKYAEETYRRELDL 130
             +  G+  EA    ++ + G P  +  YQ LGDI +     +D       E Y+R L+L
Sbjct: 66  QHIFEGRLKEAQRLMNLTMSGCPDSMHPYQTLGDINIYSSGIEDV------ENYKRSLNL 119

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A   V++   +V F RE F S P   +V + +  +S  +SF  +L        Y +G 
Sbjct: 120 EEAVCLVEFDSRSVHFKREMFLSYPKDCLVIRFTADKSSQISFQANLS----RGRYFDGI 175

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N++   G C           N    G  F  ++ IK     G  SA+    L V+G+D  
Sbjct: 176 NKLGENGIC--------LYGNLGRGGSDF--VMGIKAWAKGGVASAV-GGNLCVQGADEV 224

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN----LSYSDLYTRHLDDYQKLFH 306
           +L   A+SSF           K    E +  ++   N    L+Y +L+  H +DY+ LF 
Sbjct: 225 LLTFCAASSF---------RNKKKCDELLREIEEKMNNAAMLTYEELFEEHKEDYRTLFA 275

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
           RV  QL              E  D +P+ ER+ ++ +   D  L ++LF +GRYLLIS S
Sbjct: 276 RVEFQLD-----------GVEKFDVIPTNERIERAAKETPDIGLSKMLFDYGRYLLISCS 324

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG   A LQGIWN+D +P W+S   +NIN EMNYW +  CNLSEC  PLFD L  +  N
Sbjct: 325 RPGGLPATLQGIWNQDFTPPWESKYTININTEMNYWLAESCNLSECHMPLFDLLERMVEN 384

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+  Y   G+V HH TDI   ++          W MG AWLCTHLW HY YT+DR+
Sbjct: 385 GRRTAEKMYGCRGFVAHHNTDIHGDTAPQDTWYPATYWVMGAAWLCTHLWTHYEYTLDRE 444

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FLE R+YP++   A F +D+L+E  DGYL T PS SPE+ +  P+G++  VSY +TMD  
Sbjct: 445 FLE-RSYPIMCEAALFFIDFLVE-KDGYLVTCPSLSPENTYCLPNGEMGAVSYGATMDNQ 502

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           I+R++FS  ++A ++L+    A +EK    L +L PT+I  DG IM
Sbjct: 503 ILRDLFSQCLAAGKILQATNSAFLEKAEYVLQKLLPTRIGSDGRIM 548


>gi|345013386|ref|YP_004815740.1| large hypothetical protein [Streptomyces violaceusniger Tu 4113]
 gi|344039735|gb|AEM85460.1| large secreted protein [Streptomyces violaceusniger Tu 4113]
          Length = 805

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 226/590 (38%), Positives = 316/590 (53%), Gaps = 41/590 (6%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
            PL + +  PA  +  A+P+GNGRLGAMV+G   +E L+LN DTLW G P  Y N     
Sbjct: 44  RPLALWYREPAADWLSALPLGNGRLGAMVFGATETERLQLNADTLWAGGPHSYDNHKGLA 103

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
           AL  +R LV  G++ EA T  +    G P     YQ +G + L         A   YRRE
Sbjct: 104 ALPRIRQLVFDGKWPEAETLINSDFLGVPGGQAQYQTVGSLLLSLPTGG---AVTGYRRE 160

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL++A A   Y+   V FTRE F+S PD+VIV ++S S+ G+LSF  + +S L      
Sbjct: 161 LDLDSAVATTTYTRDGVTFTREAFASAPDRVIVVRLSASKKGALSFGATFESPLRTSLSS 220

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
                  ++G           +A     G + F A++ +         +      + V G
Sbjct: 221 PDPLTAALDG---------TGDATGGVDGAVGFRALVRVLAEG---GTTTSAGGTVTVRG 268

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D A +L+   +++    +N  ++  D   ++ + L    N  Y  L +RH+DD++ LF 
Sbjct: 269 ADAATVLVAIGTTY----VNWENANGDAAGQAAADLNPAANRPYGQLRSRHVDDHRALFR 324

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           R S+ +               +   +P+ ERV  F +  DP LVEL FQ+GRYLLI++SR
Sbjct: 325 RTSLDVGSG------------DAAALPTDERVSRFASGGDPQLVELHFQYGRYLLIAASR 372

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ A LQGIWN+  SP W S   +NIN EMNYW + P NL EC EP+F  L  L++ G
Sbjct: 373 PGTQPATLQGIWNDLTSPPWGSKYTININTEMNYWPAAPANLLECWEPVFALLDELAVAG 432

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
             TA+  Y A GWV HH TD+W + +A      W +WPMGGAW+   +WEHY YT D + 
Sbjct: 433 RSTARTQYGADGWVTHHNTDVW-RGTAPVDGAFWGMWPMGGAWMSMAIWEHYRYTRDTEK 491

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L  R YP+L+G A F LD L+ +   G L T PS SPE+   +  G   C     TMDM 
Sbjct: 492 LRAR-YPVLKGAAQFFLDALVTDPATGALVTCPSVSPENAHHSGGGGSLCA--GPTMDMQ 548

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++R++F A+ SAA+ L   + AL ++VL +  RL P KI   G + EW Q
Sbjct: 549 LLRDLFGAVASAADTL-GTDAALRDQVLAARGRLAPMKIGAQGRLQEWQQ 597


>gi|237722004|ref|ZP_04552485.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|423291145|ref|ZP_17269993.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
           CL02T12C04]
 gi|229448873|gb|EEO54664.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|392664179|gb|EIY57721.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
           CL02T12C04]
          Length = 792

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 226/598 (37%), Positives = 327/598 (54%), Gaps = 50/598 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +N PA  + +A+PIGNGR+GAMV+G    E  +LNE+++W+G P D+ NP A  AL 
Sbjct: 27  KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            VR  VD G YA+A+    K         + L    L  D      A   YR EL+++ A
Sbjct: 87  QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            + V Y    V++ R  F S PDQV+V KI+     ++S ++ L+SLL       G   +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204

Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           I+ G+ P     +   P     DD +G QF   +++++  D G   A  D  L V  ++ 
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            VLLL A + F    +     K+                 Y +L  RH DD+Q+LF+R+ 
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
           + L        T+   +E    +P+ ER+KSF+ D  D  L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQGIWN  + P W S    NIN EMNYW +   NL EC  PL DF+  L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415

Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           TA+VNY +  GW+ HH +D+WA++       S  +G   W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
             D+ +L K AYPL++G A FLL WL +  + GY  TNPSTSPE+ F  I  +GK     
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +S SS MD+ +  ++ +  I A+ VL+ ++ A  ++ +     L+P +I   G ++EW
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEW 592


>gi|333379822|ref|ZP_08471540.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
           22836]
 gi|332884726|gb|EGK04982.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
           22836]
          Length = 813

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 227/586 (38%), Positives = 335/586 (57%), Gaps = 42/586 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PAK + +A+P+GN RLGAMV+G    E L+LNE+T+W G P    +P+  K L 
Sbjct: 24  KLLYKRPAKEWVEALPLGNSRLGAMVFGNPAREQLQLNEETMWGGGPHRNDSPNMLKVLD 83

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           +VRSL+ +G+  EA A   K    P +   YQ +G + L+F   H KY+   Y R+LDL 
Sbjct: 84  EVRSLIFAGKEKEAEALLEKNMRTPHNGMPYQTIGSLYLDFA-GHNKYS--NYSRQLDLT 140

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA A  KY+V  + +TRE FSS  D VI+ +I+  +  S+SF    DS + ++      +
Sbjct: 141 TAVATTKYTVDGINYTREVFSSFTDNVIIMRITADKPNSISFTAGYDSPVKDYKVQAKGD 200

Query: 192 QIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           ++I++G             ++  KG I+F    +IK     G    +E  KL V+ ++  
Sbjct: 201 KLILKGM---------GAEHEGIKGVIRFENQTQIKT---EGGSVKVESNKLSVKAANSV 248

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           V+ +  +++F    +N  D   + ++ +   L++  +  Y      H+  Y+K F RVS+
Sbjct: 249 VIYISIATNF----VNYQDVSANESTSATHFLKTAISKPYEKALADHIKYYKKQFDRVSL 304

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L +S      D+  EE      +  RV++F+  +D SLV LLFQFGRYLLISSS+PG Q
Sbjct: 305 DLGKS------DSILEE------TDVRVRNFKEGKDQSLVTLLFQFGRYLLISSSQPGGQ 352

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+ L P WDS   +NIN EMNYW +   NLSE  +PLF  L  L++ G +TA
Sbjct: 353 PANLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHQPLFQMLKELAVTGQETA 412

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +V Y A+GWV HH TD+W  +    G     +WP GGAWL  H+W+HY YT D+ FL K 
Sbjct: 413 KVMYNANGWVAHHNTDLWRTTGPVDG-AFHGMWPNGGAWLSQHMWQHYLYTGDKSFL-KE 470

Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           AYP+L+G A F LD+L+E H  Y  + T+PSTSPE     P GK   ++  STMD  I+ 
Sbjct: 471 AYPVLKGAADFFLDFLVE-HPTYKWMVTSPSTSPEQ---GPPGKNTSITAGSTMDNQIVF 526

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +V +  + A++ L   ++A  +K+   + RL P +I +   + EW+
Sbjct: 527 DVLNNALEASKTLGVGDEAYNQKLEDMISRLAPMQIGKYNQLQEWL 572


>gi|340619498|ref|YP_004737951.1| alpha-L-fucosidase [Zobellia galactanivorans]
 gi|339734295|emb|CAZ97672.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
          Length = 792

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 228/600 (38%), Positives = 333/600 (55%), Gaps = 53/600 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  K+ +  PA  + +A+PIGNG+LGAMV+GGV SE L+LNE+++W G P       A K
Sbjct: 34  NGNKLWYTQPAADWMEALPIGNGKLGAMVFGGVESERLQLNEESVWAGPPIPENRVGAFK 93

Query: 71  ALSDVRSLVDSGQYAEATAASV-KLFGH--PADVYQLLGDIELEFDDSHLKYAEETYRRE 127
           ++   R+L+  G Y EA       + G       YQ LG++ L F+   LK +   YRRE
Sbjct: 94  SIEKARALIFQGDYLEANKVMQDNVMGERIAPRSYQPLGNLILNFN---LKGSPTDYRRE 150

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL  A A+  ++V  V +TRE+FSS  +  IV  ++ ++  ++S  + +D   D     
Sbjct: 151 LDLKRAIAKTDFTVNGVRYTREYFSSAIENTIVVVLTANQPKAISLELKMDRKADFEVAG 210

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISALEDKKLKVEG 246
            G N++ M G+                KG       E ++ +  +G   + E+  +K+  
Sbjct: 211 VGKNRLRMWGQA-------------SQKGKHLGVKYETQVMALPKGGKMSSENGNIKITA 257

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDP--------TSESMSALQSIRNLSYSDLYTRHL 298
           ++  VLL+ A + ++         KKDP        ++   S L+     S   L   H+
Sbjct: 258 ANSVVLLVSAKTDYN---------KKDPFSPFTENLSTACASVLKKTARKSVKKLKEEHI 308

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           DDYQ  F+RV + L   P +   D  + E ++ V +          +DP L+EL FQ+GR
Sbjct: 309 DDYQHYFNRVVLDLGSFPGE---DKPTNERLEAVINGA--------DDPGLMELYFQYGR 357

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPG+  ANLQGIWN+ L+  W+S  H NIN++MNYW +   NLSEC EP F+F
Sbjct: 358 YLLISSSRPGSLPANLQGIWNDHLAAPWNSDYHTNINMQMNYWPAEVANLSECHEPFFEF 417

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L  +G KTA+  Y + G+V+HH TD+W  +S   GKV + +WPMGGAW   H  EHY
Sbjct: 418 IESLVPSGKKTAKEVYDSEGFVVHHTTDVWHWTSP-IGKVQYGMWPMGGAWCTRHFMEHY 476

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG--KLAC 535
           ++T D  FL ++AYP+++  A FLLDWL+ +   G L + PSTSPE++F  P    K A 
Sbjct: 477 SFTGDTTFLAEQAYPIMKESAKFLLDWLVTDPRSGKLVSGPSTSPENKFYTPKNGEKFAN 536

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           V   + MD  II + FS ++ AA++L K EDA V++V  +L  L   KI  DG +MEW Q
Sbjct: 537 VDMGNAMDQEIIWDNFSNVLEAAKIL-KIEDAFVDEVKAALSNLSLPKIGSDGRLMEWSQ 595


>gi|423214472|ref|ZP_17201000.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692887|gb|EIY86123.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 792

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 226/598 (37%), Positives = 327/598 (54%), Gaps = 50/598 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +N PA  + +A+PIGNGR+GAMV+G    E  +LNE+++W+G P D+ NP A  AL 
Sbjct: 27  KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            VR  VD G YA+A+    K         + L    L  D      A   YR EL+++ A
Sbjct: 87  QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            + V Y    V++ R  F S PDQV+V KI+     ++S ++ L+SLL       G   +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204

Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           I+ G+ P     +   P     DD +G QF   +++++  D G   A  D  L V  ++ 
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            VLLL A + F    +     K+                 Y +L  RH DD+Q+LF+R+ 
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
           + L        T+   +E    +P+ ER+KSF+ D  D  L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQGIWN  + P W S    NIN EMNYW +   NL EC  PL DF+  L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415

Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           TA+VNY +  GW+ HH +D+WA++       S  +G   W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
             D+ +L K AYPL++G A FLL WL +  + GY  TNPSTSPE+ F  I  +GK     
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +S SS MD+ +  ++ +  I A+ VL+ ++ A  ++ +     L+P +I   G ++EW
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEW 592


>gi|390958737|ref|YP_006422494.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
 gi|390413655|gb|AFL89159.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
          Length = 824

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 225/593 (37%), Positives = 321/593 (54%), Gaps = 35/593 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           P ++ F  PA  + DA+PIGNGRLG MV+GG   + + LNEDTLW+G P D  NP A   
Sbjct: 38  PYQLWFRTPAAEWIDALPIGNGRLGGMVFGGALEDHIALNEDTLWSGYPQDGNNPAAKSK 97

Query: 72  LSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           L  VR +++ +  Y  A     ++ G  +  YQ LG + +     H +     YRR+L+L
Sbjct: 98  LPLVRQAVLKNKDYHLADTLCKEMQGPYSAAYQPLGGLHVTL---HQEGELADYRRDLNL 154

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           +TA A+  Y +G+V  +++ F S PD V+V  I  ++   ++  + LDS L +   V G+
Sbjct: 155 DTAIAKTTYRLGDVSVSKKAFVSFPDDVLVMLIETTKP--VTMEIRLDSKLRHEVSVAGH 212

Query: 191 NQIIMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
             + ++G+ P    P       P   ++   KG+ F+A   I  SD    ++  +D  L+
Sbjct: 213 -ALQLKGKAPVVSRPNYVKSQDPIQYSDTPGKGMFFAAGASIH-SDG---VTNAKDGALQ 267

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +  +   V+LL A + F G  + P     +        L +    + + L   H+  ++ 
Sbjct: 268 IANAKSVVILLAAGTGFRGHGLLPDKPMAEIMGRVQQTLANASRKTAAQLERVHIAAHRA 327

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           +F R  + L +  +D+   T           AER+  F    DPSL+ L FQFGRYLLIS
Sbjct: 328 VFRRTLLDLGK--QDLTRST-----------AERLSDFAAHPDPSLLALYFQFGRYLLIS 374

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPGTQ ANLQGIWN+DL   W      NIN++MNYW +  CNLS+   P FD L  LS
Sbjct: 375 SSRPGTQPANLQGIWNDDLRAPWSCNWTSNINIQMNYWLAETCNLSDFHAPFFDLLQSLS 434

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY 480
             G++TA+ NY   GWV HH  DIW+ SS      G   WA + M   WLC HLW+HY +
Sbjct: 435 ETGARTAKTNYGLPGWVSHHNIDIWSLSSPVGEGEGDPSWANFAMSAPWLCAHLWDHYCF 494

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           T D++FL  RAYPL++G A F   WLI    G L T PS S E++F APDGK A VS   
Sbjct: 495 TQDQNFLRTRAYPLMKGAAQFCSSWLIPDDQGNLTTCPSVSTENQFTAPDGKRASVSAGC 554

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           TMD+A+IRE+FS    AA+VL  + D    ++ +   +L P  + + G + EW
Sbjct: 555 TMDIALIREIFSNCAEAAKVLNVDHD-WANQLQQQSAKLVPYAVGQYGQLQEW 606


>gi|399028921|ref|ZP_10730010.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
 gi|398073242|gb|EJL64421.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
          Length = 820

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 219/597 (36%), Positives = 340/597 (56%), Gaps = 32/597 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
           +K+ ++ PA  + +A+P+GNGR+GAMV+G V  E ++LNE +LW+G P     NP A + 
Sbjct: 23  IKLWYDKPAAQWVEALPLGNGRIGAMVFGSVEDELIQLNEGSLWSGGPMKKNVNPKAYQY 82

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L  +R  + +  + +A     K+ G+ ++ +  +GD+ +  D    K   + Y R+L L+
Sbjct: 83  LQPLREALYAEDFQKADELCRKMQGYFSESFLPMGDLVIHHDFGSDK--SQNYYRDLKLD 140

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A +   ++V  V+++RE F S P  +++ K+  S+ G+L+F+  L S+L N   V  ++
Sbjct: 141 QAVSTTNFTVKGVKYSREIFISAPANIMIVKMKASKKGALTFDAKLSSVLTNSVSVLADD 200

Query: 192 QIIMEGRCPGKRIPPKANA-NDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
           +++++G+ P +  P   N  N  P          G++F   L+  + D  G++   +   
Sbjct: 201 RLVLDGKAPARVDPSYYNKKNRQPIILEDTTGCNGMRFRMDLKASLKD--GSVKT-DANG 257

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + V  +   +L   A++SF+G    P    K+    + S +++     Y  L   H+ DY
Sbjct: 258 IHVTNATEVILYFAAATSFNGFDKCPDSEGKNEKVITDSIIKNSTAQKYESLKKDHIADY 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
           QK F+RV++ L         +  + +N   +P  ER+K++    +DP L +  +Q+GRYL
Sbjct: 318 QKYFNRVNLDLE--------EENTNKNTSVLPWDERLKAYTAGGKDPILEQTFYQYGRYL 369

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G Q ANLQGIWN++L   W S   +NIN +MNYW +   NLSE  +PL D++ 
Sbjct: 370 LISSSRLGGQPANLQGIWNKELRAPWSSNYTININTQMNYWPAEQTNLSEMHQPLLDWIG 429

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWE 476
            LS  G   A   Y A+GWV HH +DIWA S+A      G   WA W MGG WLC HLWE
Sbjct: 430 NLSQTGRTAASEYYHANGWVAHHNSDIWALSNAVGNKGDGSPTWANWYMGGNWLCQHLWE 489

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D++FL K AYP+++  A F  DWL E  DGYL T PS+SPE+E I  +GK   V
Sbjct: 490 HYIFTGDKEFLRKTAYPVMKEAALFSFDWLQE-KDGYLVTAPSSSPENE-IHINGKNYGV 547

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           + +STMDM+I R++F  +I A+E+L  +ED   E  +K   +L P KI   G ++EW
Sbjct: 548 TVASTMDMSICRDLFGNLIKASEILNIDEDFRKELEVKK-AKLFPLKIGSKGQLLEW 603


>gi|330467858|ref|YP_004405601.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
 gi|328810829|gb|AEB45001.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
          Length = 998

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 226/573 (39%), Positives = 310/573 (54%), Gaps = 43/573 (7%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G   +E L+LNEDT+W G P D +NP    +L+++R LV + Q+ +
Sbjct: 61  ALPIGNGRLGAMVFGNSDTERLQLNEDTVWAGGPHDSSNPRGQGSLAEIRRLVFANQWTQ 120

Query: 87  A-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G+P     YQ +G++ L F  +        Y R+LDL TAT  V Y +  
Sbjct: 121 AQNLINQTMLGNPVGQLAYQTVGNLRLAFASAS---GTSQYNRQLDLTTATTSVSYVMNG 177

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V F RE F+S PDQVI  +++   S S++F  + DS             I ++G      
Sbjct: 178 VRFQREVFASAPDQVIAMRLTADRSASITFTATFDSPQRTTVSSPDGATIALDG------ 231

Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
                N       ++F   L +  +   G   +     L+V G+    LL+   SS+   
Sbjct: 232 --VSGNQEGVTGAVRF---LALAHATVSGGTVSSSGGTLRVTGATSVTLLVSIGSSY--- 283

Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
            +N  +   D    +   L + R  SY  L  RH+ DYQ LF RVS+ L R+       +
Sbjct: 284 -VNFRNVGGDYQGIARRHLTAARASSYDQLRARHVADYQALFGRVSLDLGRT-------S 335

Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
            +++     P+  R+    +  DP    LLFQ+GRYLLISSSRPGTQ ANLQGIWN+ L+
Sbjct: 336 AADQ-----PTDVRIAQHNSVNDPQFSTLLFQYGRYLLISSSRPGTQPANLQGIWNDSLT 390

Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
           P WDS   +N NL MNYW +   NLSEC +P+F  +  L+++G++TAQV Y A GWV HH
Sbjct: 391 PAWDSKYTINANLPMNYWPADTTNLSECYQPVFSMIQDLTVSGARTAQVQYGAGGWVTHH 450

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
            TD W  SS   G   W +W  GGAWL T +W+HY +T D DFL    YP ++G A F L
Sbjct: 451 NTDAWRGSSVVDG-AFWGMWQTGGAWLATMIWDHYLFTGDLDFLRAN-YPAMKGAAQFFL 508

Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
           D L+ E   GYL TNPS SPE    A     A V    TMD  I+R++F     A+E+L 
Sbjct: 509 DTLVTEPSLGYLVTNPSNSPEIGHHAD----ASVCAGPTMDNQILRDLFDGCARASEIL- 563

Query: 563 KNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWV 594
            N DA    +V  +  RL PT+I   G+IMEW+
Sbjct: 564 -NTDATFRAQVRATRDRLAPTRIGSRGNIMEWL 595


>gi|404484444|ref|ZP_11019648.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
           YIT 11860]
 gi|404339449|gb|EJZ65880.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
           YIT 11860]
          Length = 802

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 223/594 (37%), Positives = 339/594 (57%), Gaps = 31/594 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
           +K+ ++ PA++F +A+ IGNG +GA ++GGV  + +  N+ TLWTG P  + ++PDA   
Sbjct: 25  MKLHYDRPAEYFEEALVIGNGTMGATLYGGVKKDKISFNDITLWTGEPESENSSPDAFNV 84

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           + ++R+L+D+  Y  A  A  K+ GH ++ YQ LG + +E+ D     ++  Y R LD+ 
Sbjct: 85  IPEIRALLDNEDYEGADKAQYKVQGHYSENYQPLGTLTIEYLDDTAGISD--YHRWLDIG 142

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            ATAR +Y      FT ++F+S PD VIV ++       +   +S DS L + S V  +N
Sbjct: 143 NATARTQYLKDGKLFTSDYFASAPDSVIVIRLKSENKEGIHALLSFDSPLPHSSQV-ADN 201

Query: 192 QIIMEGRCPGKRIPPKANAND----DP-KGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +I +EG       P    A D    DP +GI F  ++ + +S D    +   D +++++G
Sbjct: 202 EISVEGYAAYHSFPVYYKAEDKHRYDPERGIHFKTLVRV-LSVDGSVKNRYSDSRIEIDG 260

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           S   ++L+   +SF+G   +P    ++  S     ++     +Y  L   H+ DY+  F 
Sbjct: 261 STEVLILIANVTSFNGFDKDPVKEGRNYRSHVEKRMKCAIGKTYDALREAHIRDYKYYFD 320

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFGRYLLIS 363
           RV + L  +  DI            +P+ +++  F TD   ++P L EL FQFGRYLLIS
Sbjct: 321 RVKLDLGNTDDDIAA----------LPTDKQL-LFYTDCKQQNPDLEELYFQFGRYLLIS 369

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR     ANLQG+WNE + P W S   VNINLE NYW S   NL E Q PL +F+  LS
Sbjct: 370 SSRTPGVPANLQGLWNESVLPPWSSNYTVNINLEENYWASGTTNLIEMQYPLIEFIANLS 429

Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYN 479
             G KTA+  Y +  GW + H +D+WA +     + G   WA W MGG WL TH+WEHY 
Sbjct: 430 KTGRKTAKDYYGVERGWCLGHNSDVWAMTCPVGLNEGDPSWACWTMGGTWLSTHIWEHYL 489

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           +T+D+ FL K  YP+L+G A F +DWL+E  DG L T+P TSPE+++I PDG +   SY 
Sbjct: 490 FTLDKGFLCK-FYPVLKGAAEFCMDWLVE-KDGKLVTSPGTSPENKYITPDGYVGATSYG 547

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +T D+A+IRE       A++VL  ++ +  +++ K+L RL P +I  DG++ EW
Sbjct: 548 NTSDLAMIRECLIDAAEASKVLGVDK-SFRKRIKKTLSRLYPYQIGTDGNLQEW 600


>gi|302872475|ref|YP_003841111.1| alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
 gi|302575334|gb|ADL43125.1| Alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
          Length = 753

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 223/590 (37%), Positives = 326/590 (55%), Gaps = 44/590 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LKI FN PA  + +A+PIGNG LGAM++GGV  ET++LNE+++W+  P    NPDA + L
Sbjct: 6   LKILFNHPANCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPRRRENPDALRYL 65

Query: 73  SDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R  +  G    A   SV       H    Y+ LG +++ F+    K   E Y R LD
Sbjct: 66  QEIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGIE-KDKIENYCRYLD 124

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
           ++ A  +V++SVG   + + +FSS PD+VIV KIS SE   ++    F       +D   
Sbjct: 125 ISNAICKVEFSVGKARYDKLYFSSFPDKVIVIKISCSEKCGVTLRAKFRREFQEDIDRCG 184

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
            + GN++I  E      R            G+ FSA+L+  +S D G +  + D  L ++
Sbjct: 185 KI-GNDKIFFECTAGSGR------------GVSFSAMLK-AVSKD-GDVYTIGDN-LFIK 228

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +   +LL+ +++S+          +KD  +  +  L+ +    + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDYKSLF 279

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV   +  +  +      + E I+ +    R        D  L+ LLFQFGRYLLISSS
Sbjct: 280 DRVEFYIDTANTNDRIGLTTPERINLLKKGYR--------DEELIVLLFQFGRYLLISSS 331

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG    NLQGIWN+++ P W S   +NINL+MNYW +  CNLSEC  PLF  L  +  N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEICNLSECHLPLFTLLERMYEN 391

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G  TAQ  Y   G+  HH TDIW  ++     +    WPMG AWLC H+WEHY YT D D
Sbjct: 392 GKITAQKMYNCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWEHYEYTGDLD 451

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL K+ Y L+   A FLLD+LIE  +GYL T PS SPE+ +   +G +  ++Y  T+D+ 
Sbjct: 452 FL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGNVYSLTYMPTIDIQ 509

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           II  +F  +  A ++L+ N D ++EK+  +L +L P KI + G I EW++
Sbjct: 510 IISVLFEKVKKANDILKLN-DEIIEKIDYALEKLPPIKIGKYGQIQEWIE 558


>gi|430751368|ref|YP_007214276.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
 gi|430735333|gb|AGA59278.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
          Length = 768

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 220/597 (36%), Positives = 328/597 (54%), Gaps = 53/597 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + ++ PA  + +A+PIGNGR+GAMV+G   SE L+LNED+LW G P D  NPDA K L
Sbjct: 1   MVMKYDRPAAEWNEALPIGNGRMGAMVFGHPVSERLQLNEDSLWYGGPRDRNNPDAAKVL 60

Query: 73  SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
            ++R L+  G+  EA   +V  L G P     Y+ LG + L F+      A E Y+R LD
Sbjct: 61  PEIRRLIFEGKPREAERLAVTGLSGIPETQRHYEPLGQLLLHFEGIDPD-AVEQYQRSLD 119

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
           L  A A V++    V   RE+++S PDQ I+ + +    G +S    L+       YV+ 
Sbjct: 120 LERAVASVEFLHRGVRHRREYYASCPDQAIIVRATADRPGQISLTARLERA--RWRYVDA 177

Query: 189 ----GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
               G + I M G            A+   +G+ F+A +  +     G++ A+  + L V
Sbjct: 178 TGRSGTDAIYMTG------------ASGGAEGVSFAAAVTARTEG--GSLDAI-GEHLVV 222

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           E +D   L++ A++SF          +K+P +  ++  +++      + Y RH+ DY++L
Sbjct: 223 EHADSVTLVISAATSF---------REKEPLAHCLAHARTVCAAPDDERYARHVRDYREL 273

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLIS 363
           F RVS+ L             +E    +P  ER++  +  +EDP+L  L FQ+GRYLLI+
Sbjct: 274 FGRVSLALG-----------GDEERSVLPVPERLERLRKGEEDPALAALYFQYGRYLLIA 322

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPG+  ANLQGIWN+   P WDS   +NIN +MNYW +  C L EC EPLFD +  L 
Sbjct: 323 SSRPGSLPANLQGIWNDHFLPPWDSKYTININAQMNYWPAESCALPECHEPLFDLIERLR 382

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G +TA+V Y   G+  HH TDIWA ++     +  + WP+G AWLC HLWEHY +T D
Sbjct: 383 EPGRRTARVMYGCRGFAAHHNTDIWADTAPQDTYIPASYWPLGAAWLCLHLWEHYRFTQD 442

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
             FLE R+   ++  A F++D+L+EG  G L T PS SPE+ ++ P+G+   +    TMD
Sbjct: 443 LPFLE-RSLETMKEAARFVMDYLVEGPSGELVTCPSVSPENSYVLPNGETGVLCAGPTMD 501

Query: 544 MAIIREVFSAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             IIR + SA + A  VL     + +++A + +    L RL   KI + G+I EW +
Sbjct: 502 TQIIRALLSACVEAERVLSDRTGKASDEAFIREAELVLKRLPKEKIGKLGTIQEWYE 558


>gi|312621675|ref|YP_004023288.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
 gi|312202142|gb|ADQ45469.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
          Length = 752

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 233/601 (38%), Positives = 335/601 (55%), Gaps = 50/601 (8%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           MN++S      LKI F+ PA  + +A+PIGNG LGAM++GGV  ET++LNE+++W+  P 
Sbjct: 1   MNSQS------LKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPR 54

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLK 118
              NPDA K L ++R  +  G    A   SV  L G  H    Y+ LG +++ F+     
Sbjct: 55  RRENPDAIKYLPEIRKSILEGNIKRAEELSVFALSGTPHSQGNYEPLGYLDIYFEGIEAD 114

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFN 174
             E  Y R LD++ AT +V++ V ++ + + +FSS PD+VIV KI  ++ G+L     F 
Sbjct: 115 KVER-YTRYLDISNATCKVEFDVDDIRYEKIYFSSYPDKVIVVKICCNKKGALFLRAKFR 173

Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
                 +D    V+ N++I +E      R            G+ FSA+L+  +S D G +
Sbjct: 174 REYQEDIDRCGRVD-NDKIFIECSAGSGR------------GVSFSAVLK-AVSKD-GDV 218

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
             + D  L V+ +   VLL+ +++S+           KD  +  +  L+      + +LY
Sbjct: 219 YTIGDN-LFVKDATEVVLLITSTTSYKA---------KDYFNWCVKTLEQASKHDFEELY 268

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
            RH +DY+ LF RV   +     +  T+  + E I+ +   ER K      D  L+ LLF
Sbjct: 269 KRHTEDYKSLFDRVEFYIDTENTNKRTELTTPERINLL--KERYK------DEELIVLLF 320

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLISSSRPG    NLQGIWN+++ P W S   +NINL+MNYW +  CNLSEC  P
Sbjct: 321 QFGRYLLISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMP 380

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LFD L  +  NG  TAQ  Y   G+  HH TDIW  ++     +    WPMG AWLC H+
Sbjct: 381 LFDLLEKMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHI 440

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
            +HY YT D DFL K+ Y L+   A FLLD+LIE  +GYL T PS SPE+ +   +G + 
Sbjct: 441 LDHYEYTGDLDFL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGDVY 498

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            ++Y  TMD+ II  +F  I  A +VL+ N D +VEK+  +L +L P KI + G I EW+
Sbjct: 499 SMTYMPTMDIQIITALFDKIKKANDVLKLN-DEIVEKIEYALNKLPPLKIGKYGQIQEWI 557

Query: 595 Q 595
           +
Sbjct: 558 E 558


>gi|312135763|ref|YP_004003101.1| alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
 gi|311775814|gb|ADQ05301.1| Alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
          Length = 752

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 225/595 (37%), Positives = 333/595 (55%), Gaps = 46/595 (7%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           ++  LKI F+ PA  + +A+PIGNG LGAM++GGV  ETL+LNE+++W+  P    NPDA
Sbjct: 2   SSQNLKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETLQLNEESIWSCGPRRRENPDA 61

Query: 69  PKALSDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYR 125
            K L  +R  +  G    A   SV  L G  H    Y+ LG +++ F+       E+ Y 
Sbjct: 62  LKYLQVIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGVKTDKVEK-YT 120

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFNVSLDSLL 181
           R LD++ AT +V+++V ++ + + +FSS PD+VIV KI  S+ G++     F       +
Sbjct: 121 RYLDISNATCKVEFNVDDIRYEKTYFSSYPDKVIVVKICCSKKGAIFLRAKFRREYQEDI 180

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
           D    V+ N++I  E      R            G+ FSA+L+  +S D G +  + D  
Sbjct: 181 DRCGRVD-NDKIFFECSAGSGR------------GVSFSAVLK-AVSKD-GDVYTIGDN- 224

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L V+ +   +LL+ +++S+          +KD  +  +  L+ +    + +LY RH +DY
Sbjct: 225 LFVKNATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDY 275

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
           + LF RV   +         DT +  N   + + ER+   +   +D  L+ LLFQFGRYL
Sbjct: 276 KSLFDRVEFYI---------DTANTNNRIELTTPERINLLKEGYKDEELIVLLFQFGRYL 326

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSRPG    NLQGIWN+++ P W S   +NINL+MNYW +  CNLSEC   LFD L 
Sbjct: 327 LISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMSLFDLLE 386

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            +  NG  TAQ  Y   G+  HH TDIW  ++     +    WPMG AWLC H+W+HY Y
Sbjct: 387 KMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEY 446

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           T D DFL K+ Y L+   A FLLD+LIE  +GYL T PS SPE+ +   +G +  ++Y  
Sbjct: 447 TGDLDFL-KKYYYLMREAALFLLDYLIEDENGYLVTCPSCSPENSY-KLNGDVYSLTYMP 504

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD+ +I  +F  +  A ++L+ N D +VEK+  +L +  P KI + G I EW++
Sbjct: 505 TMDIQVISALFEKVKKANDILKLN-DEIVEKIEYALNKFPPIKIGKYGQIQEWIE 558


>gi|392965675|ref|ZP_10331094.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
 gi|387844739|emb|CCH53140.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
          Length = 846

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 222/600 (37%), Positives = 325/600 (54%), Gaps = 31/600 (5%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           PL I +  PA+++ +A+P+GNGRLGAMV+G V  E ++LNE +LW+G P +   NP A  
Sbjct: 22  PLTIWYRQPARNWNEALPVGNGRLGAMVFGRVNDELIQLNEASLWSGGPVNLNPNPGAAT 81

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L  VR  +    Y EA      + G   + YQ LGD+ +      L      Y R L++
Sbjct: 82  YLPQVREALFREDYKEADKLVRNMQGLYTEAYQPLGDLTIR---QILTGEPADYYRNLNI 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A+A  ++  G V +TRE F S PDQVIV ++   + G L+  +   S       V   
Sbjct: 139 TEASATTRFKSGGVGYTREIFVSAPDQVIVIRLRADQKGKLNVTLGTRSPHPISKVVVSR 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
           +++ M G+ P    P   N N  P         +G +F   L++K +D +    A +   
Sbjct: 199 DELAMRGKSPAHADPNYVNYNKVPVRYTDSSGCRGTRFDLRLKVKSTDGQ---VATDTAG 255

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           +++  +  AV+ L A++SF+G    P    K+    + S L      S   +   H+ DY
Sbjct: 256 IRITNATEAVVYLSAATSFNGFDKCPDKDGKNEIQLAQSYLNKALAKSPDAIRKAHVADY 315

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
           Q+  +RVS  L+        D  +  N  ++P  ER+  +   E DP+L  L FQFGRYL
Sbjct: 316 QRYLNRVSFTLN--------DAQTPGNPASLPMDERLMRYAGGEPDPALETLYFQFGRYL 367

Query: 361 LISSSRPGTQVA-NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LISSSRPGT +A NLQGIWN  + P W S    NIN +MNYW +   NLSE   PL D +
Sbjct: 368 LISSSRPGTGIAANLQGIWNPMVRPPWSSNYTTNINAQMNYWPAEMTNLSEFHRPLIDQI 427

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
            + ++ G  TA+  Y A GW +HH +DIWA S+      +G  +WA W MGGAWL  HLW
Sbjct: 428 KHAAVTGKATAKNFYGAGGWTVHHNSDIWAASNPVGDLGKGGPMWANWSMGGAWLAQHLW 487

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           EHY +T DR +L++ AYPL++  A F +DWL+E   G+L T P+TSPE+ F+   G    
Sbjct: 488 EHYAFTGDRTYLKQTAYPLMKDAAQFCVDWLVEDKQGHLVTAPATSPENVFVTEKGDKES 547

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           VS ++TMDM +I ++FS +I A+E L  + D   + + +   +L P +I   G++ EW +
Sbjct: 548 VSVATTMDMGLIWDLFSNVIEASEHLGIDVD-FRKMLTEKKSKLFPLQIGRKGNLQEWYK 606


>gi|295132871|ref|YP_003583547.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
 gi|294980886|gb|ADF51351.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
          Length = 819

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 215/597 (36%), Positives = 333/597 (55%), Gaps = 43/597 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
           ++ PA+ + +A+P+GNG++GAMV+G V  E ++LNE +L++G P     NPDA   L  +
Sbjct: 28  YDAPAREWVEALPLGNGKIGAMVFGRVTDELIQLNESSLYSGGPVPQRINPDAASYLQPL 87

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDI----ELEFDDSHLKYAEETYRRELDLN 131
           R  +    YA+AT  + K+ G+    Y  +GD+    +L+ D  H       Y+R L++ 
Sbjct: 88  REAIFDKDYAQATLLAKKMQGYYTQSYMPMGDLLLHQDLQNDSVH------AYKRSLNIE 141

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A     +    V +TRE F+S PD V+V K++   + +L+ N+S +S L     V  N 
Sbjct: 142 NAITTTSFESDGVNYTREFFTSAPDNVLVMKLTADSAKALNLNLSAESQLRAEVSVTKNQ 201

Query: 192 QIIMEGRCPGKRIPPKANAN-------DDPKG---IQFSAILEIKISDDRGTISALEDKK 241
           ++++ G+ P    P   N         DDP+G   ++F   +++  +D + T    +D  
Sbjct: 202 ELVVSGKAPANVNPNYYNPEGVEPITYDDPEGCDGMRFQYRIKVLKTDGKLTT---QDTS 258

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L +  +   V+LL A++SF+G    P     D    +   +Q+    SY+ L + H+ D+
Sbjct: 259 LAIADASEVVILLTAATSFNGFDKCPDKDGLDEAKLASEFMQAASAKSYAQLKSDHIADF 318

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYL 360
                RV++ L ++PKD +            P+  R+K++ +   DP L  L FQ+GRYL
Sbjct: 319 STYMQRVALDLGKTPKDQLDQ----------PTDSRLKAYSEGANDPELEALYFQYGRYL 368

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           L+S+SRPG   ANLQGIWN+++ P W S    NIN EMNYW +   NLSE  +P   ++ 
Sbjct: 369 LVSASRPGGIAANLQGIWNKEMRPPWSSNYTTNINAEMNYWPAETTNLSEMHQPFLAYIQ 428

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADR--GKVVWALWPMGGAWLCTHLWE 476
             ++ G + A+  Y A GWV+HH +DIWA ++   DR  G  +WA W MGG WL  HLWE
Sbjct: 429 NAAVTGGRVAKEFYDAPGWVVHHNSDIWATANPVGDRGDGDPLWANWYMGGNWLTLHLWE 488

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D  +L  + YP+++  A F LDWL+E HDG L T PSTSPE+ F+  +GK   V
Sbjct: 489 HYAFTQDTSYL-AQVYPVMKEAAVFTLDWLVE-HDGKLITAPSTSPENLFLV-NGKGYAV 545

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  +TMD+AIIRE+F+  I A+++L K  D    ++  +  RL P +I   G + EW
Sbjct: 546 TEGATMDIAIIRELFNNTIKASKILGKEAD-FRHELSAAQDRLIPYQIGAKGQLQEW 601


>gi|325915867|ref|ZP_08178165.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
 gi|325537923|gb|EGD09621.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
          Length = 776

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 223/596 (37%), Positives = 330/596 (55%), Gaps = 46/596 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + + T+ L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+
Sbjct: 24  AVAPTDALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTS 83

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           P+   AL  VR+L+  G+YAEA   A  KL   P     YQ LGD+ L+FD +       
Sbjct: 84  PEGLAALPQVRALIFGGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 140

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR+LDL+TA A   +  G     R+ F     Q IV ++S     ++S  V +DS   
Sbjct: 141 EYRRQLDLDTAVATTSFRSGGALHQRDVFVCAQSQCIVVRLSCDRPRAISLRVGIDSPQS 200

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD--DRGTISALEDK 240
               V     ++  GR            N    GI+      +++      G ++AL D+
Sbjct: 201 GEVTVE-QGGLLFTGR------------NGSFAGIEGKLRFALRVVPRVKGGAVTALRDR 247

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            L++EG+D  VLLL A++S+     +  D   DP + + ++L+  + L Y+ L   HL D
Sbjct: 248 -LRIEGADEVVLLLTAATSYR--RFDAVDG--DPLALAAASLRKAQALDYAALLRAHLAD 302

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           +Q+LF RV+I L  S            +   +P+ +RV+ F    DP+L  L  Q+GRYL
Sbjct: 303 HQRLFRRVAIDLGTS------------DAAALPTDQRVRQFAGGNDPALAALYHQYGRYL 350

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LI SSRPGTQ ANLQGIWN+ + P W+S   +N+N EMNYW S    L EC EPL   + 
Sbjct: 351 LICSSRPGTQPANLQGIWNDLMQPPWESKYTINVNTEMNYWPSEANALHECVEPLESMVF 410

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L+I G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y
Sbjct: 411 DLAITGAHTARALYGAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDY 469

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
             DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C    
Sbjct: 470 GRDRAYLSK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAICA--G 524

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            TMD  ++R++F+  I+ +++L+ +  AL +++     +L P +I + G + EW Q
Sbjct: 525 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 579


>gi|334137826|ref|ZP_08511252.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
 gi|333604667|gb|EGL16055.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
          Length = 852

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 214/559 (38%), Positives = 313/559 (55%), Gaps = 42/559 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA+ +T+A+P+GNGRLGAM++G V  E + LNE++LW G P D TNP+A  AL 
Sbjct: 5   KLWYIKPAQAWTEALPVGNGRLGAMIFGRVEEELISLNEESLWYGGPKDRTNPEAAAALL 64

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++R L+  G+  EA   A + L   P  A  YQ LGD+ + F +        TYRRELDL
Sbjct: 65  EIRRLLLEGRVTEAQELAHMGLTPIPKYAGPYQPLGDLRIWFAEHEPDAG--TYRRELDL 122

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
            T   RV+Y+      TRE F+S P  V+  +++ +    L+F   L     D  +  +G
Sbjct: 123 ATGLCRVEYAWQGASCTRELFASAPAGVLACRLTTAHPEGLTFRFHLGRRPFDEGAAPDG 182

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            + ++M+GRC              P G++++A+    +S + GT+  + D  + V G+  
Sbjct: 183 PHAVLMQGRC-------------GPDGVRYAAL--ASVSPEGGTVRTIGDF-VHVAGAAE 226

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A + + A +SF           +DP +     ++  R   Y  +   H  DY  LF R+S
Sbjct: 227 ATIYVAAQTSF---------RHEDPAAACRRQVEEARRKGYEAVKAEHGADYMPLFARMS 277

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           ++L     DI            +P+ ER+ +  +  EDP L+ L FQ+GRYLL++SSRPG
Sbjct: 278 LELGTPGADI----------RLLPTDERLDRVREGGEDPELLALFFQYGRYLLLASSRPG 327

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           T  ANLQGIWN D  P W+    +NINL+MNYW +  CNL EC EPLFDF+  L  NG +
Sbjct: 328 TLPANLQGIWNADYQPPWECNYTLNINLQMNYWPAEVCNLRECHEPLFDFIDRLVANGRE 387

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y   G+V HH +++WA+S  +      A+WPMGG WL  HLWEHY +  DR FL+
Sbjct: 388 TARKLYGCRGFVAHHNSNLWAESGINGMLPRAAVWPMGGVWLALHLWEHYRFGGDRHFLD 447

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           +RAYP+++  A FLLD++ E   G L T PS SPE++++ P GK   +  +  MD+ + R
Sbjct: 448 RRAYPVMKEAALFLLDYMTEDGKGGLLTGPSVSPENKYVLPGGKSGYLCMAPAMDIQLAR 507

Query: 549 EVFSAIISAAEVLEKNEDA 567
            +F A+  AA VL     A
Sbjct: 508 TLFGAVREAAAVLACERGA 526


>gi|261406666|ref|YP_003242907.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261283129|gb|ACX65100.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 775

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 222/590 (37%), Positives = 317/590 (53%), Gaps = 48/590 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           I F+ PA+++ +A+PIGNGRLG MV+G    E ++ NED++W G P D  NPDA + L  
Sbjct: 9   IWFDQPAQNWNEALPIGNGRLGGMVFGCAQQEKIQFNEDSVWYGGPRDRNNPDALRHLPL 68

Query: 75  VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           +R L+  G+  EA   S   F G P     Y   GD  ++ D  H +     YRRELDL 
Sbjct: 69  IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYLTAGDFCIQVD--HPQGELSHYRRELDLE 126

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
            A A   Y  G V FTRE F S PDQV+V ++     G L+     +     H    + +
Sbjct: 127 KAIAVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGVLTLTARFERQKGKHMDAVHRH 186

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G + ++M   C GK             G+ +SA  +   +   GT+  +  + L V+ +D
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAITAG--GTVRVV-GEHLLVDQAD 231

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
             V++L A+S+F            DP       L+   N  Y+ L  RH+ DYQ LF RV
Sbjct: 232 EVVIILAAASTF---------RVDDPKLRCAELLEHAANQGYAALKKRHIADYQPLFERV 282

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED-PSLVELLFQFGRYLLISSSRP 367
            + L R+P D        +    +P+ +R++  +  ED   L  L F FGRYLLI+ SRP
Sbjct: 283 KLDL-RAPAD--------QERHLLPTPKRLERVRAGEDDAGLYTLYFHFGRYLLIACSRP 333

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G+  ANLQGIWN+ ++P WDS   +NIN +MNYW +  CNLSEC EPLF+ +  +  NG 
Sbjct: 334 GSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLSECHEPLFELIERMRDNGR 393

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+  Y   G+V HH TDIWA ++          W MG AWL  HLWEHY +  + DFL
Sbjct: 394 VTARTMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDFL 453

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            KRAY  ++  A F  D+L+E  +GYL TNPS SPE+ ++  +G+   + Y  +MD  II
Sbjct: 454 -KRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYLLRNGESGTLCYGPSMDTQII 512

Query: 548 REVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDGSIMEWVQ 595
            E++SA I A+  L+ +E+A  E   ++  LP +   K+   G + EW++
Sbjct: 513 SELYSACIQASLELDIDENARQEWAAIMDRLPEM---KVGRHGQLQEWLE 559


>gi|325923835|ref|ZP_08185445.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
 gi|325545691|gb|EGD16935.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
          Length = 795

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 229/594 (38%), Positives = 330/594 (55%), Gaps = 42/594 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +  + L++ +  PA  +  A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+
Sbjct: 43  AAAAGDALQLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATS 102

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           PDA  AL  VR+L+ +G+YAEA A A  K+   P     YQ LGD+ L+FD +       
Sbjct: 103 PDALAALPQVRALIFAGRYAEAEALADAKMLSRPLKQMPYQPLGDLLLDFDRAD---GIS 159

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR+LDL+T      +  G     RE F S   Q IV ++S     ++S  V +DS   
Sbjct: 160 EYRRQLDLDTGVVTTTFRSGGAVHKREVFVSAQSQCIVVRLSCDRPRAISLRVGIDSPQT 219

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
               V     ++  GR         + A  D K ++F+  +  +I    GT+S L D+ L
Sbjct: 220 GEVTVE-QGGLLFSGRN-------GSFAGIDGK-LRFALRVLPQIKG--GTVSDLRDR-L 267

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           ++EG+D  VLLL A++S+     +  D   DP + + ++L+    L Y+ L   HL D+Q
Sbjct: 268 RIEGADEVVLLLTAATSYQ--RFDAVDG--DPLALTAASLKKAGKLDYTALLRAHLADHQ 323

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +LF RV+I L  S                +P+ ERV++F    DP+L  L  QFGRYLLI
Sbjct: 324 RLFRRVAIDLGTS------------EAAKLPTDERVQAFAKGNDPALAALYHQFGRYLLI 371

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SSRPG+Q ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  L
Sbjct: 372 CSSRPGSQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLESMLFDL 431

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  
Sbjct: 432 AKTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYGR 490

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           DR +L K  YPL +G A F +  L++    G + TNPS SPE++   P     C     T
Sbjct: 491 DRAYLGK-IYPLFKGAAEFFVATLVKDPQTGAMVTNPSISPENQH--PFNAALCA--GPT 545

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD  ++R++F+  I+ +++L K +DA  + +     +L P +I + G + EW Q
Sbjct: 546 MDAQLLRDLFAQCIAMSKLL-KVDDAFAQHLSTLREQLPPNRIGKAGQLQEWQQ 598


>gi|338214785|ref|YP_004658848.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
 gi|336308614|gb|AEI51716.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
          Length = 835

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 223/597 (37%), Positives = 327/597 (54%), Gaps = 35/597 (5%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALS 73
           I +  PA+++ +A+P+GNGRLG M +G V  E L+LNE+TLW+G P +   NPDA K L 
Sbjct: 24  IHYKQPARNWNEALPVGNGRLGVMTFGRVNEELLQLNEETLWSGGPVEKNPNPDALKHLP 83

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            VR  ++   Y  A+    K+ G   + YQ LGD+ ++      +     Y R+LDL  A
Sbjct: 84  AVREALNREDYEMASKELQKIQGLYTEAYQPLGDVLIK---QPFEAQPTAYFRDLDLQNA 140

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
           TA  ++++  V ++RE F S PDQVIV +++ S+ G L+F+ S  S       + G N++
Sbjct: 141 TAHTQFTIEGVTYSRELFVSAPDQVIVLRLTASQKGKLNFSASTRSPHPFLKQITGKNEL 200

Query: 194 IMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKV 244
            M G+ P    P   N N  P         KG++F   ++++ +D  G ++A +   + +
Sbjct: 201 SMRGKAPAHADPNYVNYNAKPVYYEDPSGCKGMRFDWRVKVQTTD--GKVTA-DTSGISI 257

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
             +  A+LL+ A++SF+G    P    +D  +   + L+     S   +   H+ DY+K 
Sbjct: 258 SNATEAILLVTAATSFNGFDKCPDSQGRDEKALVEAYLKRASAKSMDLIRKAHIADYRKY 317

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
           F RV + L +S +              +P   R+  + Q   DP L  L F FGRYLLIS
Sbjct: 318 FDRVKLTLGQSGEAA-----------HLPMDARLARYAQLGNDPELEALYFDFGRYLLIS 366

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPG   ANLQGIWN    P W S    NIN EMNYW +   NLSE      D++   +
Sbjct: 367 SSRPGGIPANLQGIWNPMTRPPWSSNYTTNINAEMNYWPAEVANLSELHTTFTDWIAGAA 426

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
             G +TA+  Y   GW +HH +DIW  S+   D+GK    WA W MGGAWL  HLWEHY 
Sbjct: 427 ATGRETAKNFYGMKGWTVHHNSDIWGASNPVGDKGKGSPSWANWAMGGAWLSQHLWEHYV 486

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           Y+ D  +L+  AYPL+   A F LDWL++   G   T+PSTSPE+ FI   G    VS +
Sbjct: 487 YSGDEKYLKNYAYPLMRDAAQFCLDWLVKDAGGNWITSPSTSPENVFITEKGITQAVSVA 546

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWVQ 595
           +TMDMA++ +VF+ +I A+E L+   DA + K L+  +  L P +I + G++ EW +
Sbjct: 547 TTMDMALVYDVFTNVIHASEHLKV--DAELRKTLEDRVQHLFPLQIGKKGNLQEWYK 601


>gi|268608709|ref|ZP_06142436.1| hypothetical protein RflaF_04322 [Ruminococcus flavefaciens FD-1]
          Length = 772

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 219/597 (36%), Positives = 333/597 (55%), Gaps = 55/597 (9%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + +N PA +F +A+P+GNGR+GAM++G    E + LNED++W+G      NPDA + L +
Sbjct: 7   LRYNDPAANFNEALPLGNGRIGAMIYGDAAFEKIPLNEDSVWSGGLRHRVNPDAAEGLEE 66

Query: 75  VRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           VR L+  G   EA   +  KL G   ++  Y  LGD+ ++ +   L      Y R LD+ 
Sbjct: 67  VRRLIKEGNIPEAERIAFDKLQGVTPNMRRYMPLGDLHIDLE---LSGRARNYNRRLDIG 123

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A A V ++V +V + +E+F S PD+V+  +IS +E G ++ +          +Y++G  
Sbjct: 124 NAVADVTFTVNDVLYRKEYFISAPDEVMAVRISCAERGMINLS----------AYIDGRE 173

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
               + R  GK +      +    GI F+A+L  K     G+I  L   ++ VE +D  +
Sbjct: 174 DYYDDNRPCGKNMILFTGGSGSRDGIFFAAVLGAKARG--GSIRTL-GGRIAVEKADEVI 230

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           L+    +SF G      + +K    ++  AL++     Y +L   H++DY+ +F RV   
Sbjct: 231 LIFSVRTSFYG-----DNYEKSALIDAEMALKT----EYDELRLHHVNDYKDMFDRVDFS 281

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-----------DPSLVELLFQFGRYL 360
           L  +         +EEN+D + +AER+K  + DE           D  L+EL F FGRYL
Sbjct: 282 LCDN---------TEENLDRLDTAERIKRLKGDELDNKDCERLIHDNKLIELYFNFGRYL 332

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           +IS+SRPGTQ  NLQGIWNE++   W S   VNIN EMNYW +  CNLSEC  PLFD L 
Sbjct: 333 MISASRPGTQPMNLQGIWNEEMIAPWGSRYAVNINTEMNYWPAESCNLSECHLPLFDLLE 392

Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
            +  NG  TA+  Y +  G+V HH TDIW  ++     V   LWP GGAWL  H++EHY 
Sbjct: 393 RVCENGHITAREMYGVNKGFVCHHNTDIWGDTAPQDMWVPGTLWPTGGAWLALHIFEHYE 452

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           YT+D++FL ++ Y +L+  A F  ++LIE   G L T PS SPE+ +  PDG   C+   
Sbjct: 453 YTLDKEFLAEK-YHILKQAAEFFTEFLIEDESGMLVTCPSVSPENTYKLPDGTKGCLCMG 511

Query: 540 STMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            +MD  II  +F+ +I AAE+L+K++   A ++++LK +P+    ++ + G I EW+
Sbjct: 512 PSMDSQIITVLFTDVIRAAEILDKDKTFAAKLKRMLKKIPQ---PEVGKYGQIKEWL 565


>gi|374296937|ref|YP_005047128.1| hypothetical protein [Clostridium clariflavum DSM 19732]
 gi|359826431|gb|AEV69204.1| hypothetical protein Clocl_2638 [Clostridium clariflavum DSM 19732]
          Length = 742

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 223/588 (37%), Positives = 324/588 (55%), Gaps = 48/588 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA  + +A+PIGNGR+GAM++G + +E ++LNED++W G   D  NPDA K L 
Sbjct: 3   KLWYTKPAGCWEEALPIGNGRMGAMIFGSIETEHIQLNEDSVWYGAFVDRNNPDALKNLP 62

Query: 74  DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R L+  GQ  EA    V  L G P     YQ LGD+ + F    ++  +  Y R L L
Sbjct: 63  KIRELIIKGQIPEAEELMVYALSGIPQSQRPYQSLGDLTIRFKG--MEGDKSGYIRCLSL 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVN 188
           + A   VK  V    + RE F S  D V+V +I+      +SF+  L  +   D    V 
Sbjct: 121 DDAIHTVKVKVAENTYKRETFLSAADDVLVMRITSDGDKKISFSALLTRERFYDRVIKV- 179

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G + ++++G             N    G+ F  ++ +K   + G+   +  + L V  +D
Sbjct: 180 GQDAVMLDG-------------NLGKGGLDF--VMMLKAVAEGGSCDVV-GEHLIVNDAD 223

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              LL  A ++F   F N  +  K         L    N SY DL  RH++DY  L++RV
Sbjct: 224 AVTLLFTAGTTFR--FQNLKEQLK-------KILNDAANRSYDDLRKRHVEDYMSLYNRV 274

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
           S +L+ +           E  + + + ER+K  +  E D  L +L F FGRYLLIS SR 
Sbjct: 275 SFELNGT-----------EKYEELTTEERLKKAKEGEVDKGLAKLYFDFGRYLLISCSRE 323

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G+  ANLQG+WN+D++P WDS   +NIN +MNYW +  CNLSEC +PLFD +  +  NG 
Sbjct: 324 GSLPANLQGVWNKDMNPAWDSKYTININTQMNYWPAEVCNLSECHKPLFDLIKRMVPNGQ 383

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           KTA+  Y   G+V HH TDIW  ++     +  + W MG AWLCTHLW HY YT D+DFL
Sbjct: 384 KTARTMYNCRGFVAHHNTDIWGDTAVQDHWIPASYWVMGAAWLCTHLWMHYEYTQDKDFL 443

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            K A+P++     F LD+LIE   GYL+T PS SPE+ +I P+G    V+  +TMD  I+
Sbjct: 444 -KEAFPIMREAVLFFLDFLIE-DKGYLKTCPSVSPENTYILPNGVQGSVTIGATMDNQIL 501

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           R++FS  I AAE+L +  D +   + +++ +L PT+I   G+IMEW +
Sbjct: 502 RDLFSQCIKAAEIL-RVCDQMNRDIEETVKKLEPTRIGSRGNIMEWTE 548


>gi|198277528|ref|ZP_03210059.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
 gi|198270026|gb|EDY94296.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
          Length = 809

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 214/596 (35%), Positives = 327/596 (54%), Gaps = 31/596 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
           L + +N PA+ F +A+ IGNG +GA+++GG   + L LN+ TLWTG P    T P+A KA
Sbjct: 32  LVLHYNRPAEFFEEALVIGNGTMGAILYGGTDKDVLSLNDITLWTGEPDRKVTTPNAYKA 91

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           + ++R+L+D   Y  A  A  K+ GH ++ YQ LG + + +     K +   Y+R LD++
Sbjct: 92  IPEIRALLDKEDYRGADRAQRKVQGHYSENYQPLGQLSITYSAEPAKVSH--YQRTLDIS 149

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A AR  Y     +F  ++F+S PD VIV ++    +  L   +S +SLL + +  NGN 
Sbjct: 150 RAMARTAYQRNGADFACDYFASAPDSVIVLRLQTESTEGLQATLSFNSLLPHATTANGN- 208

Query: 192 QIIMEGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           +I  EG       P         +  D  +G  F  +  I++   +  + +    +LKV+
Sbjct: 209 EISAEGYAAYHSYPVYFDGVNNKHLYDPERGTHFRTL--IRVIAPQSEVKSFPSGELKVK 266

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G   A++L+   +SF+G   +P    +D  +     ++     ++ +L   H+ DY+  F
Sbjct: 267 GGKEALILIANVTSFNGFDKDPMKEGRDYRNLVTRRMERAAQKTFEELENAHVADYKSFF 326

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLIS 363
            RV + L ++          ++ I  +P+ E++  +  ++  +P L  L FQ+GRYLLIS
Sbjct: 327 DRVELHLGKT----------DQAIAALPTDEQLLQYTDKSQRNPELEALYFQYGRYLLIS 376

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR     ANLQG+WNE L P W      NINLE NYW +   NLSE   PL DF+  L 
Sbjct: 377 SSRTPGVPANLQGLWNERLLPPWSCNYTSNINLEENYWAAETANLSEMHRPLMDFIANLQ 436

Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYN 479
             G ++A+  Y +  GW +   TDIWA +     + G   WA W MGGAWL TH+WE Y 
Sbjct: 437 HTGEESAKAYYGVQKGWCLGQNTDIWAMTCPVGLNVGDPSWACWTMGGAWLSTHIWERYT 496

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           +T D++FL+K  YP+L+G A F L+WLIE  DG L T+P TSPE++F+ PDG     SY 
Sbjct: 497 FTQDKEFLQKY-YPVLKGAAEFCLNWLIE-KDGKLITSPGTSPENKFLTPDGYAGATSYG 554

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            T D+A+ RE       AAE L  ++D   +++ K+LPRL P ++ + G++ EW  
Sbjct: 555 CTSDLAMTRECLIDAAKAAEALGTDKD-FRKQIEKTLPRLLPYQVGKKGNLQEWFH 609


>gi|372210566|ref|ZP_09498368.1| alpha-L-fucosidase [Flavobacteriaceae bacterium S85]
          Length = 793

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 218/589 (37%), Positives = 325/589 (55%), Gaps = 39/589 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +N P+  + DA+P+GNGRLGAMV+GG   E ++ NE+TLW+G P DY N  A K+L
Sbjct: 30  LTLWYNQPSNTWNDALPVGNGRLGAMVYGGKTKEVIQFNEETLWSGQPHDYVNRRAFKSL 89

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
           + +++ +  G+  EA   A+ K   +P +   YQ   ++ ++F + H    +  Y+R LD
Sbjct: 90  AKIKNSLWDGKRKEAEEIANKKFMSNPINQSSYQSFANVLIDFKN-HSNVTD--YKRSLD 146

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  A A   Y +      RE F+S+PDQVIV  ++ S  G L+F+++LDS   ++     
Sbjct: 147 LERAIASTVYKLDKAVIKREVFASHPDQVIVVHLTSSVKGILNFDITLDSNHSDYKVSIE 206

Query: 190 NNQIIMEGRCPGKRIPPKANANDDP-KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            N+I+++G+    +     N N  P   I+F A L++     +G     ++ K+ ++ + 
Sbjct: 207 ENEIVIKGKADNFKRDLDINKNKFPLSKIKFEARLKLV---QKGGELISKNNKVTIKNAT 263

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
                LV +++F    +N  D   +P        + + N  Y+ +   H+ D+QK F+R+
Sbjct: 264 EVTCYLVGATNF----VNFKDISGNPHKRCKEYFKKLNNKPYNLVKENHIKDFQKYFNRL 319

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            I L             E  I   P+ ER+ SF  D DP+LV LL+Q+GRYLLISSSR G
Sbjct: 320 HIDLG------------ETKISRRPTNERLMSFSQDMDPNLVALLYQYGRYLLISSSRKG 367

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           TQ ANLQGIWN+ +SP W S   +NINLEMNYW +   NLSE  EPL   +  LS  G K
Sbjct: 368 TQPANLQGIWNDRISPPWGSKYTLNINLEMNYWITEVTNLSELSEPLIKLIDDLSNTGEK 427

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+ +Y   GWV HH TDIW + +A   +    +WP GGAWL  HLW HY +T ++DFL+
Sbjct: 428 IAKEHYNMPGWVAHHNTDIW-RGAAPINRSNHGIWPTGGAWLSQHLWWHYEFTQNKDFLK 486

Query: 489 KRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           K AYP+L+  + F  ++L+E  D    L + PS SPEH           +    TMD  I
Sbjct: 487 KMAYPILKKASLFFSNYLLEFPDNKELLISGPSNSPEH---------GGLVMGPTMDHQI 537

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           IR +F   I A+++L  +      K+ K + R+ P KI + G + EWV+
Sbjct: 538 IRNLFRVTIEASKILNVDR-GFRMKLEKKMNRIMPNKIGKHGQLQEWVK 585


>gi|313149824|ref|ZP_07812017.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
 gi|313138591|gb|EFR55951.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
          Length = 824

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 228/620 (36%), Positives = 330/620 (53%), Gaps = 58/620 (9%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
           N L + +  PA ++ +A+P+GNG LGAMV+G    E L+LNE TL++G P      P   
Sbjct: 25  NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 84

Query: 70  KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
              ++V +L++ G YA A     + + G  +  YQ L D+ L FD   ++   E Y REL
Sbjct: 85  SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 141

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           +L  A   ++Y  G + +TRE+F SNPD+V+V +IS S    ++  VS  S         
Sbjct: 142 NLQDAVHTIRYQAGGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 201

Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
              ++I+ G+ PG                           +R   K     D    KG+ 
Sbjct: 202 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 261

Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
           F +   +K+     T   L+D +LKV G    +LL+ A++S++G   +PS    D  ++ 
Sbjct: 262 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 316

Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
            + L     L Y DL  RHL DYQ+LF RV++ L            SE++   +P+  R+
Sbjct: 317 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 365

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
             F+ + D +L  LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+  +NIN EM
Sbjct: 366 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 425

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
           NYW +    L EC EPLF  +  L++NGS TA   Y   GW  HH T IW +S    G+ 
Sbjct: 426 NYWPAETTGLPECSEPLFRLIRELAVNGSATAAKMYNLPGWTSHHITSIWRESGPADGEP 485

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
            W +W M   WLC HLW+HY ++ D+ FL + AYPL+   A F   WL+E  DG  +T  
Sbjct: 486 TWFMWNMSAGWLCRHLWDHYLFSGDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 544

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
             SPE++F+ P+ K + V+ +  MDMAIIRE+FS    AA +L  +      D L+  V+
Sbjct: 545 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 604

Query: 574 KSLPRLRPTKIAEDGSIMEW 593
            +  +L P +I + G IMEW
Sbjct: 605 GA-KQLVPYRIGKRGQIMEW 623


>gi|146300857|ref|YP_001195448.1| hypothetical protein Fjoh_3112 [Flavobacterium johnsoniae UW101]
 gi|146155275|gb|ABQ06129.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
           [Flavobacterium johnsoniae UW101]
          Length = 822

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/598 (36%), Positives = 334/598 (55%), Gaps = 33/598 (5%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
           LK+ +N PA  +T+A+PIGNG LGAMV+G V SE ++LNE TLW+G P     NP+A + 
Sbjct: 26  LKLQYNQPAVEWTEALPIGNGTLGAMVFGRVDSELIQLNEATLWSGGPVQKNVNPNAFQN 85

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L+ +R  + +  + +A   +  + G  ++ +  LGD+ L  D    K   + Y R LD+ 
Sbjct: 86  LALIREALKAEDFDKAYNLTKNMQGAYSESFMPLGDLLLTQDLGSKK--TDFYNRSLDIQ 143

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           T  A   +    V + RE F+S P + IV K+S  +   LS ++   SLL N   +  N 
Sbjct: 144 TGLAVTNFKADGVNYKREIFASAPAKCIVMKLSADQLKKLSVSIDASSLLKNQKEIQ-NQ 202

Query: 192 QIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKL 242
            ++++G+ P    P   + N +P         +G++F  I++  + D  GT+S  E  K+
Sbjct: 203 SLVLKGKAPSHADPNYIDYNKEPVIYDDPAGCRGMRFELIVKPIVKD--GTVS-YEGNKI 259

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            ++ +   VL + A++SF+G    P    KD  + + + ++      Y  L   HL D+Q
Sbjct: 260 VIKNASEIVLFISAATSFNGFDKCPDSQGKDEHAFAENPIKKASVKKYDILVKEHLQDFQ 319

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
           K F+RVS+QL+            E +   +P+  R++ +   E D  L  L FQ+GRYLL
Sbjct: 320 KFFNRVSLQLNEK----------ETHKSNLPTDIRLEQYAKGEKDAGLEALFFQYGRYLL 369

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSSR     ANLQGIWN  L   W S    NINL+MNYW     +LSE   PL DF+  
Sbjct: 370 ISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESASLSELFFPLDDFVKN 429

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
           +S+ G++TA+  Y A+GWV+HH +DIWA ++      +G  +WA W MG  WL  HLWEH
Sbjct: 430 VSVTGAETAKSYYHANGWVLHHNSDIWATTNPVGDFGKGDPMWANWYMGANWLSRHLWEH 489

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y YT D ++L K+ YP+++G A F LDWL +  +GYL T PSTSPE+++     K   V+
Sbjct: 490 YQYTGDTEYL-KKVYPIIKGAAEFSLDWLQQDKNGYLVTMPSTSPENKYFYDGKKGGVVT 548

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +STMD+ II+++F     A+++L  + D   +KV K+  +L P +I   G + EW +
Sbjct: 549 TASTMDIGIIKDLFENTSQASKILNIDAD-FRQKVDKAANQLLPFQIGAKGQLQEWYK 605


>gi|227538538|ref|ZP_03968587.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227241457|gb|EEI91472.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 826

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 228/595 (38%), Positives = 330/595 (55%), Gaps = 49/595 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N LK+ ++ PA ++ +A+PIGNGRLGAMV+G    E ++LNE+T+W G PG+  + +A  
Sbjct: 28  NSLKLEYDKPAGNWNEALPIGNGRLGAMVFGQPDLEQIQLNEETIWAGGPGNNVSKNAYD 87

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEET 123
            +  +R L+  G+  EA   S   F  PA         YQ  GD+ + F D H +Y+  +
Sbjct: 88  KIQQIRRLLFEGKAKEAQDLSNATFPRPAPTGIDYGMPYQTFGDLRISFPD-HKQYS--S 144

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y RELD+  A  R +Y  G V +TRE F+S  D V++ K+S     SLSF++ L S  DN
Sbjct: 145 YSRELDIQDAITRTRYKAGAVNYTREVFASLKDDVVIIKLSADTKKSLSFSIGLTSPHDN 204

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
                 N Q+ + G          + +++   G IQF+ I+   +   +G     +D +L
Sbjct: 205 THITVENKQLTLSG---------ISGSHEGKTGQIQFTGIVRPIL---KGGKLIQKDNQL 252

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           +V  +D  +L +   ++F     N +D   + T+++++ L       Y      H+  YQ
Sbjct: 253 EVTHADEVILYISIGTNFK----NYNDITGNATAKALNILNKASGNKYGKAKADHIQKYQ 308

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           + F+RVS+ L  SP+       S++  D      R++ F   +DP LV L FQFGRYLLI
Sbjct: 309 QYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQFGRYLLI 356

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SSS+PG Q A LQGIWN+ LSP WDS   VNIN EMNYW +   NL E  EPLF  L  L
Sbjct: 357 SSSQPGGQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPLFAMLKDL 416

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           ++ G ++A+  Y A GW IHH TD+W  S    G   + +WPMGGAWL  HLW+H+ Y+ 
Sbjct: 417 AVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGMWPMGGAWLSQHLWQHFLYSG 475

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           DR FL K  Y +L+G A F LD L E   H  +L   PS SPE+ ++   G    VS  +
Sbjct: 476 DRSFL-KEYYHVLKGKALFYLDVLQEEPTHQ-WLVVAPSMSPENSYLPGVG----VSAGT 529

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD  ++ +VF   I A+ VL+++ D L + V  +L RL P +I +   + EW+Q
Sbjct: 530 TMDNQLVFDVFHNFIQASAVLKQDAD-LRDSVQVALDRLPPMQIGQHNQLQEWLQ 583


>gi|257053761|ref|YP_003131594.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
           utahensis DSM 12940]
 gi|256692524|gb|ACV12861.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
           utahensis DSM 12940]
          Length = 784

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 216/593 (36%), Positives = 326/593 (54%), Gaps = 45/593 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA  + +A+PIGNGRLG M++G    E ++ N DTLW G   D TNPDA + + +VR
Sbjct: 13  YDEPASAWLEALPIGNGRLGGMIFGRPGCERVQFNADTLWAGGHEDRTNPDAREHVEEVR 72

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+  G+   A A A  KL G P  +  YQ  GD+ ++        A   YRRELDL+  
Sbjct: 73  RLLFDGEVQRAQALADEKLMGDPIRLRPYQTFGDLSIDVGHD----AVTDYRRELDLSAG 128

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            ARV+Y      + RE+F+S PD  IV +++  E G+++  V LD   D    V  +  +
Sbjct: 129 VARVRYDHEGTTYVREYFASAPDDAIVIRLTAEEPGAVTATVGLDREQDADDSVR-DGTL 187

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS-----D 248
            + GR        +       +G+ F A     ++ D G +  +       E S     +
Sbjct: 188 QLRGRVVDDPDDDRGAGG---EGMAFEA--RASVTADAGNVQRVTGADAPEESSVGFRAE 242

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A  + +  + F G         +DP +   S L ++ + SY DL   H+ D+++LF RV
Sbjct: 243 AADAMTIVLTGFTG------HETEDPGAACESVLDAVADQSYDDLRDTHVADHRELFDRV 296

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L   P D  TD    E +D V + E         DP+L  L  QFGRYLLI+SSRPG
Sbjct: 297 ELDLG-EPLDRPTD----ERLDRVATGE--------ADPNLTALYAQFGRYLLIASSRPG 343

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           T+ ANLQG+WN++  P W+S   +NINLEMNYW +L  NL+EC  PL+DF+  L   G +
Sbjct: 344 TEPANLQGVWNQEFDPPWNSGYTLNINLEMNYWPALQTNLAECAAPLYDFVDDLREPGRR 403

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+ +Y  +G+ +HH +D+W +++A      W LWPMG AWL   +++HY +T D D L 
Sbjct: 404 VAETHYDCAGFAVHHNSDLW-RNAAPVDGAHWGLWPMGAAWLSRLVFDHYAFTRDEDHLR 462

Query: 489 KRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           + A P+L   A+F+ D+L+E    +G    +L T PS SPE+ ++  DG+ A V+Y+ TM
Sbjct: 463 ETAEPILREAAAFVADFLVEHPAEEGEAEDWLVTAPSNSPENAYVTDDGQEATVTYAPTM 522

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D+ + R++F   I+AAE+LE  ED   + +  +L RL P ++ E G + EW++
Sbjct: 523 DVQLTRDLFEHTIAAAEILEV-EDEFHDDLRAALDRLPPMQVGEHGQLQEWIE 574


>gi|393773725|ref|ZP_10362119.1| twin-arginine translocation pathway signal protein [Novosphingobium
           sp. Rr 2-17]
 gi|392720900|gb|EIZ78371.1| twin-arginine translocation pathway signal protein [Novosphingobium
           sp. Rr 2-17]
          Length = 852

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 219/591 (37%), Positives = 310/591 (52%), Gaps = 45/591 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           +I  N PA  +    P+GNGRLGAM+ G V  + + LN DTLWTG P  + + D    L+
Sbjct: 56  RIADNSPATEWLLGHPVGNGRLGAMMGGSVRRDVISLNHDTLWTGQPSPHPDHDGRATLA 115

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            VR  V +G YA A   S  L G  +  +  + D+ LE D +    A   YRRELDL+ A
Sbjct: 116 AVRKAVFAGDYAAADLLSRPLQGTFSQSFAPMADMTLELDHTQ---AVTAYRRELDLDRA 172

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V Y  G+V F RE F+S PD VIV ++S S + ++S  + L + L   +   GN   
Sbjct: 173 IASVAYHCGDVAFRRELFASYPDNVIVLRLSASRAAAISGRIGLATSLLGSTRAAGNTLR 232

Query: 194 IMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +M G+ P +  P       P A +    +G+ F+ +L +++    G + A  D  L V G
Sbjct: 233 LM-GKAPTRCEPNYREVPDPVAYSEQPGQGMAFATVLGVEVQG--GEVVASGDA-LSVRG 288

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D  V+ + A++ F    + P  + ++  + +   L      SY  L  RHL D+Q L+ 
Sbjct: 289 ADVVVIRIAAATGFRRFDLLPDIAAEEVAAVAERNLAIAHQNSYGSLLKRHLADHQALYR 348

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           R SI+L  +  D VT           P AER               LF  GRYLLI+SSR
Sbjct: 349 RASIELQGAGDDQVT-----------PKAER---------------LFNLGRYLLIASSR 382

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           P T  ANLQG+WN  + P W +    NINL+MNYW +  CNL+EC  PL D +  L++NG
Sbjct: 383 PDTMPANLQGLWNAQVRPPWSANYTTNINLQMNYWSAETCNLAECHLPLMDHIERLALNG 442

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           +K A+  Y   GW +HH +D+WA ++   A  G   WA WPM G WL  H+WEHY ++ D
Sbjct: 443 AKVARDLYGMPGWSVHHNSDVWAMANPVGAGDGDPNWANWPMAGPWLAQHVWEHYRFSGD 502

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
             FL KR + L+  CA F   WL+     + L T PS SPE+ F+ P GK + +S   TM
Sbjct: 503 IAFLAKRGFALMRDCAEFCAAWLVRDPSSHRLTTAPSISPENLFLGPHGKPSAISSGCTM 562

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           D+A+ RE+F   I+AA ++  +   L   +   L  L P +I   G + EW
Sbjct: 563 DLALTRELFENCIAAANLV-GDRSGLAVHLKGLLQELEPYRIGRYGQLQEW 612


>gi|395804709|ref|ZP_10483944.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
 gi|395433097|gb|EJF99055.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
          Length = 823

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 223/605 (36%), Positives = 334/605 (55%), Gaps = 33/605 (5%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYT 64
           S S    LK+ +  PA  +T+A+P+GNG LGAMV+G V +E ++LNE TLW+G P     
Sbjct: 20  SASAQKDLKLQYKQPAVEWTEALPVGNGTLGAMVFGRVEAEFIQLNEATLWSGGPVHKNV 79

Query: 65  NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
           NPDA K L+ +R  + +  + +A   +  + G  ++ +  LGD+ L+ D    K A  +Y
Sbjct: 80  NPDAFKNLALIREALKNEDFEKANVLTKNMQGPYSESFMPLGDLILKQDFGGQKAA--SY 137

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R LD+ T  A   ++ G V + RE F+S P Q IV K+S  +   LS  +   SLL N 
Sbjct: 138 DRSLDIQTGLAVTSFNAGGVNYKREIFASAPAQCIVIKLSADQLKKLSVTIDAASLLKNQ 197

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTIS 235
             V  N  ++++G+ P    P   + N +P         +G++F  I++  + D  G IS
Sbjct: 198 KAVQ-NQTLVLKGKAPSHADPNYIDYNKEPVIYEDVTGCRGMRFELIIKPVVKD--GQIS 254

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           + E  KL ++ +   +L + A++SF+G    P    KD    + + ++ +    Y  L  
Sbjct: 255 S-EGDKLVIKNASEILLFVSAATSFNGFDKCPDSQGKDEHKFAEAPIKKVAGKKYDSLLK 313

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
            H+ D+QK F+RVS+ L+            E +   +P+  R++ +   E D  L  L F
Sbjct: 314 EHIADFQKFFNRVSLMLNEK----------ETSKSDLPTDIRLEQYAKGEKDAGLEALFF 363

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLISSSR     ANLQGIWN  L   W S    NINL+MNYW     +LSE    
Sbjct: 364 QFGRYLLISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESGSLSELFFS 423

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWL 470
           L +F+   S  G++TA+  Y A+GWV+HH +DIWA ++      +G  +WA W MG  WL
Sbjct: 424 LDEFIKNASATGAETAKSYYHANGWVLHHNSDIWAMTNPVGDFGKGDPMWANWYMGANWL 483

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
             HLWEHY YT D+++L K+ YP+++G A F LDWL +  +G+L T PSTSPE+ F    
Sbjct: 484 SRHLWEHYQYTGDKNYL-KKVYPIIKGAAEFSLDWLQKDKNGHLVTMPSTSPENIFYYDG 542

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
            K   V+ +STMD+AII+++F   I A++VL  + +   +KV  +   L P +I   G +
Sbjct: 543 KKQGTVTTASTMDIAIIKDLFENTIEASKVLYADLE-FRQKVNSAREELLPFQIGSKGQL 601

Query: 591 MEWVQ 595
            EW +
Sbjct: 602 QEWYK 606


>gi|395803591|ref|ZP_10482835.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
 gi|395434145|gb|EJG00095.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
          Length = 816

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 220/595 (36%), Positives = 332/595 (55%), Gaps = 44/595 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +  N LK+ ++ PA  + +A+P+GNGRLGAMV+G    E L+LNE+T+W G P    +
Sbjct: 18  TATAQNDLKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSNAH 77

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF------GHPADVYQLLGDIELEFDDSHLKY 119
             + +AL  VR L+  G++ EA   + K        G P   YQ  G + + F+  H KY
Sbjct: 78  TKSIEALPKVRQLIFEGKFDEAQDLATKDIMSQTNDGMP---YQTFGSVYISFN-GHQKY 133

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
            +  Y R+LD++ ATA+VKY V  VEFTRE  ++  DQVIV K+S S+ G ++ NV ++S
Sbjct: 134 TD--YYRDLDISNATAKVKYKVNGVEFTREILTAFSDQVIVMKLSASKPGQITCNVFMNS 191

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
            +D        NQII+ G           N  +    ++F   L  K  +  G I A  +
Sbjct: 192 PIDKTVTSTEGNQIILSGTG--------TNFENVKGKVKFQGRLTAK--NKGGEIDA-SN 240

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
             L +  +D  +L +  +++F     N  D   D  ++S   L       + ++   H+D
Sbjct: 241 GVLSINKADEVILYISIATNFK----NYKDISGDEIAKSKDYLAKAEIKDFENIKKAHVD 296

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
            YQK F+RV++ L            S E +   P+ ER++ F    DP L  L FQFGRY
Sbjct: 297 YYQKFFNRVALDLG-----------SNELVKK-PTNERIRDFSKQFDPQLASLYFQFGRY 344

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSS+PG Q ANLQGIWN+ ++P WDS    NIN EMNYW +   NL E  EP     
Sbjct: 345 LLISSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQELHEPFVQMA 404

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             L+I G++TA++ Y A+GWV+HH TDIW + +A        +WP GGAW+C  LWE Y 
Sbjct: 405 KELAITGAETARMMYNANGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYL 463

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           YT D+ +L +  YP+++G A F LD++I + + GYL   PS+SPE+      GK + ++ 
Sbjct: 464 YTGDKKYLAE-IYPIMKGAADFFLDFMIVDPNTGYLVVVPSSSPENTHAGGTGK-STIAS 521

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +TMD  +I ++F+ ++ A+ ++  +  A V+KV ++L ++ P KI +   + EW
Sbjct: 522 GTTMDNQLIFDLFTHVMEASALISPDA-AYVKKVSEALAKMPPMKIGKHSQLQEW 575


>gi|302548581|ref|ZP_07300923.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
 gi|302466199|gb|EFL29292.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
          Length = 809

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 225/590 (38%), Positives = 328/590 (55%), Gaps = 49/590 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +   A  + +A+PIGNGRLGAMV+GG  SE L+LNEDT+W G P +  +P A  +L
Sbjct: 49  LALWYPRAASTWLEALPIGNGRLGAMVFGGAESELLQLNEDTVWAGGPYEPASPKALASL 108

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRE 127
            ++R  V +G++  A +       G P    +YQ +G++ L FD      A E   YRR 
Sbjct: 109 PEIRRRVFAGEWEAAQSLIDSDFLGTPKGELMYQPVGNLRLAFD-----AAGEVGDYRRT 163

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL++A A V+Y+ G V + RE F+S+PDQVIV +++    G++SF  + DS        
Sbjct: 164 LDLDSAVASVRYAQGGVTYDRECFASHPDQVIVMRLTADRPGAVSFTAAFDS-------- 215

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVE 245
               Q ++    P +        ++  +G+  Q       +   D GT+S+ E+  L V 
Sbjct: 216 ---PQTVIAS-SPDRITVAIDGTSETREGVTGQVRFRALARARADGGTVSS-ENGTLTVT 270

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+D   LL+   +S+   + NP+    D  + + + L +  ++ Y+ L  RH+ DY+ LF
Sbjct: 271 GADSVTLLVSVGTSYTD-YRNPT---GDHAARATAPLNAASDVPYARLRKRHVADYRGLF 326

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV + L        TD  +      +P+ ERV +F +  DP LV L FQ+GRYLLISSS
Sbjct: 327 RRVGLDLG------TTDAAA------LPTDERVANFASATDPQLVALHFQYGRYLLISSS 374

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGTQ ANLQGIWN+ LSP+WDS   +NIN EMNYW +   NL EC EP+FD L  LS+ 
Sbjct: 375 RPGTQPANLQGIWNDSLSPSWDSKYTININTEMNYWPAPVTNLLECWEPVFDLLADLSVA 434

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G+ TA+  Y A GWV HH TD W + +A   +    +W  GGAWL T +W+HY +T D+ 
Sbjct: 435 GATTAKRQYGAGGWVTHHNTDAW-RGTAPVDRAFPGMWQTGGAWLSTGIWDHYLFTGDKK 493

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            L +R YP+L G   F LD L+ +   G+  T P+ SPE+           V    TMD 
Sbjct: 494 ALRRR-YPVLRGSVRFFLDTLVTDPATGHFVTCPANSPENAHHTN----VSVCAGPTMDN 548

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEW 593
            I+R++F   + A+E+L ++ DA +   ++ + R L P KI   G + EW
Sbjct: 549 QILRDLFDGFVKASELLGEDADAGMRAEVRRVRRKLPPMKIGAQGQLREW 598


>gi|334144837|ref|YP_004538046.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
           PP1Y]
 gi|333936720|emb|CCA90079.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
           PP1Y]
          Length = 806

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 220/589 (37%), Positives = 322/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA+ +T+A+P+GNGR+GAMV+GG   E L+LNEDTLWTG P +  NP A +AL 
Sbjct: 63  RLWYCQPAREWTEALPVGNGRIGAMVFGGTGLERLQLNEDTLWTGGPYNPVNPSAREALP 122

Query: 74  DVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEE-TYRRELD 129
            +R L++ G + +A T A  +L   P     YQ  GD+ +     HL   E+ +Y RELD
Sbjct: 123 QIRRLIEQGHFTQAQTLADARLMARPLSQMAYQTFGDLTIAM--PHLGTIEQGSYLRELD 180

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+ A A   +    V ++R+  +S   QVI   +S    G +   V L +  D    ++G
Sbjct: 181 LDAALAATTFKADGVSWSRKVIASPDHQVIAVHLSADRPGRMHCLVGLGAPHDGVLSIDG 240

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLKVEGS 247
              +I  GR            N+   G++ +   E +  +    G IS + D KL VEG+
Sbjct: 241 GT-LIFGGR------------NNAAHGVEGALRFEARARVLPQGGRIS-VSDNKLAVEGA 286

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D   +L+  ++S+        D   DP+  + S +++    S++ +       +++L+ R
Sbjct: 287 DAVTILIAMATSYR----QFDDVGGDPSQITRSQIEAASRHSFARIAADTAASHRRLYRR 342

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           VS+ L  +P                P+ ER+++ +T +D +L  L FQ+GRYLLI SSRP
Sbjct: 343 VSLDLGETPAA------------HRPTDERIRTSETSQDSALAALYFQYGRYLLICSSRP 390

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G+Q ANLQGIWN+   P W S   +NIN EMNYW + P  L EC  PL   +  L+  G+
Sbjct: 391 GSQPANLQGIWNDSDDPPWGSKYTININTEMNYWPAEPTALGECVAPLVALVRDLAQTGA 450

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+  Y A GWV HH TD+W +++A      W LWPMGGAWLCTHLW+HY+Y  D  FL
Sbjct: 451 STAREMYGARGWVAHHNTDLW-RATAPIDGAAWGLWPMGGAWLCTHLWDHYDYHRDTAFL 509

Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            +  YPLL G A F LD L  +   GYL TNPS SPE+E   P G   C   S  +D  I
Sbjct: 510 -RSVYPLLRGAALFFLDTLQRDPASGYLVTNPSISPENEH--PGGASVCAGPS--VDRQI 564

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +R++F+    AA +L  ++D L  ++L +  RL P +I   G + EW++
Sbjct: 565 LRDLFAQTARAATILGLDDD-LSAQILDTSRRLAPDEIGAQGQLQEWLE 612


>gi|424794811|ref|ZP_18220740.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
 gi|422795776|gb|EKU24406.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
          Length = 775

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 221/587 (37%), Positives = 321/587 (54%), Gaps = 42/587 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P+A  AL
Sbjct: 30  LTLWYPRPATQWVEALPLGNGRLGAMVWGGIAHERLQLNEDTLYAGQPYDATSPEALAAL 89

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR+L+ +G+Y EA A A  KL   P     YQ L D+ L++D +      + YRRELD
Sbjct: 90  PQVRALIFAGRYVEAEALADAKLLSRPRKQMPYQPLADLLLDYDRAD---GIDGYRRELD 146

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TA A  ++        RE F S  +Q I+ ++S    G ++  + +DS        + 
Sbjct: 147 LDTALASTRFVSDGATHLREVFVSATEQCILVRLSCDHPGRIALRIGIDSP-QAGEVTHE 205

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
              ++  GR         A       G++F+  +  + S   G  + +E  +++++G+D 
Sbjct: 206 QGALLFAGR--------NAGFAGIEGGLRFALRVLPRAS---GGSTRIERGRIRIDGADE 254

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            VLLL A++S+        D   DP + S + L++   LSY+ L  RHL ++++LF RV+
Sbjct: 255 VVLLLTAATSYR----RYDDVGGDPLALSAAQLRTAAALSYAQLRERHLAEHRRLFRRVA 310

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           I L  S                +P+ ERV+ +    DP+L  L  Q+GRYLLISSSRPG+
Sbjct: 311 IDLGSSAAA------------QLPTDERVRRYADGNDPALAALYHQYGRYLLISSSRPGS 358

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQG+WNE + P W S   VNIN EMNYW S    L EC EPL   L  L+  G+ T
Sbjct: 359 QPANLQGVWNELMQPPWQSKYTVNINTEMNYWPSEANALHECVEPLEAMLFDLAETGAHT 418

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           AQ  Y A GWV+H+ TD+W ++    G V W+LWPMGG WL   LW+ ++Y  DR +L +
Sbjct: 419 AQAMYAAPGWVVHNNTDLWRQAGPVDG-VKWSLWPMGGVWLLQQLWDRWDYGRDRAYL-R 476

Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           R YPL +G A F +  L+ +   G + TNPS SPE+    P G   C      MD  ++R
Sbjct: 477 RIYPLFKGAAEFFVATLVRDPQSGAMVTNPSLSPENRH--PFGAALCA--GPAMDAQLLR 532

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++F+  I    +L  +  A  E++     +L P +I   G + EW Q
Sbjct: 533 DLFAQCIKMGALLGVDA-AFGERLATLRTQLPPDRIGRAGQLQEWQQ 578


>gi|326800280|ref|YP_004318099.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
 gi|326551044|gb|ADZ79429.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
          Length = 826

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 222/607 (36%), Positives = 340/607 (56%), Gaps = 53/607 (8%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + +A + + ++  K+ ++ PA H+ +A+PIGNGRLGAM++GGV  + L+LNE+T+W+G P
Sbjct: 21  IYSAVNATGSDSYKLWYDKPAAHWNEALPIGNGRLGAMLFGGVKQDHLQLNEETIWSGGP 80

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFD 113
           G+ ++ D    + ++R L+ +G+Y EA   S K      +        YQ  GD+ ++F 
Sbjct: 81  GNNSSKDLYSTMQEIRRLLFAGKYKEAQDLSNKEMPREPEANNNYGMSYQPAGDLWIDF- 139

Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
              L   E   YRRELD+  A + V Y VG V + RE+ ++  DQVI+ +++   +GS+S
Sbjct: 140 ---LHEGETVAYRRELDIADALSTVTYRVGEVTYKREYLATAHDQVIMMRVTADRAGSIS 196

Query: 173 FNVSLDS--LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISD 229
            N+ L++  L+    ++   N+I + G    K+         + KG ++FS  +E K+  
Sbjct: 197 CNLKLNTPHLIHQQPFIG--NRIYVNGTSGDKQ---------NKKGQVKFSIAVEPKV-- 243

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
            +G     E + L+V  +D   + +   ++F+    N  D   D    +   L +    S
Sbjct: 244 -KGGALQAEGEMLRVRQADELTVYIAIGTNFN----NYHDLGGDARERADDYLNTALKKS 298

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
           Y  + ++H++DY++ F RVS+ L ++   +  +  +++         RV  F    DP L
Sbjct: 299 YRKIKSKHVEDYRRYFDRVSLDLGQT---VAMNKATDQ---------RVADFHLGNDPQL 346

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
           V L FQFGRYLLISSSRPGTQ ANLQGIWN+ LSP W S   VNIN EMNYW +   NLS
Sbjct: 347 VSLYFQFGRYLLISSSRPGTQPANLQGIWNDKLSPPWSSKYTVNINTEMNYWPAEVTNLS 406

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           E  EPLF  L  LS+ G ++A   Y A GW +HH TDIW  +    G   + +WPMGGAW
Sbjct: 407 EMHEPLFAMLEDLSVTGKESAWNYYRARGWNMHHNTDIWRVTGIIDGG-FYGMWPMGGAW 465

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
           L  H+W+HY +  D  FL K  YP+L+G   F +D L E     +L   PS SPE+ + +
Sbjct: 466 LSQHIWQHYLFNGDNAFLAKY-YPILKGVTQFYVDVLQEEPKHKWLVVAPSMSPENSYQS 524

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
             G    +S  +TMD  ++ +VFS  + AA VL+ +ED  ++ V   L RL P +I + G
Sbjct: 525 GVG----ISAGTTMDNQLVFDVFSNFLEAAHVLQVDED-FMDTVASKLKRLPPMQIGKLG 579

Query: 589 SIMEWVQ 595
            + EW++
Sbjct: 580 QLQEWME 586


>gi|319786653|ref|YP_004146128.1| alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465165|gb|ADV26897.1| Alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
          Length = 805

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 231/597 (38%), Positives = 326/597 (54%), Gaps = 43/597 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + +   PL++ +  PA  + +A+P+GNGRLGAMVWGG  SE L+LNEDTL+ G P D   
Sbjct: 47  TAAPGRPLRLWYPRPATRWVEALPLGNGRLGAMVWGGGRSERLQLNEDTLYAGRPYDPVP 106

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDD-SHLKYAE 121
             A +AL +VR L+ +G++AEA A A   + G P     YQ LGD+ L+F + S L    
Sbjct: 107 DGALEALPEVRRLLFAGRHAEAEALADATMMGAPRKQMPYQPLGDLCLDFVEVSDL---- 162

Query: 122 ETYRRELDLNTATARVKYSVG-NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
           + YRRELDL+ A A   +  G  +E TRE F S  DQ +  ++  S+ G +   + LDS 
Sbjct: 163 DDYRRELDLDRAVATTSFGSGWKLEHTREAFVSAEDQCLAVRLRTSQPGRVRVRIGLDSD 222

Query: 181 LDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
                 V +G+  +++ GR          +A     G++F+A L +++   RG       
Sbjct: 223 HAQAEVVPDGDAGLLLRGR--------NGDAFGIEGGLRFAARLGVQV---RGGTLRRRG 271

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            +++VEG+D  VLLL A++SF        D   DP + + + L++    S+  L   H  
Sbjct: 272 DRIEVEGADEVVLLLTAATSFR----RYDDIGGDPEATTRTQLEAAARRSWDALLAAHEA 327

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
            +Q+LF RV+I L RS           E +  +P  ERV  F    DP L  L  QFGRY
Sbjct: 328 AHQRLFRRVAIDLGRS----------AEEVAALPIDERVARFAEGHDPELAALYHQFGRY 377

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LL+ SSRPGTQ ANLQGIWN+ L+P W+S   +NIN EMNYW +    L EC EPL   +
Sbjct: 378 LLVCSSRPGTQPANLQGIWNDLLAPPWESKYTININTEMNYWPAEANALPECVEPLERMV 437

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             L+  G+  A+  Y A GWV+HH TD+W +++   G   W LWP+GGAWL  HLW+ ++
Sbjct: 438 AELAQTGADVARRMYGAPGWVVHHNTDLWRQAAPIDG-AKWGLWPLGGAWLLQHLWDRWD 496

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           Y  +  +LEK  +PL  G A F    L+E    G + T PS SPE+E   P G   C   
Sbjct: 497 YGREPGYLEK-VWPLFRGAAEFFAATLVEDPTTGAMVTAPSISPENEH--PHGAALCAGP 553

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  MD  I+R++F   I  A +L  + D L  ++ +   RL P +I   G + EW Q
Sbjct: 554 S--MDAQILRDLFGQCIEIAGLLGVDAD-LAARLARLRERLPPHRIGRAGQLQEWQQ 607


>gi|424665666|ref|ZP_18102702.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
           616]
 gi|404573919|gb|EKA78670.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
           616]
          Length = 821

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 227/620 (36%), Positives = 329/620 (53%), Gaps = 58/620 (9%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
           N L + +  PA ++ +A+P+GNG LGAMV+G    E L+LNE TL++G P      P   
Sbjct: 22  NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 81

Query: 70  KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
              ++V +L++ G YA A     + + G  +  YQ L D+ L FD   ++   E Y REL
Sbjct: 82  SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 138

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           +L  A   ++Y    + +TRE+F SNPD+V+V +IS S    ++  VS  S         
Sbjct: 139 NLQDAVHTIRYQAEGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 198

Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
              ++I+ G+ PG                           +R   K     D    KG+ 
Sbjct: 199 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 258

Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
           F +   +K+     T   L+D +LKV G    +LL+ A++S++G   +PS    D  ++ 
Sbjct: 259 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 313

Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
            + L     L Y DL  RHL DYQ+LF RV++ L            SE++   +P+  R+
Sbjct: 314 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 362

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
             F+ + D +L  LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+  +NIN EM
Sbjct: 363 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 422

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
           NYW +    L EC EPLF  +  L++NGS TA   Y   GW  HH T IW +S    G+ 
Sbjct: 423 NYWPAETTGLPECSEPLFRLIRELAVNGSVTAAKMYNLPGWTSHHITSIWRESGPADGEP 482

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
            W +W M   WLC HLW+HY ++ D+ FL + AYPL+   A F   WL+E  DG  +T  
Sbjct: 483 TWFMWNMSAGWLCRHLWDHYLFSEDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 541

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
             SPE++F+ P+ K + V+ +  MDMAIIRE+FS    AA +L  +      D L+  V+
Sbjct: 542 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 601

Query: 574 KSLPRLRPTKIAEDGSIMEW 593
            +  +L P +I + G IMEW
Sbjct: 602 GA-KQLVPYRIGKRGQIMEW 620


>gi|392964290|ref|ZP_10329711.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
 gi|387847185|emb|CCH51755.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
          Length = 821

 Score =  369 bits (946), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 222/593 (37%), Positives = 325/593 (54%), Gaps = 48/593 (8%)

Query: 13  LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
            K+ +N PA + + +A+PIGNGRLGAMV+G V  ET++LNE T+W+G P    NPDA  A
Sbjct: 25  FKLWYNQPAGQTWENALPIGNGRLGAMVYGNVARETIQLNEHTVWSGGPNRNDNPDALAA 84

Query: 72  LSDVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
           L ++R+L+  G+  EA   + K       H   ++Q +G++ L F+  H  Y    Y R+
Sbjct: 85  LPEIRTLIFDGKQKEAEKLANKAIITKKAH-GQMFQPVGNLHLTFN-GHDNYTN--YYRD 140

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LD+  A A+  Y+V  V +TRE F+S PDQVIV  ++ S+ G + F  S  +        
Sbjct: 141 LDIERAIAKTTYTVDGVAYTREVFTSFPDQVIVVHLTASKPGRIDFTASYST-------- 192

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLKV 244
               Q       P K +      +D    KG ++F  I  IK   ++GT+++  D  L V
Sbjct: 193 ---QQKADRKTTPAKDLTIAGTTSDHEGVKGMVRFKGITRIKT--EKGTLAS-TDTTLTV 246

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           +G++ A + +  +++F+    +  D   D  + + S L      SY+ + T H+  YQ  
Sbjct: 247 KGANAATIYISIATNFN----SYKDVSGDENARAESYLNKAYPKSYAAMLTPHVAAYQNY 302

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV + L  +P +             +P+ ER+K+F+T  DP    L +Q+GRYLLISS
Sbjct: 303 FNRVRLDLGSTPTEAAK----------LPTDERLKNFRTATDPEFATLYYQYGRYLLISS 352

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN  + P WDS   +NIN +MNYW +   NL+E  EP    +  LS 
Sbjct: 353 SQPGGQPANLQGIWNHRMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLRMVNELSE 412

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G +TA+V Y A GW+ HH TDIW  + A  G   W +W  GG W   HLWEHY Y  D+
Sbjct: 413 AGQETARVMYGARGWMAHHNTDIWRTTGAIDG-ATWGMWIAGGGWTAQHLWEHYLYNGDK 471

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
            +L    YP+L+G A F +D+LIE H  Y  L  NP TSPE+   A  G  + +   +TM
Sbjct: 472 AYLAS-VYPILKGAAQFYVDYLIE-HPKYHWLVVNPGTSPENAPKAHGG--SSLDAGTTM 527

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  I  +VFS  I AAE+L K + A V+ + +   +L P  + + G + EW++
Sbjct: 528 DNQIAFDVFSTAIRAAEIL-KTDVAFVDTLKQKRSQLPPMHVGQHGQLQEWLE 579


>gi|326204164|ref|ZP_08194024.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
 gi|325985675|gb|EGD46511.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
          Length = 775

 Score =  368 bits (945), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 218/587 (37%), Positives = 317/587 (54%), Gaps = 33/587 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA+ + +A+P+GNG LG MV GG+  E + LN DTLW+G+PG   N +    L +V+
Sbjct: 7   YKSPARIWEEALPVGNGGLGGMVHGGISHECIDLNNDTLWSGLPGQLINKNILPLLPEVQ 66

Query: 77  SLVDSGQ-YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            LVD G  Y         +    +  Y  LG + L  +   L      Y R L LNTA  
Sbjct: 67  CLVDEGNNYDAQKLIEENILTGYSQSYLPLGRLLLTCE---LSGEINNYSRSLSLNTAVC 123

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
             +Y+ G V   RE   S PD V+   ++  +S S +   +LDS L       G   +IM
Sbjct: 124 ETRYTCGGVNHCREVICSYPDNVMAVHMTADKSESFTLTATLDSQLRYQVNKKGRT-LIM 182

Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            G CP   IP    A         +  + I FS  +   I   +G    +E+  + +  +
Sbjct: 183 TGDCPSCMIPDYVEAGKHIVYDSEEHSRSIGFSVGMRAYI---KGGSVIVEENGISINAA 239

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  +L+L +S++F+G  I P  S  DP S+ +  L      S+++L +RH DD+  LF R
Sbjct: 240 DEVLLVLSSSTNFEGFDIMPGSSGVDPLSKCIRTLDKAAGYSWNELLSRHKDDHSSLFKR 299

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
           V + L    +              +P+ ER+ ++   + DPSL  L+F +GRYLLI+ SR
Sbjct: 300 VCLDLGTQSQ--------------LPTDERLAAYAKGQYDPSLDSLMFAYGRYLLIACSR 345

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN+DL+  W S    NINLEMNYW +   NLSEC +PLFD L  +S  G
Sbjct: 346 PGTQAANLQGIWNKDLAAPWSSNYTTNINLEMNYWPAETANLSECHKPLFDLLKDVSKAG 405

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           S+ ++ NY   G+V+HH TD+W  +SA  G+  W  WPMGGAWL  H+ EHY ++ D  F
Sbjct: 406 SEISRENYGCRGFVLHHNTDLWRMASAVSGQARWGFWPMGGAWLSLHIMEHYRFSCDVVF 465

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L+   Y + E    F LD++     GY  TNPSTSPE+ FI  +G++  ++  STMD+ I
Sbjct: 466 LQNHYYIMREA-VLFFLDYMKPDKKGYYITNPSTSPENAFIDKEGRICSITKGSTMDLFI 524

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           IRE+F + + A  +L K +  L   +++ L +L P +I + G ++EW
Sbjct: 525 IRELFESCVEAQSIL-KIDSELSGLLVQRLCKLPPFRIGKKGQLLEW 570


>gi|346225024|ref|ZP_08846166.1| alpha-L-fucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 828

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 227/602 (37%), Positives = 330/602 (54%), Gaps = 53/602 (8%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A+  +    LK+ ++ PA  + +A+PIGNGRLGAMV+G   +E ++LNE+T W+G P   
Sbjct: 20  AKEMAQKTDLKLWYDKPANVWNEALPIGNGRLGAMVFGDPANEKIQLNEETFWSGGPSHN 79

Query: 64  TNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHL 117
            NP A KAL  VR L+  G+Y EA      +  + +L G    +YQ +G++ L FD  H 
Sbjct: 80  DNPKALKALPKVRQLIFEGKYYEAEKMVNESMVAEQLHG---SMYQTIGNLNLSFD-GHE 135

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
            Y    Y RELD+  A     Y+V +V F RE F+S P+Q+I  K+S  + GSLSF  SL
Sbjct: 136 NYT--NYYRELDIENALFSTTYTVNDVNFKREVFASFPNQIIAVKLSSDQHGSLSFTASL 193

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
           +  L  ++ V   N + M G          +++++  +G ++F+     KI +D G I  
Sbjct: 194 NGPLAKNTQVLDTNILEMTG---------ISSSHEGVEGQVKFNT--RAKILNDGGKIKT 242

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
            +  K+ V  +D  V+L+  +++F    ++      +   +    L      S+++L   
Sbjct: 243 -DGNKITVTKADEVVILISMATNF----VDYKTLSANENEQCQKFLSEASQKSFAELKNA 297

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+ DY+K F R S+ L  +P        SE      P+  R+K+F    DP+LV L +QF
Sbjct: 298 HIKDYRKYFTRSSLNLGTTP-------ASE-----YPTDVRIKNFSQTNDPALVALYYQF 345

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISSSRPG Q ANLQGIWN    P WDS   +NIN EMNYW +  CNL+E  EPL 
Sbjct: 346 GRYLLISSSRPGGQPANLQGIWNNSTHPAWDSKYTININTEMNYWPAEKCNLTELHEPLI 405

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
             +  LS  GS TAQ  Y   GWV HH TDIW       G   W +WPMGGAWL  HLWE
Sbjct: 406 QMVRELSETGSHTAQTMYGCDGWVTHHNTDIWRICGVVDG-AFWGMWPMGGAWLSQHLWE 464

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLAC 535
            + Y  D  +L    Y +++    F  ++LIE   +G+L  +PS SPE+   AP G+   
Sbjct: 465 KFLYNGDMKYLAS-VYSIMKSACRFYQNFLIEEPVNGWLVVSPSVSPEN---APAGR-PS 519

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++  +TMD  I+ ++FS  I AA +L ++E+ +     +L SLP   P +I + G + EW
Sbjct: 520 ITAGATMDNQILFDLFSKTIKAATLLNQDENLISDFRNILDSLP---PMQIGQYGQLQEW 576

Query: 594 VQ 595
           ++
Sbjct: 577 ME 578


>gi|189465240|ref|ZP_03014025.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
           17393]
 gi|189437514|gb|EDV06499.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
           17393]
          Length = 826

 Score =  368 bits (944), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 219/601 (36%), Positives = 343/601 (57%), Gaps = 44/601 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           +++ ++   T   ++ ++ PA+ + +A+PIGNGR+GAMV+GG+  E ++LNE+T+WTG P
Sbjct: 20  LLSCQNNPDTTIWRLWYDQPAEKWEEALPIGNGRIGAMVFGGITKEKIQLNEETVWTGEP 79

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHL 117
              +NPDA  A+ D+R L+  G+Y EA       V    +   +YQ +GD+ L F     
Sbjct: 80  NSNSNPDALNAIPDIRKLIFQGKYKEAQKLVDEKVISKTNHGMIYQPVGDLNLTFPGHE- 138

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
               + Y RELD+ +A A+ +Y+V +VE+ RE F+S  DQVIV  ++ S  G + F+  L
Sbjct: 139 --TAKNYYRELDIESAIAKTRYTVNDVEYQREIFTSFTDQVIVIHLTASRKGKIVFSAEL 196

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
           +S   + + +   N + ++G   G         ++  +G I FS +  +KI  ++G +  
Sbjct: 197 NSPQKSQT-ITLENGLSLQGSTEG---------HEGLEGKISFSTL--VKIVPEKGQMKT 244

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
            E  ++ V  +D AV + V+ ++    F+N ++   +P  +  S LQ      Y+ L T 
Sbjct: 245 -EASRITVSNAD-AVTIYVSIAT---NFVNYANLSGNPDQKVKSYLQHATQKDYAKLKTD 299

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+D Y+  F+RV  +L       VT+   +       +  R+  F   +DP+L  L FQF
Sbjct: 300 HMDYYRDYFNRVKFKLD------VTEAIQKT------TDVRIAEFAQGKDPNLAALYFQF 347

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLIS S+PGTQ ANLQGIWNE + P WDS    NINLEMNYW +   NLSE  EPL 
Sbjct: 348 GRYLLISCSQPGTQPANLQGIWNERMKPAWDSKYTTNINLEMNYWPTEITNLSELHEPLI 407

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
             +  L++ G  TA++ Y A GW++HH TD+W  + A DR      +WP  GAWL  HLW
Sbjct: 408 QMIKELAVTGGHTAKIMYGARGWMLHHNTDLWRTTGAVDRSGP--GMWPTCGAWLSRHLW 465

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLA 534
           EH+ Y+ D+ +LE+  YP+++G A FLLD+ +E  + + L   PS+SPE+ F   + KL 
Sbjct: 466 EHFLYSGDKTYLEE-VYPIMKGAALFLLDFAVEEPEHHWLVIAPSSSPENTFDKKN-KLT 523

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             +   TMD  ++ E+FS +ISA E+LE+++    + + +   R+ P +I     + EW+
Sbjct: 524 NTA-GVTMDNQLMFELFSNLISATEILERDQH-FADTLRQIRTRIPPMQIGRYSQLQEWM 581

Query: 595 Q 595
            
Sbjct: 582 H 582


>gi|300770084|ref|ZP_07079963.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762560|gb|EFK59377.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 826

 Score =  367 bits (943), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 225/601 (37%), Positives = 326/601 (54%), Gaps = 47/601 (7%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A      N LK+ ++ PA ++ +A+PIGNGRLGAMV+G    E ++LNE+T+W G PG+ 
Sbjct: 21  ATCLQAQNSLKLQYDKPAGNWNEALPIGNGRLGAMVFGQPDQEQIQLNEETIWAGGPGNN 80

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSH 116
            + +A   +  +R L+  G+  EA   S   F  PA         YQ  GD+ + F   H
Sbjct: 81  VSKNAYDKIQQIRRLLFEGKAKEAQDLSNATFPRPAPSGIDYGMPYQTFGDLRISFP-GH 139

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
            +Y   +Y RELD+  A  R +Y  G V +TRE F+S  D V++ K+S     SLSF++ 
Sbjct: 140 KQYT--SYSRELDIQDAITRTRYKAGAVTYTREVFASLKDDVVIIKLSADTKKSLSFSIG 197

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTIS 235
           L S  DN      N Q+ + G          + +++   G IQFS I+   +   +G   
Sbjct: 198 LTSPHDNTHITVENKQLTLSG---------ISGSHEGKTGRIQFSGIVRPVL---KGGTL 245

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
             +D +L++  +D  +L +   ++F       +D   +  ++++  L       Y     
Sbjct: 246 IQKDNQLEITNADEVILYISIGTNFK----KYNDITSNAAAKALDILNKATARKYEKAKA 301

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+  YQ+ F+RVS+ L  SP+       S++  D      R++ F   +DP LV L FQ
Sbjct: 302 DHIQKYQQYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQ 349

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSS+PG+Q A LQGIWN+ LSP WDS   VNIN EMNYW +   NL E  EPL
Sbjct: 350 FGRYLLISSSQPGSQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPL 409

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
           F  L  L++ G ++A+  Y A GW IHH TD+W  S    G   + +WPMGGAWL  HLW
Sbjct: 410 FAMLKDLAVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGIWPMGGAWLSQHLW 468

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLA 534
           +H+ Y+ DR FL K  Y +L+G A F LD L E     +L   PS SPE+ +    G   
Sbjct: 469 QHFLYSGDRSFL-KEYYHVLKGKALFYLDVLQEEPTHKWLVVAPSMSPENSYQPGVG--- 524

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            VS  +TMD  ++ +VF   I A+E+L+++ D L + V  +L RL P +I +   + EW+
Sbjct: 525 -VSAGTTMDNQLVFDVFHNFIQASEILKEDAD-LRDSVQVALHRLPPMQIGQHNQLQEWL 582

Query: 595 Q 595
           Q
Sbjct: 583 Q 583


>gi|261406479|ref|YP_003242720.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261282942|gb|ACX64913.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 783

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 214/591 (36%), Positives = 321/591 (54%), Gaps = 38/591 (6%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+    PA+ +T+A P+GNGRLGAMV+GGV +E + LNED++W G P  + NP+A + L 
Sbjct: 7   KLVERRPAQVWTEAFPVGNGRLGAMVFGGVSTERIGLNEDSVWYGGPKQHDNPEAIEKLD 66

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           D+RSL+  G+  EA   ++  F +       YQ LGD+ L+F     +     YRREL+L
Sbjct: 67  DIRSLLRCGELREAEQLALTHFTNAPPYFGPYQPLGDLLLQFKSGTSEVNH--YRRELNL 124

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
            T  A V +    + + RE F+S   QV+V +IS SE  ++  +  L     D +     
Sbjct: 125 RTGVASVSWEENGILYEREVFASAVHQVLVIRISSSEPAAIHLSARLSRRPFDGNIKREN 184

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
              + MEG C              P G+ ++ +L+   +   G         L ++ +D 
Sbjct: 185 ERTLAMEGIC-------------GPDGVTYATVLQ---AHTIGGKCHTVGNYLDIQSADA 228

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             LLL A +SF            DP  E++   +S   L Y+ L   H+ D+  L  RVS
Sbjct: 229 VTLLLAAQTSF---------RCDDPYREALRQAESAVLLPYASLLEEHITDHCALLERVS 279

Query: 310 IQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
           +++     S +P    + + +E      P++ER++ + Q   DP L  L +Q+GRYL+++
Sbjct: 280 LEIEAADTSIAPVSEESASEAEAVAVDRPTSERLQLYRQGGNDPGLEALFYQYGRYLMMA 339

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPG+  ANLQGIWNE  +P W+S  H+NINL+MNYW +   NL EC EPLFDF+  L 
Sbjct: 340 SSRPGSLPANLQGIWNESFTPPWESDYHLNINLQMNYWIAETGNLPECHEPLFDFIDRLV 399

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           ING KTA   Y A G+  H  +++WA+S           WPMGGAWL  HLWEHY Y + 
Sbjct: 400 INGRKTAASLYGARGFTAHASSNLWAESGLFGAWTPAIFWPMGGAWLALHLWEHYRYNLS 459

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
             FL +RAYP+L+  + F LD+L+   +G L T+PS SPE+ +I   G++  +S   +MD
Sbjct: 460 ESFLSERAYPVLKEASLFFLDFLVFDENGSLVTSPSLSPENSYINEKGQIGSLSSGPSMD 519

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             +I  + +A I AAE+L  +++    + + +  +L   +I   G +MEW 
Sbjct: 520 SQMIYALLTACIEAAEILGLDKE-WSRQWMDTRAKLPQPQIGRYGQVMEWA 569


>gi|346724703|ref|YP_004851372.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346649450|gb|AEO42074.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 790

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 222/595 (37%), Positives = 321/595 (53%), Gaps = 46/595 (7%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I L  S                +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECAEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWP+GG WL   LW+ ++Y 
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L  + + L +++     +L P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAE-LAQQLAALREQLPPNRIGKAGQLQEWQQ 593


>gi|304404820|ref|ZP_07386481.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
 gi|304346627|gb|EFM12460.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
          Length = 769

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 206/588 (35%), Positives = 332/588 (56%), Gaps = 39/588 (6%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N   + +  PA+ + +A PIGNG+LGAMV+G    E ++LNE+++W G P    N +A  
Sbjct: 2   NNTTLRYKKPAQEWVEAFPIGNGKLGAMVFGRPFEERIQLNEESVWHGGPLQRDNVEALP 61

Query: 71  ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
            L ++R L+ +GQ  EA   + + +   P D+  YQ LG++ ++FD    +     Y RE
Sbjct: 62  NLPEIRRLLFAGQPDEAEKLAFQTMISTPEDLGPYQTLGELAIQFDRED-QGEPSDYVRE 120

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL T    V Y  G V F R+ F+S PD VIV ++S      L F  +L       S +
Sbjct: 121 LDLATGVVSVHYEAGGVRFRRDSFASGPDGVIVYRLSADRQRRLFFTSTLSREEGTVSPL 180

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            G++ ++++G+C              P+G+Q++A+L  +I  + G +SA E   + +  +
Sbjct: 181 -GSDTLVLQGQC-------------GPEGVQYAAVL--RIVCEGGRLSA-EGNTIMISDA 223

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D A + + A+++F          + D  + S   L +     + ++   H+ +++ LF R
Sbjct: 224 DTATIYIAAATTF---------READLLAVSEQKLNAAIAKGFEEVRRSHIAEHRGLFDR 274

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
           V+++L ++      D  +E   +++P+ ER+  F+  D +  L+EL F FGRYLL+SSSR
Sbjct: 275 VALELRKA-----GDHPAEH--ESLPTDERLARFRNGDRESGLIELFFHFGRYLLLSSSR 327

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            G+  ANLQGIWN+ ++P W+S  H NIN++MNYW +   NL+EC EPLFD++  L +NG
Sbjct: 328 RGSLPANLQGIWNDSMTPPWESDFHTNINIQMNYWPAEVTNLAECHEPLFDYIDQLRVNG 387

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            +TAQ  Y A G+ +HH +++WA +S     +    WPMGGAWL  H+WEHY Y  D  F
Sbjct: 388 RRTAQAMYGARGFCVHHTSNLWADASITSRWLPAMFWPMGGAWLTLHMWEHYLYGGDIAF 447

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L  RAYP +   A F LD++++   G   T PS SPE+ +  P+G    +    +MD  +
Sbjct: 448 LRDRAYPAMRESALFFLDFMVQDPQGRWVTAPSVSPENSYRLPNGNEGALCAGPSMDTQM 507

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           IR +F A ++A E+LE++ D +  ++ + L  +    IA +G++MEW 
Sbjct: 508 IRMLFEACLTALELLEES-DEIASELRERLAGMPEQGIASNGTLMEWA 554


>gi|291545123|emb|CBL18232.1| hypothetical protein RUM_22260 [Ruminococcus champanellensis 18P13]
          Length = 776

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 213/589 (36%), Positives = 315/589 (53%), Gaps = 44/589 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA++F  A+P+GNGR+GAMV+GGV +E LKLNED++W+G   +  NPDA + +  +R
Sbjct: 9   YTKPAENFDQALPVGNGRMGAMVFGGVETEHLKLNEDSIWSGGLRNRNNPDAYQGMQQIR 68

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+   + +EA   + + + G P +   Y  LGD+++ F   H +     YRR LDL++ 
Sbjct: 69  MLLQQEKISEAEELAFQTMQGCPENSRHYMPLGDLDVVF---HKESHSTAYRRTLDLSSG 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A  +Y++  V++ R  F S PD V+V  +S  + G +SF  S            G +  
Sbjct: 126 IALTEYTLDGVQYQRSVFVSEPDNVLVLHVSADQPGQVSFAASF----------GGRDDY 175

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
             E R  G+            +GIQF+ ++   +   R         +L VEG+D A LL
Sbjct: 176 YDENRPDGEASICVTGGQGGQQGIQFAVVMTAAVQGGRAFTRG---NQLCVEGADEATLL 232

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           L   +SF          K +   E+     +   + S+ +L  RH+DDY+ LF RV ++L
Sbjct: 233 LAVQTSF---------YKGEGYLEAAQLDAEYAADCSFHELMVRHVDDYRALFDRVKLEL 283

Query: 313 -------SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
                  ++ P D         + D   +A  +       D  L EL F +GRYL+IS S
Sbjct: 284 EDNSGEGAQLPTDARLSRLRGNDFDGKDAAGLIL------DNKLTELYFNYGRYLMISGS 337

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG+Q  NLQGIWN+D+ P W S   VNIN EMNYW +  CNLSEC  PLFD +  +  N
Sbjct: 338 RPGSQPLNLQGIWNQDMWPAWGSRFTVNINTEMNYWCAESCNLSECHLPLFDLIRRMRPN 397

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+  Y   G+V HH TD+W   +     +   +WPMG AWLC H++EHY YT+DRD
Sbjct: 398 GEQTARDMYHCGGFVCHHNTDLWGDCAPQDRWMPATIWPMGAAWLCLHIFEHYQYTLDRD 457

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           FL ++ +  L G A F  +++ E   G L T PS SPE+ ++   G    +    +MD  
Sbjct: 458 FLAQQ-FDTLCGAAQFFTEYMFENSAGQLVTGPSVSPENTYLTASGAKGSLCIGPSMDSQ 516

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           II  +F+ ++ AA +LE+ E  L+EK+ + LPRL   +I + G I EW 
Sbjct: 517 IITLLFTDVLEAARILER-ESPLLEKIRQMLPRLPMPEIGKYGQIKEWA 564


>gi|294675358|ref|YP_003575974.1| hypothetical protein PRU_2729 [Prevotella ruminicola 23]
 gi|294473191|gb|ADE82580.1| conserved hypothetical protein [Prevotella ruminicola 23]
          Length = 821

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 217/595 (36%), Positives = 330/595 (55%), Gaps = 52/595 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA  + +A+PIGN  LGAMV+GG+ +E ++LNE+T W+G P +  NPDA  A+ 
Sbjct: 23  KLWYSKPAAQWLEALPIGNSHLGAMVYGGIGTEQIQLNEETFWSGSPHNNNNPDAKVAMK 82

Query: 74  DVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
           DVR L+  G+  EA A   K F  G     Y  LGD+ L FD  +   AE + YRREL+L
Sbjct: 83  DVRRLIFEGKEKEAEALIDKTFFKGPHGQKYLPLGDLMLSFD--YQNGAEPSNYRRELNL 140

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A     + V +V++ R  F+S  D  I+ +++ S+  +L+F VS              
Sbjct: 141 GDALCTTSFDVADVKYIRTAFASQADNAIIIQLTASKKKALNFGVSYQ-----------R 189

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           NQ  +EG    K        N + +GI  +  A + +K+  D GT++ +    ++V  + 
Sbjct: 190 NQQAVEGGAVAKNEHAYIINNVEHEGIAGKLQAEVRVKVVAD-GTVTDM-GSDMQVRNAT 247

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A + + A++++    +N      DP +++   +Q ++  +Y  L  RHLD YQ  + RV
Sbjct: 248 NATIFITAATNY----VNYQTINGDPVAKNNLTMQLLKGKNYKQLLKRHLDKYQDQYDRV 303

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISSSRP 367
           S+ L++S +              +P+ ER+ +F  TD D  +V L+ Q+GRYLLISSS+P
Sbjct: 304 SLSLAKSAQS------------ELPTDERLAAFDGTDLD--MVSLMMQYGRYLLISSSQP 349

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G Q ANLQG+WN  + P WDS   +NIN EMNYW +   NL+E QEPLF  +  LS+ G+
Sbjct: 350 GGQPANLQGVWNHKMDPAWDSKYTININAEMNYWPANVGNLAETQEPLFSMIRDLSVTGA 409

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           KTA+  Y   GWV HH TD+W  +    G   W ++P GGAWL THLW++Y YT D+ FL
Sbjct: 410 KTARTMYNCPGWVAHHNTDLWRIAGPVDG-TSWGMFPTGGAWLTTHLWQYYLYTGDKRFL 468

Query: 488 EKRAYPLLEGCASFLLDWL--------IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           +   YP+L+G + FLL ++        ++   G+L T P+ SPEH    P GK   V+  
Sbjct: 469 DA-CYPILKGASDFLLSYMQEYPKNGEVKQAAGWLVTVPTVSPEH---GPVGKNTTVTAG 524

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           STMD  I+ +V S+ + A ++L  N       +  ++ +L P +I   G + EW+
Sbjct: 525 STMDNQIVFDVLSSTLRAHQILGYNNVVYTTMLSNAIAKLPPMQIGRYGQLQEWL 579


>gi|284039852|ref|YP_003389782.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
 gi|283819145|gb|ADB40983.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
          Length = 864

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 217/613 (35%), Positives = 318/613 (51%), Gaps = 48/613 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKA 71
           L + +N PA  +++A+P+GNG +GAMV+G    E L+LNE TL++G P   +   +  K 
Sbjct: 25  LTLWYNKPATVWSEALPLGNGYMGAMVFGDPAKEHLQLNEGTLYSGDPASTFKAINVRKD 84

Query: 72  LSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
              V +L+ + QY EA +   K   G    +YQ +GD  ++ D  H   A   YRR+ D+
Sbjct: 85  FKQVSALLAAKQYQEAQSLIAKEWLGRNHQLYQPMGDFWIDVD--HKNEAITDYRRQFDI 142

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-YVNG 189
            TATA  +Y VGN  +TR +F+S PD VIV K++ +  G ++    L +  ++ + Y   
Sbjct: 143 ATATATTRYKVGNTTYTRTYFASYPDHVIVVKLTANGPGKINCTFHLSTPHESTARYAAQ 202

Query: 190 NNQIIMEGRCPG---------------------------KRIPPKANANDDPK--GIQFS 220
            N + M G+ PG                           +R P   N   D +  G+  +
Sbjct: 203 GNTLTMRGKVPGFGLRRTFEQIEKAGDQYKYPEVYEKNGQRKPGIDNMLYDRQINGLGMA 262

Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
               +K+    G I   ++  L V+ +   V +L A++S++G   +P+    DP      
Sbjct: 263 FETRVKVQHTGGRIRQ-DNNALTVQDASEVVFVLSAATSYNGFDKSPAYEGVDPKPILDQ 321

Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
             ++I   SY+ LY  HL DY+KLF RV IQL+           +E      P+ +RV+ 
Sbjct: 322 RFKAIEKKSYAALYQTHLADYKKLFDRVDIQLA-----------AETEQSQRPTDQRVEL 370

Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
           F    DPS   L FQ+GRYL+I+ SRPG Q  NLQG+WN+ + P W+    +NIN +MNY
Sbjct: 371 FSNGLDPSFAALYFQYGRYLMIAGSRPGGQPLNLQGMWNDLMVPPWNGGYTININAQMNY 430

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W +   NLSECQEP F  +  L+ING +TA+  Y   GWV HH  DIW + +        
Sbjct: 431 WPAELTNLSECQEPFFKAVKELAINGHETARSMYGNDGWVAHHNMDIW-RHAEPVDLCNC 489

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
           + WPM   WL +H WE Y ++ D  FL+K  +PLL+G   F   WL++   GYL T    
Sbjct: 490 SFWPMAAGWLTSHFWERYLFSGDPIFLKKEVFPLLKGAVQFYQGWLVKNEQGYLVTPVGH 549

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
           SPE  F+  D K A  S   TMDMAI+RE FS  + A + L   +D     V ++L +L 
Sbjct: 550 SPEQNFLYDDKKQATFSPGPTMDMAIVRESFSRYLEACKTLGITDD-FTAGVKQNLSQLL 608

Query: 581 PTKIAEDGSIMEW 593
           P +I + G + EW
Sbjct: 609 PYQIGKYGQLQEW 621


>gi|261416181|ref|YP_003249864.1| alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|385791048|ref|YP_005822171.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
           succinogenes subsp. succinogenes S85]
 gi|261372637|gb|ACX75382.1| Alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|302326443|gb|ADL25644.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
           succinogenes subsp. succinogenes S85]
          Length = 999

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 226/595 (37%), Positives = 328/595 (55%), Gaps = 53/595 (8%)

Query: 8   STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           +T NPL + +N  A   FT+A+PIGNG +G +++GGV  + + LNE T+W+G PGD    
Sbjct: 30  TTDNPLTLWYNSDAGTEFTNALPIGNGYMGGLIYGGVEKDYIGLNESTVWSGGPGDNNKQ 89

Query: 67  DAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
            A   L D R  +  G Y  A +  S  + G     +Q +GD  L    SH       YR
Sbjct: 90  GAASHLKDARDALWRGDYRTAESIVSQYMIGPGPASFQPVGD--LVISTSH--KGSSNYR 145

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           RELDL TA A+  Y+VG V+ TRE+F+S PD VIV  +S  + GS+SF  ++ +   N+ 
Sbjct: 146 RELDLKTAIAKTTYTVGGVKHTREYFASYPDHVIVVHLSADKDGSVSFGATMTTPHRNNR 205

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
             +  N +I +                    I+F     + +  D GT+S + +  + V+
Sbjct: 206 MTSSGNTLIYDVTV---------------NSIKFQN--RLTVVADGGTVS-VSNGNINVQ 247

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G++ A L+L  +++F     + +D   DP + +   +  +   SY DL   HL DYQ +F
Sbjct: 248 GANSATLILTTATNFK----SYNDVSGDPGAIASEIMSKVAKKSYEDLLAAHLKDYQTIF 303

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           +RV + L  + K       S  +I    ++ RVK+F +  DPSLVEL +Q+GRYLLI+SS
Sbjct: 304 NRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIASS 352

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R G Q ANLQGIWN+D +P W S    NINLEMNYW +   NL EC  PL D +  +   
Sbjct: 353 RKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVPQ 412

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-MD 483
           G KTA+V++ +  GWV HH TD+W +S+   G   W LWP G  WL THLWEH+ Y   D
Sbjct: 413 GEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPTGAGWLTTHLWEHFLYNPTD 470

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           + +L+   Y  ++G A F ++ L+E     + YL T PS SPE++     G   C  +  
Sbjct: 471 KAYLQD-VYSTMKGAALFFVNSLVEEPTTGNKYLVTAPSDSPENDH---GGYNVC--FGP 524

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD  IIR+V +  I A+++L  +ED +  K+  ++ RL PTK  + G I EW+Q
Sbjct: 525 TMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQ 578


>gi|255035049|ref|YP_003085670.1| hypothetical protein Dfer_1256 [Dyadobacter fermentans DSM 18053]
 gi|254947805|gb|ACT92505.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 768

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 214/601 (35%), Positives = 323/601 (53%), Gaps = 79/601 (13%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           PL + +  PA+ + +A+PIGNG L AM++GGV +E ++ NE+TLWTG P  Y +  A   
Sbjct: 25  PLTLWYEQPARQWEEALPIGNGALAAMIFGGVETEQIQFNEETLWTGEPRSYAHKGASAY 84

Query: 72  LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
           L  +R L++ G+  EA A A+ +    P     YQ  GD+ L+F   H+++    Y REL
Sbjct: 85  LEQIRRLLNEGKQKEAEALANEQFMSQPMRQMAYQAFGDVYLDFP-GHVQH--RAYHREL 141

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL  AT +  Y  G V +TRE F+S P + I   I+ S+   L F V + ++        
Sbjct: 142 DLRAATVKSSYESGGVRYTREAFASYPAKAIYYHINSSQKSKLDFTVRMSTI-------- 193

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE---------- 238
                            PK NA  +         +E+++  + G +  L           
Sbjct: 194 --------------HAKPKVNAEKN--------TIELEVQVENGALHGLARLKLLTDGKL 231

Query: 239 ---DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
              D K++V G+  A ++L A++++    IN  +   DP ++  +ALQ+  +  Y    +
Sbjct: 232 KTADGKIEVTGATSATIVLSAATNY----INYKNVNGDPRAKVTAALQNAPD-DYKKAAS 286

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
            HL DYQKLF+R ++ L  S                +P+ +R+  F+ + +DP+L+ L  
Sbjct: 287 GHLADYQKLFNRFALDLPASKGS------------ALPTDQRLSQFKHNPDDPALLALYV 334

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QF RYLLI+SSRPGT  ANLQG WN  L+P+WDS   VNIN EMNYW +   NLSEC +P
Sbjct: 335 QFARYLLITSSRPGTHPANLQGKWNHKLNPSWDSKYTVNINTEMNYWPAELTNLSECHQP 394

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LF  +  +S  G++ A+ +Y A+GWV+HH TD+W + +A        +W  GGAWL  HL
Sbjct: 395 LFQMVKEVSETGAEVAKEHYNANGWVLHHNTDVW-RGAAPINASNHGIWVTGGAWLSLHL 453

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
           WEHY +T D+ FL+  AYPL++G A F LD+L++    G+L ++PS SPE      +G L
Sbjct: 454 WEHYRFTEDKAFLQNTAYPLMKGAAQFFLDFLVKDPKTGHLVSSPSNSPE------NGGL 507

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
                  TMD  IIR +F A    A +L K +    +K+ ++  ++ P +I   G + EW
Sbjct: 508 VA---GPTMDHQIIRALFKACAETAGIL-KTDAVFAQKLTETAKQIAPNQIGRHGQLQEW 563

Query: 594 V 594
           +
Sbjct: 564 M 564


>gi|325927089|ref|ZP_08188358.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
 gi|325542534|gb|EGD14007.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
          Length = 790

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 225/596 (37%), Positives = 322/596 (54%), Gaps = 48/596 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I L  S                +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G++TA+  Y A GWV+H+ TD+W ++    G   W+LWP+GG WL   LW+ ++Y 
Sbjct: 426 LAQTGARTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L    DA   + L +L  +L P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQ 593


>gi|146298534|ref|YP_001193125.1| hypothetical protein Fjoh_0772 [Flavobacterium johnsoniae UW101]
 gi|146152952|gb|ABQ03806.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
           [Flavobacterium johnsoniae UW101]
          Length = 802

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 213/590 (36%), Positives = 337/590 (57%), Gaps = 35/590 (5%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
           +  PA+ F +++ +GNG++G+ V+GGV S+ + LN+ TLW+G P +   NP+A K +  +
Sbjct: 32  YKQPAEFFEESLVLGNGKMGSTVFGGVNSDKIYLNDITLWSGEPVNANMNPEAYKNIPAI 91

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           R  + +  Y  A   + K+ G  ++ Y  LG +E+   ++  K     YRRELD++ A +
Sbjct: 92  RETLQNENYKLAEELNKKVQGKNSESYAPLGTLEI---NNSEKGKAVNYRRELDISNAVS 148

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           +V Y +  +++TRE+F S  DQ+++ K++  + G+L+F+++L SLL ++  V  NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAQDQIMIIKLTADQKGALNFDINLKSLLKSNVEVR-NNILVM 207

Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            G  P     G  + PK  A  D +G +F+ +++IK +D + T S    + L ++ +  A
Sbjct: 208 TGSAPIHENAGYNVLPKYLALKD-RGTRFTGLVQIKKTDGKITSSR---ETLTLKDATEA 263

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           ++ +  ++SF+G   NP+    D  + +   L       +  +   H+ DYQK ++RV +
Sbjct: 264 IIYVSVATSFNGFDKNPASEGLDDIAIAAQNLNKAFEKPFDKIKESHIADYQKFYNRVDL 323

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
            L ++                +P+ ER+  +   +ED +L  L F +GRYLLISSSR   
Sbjct: 324 NLGKT------------TAPDLPTDERLLRYADGNEDKNLEILYFNYGRYLLISSSRTLG 371

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQG+WN  LSP W S   +NINLE NYW +   NLSE  + L  F+  LS+ G  T
Sbjct: 372 VPANLQGLWNLHLSPPWSSNYTMNINLEENYWLAENTNLSEMHKSLLSFIKNLSVTGKVT 431

Query: 430 AQVNY-LASGWVIHHKTDIWAKSS--ADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDR 484
           A+  Y +  GW   H +DIWA ++     GK   +WA WPM GAWL TH+WEHY +T D 
Sbjct: 432 AKTFYGVDKGWAAAHNSDIWAMTNPVGQFGKEDPMWACWPMAGAWLSTHIWEHYIFTQDE 491

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            +L+K  YPL++G A F L WL+    G L T+PSTSPE+++   DG +    Y  T D+
Sbjct: 492 TYLKKEGYPLMKGAAEFCLGWLVTDKKGNLITSPSTSPENQYKLEDGFVGATFYGGTADL 551

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
           A+IRE F   I A++VL  N DA     L++ L +L P +I + G++ EW
Sbjct: 552 AMIRECFDKTIKASKVL--NTDASFRVKLETVLSKLHPYQIGKKGNLQEW 599


>gi|408671641|ref|YP_006870551.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
 gi|387857648|gb|AFK05740.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
          Length = 868

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 218/615 (35%), Positives = 319/615 (51%), Gaps = 60/615 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
           ++ PA  +T+A+PIGN  +GAM++G    E ++LNE TL++G P   + N    K    V
Sbjct: 31  YDKPASVWTEALPIGNSYMGAMIFGDSRQEHIQLNESTLYSGEPDATFKNISVRKYYQQV 90

Query: 76  RSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
             L+ +G+Y EA A   K L G    VYQ LGD    F+      A   Y+R LD+++AT
Sbjct: 91  TELLKAGKYQEADAIVAKELLGRNHQVYQPLGDFWANFEHGQ---AVSAYKRWLDISSAT 147

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNGNNQI 193
           A  +Y VGN +F R++F+S PD +IV K S   +  ++  +   +  +    Y    N +
Sbjct: 148 AYTEYVVGNTKFKRQYFASYPDHIIVVKFSTEGTDKINCTLRFTTPHISTAKYEANGNML 207

Query: 194 IMEGRCP---------------------------GKRIPPKANAND-------DPKGIQF 219
            M G+ P                           G R   KANA +         +GI F
Sbjct: 208 KMMGKAPYFVQRREFEQVESVGDQYKYPELYENDGTR---KANAKNILYDSTKGGRGISF 264

Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
            +  + KI +  G +    D  +KVE +   V++L A++S++G   +PS   K+ +    
Sbjct: 265 ES--QAKILNLGGKLIRTGD-SIKVENASEIVVVLTAATSYNGFDKSPSKQGKNSSFLVN 321

Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
           S L+SI    ++ LY+ HL DY+KLF RV  +L+            E     +P+ +RV 
Sbjct: 322 SYLKSIEKKIFTQLYSTHLTDYKKLFDRVDFELAE-----------ETEQSKLPTDQRVS 370

Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
            F   +DPS   L FQ+ RYL+I+ SRP  Q  NLQGIWN+ + P W+     NIN EMN
Sbjct: 371 LFSNGKDPSFPSLYFQYSRYLMIAGSRPNGQPLNLQGIWNDQIVPPWNGGYTTNINTEMN 430

Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
           YW +   NLSEC EPLF  +  L++NG  TA+  Y   GW  HH  DIW +++    + +
Sbjct: 431 YWIAESTNLSECHEPLFKAIKELAVNGKNTAKFMYGNEGWTSHHNMDIW-RNAEPIDRCL 489

Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNP 518
            + WPMG  WL +H WE Y +T D+ FL+   YP+L+G   F   WL+ +   GYL T  
Sbjct: 490 CSFWPMGAGWLTSHFWERYLHTGDKVFLKNEVYPVLKGVVEFYQGWLVKDAKTGYLITPI 549

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
             SPE  F+  D K A +S   TMDM I+RE F+  +   + L  N D LV+ + + LP+
Sbjct: 550 GHSPESYFLYEDNKRATISQGPTMDMGIVREAFARYVEMCQTLGIN-DELVKNIKQQLPQ 608

Query: 579 LRPTKIAEDGSIMEW 593
           L P +I + G + EW
Sbjct: 609 LLPYQIGKYGQLQEW 623


>gi|336414990|ref|ZP_08595333.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941851|gb|EGN03702.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
           3_8_47FAA]
          Length = 815

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 229/591 (38%), Positives = 320/591 (54%), Gaps = 46/591 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA  +T+A+P+GN RLG MV+GG  SE L+LNE+T+W G P    NP A  AL
Sbjct: 25  LKLWYSRPATVWTEALPLGNSRLGVMVYGGAGSEELQLNEETVWGGGPHRNDNPKALAAL 84

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             +R LV  G+Y EA     + F  P +   YQ +G + L+F   H K  +  Y R+LD+
Sbjct: 85  PQIRQLVFEGRYREAQEMVAQNFETPRNGMPYQTIGSLMLDFP-GHEKATD--YYRDLDI 141

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y VG V + RE F+S  D VI+ +++ ++ G+LSF  S  S L +       
Sbjct: 142 ERAIATTRYKVGEVTYNREVFTSFVDNVIIVRLTANKQGTLSFTASYKSPLQH------- 194

Query: 191 NQIIMEGRCPGKRIPPKANANDD---PKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
                E R  GKR+       +    P  I+     E+K   + G    +  + ++V G+
Sbjct: 195 -----EVRKSGKRLVLIGKGTEHEGVPGAIRVETQTEVK---NEGGHVVVTGENIQVNGA 246

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D   L + A+++F    +N  D   D   +S S L   R   Y      H+  YQ  F+R
Sbjct: 247 DAVTLYISAATNF----VNYKDVSGDAHRKSKSYLDIARKKKYEQAREAHIAYYQNQFNR 302

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V + L          T  E   +T     RVK F   +D SL  L+FQ+GRYLLISSS+P
Sbjct: 303 VKLDLG---------TSEEAKRET---HLRVKHFNKGKDVSLATLMFQYGRYLLISSSQP 350

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G Q ANLQGIWN++L   WD    VNINLEMNYW S   NLSE   PL   L  LS  G 
Sbjct: 351 GGQPANLQGIWNDNLLAPWDGKYTVNINLEMNYWPSEVTNLSETHLPLMQMLKELSETGR 410

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +TA+  Y   GWV+HH TDIW + +    K  W +WP GGAWLC HLW+HY +T D+ FL
Sbjct: 411 ETARTMYGCDGWVLHHNTDIW-RCTGLVDKAFWGMWPNGGAWLCQHLWQHYLFTGDKAFL 469

Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
            K+AYP+++G + F L +L+E    G++ T PS SPEH     + K A  + +  TMD  
Sbjct: 470 -KKAYPIMKGASDFFLHFLVEHPKYGWMVTCPSNSPEHGPEGDEKKNAPSTVAGCTMDNQ 528

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIMEWVQ 595
           I+ ++FS  + A ++L   EDA+  K L K + RL P +I     + EW++
Sbjct: 529 IVFDLFSNTLQACKILM--EDAVYAKHLQKMIDRLPPMQIGRYNQLQEWLE 577


>gi|329925668|ref|ZP_08280486.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
 gi|328939695|gb|EGG36038.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
          Length = 767

 Score =  365 bits (938), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 222/590 (37%), Positives = 315/590 (53%), Gaps = 48/590 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           I F+ PA+++ +A+PIGNGRLG MV+G V  E ++ NED++W G P D  NPDA   L  
Sbjct: 9   IWFDQPAQNWNEALPIGNGRLGGMVFGSVMQEKIQFNEDSVWYGGPRDRNNPDALLHLPL 68

Query: 75  VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           +R L+  G+  EA   S   F G P     Y   GD  ++ D  H +     YRRELDL 
Sbjct: 69  IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYMTAGDFCIQVD--HPQGELSHYRRELDLE 126

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
            A     Y  G V FTRE F S PDQV+V ++     G+L+     +     H    +  
Sbjct: 127 KAITVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGALTLTSRFERQKGKHMDAVHRA 186

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGS 247
           G + ++M   C GK             G+ +SA  + I +    GT+  +  + L V+ +
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAIAVG---GTVRVV-GEHLLVDQA 230

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  V++L A+S+F       +D  K   +E    L+   N  Y+ L  RH+ DYQ LF R
Sbjct: 231 DEVVIILAAASTFR------ADDSKLRCNE---LLEHAANQGYAALKKRHIADYQPLFDR 281

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
           V + L            ++     VP+ +R++  +  D+D  L  L F FGRYLLI+ SR
Sbjct: 282 VKLDLG---------AAADREHHLVPTPKRLERVRAGDDDAGLYTLYFHFGRYLLIACSR 332

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG+  ANLQGIWN+ ++P WDS   +NIN +MNYW +  CNL EC EPLF+ +  +  NG
Sbjct: 333 PGSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLPECHEPLFELIERMKDNG 392

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
             TA+  Y   G+V HH TDIWA ++          W MG AWL  HLWEHY +  + DF
Sbjct: 393 RVTARKMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDF 452

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L +RAY  ++  A F  D+L+E  +GYL TNPS SPE+ ++  +G+   + Y  +MD  I
Sbjct: 453 L-RRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYMLRNGESGTLCYGPSMDTQI 511

Query: 547 IREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I E+FSA I A+  L+ +E A  E   +K   RL   K+   G + EW++
Sbjct: 512 ISELFSACIEASLELDTDESARREWAAIKD--RLPEMKVGRHGQLQEWLE 559


>gi|335437953|ref|ZP_08560710.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
           tiamatea SARL4B]
 gi|334893557|gb|EGM31768.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
           tiamatea SARL4B]
          Length = 784

 Score =  365 bits (938), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 213/596 (35%), Positives = 325/596 (54%), Gaps = 51/596 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA  + +A+PIGNGRLGAM++G   +E ++ N DTLW G   D TNPDA + + +VR
Sbjct: 13  YDAPASAWLEAVPIGNGRLGAMLFGRPGTERVQFNADTLWAGGHEDSTNPDAREHVEEVR 72

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+  G+   A A A   L G P  +  YQ  GD+ ++        A   YRRELDL+  
Sbjct: 73  RLLFDGEVERAQALADEHLMGDPFRLRPYQSFGDLSIDVGHD----AVTDYRRELDLSAG 128

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
             RV+Y      + RE+F+S PD  IV +++    GS++  V LD   D  +   G+  +
Sbjct: 129 VTRVRYDHDGTTYVREYFASAPDDAIVIRLATDSPGSVTATVGLDRERDARADARGDT-L 187

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS--------ALEDKKLKVE 245
            + G        P  +     +G+ F A    +++ D G +         A     L+ E
Sbjct: 188 TLRGTVVDD---PDDDRGAGGEGMAFEA--RARVTADGGDVQRVTGADAPAGSSVGLRTE 242

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +D   + L   ++ +           DP     + L ++ +  Y DL   H+ D+++LF
Sbjct: 243 AADAVTIALTGFTTHE---------TDDPGEACEAVLDALADRPYHDLRETHVADHRELF 293

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV + L   P D  TD    E +D V + E        EDP L  L  QFGRYLLI+SS
Sbjct: 294 DRVELDLG-DPVDRPTD----ERLDRVAAGE--------EDPHLAALYAQFGRYLLIASS 340

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGT+ ANLQG+WN++  P W+S   +N+NLEMNYW +L  NL+EC  PL+DF+  L   
Sbjct: 341 RPGTEPANLQGVWNQEFDPPWNSGYTLNVNLEMNYWPALQTNLAECAAPLYDFVDDLREP 400

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G + A+ +Y   G+ +HH +D+W +++A      W LWPMG AWL   +++HY +T D  
Sbjct: 401 GRRVAEAHYDCDGFAVHHNSDLW-RNAAPVDGARWGLWPMGAAWLSRLVFDHYAFTKDET 459

Query: 486 FLEKRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYS 539
           FL + AYP+L   A+F+LD+L+E    +G    +L T PS SPE+ ++  DG+ A V+Y+
Sbjct: 460 FLRETAYPILREAAAFVLDFLVEHPAEEGEAEDWLVTAPSISPENAYVTDDGEEATVTYA 519

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            TMD+ + R++F   I AAE+L+  E A  +++  +L RL P ++   G + EW++
Sbjct: 520 PTMDVQLTRDLFEHTIDAAEILDV-ESAFHDELRAALDRLPPMQVGAHGQLQEWIE 574


>gi|340616355|ref|YP_004734808.1| alpha-L-fucosidase [Zobellia galactanivorans]
 gi|339731152|emb|CAZ94416.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
          Length = 791

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 220/595 (36%), Positives = 337/595 (56%), Gaps = 44/595 (7%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           + T     LK+ ++ PA+ + +A+P+GNG LGAMV+G    E ++ NEDT W G P   +
Sbjct: 29  KETGGKAELKLWYDRPAEIWEEALPVGNGSLGAMVFGRPVMERIQFNEDTFWAGGPITPS 88

Query: 65  NPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELE---FDDSHLKYA 120
            P+    L +VR LV  G+Y EA A   K + G     Y  +GD+ +E    DD      
Sbjct: 89  KPETKSYLPEVRKLVFDGKYKEADALINKHIIGPKMMPYLPMGDVVIEMKGLDDI----- 143

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              +RRELDL TA ++V +S   + + RE FS+  +  IV ++  S+  SL+F+++LD+ 
Sbjct: 144 -TDFRRELDLRTAISKVGFSSKGIAYKREVFSAVEENAIVIRLEASKEKSLNFSIALDNQ 202

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           +   S V   N + + G  P +     AN   +   ++F + L I  +D    I+   D 
Sbjct: 203 IGATSQVLDANNLELSGTAPDR-----ANRKSE---LRFVSRLNIGENDGHTIIN---DS 251

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + V G+    LLL A+++F     N  D   +P  +  + L  +   S+  +  +H+ +
Sbjct: 252 TITVSGASKVTLLLFAATNFK----NYKDVSGNPDFKCKTLLDLVHLKSFEQIREQHITN 307

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           +Q+LF R+         D+ T++ S      +P+ ER++ FQ + DPSLV L +QFGRYL
Sbjct: 308 HQRLFERLDF-------DMPTNSNS-----GLPTNERLEKFQEETDPSLVALYYQFGRYL 355

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           L+SSSR  +Q ANLQGIWN++ +P WDS    NINLEMNYW +   NL+EC  PLF  + 
Sbjct: 356 LMSSSRGNSQPANLQGIWNQNPTPPWDSKYTTNINLEMNYWPAEASNLAECAIPLFTSIR 415

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L+  G+ TA+ NY A GWV+HH TDIW  ++   G   W +WP GGAWL THLWEHY +
Sbjct: 416 QLAEAGAVTAKNNYGADGWVLHHNTDIWKTTTPLDG-AAWGIWPTGGAWLTTHLWEHYLF 474

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYS 539
           + D  FL +  YP+++G A F ++ L+   + GYL TNPS SPE+  +  +G ++ V   
Sbjct: 475 SEDEAFL-RLHYPVIKGAAEFFVNTLVAHPEYGYLVTNPSISPENRHM--EGNIS-VCAG 530

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             MD  +IR++F+  I A+E+L  + D   E ++++  +L P KI  +G + EW+
Sbjct: 531 PAMDTQLIRDLFAQCIKASEILNVDSD-FRELLVETRSKLAPDKIGSEGQLQEWL 584


>gi|159127378|gb|EDP52493.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 745

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 215/586 (36%), Positives = 324/586 (55%), Gaps = 47/586 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA ++ +A+P+GNGRLGAMV+G   +E L+LNED++W G P +    DA + L  +R
Sbjct: 7   YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G +AEA     +  F HP     Y+ LG + L+F   HL    + YRR LD+  A
Sbjct: 67  SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
           T RV+Y    V+  RE  +SNPD VI  ++  S+    +  ++  S L  + + Y++   
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +  E R     I P  +     K  +   +++++ ++D+ +++ + +K L V   D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           +L+ A +++        D  K  +S+  +AL      S  +++ RH++DY+ L+ R+ + 
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS S  D+ TD                K  +   DP L+ L   + RYLLIS SR G +V
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKV 329

Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             A LQGIWN    P W     +NINL+MNYW +  CNLS+C+ PLF  L  ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           AQ  Y   GWV HH TDIWA +S     +   LWP+GGAWLC H+W+H+ +T D++FLE 
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448

Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           R +P+L+GC  FLLD+L+E   G YL TNPS SPE+ F   +G+   +   ST+D+ I+ 
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            V SA + + E LE   D L    L +L RL P +I   G + EW 
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWA 553


>gi|386819251|ref|ZP_10106467.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
 gi|386424357|gb|EIJ38187.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
          Length = 818

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 215/590 (36%), Positives = 333/590 (56%), Gaps = 51/590 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA ++ +A+PIGNGR+GAM++GG   + ++LNE+T+W G PG+    D  + +  +R
Sbjct: 27  YDEPADNWNEALPIGNGRIGAMLYGGEKVDQIQLNEETVWAGSPGNNIAKDYYQDVESIR 86

Query: 77  SLVDSGQYAEATAASVKLF----------GHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
            L+ +G+Y EA   ++++F          G P   YQ +G+I+L F + H K +   +RR
Sbjct: 87  ELLFNGKYTEAQQKALEVFPKNTPDNTNYGMP---YQTVGNIKLAFKN-HNKIS--NFRR 140

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           EL++  A A+V Y    V++ R++F S PDQV+   +  ++S  L+F++ + S    H  
Sbjct: 141 ELNIENAVAKVSYLADGVQYNRQYFVSYPDQVMAIHLQANKSEKLNFDIEIQSA-QKHVA 199

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
              NN + ++G    +         + P  ++FS ++  KI  +   +S   + KL VE 
Sbjct: 200 SIENNILHLKGVSETRE--------NKPGKVKFSTLIYPKIIGEGKIVS--REGKLSVEK 249

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +   +L +   ++F       +D        ++  L +++N S   L   H++DYQ LF 
Sbjct: 250 AQEVLLFISIGTNFK----KYNDLSNAEDEVALKFLNNVKNKSIEALLESHIEDYQDLFK 305

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV ++L +            EN+  + + ER+K+F  + D SL+ L FQFGRYLLISSSR
Sbjct: 306 RVDLKLGK------------ENLSNLTTDERLKTFSKNHDLSLISLYFQFGRYLLISSSR 353

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            G Q ANLQGIWN  LSP WDS   VNIN EMNYW +   NLSE   PLF  L  LS  G
Sbjct: 354 EGGQPANLQGIWNNKLSPPWDSKYTVNINTEMNYWPAEVTNLSELHAPLFSMLEDLSETG 413

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A   Y A GW +HH TDIW  S    G   +  WPMGGAWL  HLW+H+ +T D +F
Sbjct: 414 KESAHKMYHARGWNMHHNTDIWRISGIVDGG-FYGFWPMGGAWLSQHLWQHFLFTGDINF 472

Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L K+ YP+L+  A F +D L  E  +G+L   PS SPE+++I  DG    V+Y +TMD  
Sbjct: 473 L-KKYYPILKETALFYVDVLQKEPKNGWLVVTPSISPENKYI--DG--VGVTYGTTMDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++ +VF+ +I+AA+ L  + D  ++ V +   +L P +I +   + EW++
Sbjct: 528 LVFDVFNNVITAAKTLNIDAD-FIKVVEEKKSKLPPMQIGKHAQLQEWIE 576


>gi|409196602|ref|ZP_11225265.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 823

 Score =  365 bits (936), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 223/596 (37%), Positives = 328/596 (55%), Gaps = 59/596 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PAK + +A+PIGNGRLGAMV+G    E ++LNE+T W+G P    NP A +AL
Sbjct: 30  LKLWYDKPAKVWNEALPIGNGRLGAMVFGDPTLENIQLNEETFWSGSPSRNDNPKAIEAL 89

Query: 73  SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
            +VR+L+  G+Y EA         + +L G    +YQ +G++ L F+  H  Y+   Y R
Sbjct: 90  PEVRNLIFEGKYHEAEKIVNENMVAEQLHG---SMYQTIGNLNLTFE-GHENYS--NYSR 143

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           ELD+  A     Y+V +V F RE F+S PDQVIV K+S  +  SLSF  +L   L  ++ 
Sbjct: 144 ELDIEKALHTTSYTVDDVNFKREIFASFPDQVIVVKLSADQPESLSFTANLIGPLAKNTK 203

Query: 187 VNGNNQIIMEG------RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
               + + M G      R  GK              ++F+ + +I  +D  G  SA  DK
Sbjct: 204 AVDASTLEMTGISGNHERVEGK--------------VEFNTLAKILNTD--GATSADGDK 247

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
               + S+  +L+ +A++     F++      D   +    L + +   YS++   H+ D
Sbjct: 248 ITVKDASEVVILISMATN-----FVDYKTLTADENEKCRKFLTAAQTKEYSEIKEAHIRD 302

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+K F R S+ L  +P                P+  R+K+F    DP+LV L +QFGRYL
Sbjct: 303 YRKYFTRSSLDLGTTPAS------------QRPTDVRIKNFSHTNDPALVSLYYQFGRYL 350

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSRPG Q ANLQGIWN   +P WDS   +NIN EMNYW +   NL E  EPL + + 
Sbjct: 351 LISSSRPGGQPANLQGIWNNSTNPAWDSKYTININTEMNYWPAEKTNLPELHEPLIEMVK 410

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            LS  GS+TA+  Y  +GWV HH TDIW  +    G   W +WPMGGAWL  HLW+ Y Y
Sbjct: 411 DLSEAGSQTARNMYGCNGWVTHHNTDIWRITGVVDG-AFWGMWPMGGAWLTQHLWDKYLY 469

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           + +R++L    YP+++    F  D+L+E   +G+L  NPS SPE+   AP G+   V+  
Sbjct: 470 SGNREYLAS-VYPIMKSACKFYQDFLVEEPSNGWLVVNPSNSPEN---APVGR-PSVTAG 524

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +TMD  I+ ++F+    AA +L ++E  L+    + + RL P +I + G + EW++
Sbjct: 525 ATMDNQILFDLFTKTKKAATLLNEDE-KLINDFQRIIDRLPPMQIGQHGQLQEWME 579


>gi|375145023|ref|YP_005007464.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
 gi|361059069|gb|AEV98060.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
          Length = 834

 Score =  365 bits (936), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 216/592 (36%), Positives = 330/592 (55%), Gaps = 50/592 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA+ + +A+P+GNG+LG MV+GG   E + ++EDTLWTG P       AP+ L
Sbjct: 46  LELWYQKPAEKWLEALPVGNGKLGGMVFGGPVQERISISEDTLWTGGPYQPAVEVAPETL 105

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET--YRREL 128
           + +R L   G++AEA     +L G P     YQ +G+++L F D       ET  YRR L
Sbjct: 106 ASIRKLSFEGKFAEAQELVKQLQGKPHRQAAYQTVGEVQLNFSD-----ITETSDYRRSL 160

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL-DNHSYV 187
           +L    A V+++     +  + F+S PD VIVT+I+  +   +   ++  SL  D    +
Sbjct: 161 NLQNGVAGVQFTANGTFYKHKTFASYPDHVIVTRITAGKP--IHLTITCTSLHPDKKLTI 218

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            GNN +IM+G+     +         P  + +   + ++I   RG +    D  ++V G+
Sbjct: 219 AGNNTLIMDGKNGDLVVEGDGTI---PAALTWQCRVLVQI---RGGVQTAVDNGIQVIGA 272

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  ++L  A++S+    +  +D    P     + ++     SY  L+  HL DYQ LF++
Sbjct: 273 DEVLILTTAATSY----VRYNDVSGKPDQLCAAVIKKCIAKSYDILFEAHLKDYQPLFNK 328

Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           V ++L+  +P ++             P+ ER+K+F T  DPSL  L FQ+GRYLL++SSR
Sbjct: 329 VKLKLTNLAPSNL-------------PTTERIKNFATGNDPSLAALYFQYGRYLLLTSSR 375

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG+Q ANLQG WN+ LS +W     VNIN EMNYW +   NL+ C+ PL + +  L+I G
Sbjct: 376 PGSQPANLQGRWNDSLSASWGGKYTVNINTEMNYWPAQKTNLASCELPLLELVKDLAITG 435

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
             TAQ  Y A GWV HH TD+W +S+A      +  WP GGAWLC HL++HY Y+ D  +
Sbjct: 436 QITAQKTYHARGWVCHHNTDLW-RSTAPIDSAFFGQWPTGGAWLCNHLYQHYLYSGDTAY 494

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS--STMD 543
           L++  YPL++G A F  D L+ E   G+  T+PS SPE      +G+   VS S   TMD
Sbjct: 495 LQE-LYPLMKGSARFFFDTLVQEPKHGWYVTSPSMSPE------NGRAKGVSNSPGPTMD 547

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWV 594
           M I+RE+F+   +AA VL+K+ D   +K    +  +L P +I + G + EW+
Sbjct: 548 MQILRELFTHCATAAAVLKKDAD--FQKACNDMVFKLAPDQIGKGGQLQEWL 597


>gi|289669688|ref|ZP_06490763.1| hypothetical protein XcampmN_14597 [Xanthomonas campestris pv.
           musacearum NCPPB 4381]
          Length = 790

 Score =  364 bits (935), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 221/595 (37%), Positives = 324/595 (54%), Gaps = 46/595 (7%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
            A  AL  VR+L+ +G+YAEA   A   L   P     YQ LGD+ L+FD +        
Sbjct: 99  GALAALPQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   +
Sbjct: 156 YRRQLDLDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QS 214

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D+ 
Sbjct: 215 GDVTAEQGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR- 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L++E +D  VLLL A++S+     +  D   DP + + ++L+   +L +  L   HL D+
Sbjct: 262 LRIEAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I L  S            +   +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    + EC EPL   +  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y ASGWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y 
Sbjct: 426 LAKTGAHTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C     
Sbjct: 485 RDRAYLSK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GP 539

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD  ++R++F+  I+ +++L  + + L +++     +L P +I + G + EW Q
Sbjct: 540 TMDAQLLRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQ 593


>gi|294666331|ref|ZP_06731579.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292603880|gb|EFF47283.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 830

 Score =  364 bits (934), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 226/591 (38%), Positives = 321/591 (54%), Gaps = 50/591 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+PDA  AL
Sbjct: 85  LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TA A   +  G     RE F S   Q IV ++S +  G +S  V +DS   N      
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCNRPGGISLRVGIDSP-QNGEVTAE 260

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
              ++  GR            N    GI+      +++      G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  VLLL A++S+     +  D   DP + + ++L+    L +  L   HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V+I L  S      D          P+ ERV+ F    DP+L  L  Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  L+  G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAQTGA 471

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S  MD  +
Sbjct: 531 SK-IYPLFKGAAEFFVATLMRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585

Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 633


>gi|294626600|ref|ZP_06705197.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292599020|gb|EFF43160.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 830

 Score =  364 bits (934), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 226/591 (38%), Positives = 320/591 (54%), Gaps = 50/591 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+PDA  AL
Sbjct: 85  LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   N      
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISLRVGIDSP-QNGEVTAE 260

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
              ++  GR            N    GI+      +++      G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  VLLL A++S+     +  D   DP + + ++L+    L +  L   HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V+I L  S      D          P+ ERV+ F    DP+L  L  Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  L+  G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAKTGA 471

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S  MD  +
Sbjct: 531 SK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585

Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 633


>gi|192359217|ref|YP_001984046.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
 gi|190685382|gb|ACE83060.1| alpha-L-fucosidase, putative, afc95B [Cellvibrio japonicus Ueda107]
          Length = 839

 Score =  364 bits (934), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 220/599 (36%), Positives = 330/599 (55%), Gaps = 44/599 (7%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           ++ S++T   L++ +N PA  +  A+PIGNGRLGAMV+G    E L+LNEDT+W G P +
Sbjct: 37  SSHSSATKQDLRLWYNTPASDWNQALPIGNGRLGAMVFGQPAQEQLQLNEDTIWAGGPNN 96

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHL 117
             NP A + +  V  L+  GQ+ +A   + +       G P   YQ LG++ L+F   H 
Sbjct: 97  NVNPAAAQTIEQVTRLLLQGQHQQAQTLADQQIRSLNNGMP---YQTLGNLRLDFA-GHG 152

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
           +   + Y R+LDL  A ARV Y    V FTRE FSS  DQVIV ++S S+ G ++  +  
Sbjct: 153 QV--DDYYRDLDLANAIARVSYVKAGVTFTRELFSSLSDQVIVVRLSASKPGQINTRIGF 210

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
           DS + +   V+    + ++GR         ++   D K I+F+A++  ++   RG     
Sbjct: 211 DSPMQHQLSVH-ERWLQVDGRG-------GSHEGLDGK-IRFTALIAPEL---RGGTLRR 258

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
           +DK L++EG+D  ++ + A+++F    +  +D   D  + + + L +     ++ L   H
Sbjct: 259 DDKALRIEGADEVLIRIAAATNF----VRYNDLGGDSLARAQAYLSAAEGKGFAQLQQAH 314

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           +  YQ  F+RVS+ L  S                 P+ +R+  F   +DP L  L FQ+G
Sbjct: 315 VAAYQAQFNRVSLDLGTSAAM------------ARPTDQRIAEFAHSQDPHLAMLYFQYG 362

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSS+PGTQ ANLQGIWN   SP WDS   VNIN EMNYW +    L E  +PLF 
Sbjct: 363 RYLLISSSQPGTQPANLQGIWNPHTSPPWDSKYTVNINTEMNYWPAEVTQLPELHQPLFA 422

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            L  L++ G  +AQ  Y A GW++HH TD+W + +    K  +  W  GGAWLC H+W H
Sbjct: 423 MLEDLALTGRASAQQLYGARGWMMHHNTDLW-RITGQVDKAFYGQWQTGGAWLCQHIWYH 481

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y ++ DRDFL+ R YP+L   + F +D L +E + G L   PS SPE+ +    G    +
Sbjct: 482 YLHSGDRDFLQ-RYYPVLREASRFFVDSLTLEPNSGALVVVPSNSPENTY-ERAGYPTSI 539

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +TMD  ++ ++FS  I AA +L  + D L  ++ +   RL P +I   G + EW++
Sbjct: 540 SAGTTMDNQLVFDLFSITIDAAHILGVDSD-LAAQLRQKRERLAPMRIGHFGQLQEWLE 597


>gi|427384395|ref|ZP_18880900.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727656|gb|EKU90515.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
           12058]
          Length = 809

 Score =  364 bits (934), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 210/585 (35%), Positives = 316/585 (54%), Gaps = 41/585 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PAK +T+A+P+GN RLGAM++GGV +E ++LNE+T+W G P    +P A   L
Sbjct: 23  LKLWYSQPAKVWTEALPLGNSRLGAMLYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA       F  G     +Q +G + LEFD  H  Y++  YRRELDL
Sbjct: 83  PQVRELLFTGREKEAEKMIADNFFTGQHGMPFQTIGSLMLEFD-GHADYSD--YRRELDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A V+Y +G V +TR  F+S  D  ++ +I   + G++SF     +    ++     
Sbjct: 140 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVSFTTRYSTPYKEYAVKKSG 199

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             +++ G        P A        I+F    +IK   ++G +S   D  ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVSVTNDC-IEVKGADAA 248

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           V+ + A+++F    +N  D   + T  +   L       Y+   + H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALSAHEEAYQKLFGRVSL 304

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +  S K+               ++ R+K F   +DP LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NVGASAKE--------------ETSYRIKHFNEGKDPGLVALMFQFGRYLLISSSQPGGQ 350

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            A LQGIWN +L   WD    +NIN EMNYW +   NL+E  EPLF  +  LS +   TA
Sbjct: 351 PAGLQGIWNHELFAPWDGKYTININTEMNYWPAEVTNLTEMHEPLFQMVKELSESAQGTA 410

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
              Y   GW +HH TD+W  +    G     +WP+GGAWL  HLW+HY YT D+ FL+  
Sbjct: 411 HTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 467

Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           AYP L+G A F LD+L+E    G++   PS SPE     P G    ++   TMD  I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 524

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++++SA ++L  +  +  + +   + RL P +I +   + EW+
Sbjct: 525 ALTSVLSATKLLYPDHTSYCDSLQSMIKRLPPMQIGKHNQLQEWL 569


>gi|322512626|gb|ADX05719.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 999

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 224/596 (37%), Positives = 326/596 (54%), Gaps = 55/596 (9%)

Query: 8   STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           +T NPL + +N  A   FT+A+PIGNG +G +++GGV  + + LNE T+W+G PGD    
Sbjct: 30  TTDNPLTLWYNSDAGSEFTNALPIGNGYMGGLIYGGVTKDFIGLNESTVWSGGPGDNNKQ 89

Query: 67  DAPKALSDVRSLVDSGQY--AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
            A   L D R  +  G Y  AE+      +   PA  +Q +GD+ +    S        Y
Sbjct: 90  GAASHLKDARDALFRGDYRAAESIVNQYMIGPGPAS-FQPVGDLIISTSHS----GASDY 144

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
           RRELDL TA A+  Y+   V+ TRE+F+S PD VIV  +S  +SGS+SF  ++ +  ++ 
Sbjct: 145 RRELDLKTAIAKTTYTHSGVKHTREYFASYPDHVIVVYLSADKSGSVSFGATMTTPHNSK 204

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              N  N +I +                    I+F   L +     + ++S   +  + V
Sbjct: 205 RMSNDGNTLIYDVTV---------------NSIKFQNRLTVVTDGGKASVS---NGNINV 246

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           EG++ A L+L  +++F       +D   DP + +   +  +   SY DL   HL DYQ +
Sbjct: 247 EGANSATLILTTATNFKAY----NDVSGDPGAIAAEIMSKVAKKSYEDLLAAHLKDYQTI 302

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV + L  + K       S  +I    ++ RVK+F +  DPSLVEL +Q+GRYLLI+S
Sbjct: 303 FNRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIAS 351

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SR G Q ANLQGIWN+D +P W S    NINLEMNYW +   NL EC  PL D +  +  
Sbjct: 352 SRKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVP 411

Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-M 482
            G KTA+V++ +  GWV HH TD+W +S+   G   W LWP G  WL THLWEH+ Y   
Sbjct: 412 QGEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPSGAGWLSTHLWEHFLYNPT 469

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           D+ +L+   YP ++G A F ++ L+   E  + YL T PS SPE++     G   C  + 
Sbjct: 470 DKAYLQD-VYPTMKGAALFFVNSLVEEPETGNKYLVTAPSDSPENDH---GGYNVC--FG 523

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            TMD  IIR+V +  I A+++L  +ED +  K+  ++ RL PTK  + G I EW+Q
Sbjct: 524 PTMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQ 578


>gi|78047362|ref|YP_363537.1| hypothetical protein XCV1806 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035792|emb|CAJ23483.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 856

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 224/596 (37%), Positives = 320/596 (53%), Gaps = 48/596 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D  +P
Sbjct: 105 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSNSP 164

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 165 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 221

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS    
Sbjct: 222 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 281

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D +
Sbjct: 282 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 327

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 328 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 383

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I L  S                +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 384 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 431

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 432 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 491

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWP+GG WL   LW+ ++Y 
Sbjct: 492 LAQAGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 550

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 551 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 606

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L    DA   + L +L  +L P +I + G + EW Q
Sbjct: 607 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQ 659


>gi|70999286|ref|XP_754362.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gi|66851999|gb|EAL92324.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 745

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 214/586 (36%), Positives = 323/586 (55%), Gaps = 47/586 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA ++ +A+P+GNGRLGAMV+G   +E L+LNED++W G P +    DA + L  +R
Sbjct: 7   YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G +AEA     +  F HP     Y+ LG + L+F   HL    + YRR LD+  A
Sbjct: 67  SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
           T RV+Y    V+  RE  +SNPD VI  ++  S+    +  ++  S L  + + Y++   
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +  E R     I P  +     K  +   +++++ ++D+ +++ + +K L V   D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           +L+ A +++        D  K  +S+  +AL      S  +++ RH++DY+ L+ R+ + 
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           LS S  D+ TD                K  +   DP L+ L   + RYLLIS SR G + 
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKA 329

Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             A LQGIWN    P W     +NINL+MNYW +  CNLS+C+ PLF  L  ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           AQ  Y   GWV HH TDIWA +S     +   LWP+GGAWLC H+W+H+ +T D++FLE 
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448

Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           R +P+L+GC  FLLD+L+E   G YL TNPS SPE+ F   +G+   +   ST+D+ I+ 
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            V SA + + E LE   D L    L +L RL P +I   G + EW 
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWA 553


>gi|384098831|ref|ZP_09999943.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
 gi|383834974|gb|EID74405.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
          Length = 786

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 218/605 (36%), Positives = 333/605 (55%), Gaps = 43/605 (7%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A + ++ +  ++ +  PA  + +A+P+GNGRLGAM++G   +E ++LNED++W G P   
Sbjct: 17  ANAQNSQSKERLWYKEPATKWMEALPVGNGRLGAMIFGQPINERIQLNEDSMWPGGPDWG 76

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE 121
            +   P+ L  +R L+  GQY +A    V  F +   V  +Q +GD+ ++F    +    
Sbjct: 77  DSKGTPEDLVYIRQLLKEGQYHKADEEIVTRFSNKGVVRSHQTMGDLYIDFSTKKVA--- 133

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y RELD+ TA A   Y+     +T+E F+S P  V++ + + +    +   + ++   
Sbjct: 134 -NYYRELDIETAVATTSYNSEGYNYTQEVFASAPHNVLIIRYTTTNPKGMDATLRMNRPK 192

Query: 182 D---NHSYVN--GNNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
           D   N   V+    NQI M+G     G R+  +A   D   G++F   L +K   + G I
Sbjct: 193 DEGFNTVQVSSPAPNQIQMKGMVTQNGGRLNSEAKPLD--YGVKFDTRLVVK---NNGGI 247

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
              +D  L+++  + AVLLLV S+SF            +  S +   L  ++ LSY+++ 
Sbjct: 248 VVSKDGILELKNVNEAVLLLVGSTSFY--------HGNNYESYNEQLLGQVQELSYNEML 299

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELL 353
           + H+ DYQ L+ RV++ L  +              + +P+ ER+K  +    D +L  LL
Sbjct: 300 SAHVADYQSLYKRVTLDLGGN------------EFNKIPTDERLKKIKDGGTDKALSALL 347

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQ+GRYLLISSSRPGT  ANLQGIWNE +   W++  H+N+NL+MNYW +   NLSEC  
Sbjct: 348 FQYGRYLLISSSRPGTNPANLQGIWNEHIRAPWNADYHLNVNLQMNYWPAEVTNLSECHS 407

Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           PLFD+   L   G  TA+  Y +  G VIHH +DIWA +     +  W  W  GG WL  
Sbjct: 408 PLFDYTDRLINRGRITAKDQYGIHRGAVIHHTSDIWAPAWMHAERAYWGAWIHGGGWLAQ 467

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDG 531
           H WEHY+YT D DFL+ RA+P ++  A F LDWLI   D     ++P TSPE+ ++APDG
Sbjct: 468 HYWEHYSYTNDIDFLKNRAWPAMKALAEFYLDWLIYDQDSKTWVSSPETSPENSYMAPDG 527

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSI 590
             A VS+ + M   II EVF+  + AA +L+ N+D  V++V   L ++ P   +  DG I
Sbjct: 528 TPAAVSHGAAMGHQIIGEVFNNTLKAASILKINDD-FVQEVKSKLKKIHPGVVLGPDGRI 586

Query: 591 MEWVQ 595
           +EW +
Sbjct: 587 LEWTK 591


>gi|289664854|ref|ZP_06486435.1| hypothetical protein XcampvN_17740 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 792

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 221/589 (37%), Positives = 323/589 (54%), Gaps = 46/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P A  AL
Sbjct: 47  LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPGALAAL 106

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR+L+ +G+YAEA   A   L   P     YQ LGD+ L+FD +        YRR+LD
Sbjct: 107 PQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 163

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TA A   +  G     RE F S   Q IV ++S    G +S  V +DS   +      
Sbjct: 164 LDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QSGDVTAE 222

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKKLKVEGS 247
              ++  GR            N    GI+      +++      G +S + D+ L++E +
Sbjct: 223 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR-LRIEAA 269

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  VLLL A++S+     +  D   DP + + ++L+   +L +  L   HL D+Q+LF R
Sbjct: 270 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADHQRLFRR 325

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V+I L  S            +   +P+ ERV+ F    DP+L  L  Q+GRYLLI SSRP
Sbjct: 326 VAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 373

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    + EC EPL   +  L+  G+
Sbjct: 374 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFDLAKTGA 433

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TA+  Y ASGWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y  DR +L
Sbjct: 434 HTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYGRDRAYL 492

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C     TMD  +
Sbjct: 493 SK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GPTMDAQL 547

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +R++F+  I+ +++L  + + L +++     +L P +I + G + EW Q
Sbjct: 548 LRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQ 595


>gi|380694480|ref|ZP_09859339.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
          Length = 804

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 217/620 (35%), Positives = 327/620 (52%), Gaps = 54/620 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
            T+  N L + +  PAK + +A+P+GNGRLGAM++G    E ++ NE+TL++G P    N
Sbjct: 10  GTNAQNHLTLWYKSPAKAWEEALPVGNGRLGAMIFGDTQKERIQFNENTLYSGEPETPKN 69

Query: 66  PDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
            +    L+ +R L+  G+ AEA T    K  G   + YQ  GD+ ++FD    K A   Y
Sbjct: 70  INIVPDLAHIRQLLGEGKNAEAGTIMQEKWIGRLNEAYQPFGDLYIDFDS---KEAVTDY 126

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
              LD+  A     Y    V+ +RE F+S P Q IV  +  S+   L+F   L S   + 
Sbjct: 127 MHSLDMENAVVTTSYKQNGVDISREVFASYPAQAIVIHLKSSKP-VLNFTAYLAS--PHP 183

Query: 185 SYVNGNNQII-MEGRCPG---------------KRIPPK--------------ANAND-D 213
                ++Q++ ++G+ P                +R+ P+                 N+ D
Sbjct: 184 VTKESDSQVVYLKGQAPAHAQRRDTDHMKRFNTQRLHPEYFDASGHIIQKKQVIYGNEMD 243

Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
            KG  F A L   +   +G   ++ D ++         L+L A++S++GP  +PS   K+
Sbjct: 244 GKGTFFEACL---LPTHKGGQLSISDNQITARNCSEVTLMLYAATSYNGPRKSPSKEGKN 300

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
           P    M+  +     +Y +L  +H  DYQ LF+RVS  L  + +              +P
Sbjct: 301 PHQAIMNYRRISEGETYKELKRQHTTDYQALFNRVSFDLPANKQQ-----------KELP 349

Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + ER+K F+ +ED +L+  LFQFGRYL+I+ SR   Q  NLQG+WN+ + P W+S   +N
Sbjct: 350 TDERLKRFKDEEDQALIAQLFQFGRYLMIAGSRGEGQPLNLQGLWNDQILPPWNSGYTLN 409

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           INLEMNYW +   NLSEC +PLF  +  ++  G   A+  Y  +GW IHH   IW ++  
Sbjct: 410 INLEMNYWPAEVTNLSECHQPLFKLIEEIADKGKDLARDMYGLNGWAIHHNISIWREAYP 469

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
             G V W  W M G WLC HLWEHY +T D +FL K+ YP+L+G A+F  +WL++   G 
Sbjct: 470 SDGFVYWFFWNMSGPWLCNHLWEHYLFTKDANFL-KKYYPILKGAATFCSEWLVKNSKGE 528

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
           L T  STSPE+ ++  D   A V   STMD+AIIR +FS  I AAE+L+ + D   E ++
Sbjct: 529 LVTPVSTSPENAYLMGDHTPASVCEGSTMDIAIIRSLFSNTIQAAEILQTDMDFRSE-LI 587

Query: 574 KSLPRLRPTKIAEDGSIMEW 593
           K   +L+  +I   G ++EW
Sbjct: 588 KKRNKLKKYQIGSKGQLLEW 607


>gi|313203234|ref|YP_004041891.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
 gi|312442550|gb|ADQ78906.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
          Length = 822

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 214/583 (36%), Positives = 317/583 (54%), Gaps = 48/583 (8%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
           +A+PIGNG LGAMV+G V  E ++LNE TLW+G P D  NP A +ALS +R+ +  G+Y 
Sbjct: 55  NALPIGNGFLGAMVYGNVNQELIQLNEKTLWSGSPDDNNNPQAAEALSQIRNFLFEGKYK 114

Query: 86  EATAASVK-------------LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           EA   + K                 P   YQ LG++  +F  +      E Y RELDLN 
Sbjct: 115 EANELTNKTQICKGVGSGTGSGTNVPYGSYQTLGNLFFDFGKTA---PFENYVRELDLNR 171

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               V YS   V + RE F+S PD+ ++  ++  + G+LSF   L       + V  N+ 
Sbjct: 172 GVVTVSYSQNGVRYKREIFASYPDRALIIHLTADKKGALSFTTELTRPERFETRVE-NDH 230

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           ++M G     +            G++++A L+   +  RG     ++ +++VEG+D  ++
Sbjct: 231 LLMTGALTNGQ---------GGDGMKYAARLK---ATTRGGKLNYKNNEIRVEGADEVIM 278

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           +L AS+++   +  PS    DP   + + L    +  Y  L   H  DY  LF +VS+ L
Sbjct: 279 ILTASTNYKQEY--PSFVGDDPRLTTQNQLSKASSKPYPTLLKNHTVDYAALFGKVSLNL 336

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           S            + + DT+P+  R+++  +  +D  L E+ FQFGRYLLISSSR G+  
Sbjct: 337 S------------DNDPDTIPTDRRLRNQTKNPDDLHLQEVYFQFGRYLLISSSREGSLP 384

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIW   +   W+   H NIN++MNYW +   NLSEC  PL   +  L   G  +A 
Sbjct: 385 ANLQGIWCNKIQAPWNCDYHSNINVQMNYWGADIVNLSECFSPLSRLIESLVKPGEISAA 444

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           V Y ASGW +   T++W  +S   G + W L+  GG WLC HLW+HY +T+DR++L+ R 
Sbjct: 445 VQYNASGWCVQPITNVWGYTSPGEG-INWGLYVAGGGWLCRHLWDHYTFTLDRNYLQ-RV 502

Query: 492 YPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
           YP++   A F LDWL+ +   G L + PSTSPE+ FIAPDG    +    + D  II E+
Sbjct: 503 YPVMLNAARFYLDWLVTDPKTGKLVSGPSTSPENSFIAPDGSRGSICMGPSHDQEIIHEL 562

Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           F+ +++A++VL KN D L+ K+  +L  L   KI  DG +MEW
Sbjct: 563 FTNVLTASKVL-KNTDPLLAKIDIALRNLATPKIGSDGRLMEW 604


>gi|146301819|ref|YP_001196410.1| hypothetical protein Fjoh_4083 [Flavobacterium johnsoniae UW101]
 gi|146156237|gb|ABQ07091.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
           [Flavobacterium johnsoniae UW101]
          Length = 816

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 220/585 (37%), Positives = 323/585 (55%), Gaps = 38/585 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA  + +A+P+GNGRLGAMV+G    E L+LNE+T+W G P    +  + KAL
Sbjct: 25  LKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNGNAHNKSIKAL 84

Query: 73  SDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR L+  G++ EA   A+  +     D   YQ  G + + F   H KYA+  Y R+LD
Sbjct: 85  PIVRQLIFDGKFDEAQDLATQDIMSQTNDGMPYQTFGSVYISFA-GHQKYAD--YYRDLD 141

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           ++ ATA+VKY V  VEFTRE  ++  DQVIV K+S S+ G ++ NV ++S +D       
Sbjct: 142 ISNATAKVKYKVNGVEFTREILTAFSDQVIVVKLSASQPGQITCNVFMNSPIDKTVASTE 201

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            NQII+ G           N       ++F   L  K  +  G I A  +  L +  +D 
Sbjct: 202 GNQIILSGVG--------TNFEGVKGKVKFQGRLTAK--NKGGEIDA-SNGVLSINKADE 250

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             L +  +++F     N  D   D  ++S   L       +  +   H+D YQK F+RVS
Sbjct: 251 VTLYISIATNFK----NYQDISGDEIAKSKDYLAKAEVKDFETIKKAHVDYYQKFFNRVS 306

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L  +  D+V            P+ ER++ F    DP L  L FQFGRYLLISSS+PG 
Sbjct: 307 LNLGSN--DLVKK----------PTNERIRDFSKQFDPQLASLYFQFGRYLLISSSQPGG 354

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN+ ++P WDS    NIN EMNYW +   NL E  EP       L++ G++T
Sbjct: 355 QPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQEMHEPFVQMAKELAVTGAET 414

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y ASGWV+HH TDIW + +A        +WP GGAW+C  LWE Y YT D+ +L +
Sbjct: 415 AKTMYNASGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYLYTGDKKYLVE 473

Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
             YP+++G A F LD++ I+ +  YL   PS+SPE+      GK A ++  +TMD  ++ 
Sbjct: 474 -IYPIMKGAADFFLDFMVIDPNTKYLVVVPSSSPENTHAGGTGK-ATIASGTTMDNQLVF 531

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++F+ +I A+ ++  +  A  +KV  +L ++ P KI +   + EW
Sbjct: 532 DLFTHVIEASALVSPDV-AYAKKVSDALAKMPPMKIGKYNQLQEW 575


>gi|325105420|ref|YP_004275074.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324974268|gb|ADY53252.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 768

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 218/586 (37%), Positives = 318/586 (54%), Gaps = 45/586 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA  + +A+P+GNG LGAM++G   +E L+LNE ++W G   D+ NP A  +L
Sbjct: 28  LKLWYNKPALDWNEALPVGNGSLGAMIFGNTFNEVLQLNESSVWAGKDEDFVNPRAKASL 87

Query: 73  SDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR+L+   +Y EA   A   L G       YQ LG++ L+F  S+   +   Y REL+
Sbjct: 88  KKVRNLLFQEKYTEAQDLADSSLMGDKKIWSSYQELGNLRLDFKKSNRSVS--NYNRELN 145

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           +  A A   ++V    F RE FSS     +  K+S +++  +S  + +D   +       
Sbjct: 146 IENAIATTTFNVDGTLFEREVFSSAVANTVFIKLSSNKTKQISLTIGMDRAGNLAKISAS 205

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           ++QI +                ++  G+   +I  I     R ++S   + K+ VE +D 
Sbjct: 206 DHQIYLTEHV------------NNGVGVILHSIANIANKGGRLSVS---NNKIIVENADE 250

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            V+ L A+++F+    NP ++ K   SES++        +Y      H+ DYQ+ F+RV 
Sbjct: 251 VVITLAAATNFN--HTNPLETVKSRISESLAK-------AYQQHKEEHIKDYQQYFNRVK 301

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
           + L  +            N    P+  R+ + +    DPSL+ L +Q+GRYLLISSSRPG
Sbjct: 302 LNLGNN------------NSSLFPTDARLSALKNGNFDPSLITLFYQYGRYLLISSSRPG 349

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQGIW E L   W+   H+NIN +MNYW +   NLSE   P  D+LT L  +G K
Sbjct: 350 GLPANLQGIWAEGLQVPWNGDYHININAQMNYWLAENTNLSEMHMPFLDYLTNLGKDGKK 409

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y  SG V H  +DI+  +    GK  WA+WP G AW   H WEHY YT D+ FLE
Sbjct: 410 TAKDMYGLSGEVAHFASDIFYYTEP-WGKPKWAMWPTGLAWCSQHAWEHYLYTQDKAFLE 468

Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
           K+ Y +L+  + F LDWL++    G L + PS SPE+ F  PDGK+A V     MD  II
Sbjct: 469 KQGYEILKQSSIFFLDWLVKNPKTGLLVSGPSISPENTFKTPDGKIATVIMGPAMDHMII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           RE+F   ISAA++L K++  LV K+ K+L +L PT+I  DG I+EW
Sbjct: 529 RELFGNTISAAQILGKDKK-LVTKLQKALKQLTPTQIGSDGRILEW 573


>gi|189462578|ref|ZP_03011363.1| hypothetical protein BACCOP_03268 [Bacteroides coprocola DSM 17136]
 gi|189430739|gb|EDU99723.1| intein C-terminal splicing region [Bacteroides coprocola DSM 17136]
          Length = 866

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 220/590 (37%), Positives = 324/590 (54%), Gaps = 42/590 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK + +A+P+GN  +GAMV+GG   E L+LNE+TLW G P    NP A ++L
Sbjct: 68  LKLWYQQPAKTWVEALPVGNSSMGAMVYGGTSREELQLNEETLWGGGPYRNDNPKALESL 127

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++VR+L+ SG+  +A     + F  G     YQ +G + +E    H K   + Y R+L+L
Sbjct: 128 AEVRNLIFSGKTMDAQNLIDQTFYTGRNGMPYQTIGSLIIE-APGHEK--AKNYYRDLNL 184

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+VI+ + +  + G L+F VS DS L +     G 
Sbjct: 185 ERAVATTRYQVDGVNFQREVFASFPDRVIIVRFTTDKPGELNFKVSYDSPLQSTVRKQGK 244

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD---RGTISALEDKKLKVEGS 247
            ++++ G+              D +G++   ++E++        G   +L DK + VE +
Sbjct: 245 -KLVLRGK------------GGDHEGVK--GVIEVETQSQVIAEGGKVSLTDKYISVEHA 289

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
             A L + A+++F    +N  + K + + ++ + L       YS+    H D YQ  F+R
Sbjct: 290 TAATLYIAAATNF----VNYHNVKGNESKKASALLAGAMKKEYSEALKAHTDYYQSQFNR 345

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           VS+ L        T T  +E +      +R+  F    DP+L  L+FQ+GRYLLISSS+P
Sbjct: 346 VSLSLGGEN----TKTARQETV------KRIAGFSQGNDPALAALMFQYGRYLLISSSQP 395

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  EPLF  +  LS+ G 
Sbjct: 396 GGQPANLQGIWNHQLNAPWDGKYTININTEMNYWPAEVTNLSETHEPLFGLVQDLSVTGR 455

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +TA+  Y  +GWV HH TDIW + +    K  +  WP+GGAWL THLW+HY YT D+DFL
Sbjct: 456 ETARTMYGCNGWVAHHNTDIW-RVTGPVDKAFYGTWPVGGAWLTTHLWQHYLYTGDKDFL 514

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
            K +YP ++G A F L ++I     G+  T PS SPEH     D K A    S  TMD  
Sbjct: 515 RK-SYPAMKGAADFFLGYMIPHPKYGWKVTAPSMSPEHGPKGEDTKKASTIVSGCTMDNQ 573

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           II +V S  ++A+E+LE +  A  + +   L  + P +I     + EW++
Sbjct: 574 IIFDVLSNTLAASEILELSA-AYRDSLRTLLSEMAPMQIGRYNQLQEWLE 622


>gi|300785873|ref|YP_003766164.1| large protein [Amycolatopsis mediterranei U32]
 gi|384149183|ref|YP_005531999.1| large protein [Amycolatopsis mediterranei S699]
 gi|399537756|ref|YP_006550418.1| large protein [Amycolatopsis mediterranei S699]
 gi|299795387|gb|ADJ45762.1| large secreted protein [Amycolatopsis mediterranei U32]
 gi|340527337|gb|AEK42542.1| large protein [Amycolatopsis mediterranei S699]
 gi|398318526|gb|AFO77473.1| large protein [Amycolatopsis mediterranei S699]
          Length = 949

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 231/591 (39%), Positives = 319/591 (53%), Gaps = 46/591 (7%)

Query: 11  NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N L + ++ PA   +  A+PIGNGRLGAMV G   +E L+LNEDT+W G P DY+N    
Sbjct: 39  NDLALWYDKPAGTEWLRALPIGNGRLGAMVSGNTDTERLQLNEDTVWAGGPHDYSNAQGA 98

Query: 70  KALSDVRSLVDSGQYAEATA-ASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRR 126
            ALS +R LV + Q+ +A +    K+ G PA    YQ +G + L    +       +Y+R
Sbjct: 99  GALSQIRQLVFANQWTQAQSLIDQKMLGTPAAQQPYQPVGTLSLALPGNS---GVSSYQR 155

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL TAT  V Y   NV + RE F+S  DQVIV +++    GS+SF+ SL +     + 
Sbjct: 156 WLDLTTATTVVTYVANNVRYRREVFASAADQVIVLRLTAETPGSISFSASLGTPQRATTS 215

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA-ILEIKISDDRGTISALEDKKLKVE 245
                 I ++G             + D +GI  S   L +  +   G  ++     L+V 
Sbjct: 216 SPNGTTIALDG------------ISGDSRGIAGSVRFLALAGATAEGGSTSSSGGTLRVS 263

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+D   LL+   +S+    ++      D    + S L + + L +  L  RHL DYQKLF
Sbjct: 264 GADAVTLLISIGTSY----VDYRTVNGDYQGIARSRLAAAQALPHDTLRGRHLADYQKLF 319

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R ++ L R        T + +     P+  R+    +  DP    LLFQFGRYLLISSS
Sbjct: 320 GRTTLDLGR--------TAAADQ----PTDVRIAQHNSVNDPQFAALLFQFGRYLLISSS 367

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGTQ ANLQGIWN+ L+P+W+S   +N NL MNYW +   NL+EC EP+F  +  L++ 
Sbjct: 368 RPGTQPANLQGIWNDQLNPSWESKYTLNANLPMNYWPADVTNLAECYEPVFAMIGDLAVT 427

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           G++TAQV Y A GWV HH TD W  SS  D  +    +W  GGAWL T +W+HY +T D 
Sbjct: 428 GARTAQVEYGARGWVTHHNTDGWRGSSIVDFAQA--GMWQTGGAWLATMIWDHYRFTGDV 485

Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
           +FL  R YPLL+G A F LD L+ E   GYL TNP+ SPE    A     A V    TMD
Sbjct: 486 EFLRAR-YPLLKGAAQFFLDTLVTEPSLGYLVTNPANSPELNHHAN----ASVCAGPTMD 540

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           M I+R++F     A +VL  +     ++V  +  RL P K+   G+I EW+
Sbjct: 541 MQILRDLFDGCAGACQVLGVDA-TFADQVTAARQRLAPMKVGSRGNIQEWL 590


>gi|395776471|ref|ZP_10456986.1| large protein [Streptomyces acidiscabies 84-104]
          Length = 802

 Score =  362 bits (928), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 226/577 (39%), Positives = 316/577 (54%), Gaps = 49/577 (8%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+P+GNGRLGAMV+G   +E L+LNEDTLW G P +Y NP    AL  +R LV + Q+ +
Sbjct: 46  ALPVGNGRLGAMVFGNTDTERLQLNEDTLWAGGPHNYDNPRGAAALGRIRQLVFADQWGQ 105

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G PA    YQ +GD+ L F       A   Y R LDL TAT  V Y+  N
Sbjct: 106 AQDLINQTMLGDPAAQLAYQPVGDLRLTFPAGS---AVSAYERLLDLTTATTAVTYTANN 162

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V + RE F+S PDQVIV +++    GS++F+ +  S             I ++G      
Sbjct: 163 VSYRREVFASAPDQVIVMRLTARTPGSITFSATFASPQRTSLSSPDGTTIALDG------ 216

Query: 204 IPPKANANDDPKGI----QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
                  + D +GI    +F A+   K   + G++++     L+V G+D   LL+   +S
Sbjct: 217 ------VSGDMRGIAGTVRFLAL--AKAVAEGGSVTS-SGGTLRVTGADSVTLLVSIGTS 267

Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
           +    ++      D    + + L + + ++Y  L  RH+ DYQ LF RVS+ + R+P   
Sbjct: 268 Y----VDYRTVDGDYQGIARTHLDAAQGVAYDTLRARHVADYQALFGRVSLDVGRTP--- 320

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                +++     P+  R+    + +DP    LLFQ+GRYLLISSSRPGTQ ANLQGIWN
Sbjct: 321 ----AADQ-----PTDVRIAQHGSADDPQFSALLFQYGRYLLISSSRPGTQPANLQGIWN 371

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
           + L+P+WDS   +N NL MNYW +   NL+EC  P+F  +  L+  G++TAQ  Y A GW
Sbjct: 372 DQLTPSWDSKYTINANLPMNYWPADTTNLAECLAPVFAMIDDLTATGARTAQAQYGARGW 431

Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
           V HH TD W  +S   G  VW +W  GGAWL + +W+HY +T D +FL +R YP L+G A
Sbjct: 432 VTHHNTDAWRGTSVVDG-AVWGMWQTGGAWLASLIWDHYRFTGDVEFL-RRNYPALKGAA 489

Query: 500 SFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
            F LD L+     G+L TNPS SPE     PD     V    TMDM I+R +F    SA+
Sbjct: 490 RFFLDTLVPHPGLGHLVTNPSNSPELTH-HPD---VSVCAGPTMDMQILRSLFDGCASAS 545

Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           EVL  +  A   +V  +  RL P KI   G+I EW+ 
Sbjct: 546 EVLGVDA-AFRAQVRSARRRLAPMKIGSRGNIQEWLH 581


>gi|372220893|ref|ZP_09499314.1| alpha-L-fucosidase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 805

 Score =  362 bits (928), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 203/592 (34%), Positives = 338/592 (57%), Gaps = 32/592 (5%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKAL 72
           +I F+ PA +F + + +GNG++GA ++GG+ +E + LN+ TLW+G P ++ N P+A K L
Sbjct: 33  EIWFDKPATYFEETLVLGNGKMGASIFGGIQTEKIFLNDITLWSGEPMNHNNNPEAYKNL 92

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            ++R+ + +  Y  A + + KL G  +  Y  LG + L F +   +     Y+R LDL T
Sbjct: 93  PEIRAALKAENYKLADSLNKKLQGQFSQSYAPLGTLWLHFKN---ETNITNYKRSLDLTT 149

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A A V Y    V++ RE+F SNP +V+V +++     ++SF++  +S L        +++
Sbjct: 150 AIADVSYESNGVKYKREYFISNPKKVMVVRLTSDRKKAISFDLKFESQL-RFKIKELDSK 208

Query: 193 IIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +I  G  P    P    +  +P      KG +F++   IK +D  GT+  ++D  L V+ 
Sbjct: 209 LIATGYAPVHVEPSYRGSIKNPIVFDADKGTRFTSAFSIKQTD--GTVK-IQDSVLSVQN 265

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    LL+  ++SF+G   NP+    +  + ++  ++S +  +Y++L   H+ DY +L++
Sbjct: 266 ATEVELLVAVATSFNGFDKNPATEGLNHENIALEQIKSSKKETYANLKKEHVADYSELYN 325

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSS 365
           RV  +LS             + +  VP+ +R+  ++T  +   +E+L F +GRYLLI+SS
Sbjct: 326 RVDFKLSH------------KELPNVPTDQRLLRYETGANDQNLEILYFNYGRYLLIASS 373

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R     ANLQG+WN  + P W S   +NINL+ NYW +   NLSE  +PL  F+  LS  
Sbjct: 374 RTKEVPANLQGLWNPHIRPPWSSNYTININLQENYWLAETANLSELHQPLLSFIGNLSKT 433

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           G+ TA+  Y  +GW   H +DIWA ++      +G   WA W MGG WL +HLWEHY YT
Sbjct: 434 GAITAKTYYGTNGWAAGHNSDIWALTNPVGDFGQGNPNWANWNMGGVWLTSHLWEHYLYT 493

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            D  +L++ AYP+++G A+F  +WLI+   G   ++PSTSPE+ +  P+G +    Y +T
Sbjct: 494 KDTTYLKEYAYPIIKGAATFASEWLIKDQHGQFISSPSTSPENLYKTPEGYVGATLYGAT 553

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            DMA+I+E+F + ++A++ L   +D    K+  +L  L P KI + G++ EW
Sbjct: 554 ADMAMIKELFYSYLNASKTLAIQDD-FTRKIKFNLENLSPYKIGQKGNLQEW 604


>gi|410096950|ref|ZP_11291934.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224744|gb|EKN17668.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 804

 Score =  362 bits (928), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 220/606 (36%), Positives = 327/606 (53%), Gaps = 51/606 (8%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A++T+T NP K  ++  A+ +  A+P+GNG LGAMV+G V  E ++LNE+T+W+G   D 
Sbjct: 39  ADATATDNPNK-GYDDDAE-WLKALPLGNGSLGAMVFGDVHKERIQLNEETMWSGSIQDS 96

Query: 64  TNPDAPKALSDVRSLVDSGQYAEAT-------AASVKLFGH------PADVYQLLGDIEL 110
            NP+A K + +++ L+  G+Y EAT         + K  GH      P   YQ +GD+ +
Sbjct: 97  DNPEAAKHIEEIKQLLFDGKYKEATDLTNRTQICTGKGSGHGQGSNAPFGCYQTMGDLWI 156

Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
           +FD+   K     YRREL+L+ ATAR+ Y  G+V F RE F S+PDQ +V +IS  +   
Sbjct: 157 DFDN---KSPYTDYRRELNLDDATARISYKQGDVNFKREIFISHPDQSMVMRISADKKQQ 213

Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
           LSF   ++   + +S    N Q+IM G             +D   G     +  +K    
Sbjct: 214 LSFTCRMNRP-ERYSTYTENEQLIMAGAL-----------SDGKGGDGLQYMTRLKAVPM 261

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
            G+++   D  L V+ +D  +L L AS+ +   +  P    +D +S + ++L    N SY
Sbjct: 262 NGSVT-YSDSTLTVKDADEVLLFLTASTDYKLEY--PIYKGRDFSSITEASLNKAINKSY 318

Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSL 349
           + LY  H+ +Y   F R ++QL+ +P             DT+P+  +V + +    DP L
Sbjct: 319 NQLYETHVKEYTDYFQRANLQLTNTP-------------DTIPTDIKVMNARKGMIDPHL 365

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
            E +FQ+GRYLLISSSRPGT  ANLQGIW   L   W+   H ++N+EMNYW +   NLS
Sbjct: 366 YEQMFQYGRYLLISSSRPGTMPANLQGIWANKLQTAWNGDYHTDVNIEMNYWPAEVTNLS 425

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           E   P+FD +  L   GSKTAQ+ Y   GWV+H  T++W  +S       W +     AW
Sbjct: 426 EMHLPMFDLIASLVEPGSKTAQIQYNKKGWVVHPITNVWGYTSPGEA-ASWGMHTGAPAW 484

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
           +C H+ EHY +T D+DFL ++ YP+L+G   F +DWL E      L + P+ SPE+ F+A
Sbjct: 485 ICQHIGEHYRFTGDKDFL-RKTYPVLKGAIEFYMDWLTENPKTKELVSGPAVSPENTFVA 543

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
           PDG  + +S     D   I ++F      +  L  ++D    +V  +  RL  TKI  DG
Sbjct: 544 PDGSHSQISMGPAHDQQTIWQLFDDFAMISSELSIDDD-FTRQVADAKDRLADTKIGSDG 602

Query: 589 SIMEWV 594
            IMEW 
Sbjct: 603 RIMEWA 608


>gi|390989152|ref|ZP_10259452.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
 gi|372556186|emb|CCF66427.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
           str. LMG 859]
          Length = 790

 Score =  361 bits (927), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 224/597 (37%), Positives = 321/597 (53%), Gaps = 50/597 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F     Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D+ 
Sbjct: 216 EITAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+   NL +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAANLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I           D  S E +  +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTNERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y 
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593


>gi|408671718|ref|YP_006875526.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
 gi|387857567|gb|AFK05662.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
          Length = 818

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 218/593 (36%), Positives = 326/593 (54%), Gaps = 48/593 (8%)

Query: 12  PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           PLK+ +  P+ + + +A+PIGNGRLGAM++G V  E ++LNE T+W+G P    NP A +
Sbjct: 22  PLKLWYKQPSGNTWENAMPIGNGRLGAMIYGNVEQEIIQLNEHTVWSGSPNRNDNPLALE 81

Query: 71  ALSDVRSLVDSGQYAEA----TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
            L+++R L+  G + EA      A +    H    ++ +G++ L F         + Y R
Sbjct: 82  KLAEIRKLIFEGNHKEAEKLANQAIISKTSH-GQKFEPVGNLNLVFAGQE---NYKNYYR 137

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           ELD+  A ++  Y VG+V +TRE F+S  D+VI+ KIS +++G++SFN ++ S     + 
Sbjct: 138 ELDIERAISKTTYQVGDVTYTREAFASLADRVIIMKISANKAGNVSFNANISSPQKRKTI 197

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLK 243
                        P K +      +D    KG + F  I  IK+  + G++ +  D  L 
Sbjct: 198 AT----------TPNKDLTLSGITSDHETVKGMVAFKGISRIKL--EGGSLQS-TDTSLV 244

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+G++ A++ +  +++F+    N  D   D    +   L +    +Y+ L + H+  YQK
Sbjct: 245 VKGANSAIIFISIATNFN----NYQDLSGDENKRANDYLNNAFAKTYTTLLSSHILAYQK 300

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF+RV I L             E +   +P+ ER+++F+   DP +V L +QFGRYLLIS
Sbjct: 301 LFNRVKIDLG------------ETDAAKLPTDERLRNFRNINDPQMVALYYQFGRYLLIS 348

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  ++P WDS   +NIN EMNYW +   NLSE  EP    +  LS
Sbjct: 349 SSQPGGQPANLQGIWNNRINPPWDSKYTININAEMNYWPAEKTNLSELHEPFLKMVKELS 408

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           I G KTA+  Y A GW+ HH TDIW  + A  G   W +W  GG W+  HLWEHY YT D
Sbjct: 409 ITGQKTAKDMYGARGWMAHHNTDIWRATGAIDG-AFWGMWTAGGGWVSQHLWEHYLYTGD 467

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           + FL   AYP L G A F  D+L+     + +L  NP  SPE+   A DG  + +    T
Sbjct: 468 KAFLAS-AYPALRGAAQFYADFLVPHPNKNNWLVVNPGNSPENAPAAHDG--SSLDAGVT 524

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           MD  I+ +VF+  ISAAE+L+ + +  V+ + K   +L P  I +   + EW+
Sbjct: 525 MDNQIVFDVFNKAISAAEILKIDAN-FVDSLKKLRAKLPPMHIGQHNQLQEWL 576


>gi|333381508|ref|ZP_08473190.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830478|gb|EGK03106.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 813

 Score =  361 bits (926), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 217/588 (36%), Positives = 332/588 (56%), Gaps = 44/588 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +  PAK + +A+P+GN RLGAMV+G    E L+LNE+T+W G P    +P    +L
Sbjct: 23  IKLQYKRPAKEWVEALPLGNSRLGAMVFGSPVRERLQLNEETMWGGGPHRNDSPALLGSL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++VRSL+ +G+  EA A   K    P +   YQ +G++ L+F   H  Y++  Y R LDL
Sbjct: 83  NEVRSLIFAGKEKEAEALLDKTMRTPHNGMPYQTIGNLYLDFT-GHDNYSD--YSRNLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TA A  +Y+V  V +TRE F+S  D VI+ +I+  ++ S++F+ S DS +  +S     
Sbjct: 140 KTAVATTRYAVDGVTYTREVFTSFTDNVIIMRITADKANSINFSASYDSQVKGYSVSVKG 199

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
           N+++++G               D +GI+     E   +I  + GT+ A +D  +    + 
Sbjct: 200 NRLVLKG------------TGSDHEGIKGVVRFENQTEIKTEGGTVKAGKDNIVVKNANT 247

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
             + + +A++  D   ++ ++++K  T      L+S     Y    T H+  YQK F+RV
Sbjct: 248 ATIYISIATNFIDYKNVSGNEARKAET-----ILKSALTKPYQTALTDHIKYYQKQFNRV 302

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L            SE   D   S  RV++F+  +D +LV LLFQFGRYLLISSS+PG
Sbjct: 303 ELDLG----------TSERMNDETDS--RVRNFKDGKDQNLVTLLFQFGRYLLISSSQPG 350

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q + LQGIWN+ L P WDS   +NIN EMNYW +   NLSE   PLF+ +  ++  G +
Sbjct: 351 GQPSTLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHFPLFEMVKEIAETGKE 410

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+V Y A+GWV HH TDIW  +    G   + +WP GGAWL  H+W+HY YT D+ FL 
Sbjct: 411 TAKVMYNANGWVTHHNTDIWRTTGPVDG-AFYGMWPDGGAWLSRHMWQHYLYTGDKAFLS 469

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           +  YP+L+G A F LD+L+E H  Y  + + PSTSPE     P G    ++  STMD  I
Sbjct: 470 E-VYPVLKGAADFFLDFLVE-HPKYKWMVSAPSTSPEQ---GPPGTGTSITAGSTMDNQI 524

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           + +V S  ++A+  L+  ++A  +++   + RL P +I +   + EW+
Sbjct: 525 VFDVLSDALNASRALQLADNAYEKRLEDMISRLAPMQIGKYNQLQEWL 572


>gi|198275212|ref|ZP_03207743.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
 gi|198271795|gb|EDY96065.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
          Length = 800

 Score =  361 bits (926), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 212/583 (36%), Positives = 308/583 (52%), Gaps = 47/583 (8%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
            +P+GNG LGA+V+G V  E ++LNE+T+W+G P +  NPDAP+ L  +R L+  G+Y E
Sbjct: 56  GLPLGNGSLGAVVFGDVAMERIQLNEETMWSGSPQECDNPDAPQYLDKIRQLLLEGKYKE 115

Query: 87  ATAASVK-------------LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           AT  + +                 P   +Q +GD+ ++F +   K A   YRREL+L  A
Sbjct: 116 ATELTNRTQVCTGKGSGGGNGSTVPFGCFQTMGDLWIDFAN---KEAYSDYRRELNLEDA 172

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
           TA V Y+ G+V F RE F S+PDQV+V ++S  +   +SF   +       ++   + Q+
Sbjct: 173 TATVTYTQGDVHFKREIFISHPDQVMVIRLSADKQQQMSFTCRMTRPEYFFTHTE-DGQL 231

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           IM G     +            G+Q+ A L+   +  +G      D  L V G+D  +LL
Sbjct: 232 IMSGALSDGK---------GGDGLQYMARLK---AVTKGGEVICTDSTLTVSGADEVMLL 279

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L AS+ +      P    +D  S +  ++      ++  LY  H  +Y   F R S QL+
Sbjct: 280 LAASTDYQ--LTYPHYKGRDYLSLTRESIAKAEKKTFESLYQAHQKEYAAYFDRASFQLA 337

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
            SP  + TD    E       A ++       +P L EL+FQ+GRYLLISSSRPGT  AN
Sbjct: 338 ESPDTLATDVLVAE-----AKAGKI-------NPHLYELMFQYGRYLLISSSRPGTMPAN 385

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIW   L   W+   H ++N+EMNYW +   NLSE   P+FD +  L   G+KTAQ  
Sbjct: 386 LQGIWANKLQTPWNGDYHTDVNIEMNYWPAEVTNLSEMHLPMFDLIASLVAPGTKTAQTQ 445

Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
           Y   GWV+H  T++W  +S       W +     AW+C H+ EHY +T D+DFL K+ YP
Sbjct: 446 YQKKGWVVHPITNVWGYTSPGE-SASWGMHTGAPAWICQHIGEHYRFTGDKDFL-KKMYP 503

Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
           +L+G   F +DWL+ +   G L + P+ SPE+ F+APDG    +S   T D   I ++F 
Sbjct: 504 VLKGAVEFYMDWLVTDPKTGKLVSGPAVSPENTFVAPDGSQCQISMGPTHDQQTIWQLFD 563

Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
               A+E L+ N DA  + V  +  +L  T+I  DG IMEW Q
Sbjct: 564 DFEMASEALQIN-DAFTQAVGDAKGKLLETRIGSDGRIMEWAQ 605


>gi|329849976|ref|ZP_08264822.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
 gi|328841887|gb|EGF91457.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
          Length = 806

 Score =  360 bits (925), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 223/620 (35%), Positives = 324/620 (52%), Gaps = 55/620 (8%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A++ S  + L + +  PA  + +A+P+GNGRLGAMV+G V  E L+LNEDTLW G P D 
Sbjct: 25  AQAKSRPSDLTLWYAQPAGPWVEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGSPYDP 84

Query: 64  TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
            NP   + L+  R+L+D+ ++ +A+   +  +   P     Y   GD+ L+F   H    
Sbjct: 85  NNPGCLENLAKCRALIDAEKFKDASDLVNASMMAQPKTQMPYGAAGDLLLDF---HGLAQ 141

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---- 176
              YRR LDL+TA A   + +G   +TRE FSS  DQV+V +++    G L F++     
Sbjct: 142 PSDYRRSLDLDTAVATTTFKIGATTYTREVFSSAVDQVLVVRLTAKGKGRLDFDLGYRHP 201

Query: 177 -------------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN------ANDDPKGI 217
                        +   L   +  +    +  E R         +N      AN    GI
Sbjct: 202 DQVDYGAPVYDGKVTDTLSQGAAWDKREGLSRERRPQSLAFAASSNELLVTGANIASAGI 261

Query: 218 QFSAILEIKI-SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
                  ++I +   G I+A  D  L V G+    LL+ A++SF    +   D+  DP +
Sbjct: 262 PAGLTYAVRIRAIGDGNITAAGDS-LTVRGATTVTLLIAAATSF----VRFDDTGGDPIA 316

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
            + +AL +     Y+ L   H+  ++ LF R++I L  +     +  C+  +I       
Sbjct: 317 RT-AALNTAAAKPYAALKADHIAAHRALFRRMTIDLGNT-----SAACAATDI------- 363

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
           R+      +DP L  L  QF RYL+ISSSRPGTQ ANLQGIWNE ++P W S   +NIN 
Sbjct: 364 RIGKSLASDDPQLAALYVQFARYLMISSSRPGTQPANLQGIWNEGVNPPWGSKYTININT 423

Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
           EMNYW   P N+  C EPL   +  LS+ G+KTA+V Y ASGW+ HH TD+W ++SA   
Sbjct: 424 EMNYWLVEPANIGVCVEPLVRMVEDLSMTGAKTAKVMYGASGWMAHHNTDLW-RASAPID 482

Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LE 515
              W +WP GGAWLC  LW+HY+Y  D +FL KR YPLL+G + F  D L+E   G  L 
Sbjct: 483 GAWWGMWPTGGAWLCKTLWDHYDYNRDPEFL-KRIYPLLKGASQFFADTLVEDPKGRGLV 541

Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
           T+PS SPE+E +   G   C      MD  IIR++F++ I+A ++L   +D    K+   
Sbjct: 542 TSPSISPENEHM--KGVATCA--GPAMDSQIIRDLFASTIAAQKLLANGDDGFTAKLAAM 597

Query: 576 LPRLRPTKIAEDGSIMEWVQ 595
             RL   +I   G + EW++
Sbjct: 598 HARLPADRIGAQGQLQEWLE 617


>gi|332662485|ref|YP_004445273.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332331299|gb|AEE48400.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 819

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 219/589 (37%), Positives = 323/589 (54%), Gaps = 43/589 (7%)

Query: 13  LKITFN-GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           LK+ +N      + +A+PIGNGRLGAMV+G V  ET++LNE T+W+G P    NP A  +
Sbjct: 24  LKLWYNQSSGTKWENALPIGNGRLGAMVYGNVDKETIQLNEHTVWSGSPNRNDNPAALDS 83

Query: 72  LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
           L+++R L+  G++  A   + ++         ++Q +G + L F   H  Y+   Y REL
Sbjct: 84  LAEIRKLIFEGKHKAAERLANRVIITKKSHGQMFQPVGSLHLSFP-GHENYSN--YYREL 140

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           D+  A A+  Y+V  V +TRE  +S PD+VIV +++ S++GSLSF+ +  S      +  
Sbjct: 141 DIEKAVAKTSYTVDGVTYTREALASFPDRVIVVRLTASKAGSLSFSANYSSPQRKKVFAT 200

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
              + +         I    + ++  KG ++F  I  IK+  D G++S+  D  L V+G+
Sbjct: 201 TATKDLT--------ISGTTSDHEGVKGMVEFKGITRIKL--DGGSLSS-NDTSLTVKGA 249

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           + A L +  +++F+    N  D   D    +   L      +Y+ + T H+  YQK F R
Sbjct: 250 NSATLFISIATNFN----NYKDVSGDEEKRAADYLNKAYPKAYATILTGHIAAYQKYFKR 305

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V + L  +P               +P  ER+K+F +  DP LV L +QFGRYLLISSS+P
Sbjct: 306 VKLDLGTTPAA------------NLPIDERLKNFSSSNDPHLVSLYYQFGRYLLISSSQP 353

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G Q ANLQGIWN  L+P WDS   +NIN EMNYW +   NL+E   PL + +  LSI G 
Sbjct: 354 GGQPANLQGIWNNRLNPPWDSKYTININTEMNYWPAERTNLAELHRPLLEMVKELSITGQ 413

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +TA+  Y   GW+ HH TDIW  + A  G   W +W  GGAWL  HLWEHY Y  D+ +L
Sbjct: 414 ETARTMYGTRGWMAHHNTDIWRMNGAIDG-AFWGMWTAGGAWLTQHLWEHYLYNGDKTYL 472

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
               YP L+G A F +D+LIE H  Y  L  +P  SPE+   A  G  + +   +TMD  
Sbjct: 473 AS-VYPALKGAALFYVDFLIE-HPQYKWLVVSPGNSPENAPKAHGG--SSLDAGTTMDNQ 528

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           I+ +VFS+ I  A++L K+  A V+ + +   RL P  I +   + EW+
Sbjct: 529 IVYDVFSSTIRTAQLLGKDA-AFVDTLKQLRSRLAPMHIGQHNQLQEWL 576


>gi|311746497|ref|ZP_07720282.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
 gi|126575394|gb|EAZ79726.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
          Length = 826

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 212/590 (35%), Positives = 337/590 (57%), Gaps = 43/590 (7%)

Query: 12  PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           PL + +  PA   +T+A+PIGNG+LGAMV+G V +E ++LNE T+W+G P    NPDA  
Sbjct: 32  PLTLWYEQPAGEVWTNALPIGNGKLGAMVYGNVENELIQLNEHTVWSGGPNRNDNPDALA 91

Query: 71  ALSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+  EA    + +++        +Q +GD+ + F+  H  +    YRRE
Sbjct: 92  ALPEIRRLIFEGKQKEAEELASKTIQTKKSNGQKFQPVGDLNIAFE-GHTTFT--NYRRE 148

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY- 186
           LD+  A ++V Y V  V +TRE  +S  + VI   ++ S+ G +SF  S+ +   N S  
Sbjct: 149 LDIERAVSKVTYEVDGVVYTREAIASFAENVIAVHLTASKPGMISFIASMTTPQPNASIA 208

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
           +N +N++ + G             ++  KG I+F ++ +IK    + T +      + V+
Sbjct: 209 LNSDNELAISGTT---------TDHEGVKGKIKFKSLTKIKNIGGKLTSTG---TSIAVK 256

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +D A + +  +++F+    N  D + D  S +   L +    S++DL   +L DYQ  F
Sbjct: 257 NADEATIYIAIATNFN----NYLDLEGDENSRAKGFLVNATTQSFNDLLKTNLVDYQNYF 312

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           +RVS+ L             E +   +P+ ER+++F+T  DPSLV L +Q+GRYLLISSS
Sbjct: 313 NRVSLSLG------------ETDASKLPTDERLRNFRTGNDPSLVSLYYQYGRYLLISSS 360

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PG Q ANLQGIWN+++SP WDS   +NIN +MNYW +   NL+E  EP    ++ ++  
Sbjct: 361 QPGGQPANLQGIWNKEMSPPWDSKYTININAQMNYWPAEKTNLAELHEPFLKMVSEMAEA 420

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+V Y A GW+ HH TDIW + +     + W +W  GGAW   HLW+H+ Y+ D +
Sbjct: 421 GEETARVMYGARGWMAHHNTDIW-RITGPVDAIFWGIWSGGGAWTSQHLWDHFQYSGDME 479

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           +L K  YP+L+G A F +D+L+E  D  +L  NP TSPE+   A DG  + +   +TMD 
Sbjct: 480 YL-KSIYPILKGAAMFYVDFLVEHPDKPWLVVNPGTSPENAPAAHDG--SSLDAGTTMDN 536

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            ++ + FS +I A+E+L K + A  + +     +L P +I + G + EW+
Sbjct: 537 QLVFDAFSTVIQASELL-KIDQAFADTLQLMRDQLPPMQIGKHGQLQEWL 585


>gi|443622308|ref|ZP_21106841.1| putative Large secreted protein [Streptomyces viridochromogenes
           Tue57]
 gi|443344193|gb|ELS58302.1| putative Large secreted protein [Streptomyces viridochromogenes
           Tue57]
          Length = 973

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 219/576 (38%), Positives = 309/576 (53%), Gaps = 49/576 (8%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      ++++R  V + Q+  
Sbjct: 60  ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 119

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G PA    YQ +G++ L F  +        Y R LDL TATA   Y +  
Sbjct: 120 AQDLINQTMLGSPAGQLAYQPVGNLRLSFGSA---TGASQYNRTLDLTTATAITTYVLNG 176

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V + RE F+  PDQVIV +++   + S++F  + DS             I ++G      
Sbjct: 177 VRYQREVFAGAPDQVIVVRLTADRANSIAFIATFDSPQRTTVSSPDGATIALDG------ 230

Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
               + A +   G ++F A+    ++   GT+S+     L+V G+    +L+   SS+  
Sbjct: 231 ---ISGAMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTMLVSIGSSY-- 282

Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
             +N   +  D    + S L + R++    L +RHL DYQ LF+RVS+ L R        
Sbjct: 283 --VNFRKADGDYQGIARSHLNAARDVGIDVLRSRHLADYQALFNRVSVDLGR-------- 332

Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
           T + +     P+  R+       DP    LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 333 TAAADQ----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQM 388

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
           +P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G++ AQ  Y A GWV H
Sbjct: 389 APSWDSKFTINANLPMNYWPADTTNLSECFRPVFDMINDLTVTGARVAQAQYGAGGWVTH 448

Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
           H TD W  +S   G   W +W  GGAWL T +W+HY +T D DFL    YP L+G A F 
Sbjct: 449 HNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFF 506

Query: 503 LDWLIEGHD--GYLETNPSTSPE--HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
           LD L+  H   G+L TNPS SPE  H         A V    TMD  I+R++F+++  A 
Sbjct: 507 LDTLVA-HPALGHLVTNPSNSPELAHH------TNATVCAGPTMDNQILRDLFNSVARAG 559

Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           E+L  +      + L +  RL PT++   G+I EW+
Sbjct: 560 EILGADA-TFRAQALAARDRLPPTRVGSRGNIQEWL 594


>gi|418518724|ref|ZP_13084861.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|418519757|ref|ZP_13085808.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702673|gb|EKQ61175.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410704417|gb|EKQ62899.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 790

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F     Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D+ 
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I           D  S E +  +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y 
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593


>gi|21242520|ref|NP_642102.1| hypothetical protein XAC1774 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21107972|gb|AAM36638.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 790

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F     Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D+ 
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I           D  S E +  +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y 
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593


>gi|381169519|ref|ZP_09878684.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380690109|emb|CCG35171.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 790

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG+  E L+LNEDTL+ G P D T+P
Sbjct: 39  VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G+YAEA   A  KL   P     YQ LGD+ L+FD +        
Sbjct: 99  DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKKMPYQPLGDLLLDFDRAD---GISD 155

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F     Q IV ++S    G +S  V +DS    
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
                    ++  GR            N    GI+      +++      G +S + D+ 
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S+     +  D   DP + + + L+    L +  L   HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I           D  S E +  +P+ ERV+ F    DP+L  L  Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRPGTQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL   L  
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A GWV+H+ TD+W ++    G   W+LWPMGG WL   LW+ ++Y 
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR +L K  YPL +G A F +  L+ +   G + TNPS SPE++   P G   C   S 
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540

Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  ++R++F+  I+ +++L  +      +  + + LP   P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593


>gi|393719778|ref|ZP_10339705.1| alpha/beta hydrolase domain-containing protein [Sphingomonas
           echinoides ATCC 14820]
          Length = 811

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 224/631 (35%), Positives = 336/631 (53%), Gaps = 73/631 (11%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           ++ A+++  ++ L++ +  PA  +T+A+P+GNGRLGAMV+G V  E L+LNEDTLW G P
Sbjct: 28  LLAAKASDASSDLRLWYRQPAGAWTEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGAP 87

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
            D  NP+A  AL +VR+L+ +G+Y +AT  AS K+ G P     Y  LGD+ L F  +H+
Sbjct: 88  YDPDNPEALAALPEVRALLAAGRYKDATDLASAKMMGKPPAQMPYGTLGDVLLTFASAHV 147

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
                 YRRELDL +  A  ++   +  + RE  +S PDQVIV ++  +E+G+L F+++ 
Sbjct: 148 P---TVYRRELDLASGIATTEFETADGRYRREVLASAPDQVIVMRLE-AEAGTLDFDLAY 203

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD------------------------ 213
            +       ++       EG  P    P +    +D                        
Sbjct: 204 RA----PRAISTPRAQFSEGATPQTTRPTEWMQREDAERPGPDVTIAADGAHALLVTGSN 259

Query: 214 ------PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
                 P G++++  L ++   D G I A   K + V G+    +L+ A++S+     + 
Sbjct: 260 EAALGVPAGLRYA--LRVQAVGD-GVIIA-NQKGITVSGARSVTVLITAATSYR----SY 311

Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
           SD+  DP     +A ++     Y  L   H+ D+  LF  V I L  SP           
Sbjct: 312 SDTGGDPVGAVRAAGRAAERKGYPALRRSHVADHAALFGGVKIDLGTSPAA--------- 362

Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
               +P+  R+ +  T  DP+L  L  Q+GRYLLI+SSRPG+Q + LQGIWNE  +P W 
Sbjct: 363 ---ALPTDARIAAGATAVDPALAALYLQYGRYLLIASSRPGSQPSTLQGIWNEGTTPPWG 419

Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
           S   +NIN EMNYW + P  L  C EPL   +  LS+ G++TA+  Y A GWV HH TD+
Sbjct: 420 SKYTININTEMNYWAADPGGLGLCVEPLVRMVEDLSVTGARTARTMYGARGWVAHHNTDL 479

Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
           W +++A     +W LWP GGAWLC  L+ H+++  D   L  R YPLL+G A F +D LI
Sbjct: 480 W-RATAPIDGPLWGLWPCGGAWLCNTLFTHWDFARDPALL-ARLYPLLKGAAHFFVDTLI 537

Query: 508 EGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
           E   G  L T+PS SPE+E   P G   CV     MD  I+R++F+  + A   L ++ +
Sbjct: 538 EDPKGRGLVTSPSLSPENEH--PFGSSLCV--GPAMDRQIVRDLFTNTVVAGRTLGRDGE 593

Query: 567 --ALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             A++E+V     R+ P +I   G + EW++
Sbjct: 594 WLAMLEQVGA---RIAPDRIGAGGQLQEWLE 621


>gi|116622997|ref|YP_825153.1| hypothetical protein Acid_3901 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116226159|gb|ABJ84868.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 759

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 215/592 (36%), Positives = 312/592 (52%), Gaps = 73/592 (12%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           + +PL + +  PA  +TDA+P+GNGR+GAMV+GG   E ++ NE T+WTG P DY +  A
Sbjct: 15  SQSPLTLWYTHPADIWTDALPVGNGRMGAMVFGGAAHERIQFNEQTVWTGEPHDYAHKGA 74

Query: 69  PKALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYR 125
            K+L  +R L+ +G+  EA A A  +    P     YQ LGD+ +E   +    A   Y+
Sbjct: 75  SKSLQQIRELLWAGKQKEAEALAMTEFMSEPLHQKAYQALGDLIIETPGAETPTA---YK 131

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL+T  A  +++   + + RE F+S+P   IV  ++ S+    S      +L   H+
Sbjct: 132 RSLDLDTGIAVTEFTANGITYRREVFASHPASAIVVHLTSSQPAEFS-----ATLKCAHA 186

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
              G     M G+              +   I+F + LE  I                  
Sbjct: 187 ACKGG--ATMSGQV-------------ENSAIRFDSRLEKHIDSPTS------------- 218

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
               A LLL A+++F        D   DP   +++ L +I N SY  L   H+ D+Q LF
Sbjct: 219 ----ATLLLTAATNFK----TYQDVTADPVQRNLATLVAIGNKSYDALRAEHIRDHQSLF 270

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV++ L  +                +P+ ER+ +F    DP+L+ LLFQFGRYL+I SS
Sbjct: 271 RRVTLDLGATAAS------------QLPTDERIAAFAKGSDPALITLLFQFGRYLMIGSS 318

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPG Q ANLQG+WNE  +P WDS    NIN EMNYW     NLSEC  PLFD L  L+ +
Sbjct: 319 RPGGQPANLQGLWNESNTPAWDSKYTDNINTEMNYWPVEETNLSECHLPLFDALKDLAQS 378

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G+ TA+  Y A GWV+HH  D+W + +A        +W  GGAWL THLWEHY +T DR+
Sbjct: 379 GAITAREQYNARGWVLHHNFDLW-RGTAPINASNHGIWQTGGAWLSTHLWEHYLFTGDRE 437

Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL   AYPL++G ++F +D L++    G+L T PS SPE            +    TMD 
Sbjct: 438 FLRAAAYPLMKGASTFFIDALVKDPKTGFLYTGPSNSPEQ---------GGLVMGPTMDR 488

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
            I+R +F   I+AA++L  N D  +++ L +L + + P +I + G + EW++
Sbjct: 489 EIVRSLFGETIAAAKIL--NLDPALQEQLATLRKQIAPLQIGKYGQLQEWME 538


>gi|254472686|ref|ZP_05086085.1| large secreted protein [Pseudovibrio sp. JE062]
 gi|211958150|gb|EEA93351.1| large secreted protein [Pseudovibrio sp. JE062]
          Length = 835

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 219/623 (35%), Positives = 322/623 (51%), Gaps = 64/623 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
           ++ PA H+ +A+P+GNGRLGAMV+G   S  + LNEDTL++G P   Y  P+    +  V
Sbjct: 17  YDTPAAHWNEALPLGNGRLGAMVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHV 76

Query: 76  RSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTA 133
            +L+  G+  EA     K + G     YQ +G++ +   DDS +      YRR LD+  +
Sbjct: 77  EALLRDGKLFEAQEFVRKNWTGRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHS 132

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVNGNN 191
                Y     +F R  F+S PD VIV +++  +  +LSFN+  DS       ++   N 
Sbjct: 133 LHHESYEQNGTKFERTSFASFPDNVIVVRLTADKPCALSFNLRYDSPHPTCRTTHEGENT 192

Query: 192 QIIMEGRCP---------------------------GKRIPPKANANDDPKG-------- 216
           ++ + G+ P                           GK  P   N  D  +G        
Sbjct: 193 RLHLRGQAPAFTSSRVIERIEHDLEQHRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDG 252

Query: 217 ----IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
                 F A L +++   R      E  +L +EG+    L +  ++SF+GP  +PS   K
Sbjct: 253 LGEGTYFEAGLSVELEGGR---IRPERGELHIEGATAVTLRIAMATSFNGPDKSPSREGK 309

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP     S L +  ++SY+D+  +H DD  +LF R+S++L     D ++D         +
Sbjct: 310 DPAPIVKSILNAAGSVSYADMLQKHSDDVLRLFDRISLKLG---NDAISD---------L 357

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           P++ R++ FQ   DP+L  L FQ+GRYLLI+SSR G+Q  NLQGIWN    P W S   +
Sbjct: 358 PTSTRLEQFQEKGDPALAALQFQYGRYLLIASSRAGSQPPNLQGIWNNLRRPQWSSNYTM 417

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NINLEMNYW +    LS+  EPLF  +  L+++G++TA+  + A GW   H T IW  S 
Sbjct: 418 NINLEMNYWPAEITGLSDLHEPLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSV 477

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
                   A WPM   WL +H+WEH+ YT D++FL+ RAYPL++  A F   WL E  DG
Sbjct: 478 PSPCDPASAFWPMAAGWLLSHMWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDG 537

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           YL    STSPE+ ++  DG +  V   STMD AIIRE F+   +AA++L  + + L   +
Sbjct: 538 YLVPKVSTSPENRYLDEDGHVITVDQGSTMDCAIIRETFANTATAAKLLGLDAE-LANTL 596

Query: 573 LKSLPRLRPTKIAEDGSIMEWVQ 595
            +   RL P +I   G + EW Q
Sbjct: 597 EEKAARLLPYQIGAQGQVQEWSQ 619


>gi|436835731|ref|YP_007320947.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
 gi|384067144|emb|CCH00354.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
          Length = 821

 Score =  358 bits (919), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 223/596 (37%), Positives = 327/596 (54%), Gaps = 52/596 (8%)

Query: 13  LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           LK+ +N P+ + + +A+PIGNGRLGAMV+G VP ET++LNE TLW+G P    NP+A  +
Sbjct: 24  LKLWYNTPSGQTWENALPIGNGRLGAMVYGNVPRETIQLNEHTLWSGGPNRNDNPEALAS 83

Query: 72  LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
           L ++R L+ + +  EA A + K          ++Q +G + L FD  H  Y    Y REL
Sbjct: 84  LPEIRQLIFTNKQKEAEALANKTIITKKSHGQMFQPVGSLHLTFD-GHENYTN--YYREL 140

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-V 187
           D+  A A+  Y+V  V +TRE  +S PDQV+V +++ S+ G L+F  S  +         
Sbjct: 141 DIERAVAKTTYTVDGVTYTREILASLPDQVLVMQLTASKPGRLAFRASYATPQAKPVIKT 200

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
           N  N++ + G          A+ +D  KG +++  I  IK     G++SA +D  L V+G
Sbjct: 201 NSTNELTIAG---------TASDHDGVKGLVRYKGIARIKTQG--GSVSA-DDSTLTVKG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +  A + L  +++F    I  +D   D  + + + L +    +Y+ + T H+  YQ+ F 
Sbjct: 249 ATTATIYLSVATNF----IKYNDVSGDENARAATYLNNAFPKTYAAILTPHVAAYQRYFK 304

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS  L  +                +P+ ER+K+F+T  DP LV L +Q+GRYLLISSS+
Sbjct: 305 RVSFDLGST------------EAANLPTDERLKNFRTANDPQLVTLYYQYGRYLLISSSQ 352

Query: 367 PGT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           PG      Q ANLQGIWN  + P WDS   +NIN +MNYW +   NL+E  EP    +  
Sbjct: 353 PGRDGVMGQPANLQGIWNNKMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLQMVRD 412

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           LS  G +TA+V Y A GW+ HH TDIW  + A  G   W +W  GG W   HLWEHY Y+
Sbjct: 413 LSETGQETARVMYGARGWMAHHNTDIWRATGAIDG-AFWGMWIAGGGWTSQHLWEHYLYS 471

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYS 539
            D+ +L    YP+L+G A F  D+L+E H  Y  L  NP +SPE+   A  G  + +   
Sbjct: 472 GDKTYLAS-VYPILKGAALFYADFLVE-HPTYHWLVANPGSSPENAPKAHGG--SSLDAG 527

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWV 594
           +TMD  I  +VF+  I AA++L+   DA     LK L  +L P  + + G + EW+
Sbjct: 528 TTMDNQIAFDVFTTTIRAADILKT--DAAFADTLKQLRSKLPPMHVGQYGQLQEWL 581


>gi|399031123|ref|ZP_10731262.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
 gi|398070592|gb|EJL61884.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
          Length = 821

 Score =  357 bits (916), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 216/594 (36%), Positives = 331/594 (55%), Gaps = 38/594 (6%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A +  + + LK+ +N PA  + +A+P+GNGRLGAMV+G    E L+LNE+T+W G P   
Sbjct: 18  ASTAQSKSELKLWYNKPATIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSN 77

Query: 64  TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYA 120
            +  + +AL  VR LV  G++ EA   A+  +     D   YQ  G   + F   H KY 
Sbjct: 78  AHTKSIEALPKVRKLVFEGKFDEAQDLATRDIMSQTNDGMPYQTFGSAYISFP-GHQKYT 136

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              Y R+LD+  A+A+VKY+V  +EFTRE  +S  DQVIV K+S S+ G ++ NV ++S 
Sbjct: 137 --NYYRDLDIENASAKVKYTVNGIEFTREILTSFSDQVIVVKLSASQPGQITANVFMNSP 194

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           +D        NQII+ G           N       ++F   +E K  +  G +SA  + 
Sbjct: 195 IDKTVPSTEGNQIILSGVG--------TNFEGVKGKVKFQGRIEAK--NKGGEVSA-SNG 243

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            L +  +D   L +  +++F     N  D  +D  ++S   L+   +  +  +   H+  
Sbjct: 244 ILIINKADEVTLYISIATNFK----NYQDITEDEVAKSKVYLEKAISKDFETIKKAHVAY 299

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQK F+RV++ L  +      D   +      P+ ER++ F+ + DP L  L FQFGRYL
Sbjct: 300 YQKFFNRVALDLGSN------DAIKK------PTNERIRDFKKEFDPQLASLYFQFGRYL 347

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSS+PG Q ANLQGIWN+ ++P WDS    NIN EMNYW +   NL+E  EP      
Sbjct: 348 LISSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAEVTNLTEMHEPFIQMAK 407

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            LS+ G++TA+  Y A+GWV+HH TDIW + +A        +W  GGAW+   LWE Y Y
Sbjct: 408 ELSVAGAETAKTMYNANGWVLHHNTDIW-RVTAPVDSAASGMWMTGGAWVSQDLWERYLY 466

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           T D ++L K  YP+++G A F LD++I + + GYL   PS+SPE+      GK + ++  
Sbjct: 467 TGDINYL-KEIYPVIKGAADFFLDFMITDPNTGYLVVVPSSSPENTHAGGTGK-STIASG 524

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +TMD  ++ ++FS +I A++++  +E+   +K+  +L ++ P KI +   + EW
Sbjct: 525 TTMDNQLVFDLFSNVIKASKLVAPDEN-YTKKLSDALAKMPPMKIGKHSQLQEW 577


>gi|87200424|ref|YP_497681.1| twin-arginine translocation pathway signal protein [Novosphingobium
           aromaticivorans DSM 12444]
 gi|87136105|gb|ABD26847.1| Twin-arginine translocation pathway signal [Novosphingobium
           aromaticivorans DSM 12444]
          Length = 824

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 218/593 (36%), Positives = 315/593 (53%), Gaps = 34/593 (5%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ F+ PA+ + +A+P+GNGRLGAM+ G +  E L LNEDTLW+G P       A   L 
Sbjct: 45  RLVFDSPAREWIEALPVGNGRLGAMMHGLLDGERLSLNEDTLWSGQP-SVGGAAADGLLE 103

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            +R L+ +G Y  A   + ++ GH ++ Y  L D+ ++ D +    A    RR LDL  A
Sbjct: 104 QMRDLIFAGDYPGADRLARRMQGHFSEAYLPLADLHVDLDQAGPARA---IRRTLDLREA 160

Query: 134 TARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           TA V+    G +E  R  F S P Q++V +I    +     +V LD  L +        +
Sbjct: 161 TAGVEIDRDGGIE-RRTLFVSAPAQLVVFRIEREGAARFGASVRLDCQLRSSIRAVSPRR 219

Query: 193 IIMEGRCPGKRIPPKANANDDPK-------GIQFSAILEIKISDDRGTISALEDKKLKVE 245
           +++ G+ P    P   N  D  +       G+ F+AI EI   D  G++   E   L+VE
Sbjct: 220 LVLAGKAPTVCEPDYRNVPDPVRYSDRAGYGMAFAAIAEI---DTDGSVRKGE-GALRVE 275

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            + W  + L A++ + GP + P        + + + L+  R   ++ L   H  D++ L+
Sbjct: 276 NAGWLEIRLAAATGYRGPHVLPDLDPGAVEALAAAPLRRARGKPHTRLLADHRRDHRALY 335

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R ++ L         DT      D +P+  R  +     DP+L  LL+ +GRYLLI+SS
Sbjct: 336 ERSALALGGG------DTARRH--DGLPTDARRAA--DPGDPALAALLYNYGRYLLIASS 385

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           RPGT+ ANLQGIWN  L   W      NIN+ MNYW +   NL++C  PL DF   L+ N
Sbjct: 386 RPGTRPANLQGIWNAQLRAPWSCNYTTNINVPMNYWMAETANLADCHRPLVDFAEALARN 445

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           G  TA+  Y   GW +HH TD+WA S+   A  G   WA WPMG  W+  HLWEHY ++ 
Sbjct: 446 GGDTARDYYRMPGWCLHHNTDLWAMSNPVGAGEGDPNWANWPMGAPWIAQHLWEHYRFSG 505

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D  FL  RA+P++ G A F + WL+ +   G L T PS SPE+ F+  DG+ A +S   T
Sbjct: 506 DLAFLRDRAWPVMRGAADFCVGWLVRDPASGQLTTAPSISPENLFVTADGRTAAISAGCT 565

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEW 593
           MD+A+IRE+F   I+AA VL   EDA   KVL++L   L P +I   G + EW
Sbjct: 566 MDIAMIRELFGNCIAAAAVL--GEDAAFAKVLRNLSEELPPYRIGRHGQLQEW 616


>gi|365118140|ref|ZP_09336940.1| autotransporter-associated beta strand [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363651034|gb|EHL90117.1| autotransporter-associated beta strand [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1402

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 218/609 (35%), Positives = 338/609 (55%), Gaps = 57/609 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA ++ +A+P+GNGRL AMV+G +  +T+++NEDT W+G P +  NP+A   L
Sbjct: 26  LKLWYDRPADYWVEALPLGNGRLAAMVYGTILQDTIQINEDTYWSGSPYNNANPNAKTHL 85

Query: 73  SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
           + +R  ++ G+YAEA         A   + GH   +Y+ +G++ L+F +SH       Y 
Sbjct: 86  NQIREYINDGEYAEAQKIALANIIADRNITGHGM-IYESIGNLLLDFPESH--KTPTNYY 142

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
           RELDL+ A A+V Y+V  V++TRE F+S  D +I+ KIS S+ G ++FN S    L ++ 
Sbjct: 143 RELDLSNAIAKVTYTVDGVDYTREAFTSFTDDLIIIKISASKQGMVNFNTSFVGPLKSNR 202

Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA-LE 238
                  V+G N  I     PGK       A ++   +       I++  + GT SA   
Sbjct: 203 VKASTEIVSGTNNTIRVKNTPGKT------AEENIPNL-LRPTTYIRVVAEGGTQSADSS 255

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           +K LKV  +D A + + ++++    FIN  D   D  ++++S L    +  Y      H+
Sbjct: 256 NKILKVSDADVAYIYISSATN----FINYKDISGDSDAKALSYLNKF-DKDYEQAKNDHI 310

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
             YQ+ F RVS+       D+  ++  E+     P+ +R++ F    DPSL  L FQFGR
Sbjct: 311 TRYQEQFGRVSL-------DLGNNSVQEKK----PTDKRIEEFSNTNDPSLASLYFQFGR 359

Query: 359 YLLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSS+PG+Q ANLQGIWN +    P WDS    NIN+EMNYW +   NLSEC +P  
Sbjct: 360 YLLISSSQPGSQPANLQGIWNPNAGQYPAWDSKYTTNINVEMNYWPAEVTNLSECHQPFL 419

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           + +  +S+ G ++A+  Y   GW +HH TD+W +S+    K    +WP   AW C+HLWE
Sbjct: 420 EMVKDVSVTGQESAETMYGCRGWTLHHNTDLW-RSTGAVDKSACGIWPTCNAWFCSHLWE 478

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE-----FIAPD 530
           HY +T D++FL +  YP+L+    F  D+LI +   GY   +PS SPE+      ++   
Sbjct: 479 HYLFTGDKEFLSE-VYPILKSACEFYQDFLITDPKTGYKVVSPSNSPENHPGLFSYVDDS 537

Query: 531 GKLACVSYSS--TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAE 586
           G    V+  S  TMD  ++ ++    I AAE+L K+ D  A ++K+   LP   P  + +
Sbjct: 538 GNKQNVALFSGVTMDNQMVFDLLKNTIDAAEILGKDADFAADLKKLKDQLP---PMHVGK 594

Query: 587 DGSIMEWVQ 595
            G + EW++
Sbjct: 595 YGQLQEWLE 603


>gi|393788377|ref|ZP_10376507.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
           CL02T12C05]
 gi|392656050|gb|EIY49691.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
           CL02T12C05]
          Length = 809

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 221/618 (35%), Positives = 314/618 (50%), Gaps = 54/618 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +  N L + +  PA+ + +A+P+GNGRLGAMV+G    E ++ NE+TL++G P      
Sbjct: 17  VNAQNDLTLWYTTPARVWEEALPLGNGRLGAMVFGDTQKERIQFNENTLYSGEPAALNRS 76

Query: 67  DA--PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
               P+    VR L+  G+ AEA      +  G   +VYQ  GD+  +F    +K     
Sbjct: 77  TCILPQ-YEKVRDLLKQGKNAEAEKIMQYEWIGRLNEVYQPFGDVCFDFK---MKGEVTE 132

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y   LD+  A    +Y  G  E  RE F+S P Q IV  +  +E   L F + L SL   
Sbjct: 133 YVHSLDMEQAVVTTRYKQGGTEILREVFASFPGQAIVIHLK-AEKPVLHFEMQLASLHPV 191

Query: 184 HSYVNGNNQIIMEGRCP---------------------------GKRIPPKANANDDPKG 216
           H    G  ++ MEGR P                           GK I  +     +  G
Sbjct: 192 HLSCEGE-RLQMEGRAPAHVQRRTIEGMRKYNTERLHPEYFDEKGKVIRTEQVIYAEDAG 250

Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
           + F A + + +  D G I+  +D +L V+ +     LL A++S++G   +PS + K+   
Sbjct: 251 MAFEAYV-VPLKKD-GVIT-FKDNRLVVKDASEITFLLYAATSYNGFDKSPSKAGKNIAK 307

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
           E  +  + +    Y  +   H+ DYQ LF RV + L  SP           N    P+  
Sbjct: 308 ELQAQRKKLAGKEYQQIRNEHVADYQSLFKRVDLALPSSP-----------NQKDKPTDI 356

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
           R+K FQT  D SL+  LFQ+GRYL+IS SRPG Q  NLQG+WN+ + P W+S    NINL
Sbjct: 357 RLKEFQTKTDLSLIAQLFQYGRYLMISGSRPGGQPLNLQGLWNDKIIPPWNSGYTTNINL 416

Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
           +MNYWQ+   NLSEC +PLF F+  ++ +G + A   Y  +GW+ HH   IW ++    G
Sbjct: 417 QMNYWQAEVTNLSECHQPLFTFIEEIAQSGKEAAHNMYGRNGWIAHHNMSIWREAYPADG 476

Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLET 516
            V W  W M G WLC+H+WEHY YT D  FL +  Y +L+  A F  +WL++   G   T
Sbjct: 477 FVHWFFWNMSGPWLCSHIWEHYLYTKDVAFL-REYYSILKESARFCSEWLVQNTKGEWVT 535

Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
             STSPE+ F  PDG+ A V   STMDMAIIR +F   I AAE+L    D    K+L+  
Sbjct: 536 PVSTSPENAFRMPDGREAAVCEGSTMDMAIIRNLFGNTIHAAELL--GVDVEFRKMLEQK 593

Query: 577 PR-LRPTKIAEDGSIMEW 593
            + L   +I   G ++EW
Sbjct: 594 SKYLAGYRIGSHGQLLEW 611


>gi|182413173|ref|YP_001818239.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
 gi|177840387|gb|ACB74639.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
          Length = 1139

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 221/608 (36%), Positives = 314/608 (51%), Gaps = 47/608 (7%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + F+ PA+HFT A P+GNGRLG M +GGV  E + LNE  +W+G P D   P+A  AL +
Sbjct: 321 VRFDAPARHFTAATPLGNGRLGLMPFGGVDEERVVLNEAGMWSGSPQDADRPNAAAALPE 380

Query: 75  VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
           +R L+ +GQ AEA     + F               P   YQ+LG++ L F  S      
Sbjct: 381 IRRLLLAGQNAEAEKVVAENFTCAGAGSGRGRGANVPYGSYQVLGELRLAFASSASGTEV 440

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y RELDL  A +RV Y    V F RE F S PD+V V +++ ++ G++SF ++L+   
Sbjct: 441 TNYARELDLADAVSRVSYERDGVRFEREAFVSAPDEVAVIRLTANKRGAISFELALERPE 500

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
              + V    +++M GR    R           + + F+ I  I    +RG      D  
Sbjct: 501 RATTRVLEGGRLLMSGRLSDGR---------GGENVGFATIARIV---NRGGSVESGDGV 548

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL--SYSDLYTRHLD 299
           L+V  +D  ++L+ A++      I     +K   + + +     R+   S+  L   HL 
Sbjct: 549 LRVRAADEVLVLVTAATD-----IKSFAGRKVEDAAATAMADMDRSAQKSFGALRAAHLA 603

Query: 300 DYQKLFHRVSIQLSR----------SPKDIVTD-TCSEENIDTVPSAERVKSFQTDEDPS 348
            Y+ LF RV ++LS           SP  + TD   +E N      A  V       DP 
Sbjct: 604 HYRGLFDRVLLRLSEDGTEGGRRVPSPPQMTTDDRGAERNPRPTTQARLVAQAAGANDPG 663

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L +L F FGRYLLISS+RP     NLQGIW + +   W+   H+NIN++MN+W +  C L
Sbjct: 664 LAQLYFDFGRYLLISSTRPDGFPPNLQGIWADGVQTPWNGDWHLNINVQMNFWPAEICGL 723

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
            E  + LF F   L+  G++TA+  Y A GWV H   + W  +S   G   W     G A
Sbjct: 724 PELHDSLFSFTQSLTEPGARTARAYYGARGWVAHVLANPWGFTSPGEG-ASWGATTTGSA 782

Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFI 527
           WLC HLW+HY +T DR FLE RAYP+++G A F LD LI E   G+L T P+ SPE+EF+
Sbjct: 783 WLCQHLWDHYLFTGDRAFLE-RAYPMMKGSAEFYLDMLIEEPTHGWLVTAPANSPENEFV 841

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
             DG  A V    T D  I+R +F+A   AA VL+ + + L  ++     RL PT+IA D
Sbjct: 842 LADGTKAHVCLGPTFDNQILRSLFTATAEAARVLDVDAE-LQRELGAKTARLPPTRIAPD 900

Query: 588 GSIMEWVQ 595
           G +MEW++
Sbjct: 901 GRVMEWLE 908


>gi|332882277|ref|ZP_08449905.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357046479|ref|ZP_09108106.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
           11840]
 gi|332679661|gb|EGJ52630.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355530718|gb|EHH00124.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
           11840]
          Length = 807

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 226/599 (37%), Positives = 320/599 (53%), Gaps = 59/599 (9%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S +    LK+ ++ PAK +T+A+P+GN RLGAMV+GG   E L+LNE+T W G P D  N
Sbjct: 15  SVAWAGELKLWYSKPAKDWTEALPVGNSRLGAMVYGGTGREELQLNEETFWAGGPYDNNN 74

Query: 66  PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET 123
            +A   L  VR+L+  G+  EA          H   + Y  +G + L+F   H +  E  
Sbjct: 75  TNALYVLPVVRNLIFQGKTREAQQLVDANFLAHKDGMSYLTMGSLFLDFP-GHEEATE-- 131

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           + R+L++  ATA  +Y V  V +TR  F+S  D VIV ++   ++G+L+F VS D+ L +
Sbjct: 132 FYRDLNIEDATATTRYKVDGVTYTRRVFASFTDSVIVVRLQADKAGALAFTVSYDAPLKH 191

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
                G+   I    C GK          D +G++    A   +K+  D  TI+  E K 
Sbjct: 192 EVSAEGDLLTIT---CEGK----------DQEGVKAALRAECRVKVVSDGQTIT--EGKN 236

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           LKV G+  A L L A++++    +N  D   D  + +   LQ    + Y      H+  Y
Sbjct: 237 LKVTGATEATLYLSAATNY----VNYHDVSGDAAARADCCLQRAVQIPYKKALENHVAYY 292

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           +KLF RV + L       VT   S+E      +  R++ F    DPSL  LLFQ+GRYLL
Sbjct: 293 RKLFGRVQLDLG------VTAASSKE------TTLRIRDFSQGNDPSLATLLFQYGRYLL 340

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISSS+PG Q ANLQGIWN   +  WDS   +NIN EMNYW +   NLSE  +PLF  L  
Sbjct: 341 ISSSQPGGQPANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLED 400

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHY 478
           LS+ G+KTA+  Y   GWV HH TD+W       G V +A   +WP GGAWL  HLW+HY
Sbjct: 401 LSVTGAKTAREMYGCGGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHLWQHY 456

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
            +T D+DFL K  YP+L+G A F LD+L+E H  Y      PS SPEH           V
Sbjct: 457 LFTADKDFL-KTYYPVLKGTARFFLDFLVE-HPSYKWWVVAPSVSPEH---------GPV 505

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +   TMD  I+ +     + A+E++  ++ A  + + + L +L P ++   G + EW+Q
Sbjct: 506 TAGCTMDNQIVFDALRNTLLASEIV-GDDAAFRDSLAQMLDKLPPMQVGRHGQLQEWLQ 563


>gi|423223718|ref|ZP_17210187.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638093|gb|EIY31946.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 809

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 208/585 (35%), Positives = 311/585 (53%), Gaps = 41/585 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P    +P A   L
Sbjct: 23  LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA       F  G     +Q +G + LEFD  H  Y+   YRR+LDL
Sbjct: 83  PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A V+Y +G V +TR  F+S  D  ++ +I   + G+++F     +    +      
Sbjct: 140 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIETDKPGAVNFTTRYSTPYKEYEIKKNG 199

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             +++ G        P A        I+F    +IK   ++G ++   D  ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 248

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           V+ + A+++F    +N  D   + T  +   L       Y+   T H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALTAHEEAYQKLFGRVSL 304

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +  S ++               ++ R+K F   +D  LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 350

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            A LQGIWN +L   WD    +NIN EMNYW +   NL E  EPLF  +  LS +   TA
Sbjct: 351 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 410

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y   GW +HH TD+W  +    G     +WP+GGAWL  HLW+HY YT D+ FL K 
Sbjct: 411 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 467

Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           AYP L+G A F LD+L+E    G++   PS SPE     P G    ++   TMD  I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCTPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 524

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++++SA ++L     +  + +   + RL P +I +   + EW+
Sbjct: 525 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWL 569


>gi|115391619|ref|XP_001213314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194238|gb|EAU35938.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 749

 Score =  354 bits (909), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 218/592 (36%), Positives = 317/592 (53%), Gaps = 51/592 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+P+GNGRLGAMV G   +E L+LNED++W G PGD T   A + L 
Sbjct: 3   ELWYRSPAATWDEALPVGNGRLGAMVHGRTTTELLQLNEDSVWYGGPGDRTPVGASRYLQ 62

Query: 74  DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R  +  G +AEA     ++F  HP     Y+ LG + L+F   HL+     YRR LDL
Sbjct: 63  QLRQYIRKGAHAEAEELVRRVFFAHPISQRHYEPLGTLFLDF--GHLESEVTEYRRSLDL 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
                RV+Y    V F RE  +S+PD VI  ++  SE   + F V L  + D     N  
Sbjct: 121 QRGITRVQYMHTGVHFEREVLASHPDAVIAIRVRASEP--VEFVVRLTRMSDLEYETNEY 178

Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD-DRGTISALEDKKLKVEGSD 248
            + + ++  C    + P    ++     +    + I+  D D  TI+ +  +KL V   +
Sbjct: 179 LDDVAVDDNCVTMHVTPGGRNSN-----RACCKVAIRCDDPDGATIARVGGRKLMVRARE 233

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS--DLYTRHLDDYQKLFH 306
              LLLVA+ +          + +    +  +AL     L +S  ++++RH++DYQ+L+ 
Sbjct: 234 --TLLLVAAQT----------TYRYQDIDGRAALDVADALRWSTEEIWSRHIEDYQQLYA 281

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           R+++ +S     I TD             ER+K      DP LV L   FGRYLLI+SSR
Sbjct: 282 RMTLAMSPDASHIPTD-------------ERIKH---SRDPGLVSLYHNFGRYLLIASSR 325

Query: 367 PGTQ----VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            G       ANLQGIWN    P W S   +NINL+MNYW +  CNL+EC+ PLFD L  +
Sbjct: 326 EGNGNKVLPANLQGIWNPSFHPAWGSKYTLNINLQMNYWPANVCNLAECEMPLFDLLERI 385

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +  G KTA   Y   GW +HH TDIWA ++     +   LWP+GGAWLC H+WE + ++ 
Sbjct: 386 ASAGQKTAHEVYGCRGWAVHHCTDIWADTAPVDQWMPATLWPLGGAWLCFHVWERFLFSK 445

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D  FL +R +P+L GC  FLLD+L+E   G YL T+PS SPE+ F   +G+   +   ST
Sbjct: 446 DEMFL-RRMFPVLRGCVEFLLDFLVEDATGQYLVTSPSLSPENLFYDAEGRQGVLCEGST 504

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +DM ++  VF A I +  +L  N+D LV +V  +  RL P +I   G + EW
Sbjct: 505 IDMQLVDAVFHAFIQSVNILNLNDD-LVSRVNHASERLPPARIGSFGQLQEW 555


>gi|383641029|ref|ZP_09953435.1| hypothetical protein SchaN1_11878 [Streptomyces chartreusis NRRL
           12338]
          Length = 953

 Score =  354 bits (909), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 224/590 (37%), Positives = 314/590 (53%), Gaps = 44/590 (7%)

Query: 11  NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N   + ++ PA   +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  NP   
Sbjct: 23  NDFALWYDKPAGTEWLRALPIGNGRLGAMVFGNVDNERLQLNEDTVWAGGPYDSANPRGA 82

Query: 70  KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
             ++++R  V + Q+  A    +  + G PA    YQ +G++ L    +        Y R
Sbjct: 83  ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSLGSA---TGASQYNR 139

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL TATA   Y +G V + RE F+S PDQVIV +++   + S++FN + DS       
Sbjct: 140 TLDLTTATAVTTYVLGGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 I ++G                   ++F A+    ++   GT+S+     L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALAHAAVTG--GTVSS-SGGTLRVSG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +  +V +LV   S    +++      D    +   L + R++    L  RHL DYQ LF+
Sbjct: 249 AT-SVTVLV---SIGSGYVDFRRVDGDYQGIARRHLNAARDIGIDQLRKRHLADYQALFN 304

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+ L R        T + +     P+  R+       DP L  LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGR--------TAAADQ----PTDVRIAQHAQANDPQLSALLFQFGRYLLISSSR 352

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN+ ++P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           ++ AQ  Y A GWV HH TD W  +S  D  +  W +W  GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDTD 470

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL    YP L+G A F LD L+     GYL TNPS SPE    A     A V    TMD 
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGYLVTNPSNSPELAHHAN----ATVCAGPTMDN 525

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            I+R++F+++  A EVL  +      + L +  RL PTK+   G++ EW+
Sbjct: 526 QILRDLFNSVARAGEVLGVDA-GFRAQALAARDRLAPTKVGSRGNVQEWL 574


>gi|431797172|ref|YP_007224076.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
 gi|430787937|gb|AGA78066.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
          Length = 792

 Score =  354 bits (908), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 205/588 (34%), Positives = 323/588 (54%), Gaps = 39/588 (6%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  + +A+P+GNGRLGAMV+G   +E ++LNED++W G      +  +P  L+ +R
Sbjct: 37  YEQPAGSWEEALPVGNGRLGAMVFGQTSTERIQLNEDSMWPGAADWGDSKGSPADLASLR 96

Query: 77  SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
           +LV SG+  EA    +  F +   V  +Q +GD+ ++F D       + YRR+L L+ A 
Sbjct: 97  ALVKSGRVHEADKEIIDKFSYRGIVRSHQTMGDLFIDFGDER---EIQHYRRQLSLDDAL 153

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNGN--- 190
             V+Y  G  ++T E F+S  D  +V +++ ++   ++F + L    D+ H  VN N   
Sbjct: 154 VSVRYQSGGEQYTEEVFASAVDDALVIRLTTTDEAGMNFKLRLGRPKDDGHPTVNVNAPA 213

Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            ++++M+G     +   +        G++F   L++  S   G  S+ E+ +L++EG   
Sbjct: 214 ADELVMDGEVTQYKAAKEGQPTPLDYGVKFQTKLKVVTS---GGASSAENGELRLEGVKE 270

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           AV+ LV ++S+          + D  S++   LQ +    + +L   H +D+ + + RVS
Sbjct: 271 AVIYLVCNTSY---------YEDDYASKNEKTLQKLGTKGFDELLLAHQEDFDEYYSRVS 321

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
           + L                +DT+P+ +R+K  Q   +D  L   LFQ+GRYLLISSSRPG
Sbjct: 322 LDLGGHA------------LDTLPTDKRLKRVQDGRKDEGLAAALFQYGRYLLISSSRPG 369

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           T  ANLQGIWN+D+   W++  H+NINL+MNYW + P +L E   PLFD++  L   G  
Sbjct: 370 TNPANLQGIWNKDIEAPWNADYHLNINLQMNYWPAGPTHLPEMHLPLFDYVDQLIQRGKI 429

Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           TA+  Y +  G V+HH +D+WA       +  W  W  GG W+  H WE++ +T D  FL
Sbjct: 430 TAKEQYGVERGSVVHHASDLWAAPWMRANRAYWGAWIHGGGWISRHYWEYFQFTGDTTFL 489

Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           ++R YP L+  A+F +DWL  +   G   + P TSPE+ ++A DG+ A +SY + M   I
Sbjct: 490 KERGYPALKEFAAFYMDWLQKDDQTGLYVSYPETSPENSYLAADGQPAAISYGAAMGHQI 549

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
           I +VF   +SAA+VL   ED   E+V   L +L P   I  DG I+EW
Sbjct: 550 ISDVFQNTLSAAKVLSI-EDDFTEEVSGKLAKLYPGVGIGPDGRILEW 596


>gi|189464329|ref|ZP_03013114.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
           17393]
 gi|189438119|gb|EDV07104.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
           17393]
          Length = 794

 Score =  354 bits (908), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 207/585 (35%), Positives = 312/585 (53%), Gaps = 41/585 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P    +P A   L
Sbjct: 8   LKLWYKQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 67

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ SG+  EA       F  G     +Q +G + LEF+  H  Y++  YRRELDL
Sbjct: 68  PTVRELLFSGREKEAEKVIADNFFTGQHGMPFQTIGSLMLEFE-GHADYSD--YRRELDL 124

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A V+Y +G V +TR  F+S  D  ++ +I   + G+++F     +    +      
Sbjct: 125 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVNFTTRYSTPYKEYEIKKNG 184

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             +++ G        P A        I+F    +IK   ++G ++   D  ++V+G+D A
Sbjct: 185 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 233

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           V+ + A+++F    +N  D   + T  +   L       Y+     H + YQKLF RVS+
Sbjct: 234 VIYVTAATNF----VNYKDVSANETRRATEFLSQAMKRPYAQALAAHEEAYQKLFGRVSL 289

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +  S K+               ++ R+K F   +D  LV L+FQFGRYLLISSS+PG Q
Sbjct: 290 NVGASSKE--------------ETSYRIKHFNEGKDLGLVALMFQFGRYLLISSSQPGGQ 335

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            A LQGIWN +L   WD    +NIN EMNYW +   NL E  +PLF  +  LS +   TA
Sbjct: 336 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHQPLFQMVKELSESAQGTA 395

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y   GW +HH TD+W  +    G     +WP+GGAWL  HLW+HY YT D+ FL+  
Sbjct: 396 RTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 452

Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           AYP L+G A F LD+L+E    G++   PS SPE     P G    ++   TMD  I+ +
Sbjct: 453 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 509

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++++SA ++L  +  +  + +   + RL P +I +   + EW+
Sbjct: 510 ALTSVLSATKLLYPDHTSYCDSLQGMIKRLPPMQIGKHNQLQEWL 554


>gi|237710563|ref|ZP_04541044.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|265750338|ref|ZP_06086401.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|345516324|ref|ZP_08795817.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|423232070|ref|ZP_17218472.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
           CL02T00C15]
 gi|423238857|ref|ZP_17219973.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
           CL03T12C01]
 gi|423246621|ref|ZP_17227674.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
           CL02T12C06]
 gi|229433914|gb|EEO43991.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|229455285|gb|EEO61006.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|263237234|gb|EEZ22684.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|392625607|gb|EIY19671.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
           CL02T00C15]
 gi|392635319|gb|EIY29221.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
           CL02T12C06]
 gi|392647735|gb|EIY41433.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
           CL03T12C01]
          Length = 819

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 222/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN R+GAMV+GG   E L+LN++T+W G P     P+A K+L
Sbjct: 23  LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA       F  G     YQ +G + +E    H K  +  Y R+LDL
Sbjct: 83  PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+VIV +++    G L+F V   S L+ H      
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
            ++++ G+              D +G++    +E +   D  G    ++D+ + VEG+D 
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           +V L V+S +    FIN  D   + + ++   L       YS +   H+  Y++ F RV 
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L          T     ++TV   +R++ F   +D SL  LLFQ+GRYLLISSS+PG 
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  +PLF+ +  LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y  +GWV HH TDIW +++    K  +  WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469

Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
            AYP L+G A F LD+LIE  + G++ T PS SPEH     D K A  +    TMD  II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +V S  + A+ +L+ +  A  +  L+S L RL P +I +   + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575


>gi|212695001|ref|ZP_03303129.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
 gi|212662454|gb|EEB23028.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
          Length = 819

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 222/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN R+GAMV+GG   E L+LN++T+W G P     P+A K+L
Sbjct: 23  LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA       F  G     YQ +G + +E    H K  +  Y R+LDL
Sbjct: 83  PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-APGHEKVTD--YYRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+VIV +++    G L+F V   S L+ H      
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
            ++++ G+              D +G++    +E +   D  G    ++D+ + VEG+D 
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           +V L V+S +    FIN  D   + + ++   L       YS +   H+  Y++ F RV 
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L          T     ++TV   +R++ F   +D SL  LLFQ+GRYLLISSS+PG 
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  +PLF+ +  LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y  +GWV HH TDIW +++    K  +  WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469

Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
            AYP L+G A F LD+LIE  + G++ T PS SPEH     D K A  +    TMD  II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +V S  + A+ +L+ +  A  +  L+S L RL P +I +   + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575


>gi|224536380|ref|ZP_03676919.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522018|gb|EEF91123.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 793

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 207/585 (35%), Positives = 312/585 (53%), Gaps = 41/585 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P    +P A   L
Sbjct: 7   LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 66

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA       F  G     +Q +G + LEFD  H  Y+   YRR+LDL
Sbjct: 67  PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 123

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A V+Y +G V +TR  F+S  D  ++ +I   + G+++F     +    +      
Sbjct: 124 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIEADKPGAVNFTTRYSTPYKEYEIKKNG 183

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             +++ G        P A        I+F    +IK   ++G ++ + +  ++V+G+D A
Sbjct: 184 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVN-VTNNCIEVKGADAA 232

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           V+ + A+++F    +N  D   + T  +   L       Y+   T H + YQKLF RVS+
Sbjct: 233 VIYVTAATNF----VNYKDVSANETRRATEFLVKAMKRPYAQALTAHEEAYQKLFGRVSL 288

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +  S ++               ++ R+K F   +D  LV L+FQFGRYLLISSS+PG Q
Sbjct: 289 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 334

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            A LQGIWN +L   WD    +NIN EMNYW +   NL E  EPLF  +  LS +   TA
Sbjct: 335 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 394

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y   GW +HH TD+W  +    G     +WP+GGAWL  HLW+HY YT D+ FL K 
Sbjct: 395 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 451

Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           AYP L+G A F LD+L+E    G++   PS SPE     P G    ++   TMD  I+ +
Sbjct: 452 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 508

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++++SA ++L     +  + +   + RL P +I +   + EW+
Sbjct: 509 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWL 553


>gi|260910947|ref|ZP_05917588.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
           str. F0295]
 gi|260634938|gb|EEX52987.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
           str. F0295]
          Length = 792

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 216/593 (36%), Positives = 329/593 (55%), Gaps = 36/593 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
           P K+ ++ PA  F +A+PIGNG+LGAMV+G V ++ L LN+ TLW+G P D  N DA   
Sbjct: 24  PQKLWYDKPATFFEEALPIGNGKLGAMVYGDVWNDNLFLNDLTLWSGQPID-PNEDAGAH 82

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH--LKYAEETYRRE 127
           K + ++R  +    Y  A +  +++ GH +  YQ L  + ++  +S    + + + YRRE
Sbjct: 83  KWIPEIRKALFEENYKLADSLQLRVQGHNSAWYQPLSIVSIQPINSQGSSQASIKNYRRE 142

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL++A A+V Y +  V + RE+ +++PD+ I+ +++ S+  +L+  +SL S+L +    
Sbjct: 143 LDLDSALAKVSYEIDGVTYRREYLATHPDRAILLRLTASKPRALNLRLSLTSILSH---- 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
                   + R  G  I    +A   P   + F  +L+ K +D  GTI+A +D  L +  
Sbjct: 199 --------QLRAEGDLIRLTGHAMGHPDSTVHFCNLLQAKATD--GTITA-QDTTLLINN 247

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +   VL LV  +S++G   +P          + + L+S+++ S+  L   HLDDYQ LF 
Sbjct: 248 ATQVVLYLVNETSYNGFDKHPVTQGAPYVQLAEADLKSLQDCSFEQLKQNHLDDYQALFG 307

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+QL  +  D    T  ++ +D     E         +P L  L FQFGRYLLISSSR
Sbjct: 308 RVSLQLGGAQFD-TNRTTEQQLLDYTDKCE--------ANPYLEALYFQFGRYLLISSSR 358

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
                ANLQG+WN  L   W S   VNINLE NYW +   NL+E   PL   +  LS+NG
Sbjct: 359 TPGVPANLQGLWNPHLKAQWRSNYTVNINLEENYWPAQVANLAEMTMPLTGMVKALSVNG 418

Query: 427 SKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
              A+  Y +  GW   H TD+WA ++     R    WA W +GGAWL ++LWE Y++T 
Sbjct: 419 RYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWADWNLGGAWLLSNLWEQYDFTR 478

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           DR++L +  +PL++G   F+L WLI      G L T PSTSPE+E++ P+G      Y  
Sbjct: 479 DRNYLRETLFPLMKGACDFMLQWLIGNPKKPGELITAPSTSPENEYVTPEGYHGTTMYGG 538

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           T D+AI+RE+F+   +A E L     A  +K+ +++ RL P  I ++G + EW
Sbjct: 539 TADLAILRELFANTATADETLNGRPTAYSKKLRQTIARLHPYTIGKEGDLNEW 591


>gi|354584080|ref|ZP_09002977.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
 gi|353197342|gb|EHB62835.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
          Length = 844

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 213/615 (34%), Positives = 329/615 (53%), Gaps = 55/615 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S +   PL++ +  PA  + +A+PIGNGRLG MV+G    E ++LNED+LW G PG   N
Sbjct: 31  SGAVERPLRLWYTSPAAEWNEALPIGNGRLGGMVFGRTGLERVQLNEDSLWYGGPGRGGN 90

Query: 66  PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEE 122
           P+A   L D+R L+  G+ AEA   A + +   P     YQ LGD+ L+F ++       
Sbjct: 91  PNAIPYLGDIRQLLQDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLNAEAPATH- 149

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLL 181
            Y RELDL  + A V Y+ G + + R++F+S PD V+V +++    GSL+F  +L     
Sbjct: 150 -YERELDLQRSMAAVTYTSGGITYRRQYFASAPDGVLVIRLTADRPGSLTFAANLMRRPF 208

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
           D  +   GN+ + M+G         +A A+    G+ F A L  + + + G I  + D  
Sbjct: 209 DCGTRSIGNDTLTMKG---------EAGAD----GVSFCASL--RGAAEGGNIRIIGDF- 252

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + VEG+D   LLL A ++F           + P    +  L    ++ Y  L++RH+++Y
Sbjct: 253 MSVEGADAVTLLLSAQTTF---------RCRKPEEMCLQQLDHASSIPYERLFSRHVEEY 303

Query: 302 QKLFHRVSIQL---------SRSPKDI----------VTDTCSEENIDTVPSAERVKSFQ 342
           ++ F R S++L         +  P D           V+++ +    ++    E      
Sbjct: 304 REKFGRFSLKLEVDAGARDYASLPTDQRLNLLKERVRVSNSGANPEGNSGADPEGNSGAY 363

Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            D+DP L+EL  Q+GRYLL+SSSRPG+  ANLQGIWN+  +P W+S   +N N++MNYW 
Sbjct: 364 PDDDPGLIELYVQYGRYLLLSSSRPGSLAANLQGIWNDSFTPPWESKYTINANIQMNYWP 423

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
           +    L EC EPLFD +  +  NG KTA   Y   G+  HH T++W ++  +   +   +
Sbjct: 424 AELLGLPECHEPLFDLIHRMLPNGRKTAGEMYGCRGFAAHHNTNVWGETRPEGILMTCTV 483

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WPMG AWLC HLWEH  +  D DFL  RAYP+++  A FLLD++    +G   T PS SP
Sbjct: 484 WPMGAAWLCLHLWEHVRFGGDADFLRDRAYPVMKEAAIFLLDYMTIDGEGRRITGPSVSP 543

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLR 580
           E+ F+ PDG +  +    +MD  I   +  A + A  +L ++   L  +E  ++++P   
Sbjct: 544 ENRFVLPDGAVGSLCMGPSMDSQIAHALLQACLEAGRLLGEDTRFLDELEAAIRNIP--- 600

Query: 581 PTKIAEDGSIMEWVQ 595
             +I   G IMEW++
Sbjct: 601 APQIGRHGGIMEWLE 615


>gi|119491166|ref|XP_001263205.1| hypothetical protein NFIA_064720 [Neosartorya fischeri NRRL 181]
 gi|119411365|gb|EAW21308.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 744

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 211/589 (35%), Positives = 322/589 (54%), Gaps = 47/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA ++ +A+P+GNGRLGAMV+G   +E L+LNED++W G P +    DA + L 
Sbjct: 3   ELWYQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPRDAFECLP 62

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +RSL+  G +AEA     +  F HP     Y+ LG + L+F   H     + YRR LD+
Sbjct: 63  RLRSLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHAPEYMQNYRRSLDI 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVN 188
             AT+RV+Y    V+  RE  +SNPD VI  +I  S+    +  ++  S L+   + Y++
Sbjct: 121 ERATSRVEYEHKGVKVRREVIASNPDGVIAIRIQASQKTEFALRLTRMSELEYETNEYLD 180

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
               +  E R     I P  +     K  +   + +++ +DD+ +++ + +K L V   D
Sbjct: 181 ---DVTAEDRTITMHITPGGH-----KSNRACCMAKVRTADDQDSVTQIGNKLL-VNAQD 231

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A++L+ A +++        D  K+ +S+  +AL      S  +++ RH++DY+ L+ R+
Sbjct: 232 -ALVLISAQTTY-----RCDDIDKEASSDLETALLH----STDEIWERHVNDYRSLYGRM 281

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + LS +  D+ TD                K  +   DP L+ L   + RYLLIS SR  
Sbjct: 282 ELHLSPNNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNE 325

Query: 369 TQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            +   A LQGIWN    P W     +NINL+MNYW +  CNLS+C+ PLF  L  ++ +G
Sbjct: 326 DKALPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSG 385

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            + AQ  Y   GWV HH TDIWA +S     +   LWP+GGAWLC H+W+H+ +T D+ F
Sbjct: 386 EEAAQTMYGCRGWVAHHCTDIWADTSPVDTWMPATLWPLGGAWLCVHIWDHFRFTRDKGF 445

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L+ R +P+L+GC  FLLD+L+E   G YL TNPS SPE+ F   +G+   +   ST+D+ 
Sbjct: 446 LQ-RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYDKNGERGVLCEGSTIDIQ 504

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           I+  V SA + + E LE  E  L    L +L RL P +I   G + EW 
Sbjct: 505 IVNAVLSAYLKSVEELEI-EAKLAPAALDALHRLPPLRIGSYGQLQEWA 552


>gi|390943730|ref|YP_006407491.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
 gi|390417158|gb|AFL84736.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
          Length = 836

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 209/587 (35%), Positives = 326/587 (55%), Gaps = 42/587 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ ++ PA+ + +A+PIGNGRLGAMV+G    E ++LNE+T + G P    NP+A KAL
Sbjct: 45  MKLWYDRPAQQWVEALPIGNGRLGAMVFGNPQEEVIQLNENTFYAGHPYRNDNPNALKAL 104

Query: 73  SDVRSLVDSGQYAEAT-AASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+Y +A        FG P  + YQ +G+++L++ D       E Y RELDL
Sbjct: 105 EGVRKLIFDGEYVQAQDTIDQNFFGGPHGMPYQTIGNLKLKYQDES---EVENYYRELDL 161

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A    ++    V F+ +  SS PDQVIV KI+  +  S+SF+ ++D          G 
Sbjct: 162 EYAVVSNRFKKSGVNFSTKIISSFPDQVIVAKITADKPKSISFSATMDRPGPFEITTTGE 221

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
           +Q+IM G             + D +GI+ +   +  +K  +  G+I + E+K++ +  +D
Sbjct: 222 DQLIMSG------------ISSDHEGIKGAVKFQANVKFVNKNGSIKS-ENKEIIISEAD 268

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              + +  +++F    +N  D   D + +S S L+      +  +Y +H+ DY+ LF RV
Sbjct: 269 EVTIYISIATNF----VNYKDISADASEKSTSLLEKAIENDFERIYKKHVTDYRNLFDRV 324

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L +S  D V           +P+ +R+  F    D  L  L FQFGRYLLI++SRPG
Sbjct: 325 QLDLGKS--DAVN----------LPTDKRIAQFAEGNDAHLAALYFQFGRYLLIAASRPG 372

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQGIWN  ++P WDS   VNIN EMNYW +   NLSE  EP       LS +G +
Sbjct: 373 GQPANLQGIWNHQMNPAWDSKYTVNINAEMNYWPAEITNLSELHEPFIQMAKDLSESGQQ 432

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y A GWV+HH TD+W + +         +WP+GGAW+  HL+E Y+++ D  +L 
Sbjct: 433 TARNMYGARGWVLHHNTDLW-RVTGPIDFAAAGMWPLGGAWVSQHLFEKYDFSGDEKYL- 490

Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
           K  YP+ +  A+F LD+L++    G+   +PS SPE+  I      + V+  +TMD  ++
Sbjct: 491 KSVYPVAKEAATFFLDFLVKDPQTGFWVVSPSVSPEN--IPYQFHNSAVAAGNTMDNQLV 548

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            ++F+  I AAE+L  +ED L+ ++ + L  L P +I + G + EW+
Sbjct: 549 FDLFTKTIRAAEIL-GDEDDLINEMKEKLSMLPPMQIGKWGQLQEWM 594


>gi|383114822|ref|ZP_09935584.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
 gi|313693469|gb|EFS30304.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
          Length = 812

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 210/612 (34%), Positives = 322/612 (52%), Gaps = 52/612 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +  PAK + +A+P+GNGRLGAM++G    E ++ NE+TL++G P    + +    L
Sbjct: 24  LTLWYKSPAKVWEEALPVGNGRLGAMIFGEPQKERIQFNENTLYSGEPETPKDINVASDL 83

Query: 73  SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
             +R L++ G+  EA      K  G   + YQ  GD+ +EF     K A   Y   LD+N
Sbjct: 84  GHIRQLLNEGKNTEAGNIIQQKWIGRLNEAYQPFGDLYIEFAS---KGAITDYIHSLDMN 140

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            +     Y    +   RE F+S P Q I+  +S S+   L+F   L+S    H     ++
Sbjct: 141 NSIVTTSYKQNGIAIRREVFASYPAQAIIIHLSASKP-VLNFTAHLES---PHPVTQDSD 196

Query: 192 Q--IIMEGRCPG---------------KRIPPKANANDDPKGIQFSAILEIKISDDRGT- 233
              I ++G+ P                +R+ P+   +     IQ   ++       +GT 
Sbjct: 197 SQAIYLKGQAPAHAQRRDIEHMKRFNTQRLHPEY-FDQTGHVIQKKQVIYGNELGGKGTF 255

Query: 234 -----ISALEDKKLKVEGSDW-------AVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
                +S+ +D KL +E + +         L+L A++S++G   +PS   K+P  E  + 
Sbjct: 256 FEACLLSSHKDGKLVIENNQFIAQDCSEVTLVLYAATSYNGLHKSPSKEGKNPHQEINNY 315

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
            +     SY  L   H+ DYQ LF RVS  L            + + +   P+ +R+K F
Sbjct: 316 RKISEKHSYKKLKEEHITDYQSLFKRVSFNLH-----------TNKQLKKTPTDQRLKLF 364

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
           +  ED +++  LFQFGRYL+I+ SR   Q  NLQG+WN ++ P W+S   +NINLEMNYW
Sbjct: 365 KKKEDQTIITQLFQFGRYLMIAGSRGEGQPLNLQGLWNNEVLPPWNSGYTLNINLEMNYW 424

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
            +   NLSEC +PLF  +  ++  G   A+  Y  +GW IHH   IW ++    G V W 
Sbjct: 425 PAEVTNLSECHQPLFKLIEEIADKGKNLARDMYGLNGWAIHHNISIWREAYPSDGFVYWF 484

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
            W M G WLC H+WEHY YT D DFL K+ YP+L+G A+F  +WL+E  +G L T  STS
Sbjct: 485 FWNMSGPWLCNHIWEHYLYTKDIDFL-KKYYPILKGSATFCSEWLVENSEGELVTPVSTS 543

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PE+ ++ PDG  A V   STMD+AIIR +FS  I+A++VL+  +     ++ + + +L+ 
Sbjct: 544 PENAYLMPDGISASVCEGSTMDIAIIRSLFSNTINASKVLQ-TDSLFCAELTQKVNKLKK 602

Query: 582 TKIAEDGSIMEW 593
            +I   G ++EW
Sbjct: 603 YQIGSKGQLLEW 614


>gi|373952811|ref|ZP_09612771.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
 gi|373889411|gb|EHQ25308.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
          Length = 833

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 216/602 (35%), Positives = 325/602 (53%), Gaps = 50/602 (8%)

Query: 6   STSTTNP-----LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           + + TNP     L++ +N P+ K + +A+PIGNGRLGAM++G V  ET++LNE TLW+G 
Sbjct: 26  AKAQTNPKDQTTLRLWYNKPSGKVWENALPIGNGRLGAMIYGNVGVETIQLNEHTLWSGG 85

Query: 60  PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSH 116
           P    NP A  +L+ +R L+ +G+  +A   + K+         +++  G++ L F++  
Sbjct: 86  PNRNDNPLALDSLAAIRKLIFNGKQKQAEQLANKVIISKKSQGQIFEPAGELYLAFNNQE 145

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
                  Y RELD+  A ++  Y VG+V FTRE F+S PD+VIV  ++ S+ GS+SF   
Sbjct: 146 ---NYTNYYRELDIEKAISKTSYQVGDVSFTREAFASIPDRVIVMHLTASKPGSISFTAF 202

Query: 177 LDSLLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTI 234
             S   + +       QI   G             ++  KG +++  I E K +   GT 
Sbjct: 203 YSSPQHDVAVATFQARQITFAGTTID---------HEGVKGMVRYKGIAEFKTNG--GTK 251

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
           SA  D  + + G++   + +  +++F+    N  D   + T  + + L      SY++L 
Sbjct: 252 SA-TDTSVTIYGANDVTIYISIATNFN----NYHDLGGNETERAANYLNKASGKSYTELQ 306

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+  YQK F+RV   L  +            +I  +P+ ER+K+F   +DP    L F
Sbjct: 307 KTHIAAYQKYFNRVRFSLGAA------------DISKLPTDERLKNFNQGQDPQFAALYF 354

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           Q+GRYLLISSS+PG Q ANLQGIWN  L P WDS   +NIN EMNYW +   NL E  EP
Sbjct: 355 QYGRYLLISSSQPGGQPANLQGIWNNKLYPAWDSKYTININAEMNYWPAEKTNLPEIHEP 414

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
               +  L++NG +TA+V Y A GW+ HH TDIW  + A  G   W +W  GG W   HL
Sbjct: 415 FLQMVKELAVNGEQTAKVMYGARGWMAHHNTDIWRATGAVDG-AFWGIWNQGGGWTSEHL 473

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGK 532
           WEHY Y  D+D+L +  Y +L G A F +D+L+E   H  +L  NP  SPE+   A  G 
Sbjct: 474 WEHYLYNGDKDYL-RSVYGVLRGAALFYVDFLVEQPVHH-WLVINPDMSPENAPAAHQG- 530

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            + +   +TM   I+ +VFS+ I AAE+L  ++   V+ + +   +L P  I + G + E
Sbjct: 531 -SSLDAGTTMSNQIVFDVFSSTIRAAEILNIDK-PFVDTLKQMRSKLSPMHIGQFGQLQE 588

Query: 593 WV 594
           W+
Sbjct: 589 WL 590


>gi|315607320|ref|ZP_07882320.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
 gi|315251023|gb|EFU31012.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
          Length = 787

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 210/604 (34%), Positives = 327/604 (54%), Gaps = 46/604 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M A   S  +P K+ +  PA+ +TDA+P+GNGRLGAMV+G   +E ++LNE+T+W G P 
Sbjct: 16  MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
              N  A KA+  ++ L+  G+Y +A         S   +G P   YQ  G++ +     
Sbjct: 76  GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   Y REL L++A A  +++   V + RE  +S  D V+  + +  + G ++FN 
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTANGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
              +  D+         I+++       +    + ++  KG ++F   +  +      G 
Sbjct: 190 YFTTPHDD---------IMIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           ++  +D  + V+G+D AVL +  +++F+    N  D   D    S   L++     Y+  
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+  +++L HRV++ L             E+    +P+ ER+  F   +D  LV   
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADRDDNYLVATY 344

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS    NINLEMNYW + P  L+E  E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELTE 404

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
           PLF  +  +S  G+KTA+  Y  SGWV+HH TDIW  +   D  +    +W  GGAWLC 
Sbjct: 405 PLFRLIREVSETGAKTARTMYGKSGWVLHHNTDIWCVTGGIDHAQS--GMWMTGGAWLCR 462

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
           HLWEHY YTMD+DFL +R YP+++G A FL   LI E   G+L  +PS SPE+   + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K+A +S  +TMD+ ++ E+F  +++A++VL ++  AL     + L  + P ++ + G + 
Sbjct: 522 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579

Query: 592 EWVQ 595
           EW++
Sbjct: 580 EWME 583


>gi|423311596|ref|ZP_17289533.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
           CL09T03C04]
 gi|392690241|gb|EIY83511.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
           CL09T03C04]
          Length = 819

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN R+GAMV+GG   E L+LN++T+W G P     P+A ++L
Sbjct: 23  LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA     + F  G     YQ +G + +E    H K  +  Y R+LDL
Sbjct: 83  PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+V+V +++    G L+F V   S L+ H      
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
            ++++ GR              D +G++    +E +   D  G    ++D+ + VEG+D 
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           +V L V+S +    FIN  D   + + ++   L       YS +   H+  Y++ F RV 
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L          T     ++TV   +R++ F   +D SL  LLFQ+GRYLLISSS+PG 
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  +PLF+ +  LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y  +GWV HH TDIW +++    K  +  WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469

Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
            AYP L+G A F LD+L E  + G++ T PS SPEH     D K A  +    TMD  II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +V S  + A+ +L+ +  A  +  L+S L RL P +I +   + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575


>gi|319640719|ref|ZP_07995432.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
 gi|345517731|ref|ZP_08797196.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|254836837|gb|EET17146.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|317387531|gb|EFV68397.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
          Length = 819

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN R+GAMV+GG   E L+LN++T+W G P     P+A ++L
Sbjct: 23  LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA     + F  G     YQ +G + +E    H K  +  Y R+LDL
Sbjct: 83  PQVRELIFAGKNMEAQDLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+V+V +++    G L+F V   S L+ H      
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
            ++++ GR              D +G++    +E +   D  G    ++D+ + VEG+D 
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           +V L V+S +    FIN  D   + + ++   L       YS +   H+  Y++ F RV 
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L          T     ++TV   +R++ F   +D SL  LLFQ+GRYLLISSS+PG 
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  +PLF+ +  LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y  +GWV HH TDIW +++    K  +  WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469

Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
            AYP L+G A F LD+L E  + G++ T PS SPEH     D K A  +    TMD  II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +V S  + A+ +L+ +  A  +  L+S L RL P +I +   + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575


>gi|251798253|ref|YP_003012984.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
 gi|247545879|gb|ACT02898.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
          Length = 767

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 211/586 (36%), Positives = 319/586 (54%), Gaps = 44/586 (7%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + ++ PAK + +A+PIGNGRLGAM++G   +E ++LNED+LW G P D  NPDA   L++
Sbjct: 12  LLYHSPAKQWEEALPIGNGRLGAMIFGDPRAERVQLNEDSLWYGGPRDRHNPDALPNLAE 71

Query: 75  VRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
           +R L+  G+  EA   AS+ L   P     Y  LGD+ L F+ +    AE   Y R LDL
Sbjct: 72  IRKLIFEGKLQEAERLASLALTAIPESQRHYVPLGDLFLRFEHA----AEIRNYERRLDL 127

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           + A   V Y+ G  +F RE F+S PD+ IV +++    G +SF   +    +   YV+  
Sbjct: 128 SEAIVHVSYTAGETKFAREIFASYPDRAIVLRLTADSPGQISFTARMGR--ERFRYVD-- 183

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
                E R    RI    N+     G+++  +L      + G++  +  + L V  +D  
Sbjct: 184 -----EIRAEEGRIVMCGNSGG---GVRYCGVL--ACVPEGGSMRTI-GEHLVVSNADAV 232

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           +L++ AS+ F          + DP + ++     +   +YS+L   H+ DY+ L+ R  +
Sbjct: 233 LLVVTASTDF---------READPEAAALGDAGRVAAAAYSELKASHISDYRSLYDRTRL 283

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
            +    +  +    SE       ++ER+ + +   EDP L  L F +GRYLLI+SSRPG+
Sbjct: 284 WIG--AESGLKPEISE-------TSERLVNVKAGREDPGLTALYFHYGRYLLIASSRPGS 334

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWN+D+ P WDS   +NIN +MNYW +  C L EC  PLF+ +  +  NG  T
Sbjct: 335 LPANLQGIWNKDMLPAWDSKFTININTQMNYWPAESCYLPECHLPLFELIERMIPNGRHT 394

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   G   HH TDIWA ++          WP+G AWL  HLWEHY Y  D  FLE 
Sbjct: 395 ARSMYGCRGSAAHHNTDIWADTAPQDLWPSSTYWPLGLAWLSLHLWEHYRYGGDTAFLE- 453

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           R YP+++  A FLLD+L+E   G   T+PS SPE+ +  P+G+   + Y  +MD  I RE
Sbjct: 454 RVYPMMKEAAVFLLDYLVELPSGEWVTSPSVSPENTYRLPNGETGVLCYGPSMDSQIARE 513

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +F A  +A E +  N D L+ ++ +++ +L P +I   G ++EW +
Sbjct: 514 LFQACAAAGERIGSN-DELLGELRQAIDKLPPPRIGRYGQLLEWYE 558


>gi|408371030|ref|ZP_11168802.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
 gi|407743587|gb|EKF55162.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
          Length = 821

 Score =  351 bits (901), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 221/601 (36%), Positives = 323/601 (53%), Gaps = 60/601 (9%)

Query: 10  TNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           T+PLK+ ++ P+   + +A+P+GNG +GAMV+G V  E  +LNE T+W+G P    NP A
Sbjct: 21  TDPLKLWYDEPSGDVWENALPLGNGNIGAMVYGNVSKEIFQLNESTVWSGSPNRNDNPAA 80

Query: 69  PKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYR 125
            +AL  +R L+   QY  A   A+ K+    +   ++Q +G++EL F+  H  +    Y 
Sbjct: 81  LEALPKIRQLIFDKQYKAAEDLANEKIITKKSHGQMFQPVGNLELTFE-GHQDF--HNYS 137

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           REL++  A ++  Y+V  V +TRE F+S  D+V+V KIS  + G +SF     +      
Sbjct: 138 RELNIGNAVSKTTYTVDGVTYTREAFTSLTDKVLVIKISADQPGKISFKADFTTPHKKQK 197

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGI----QFSAILEIK-----ISDDRGTISA 236
               +N + + G               D +G+    +F A+L IK     I+  R TI  
Sbjct: 198 IAIMDNNLSLWG------------VTSDHEGVLGKVEFQALLRIKTLNGDITQGRNTI-- 243

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
                 +V  +D A L +  +S+F     N  D   D T  + + L      +Y +L   
Sbjct: 244 ------EVTNADSATLYISIASNFK----NYDDLSADETLRAKNDLDKAFIENYENLKDA 293

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+  YQ  F+RVS+QL          T    N    P+ ER+++F+ ++DPS V L FQ+
Sbjct: 294 HIKAYQNYFNRVSLQLG---------TIEASN---QPTDERLENFRKNQDPSFVSLYFQY 341

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISSS+PG Q ANLQGIWN+ L+P WDS   +NIN +MNYW +   NLSE  EP  
Sbjct: 342 GRYLLISSSKPGGQAANLQGIWNKSLTPPWDSKYTININAQMNYWPAEKTNLSELHEPFL 401

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           + +  LS  G KTA   Y A GW+ HH TDIW  + A  G   W +W  GGAWL  H+WE
Sbjct: 402 NMVQELSQTGKKTANDMYGARGWMAHHNTDIWRVTGAIDG-AFWGIWNGGGAWLSQHIWE 460

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
           HY YT D +FL +  Y LL+G A F +D+L +  D  YL   P  SPE+      G    
Sbjct: 461 HYLYTGDTEFL-RENYDLLKGAALFYVDFLAQHPDHPYLVVAPGNSPENAAQGRQG--TS 517

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWV 594
           ++  STMD  ++ ++F+A+ISA+E L  N D      LK +  +L P +I +   + EW+
Sbjct: 518 ITAGSTMDNQLVEDIFNAVISASEAL--NTDTAFTDSLKVIKNKLPPMQIGKHNQLQEWL 575

Query: 595 Q 595
           +
Sbjct: 576 E 576


>gi|150005495|ref|YP_001300239.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|294778696|ref|ZP_06744115.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|149933919|gb|ABR40617.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
 gi|294447352|gb|EFG15933.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 819

 Score =  351 bits (901), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN R+GAMV+GG   E L+LN++T+W G P     P+A ++L
Sbjct: 23  LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+ +G+  EA     + F  G     YQ +G + +E    H K  +  Y R+LDL
Sbjct: 83  PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V F RE F+S PD+V+V +++    G L+F V   S L+ H      
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
            ++++ G+              D +G++    +E +   D  G    ++D+ + VEG+D 
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           +V L V+S +    FIN  D   + + ++   L       YS +   H+  Y++ F RV 
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L          T     ++TV   +R++ F   +D SL  LLFQ+GRYLLISSS+PG 
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN  L+  WD    +NIN EMNYW +   NLSE  +PLF+ +  LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y  +GWV HH TDIW +++    K  +  WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469

Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
            AYP L+G A F LD+L E  + G++ T PS SPEH     D K A    S  TMD  II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVSGCTMDNQII 528

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +V S  + A+ +L+ +  A  +  L+S L RL P +I +   + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575


>gi|237720803|ref|ZP_04551284.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
 gi|229449638|gb|EEO55429.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
          Length = 822

 Score =  351 bits (900), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 220/603 (36%), Positives = 338/603 (56%), Gaps = 55/603 (9%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VEG+D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
           LWE Y YT D +FL +  YP+L+    F  + +++   H+ +L   PS SPE+     +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + 
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQHLKEMAPMQVGHWGQLQ 580

Query: 592 EWV 594
           EW+
Sbjct: 581 EWM 583


>gi|336402504|ref|ZP_08583239.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
 gi|335948353|gb|EGN10067.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
          Length = 822

 Score =  351 bits (900), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 220/603 (36%), Positives = 338/603 (56%), Gaps = 55/603 (9%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VEG+D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
           LWE Y YT D +FL +  YP+L+    F  + +++   H+ +L   PS SPE+     +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + 
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580

Query: 592 EWV 594
           EW+
Sbjct: 581 EWM 583


>gi|338213645|ref|YP_004657700.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
 gi|336307466|gb|AEI50568.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
          Length = 829

 Score =  351 bits (900), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 215/615 (34%), Positives = 323/615 (52%), Gaps = 70/615 (11%)

Query: 11  NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           NP  ++ +N PAK + DA+P+GNGRLGAMV+G    E ++LNE+T W+G P         
Sbjct: 47  NPSTVSWYNAPAKKWEDALPVGNGRLGAMVFGRSGEERIQLNEETYWSGGPYSTVVKGGY 106

Query: 70  KALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
           K L +++ LV   +Y  A       L G+P +   YQ L ++ L F +     +   Y+R
Sbjct: 107 KVLPEIQKLVFEEKYLAAHNLFGRHLMGYPVEQQKYQSLANLHLFFQNQD---STTEYKR 163

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            L+L +    V Y    + + R+ F+S PDQVIV +++  +SGS+SF  +L  +  N ++
Sbjct: 164 WLNLESGITSVSYKSNGITYQRDVFASAPDQVIVIRLTADKSGSISFKANLRGV-RNQAH 222

Query: 187 VN-----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTI 234
            N           G++ +I+ G+              D  G+      E +I +   G  
Sbjct: 223 SNYATDYFRMDPYGSDGLILTGKSA------------DYMGVAGKLKYEARIKAIPEGGR 270

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
              +   L +E ++   L   A+++F    +N  D + +P          I++ SY+ + 
Sbjct: 271 MKTDGVDLIIENANTVTLYFAAATNF----VNYKDVRANPHQRVEDYFARIKSKSYTSIL 326

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
              L DY+  F RVS+QL  +    +            P  ER++  Q+  DPSL  L +
Sbjct: 327 EAALADYKHFFDRVSLQLPTTENSFL------------PLPERIQKIQSSPDPSLSALSY 374

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
            FGRYL+I+SSRPGT+ ANLQGIWN++++P WDS    NIN +MNYW     NLSEC EP
Sbjct: 375 NFGRYLMIASSRPGTEPANLQGIWNDNMNPDWDSKYTTNINTQMNYWPVESSNLSECAEP 434

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           L  F+  L+  G++ A+ +Y A GWV H  TD+W + +A      W  + +GGAWLCTHL
Sbjct: 435 LVRFIKELTDQGTQVAREHYGAKGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLCTHL 493

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG-- 531
           WEHY YTMD  FL K  YPL++G   F +D+L    +G +L TNPSTSPE+    PDG  
Sbjct: 494 WEHYQYTMDAAFL-KETYPLMKGSVQFFMDFLKPHPNGKWLVTNPSTSPEN---FPDGGG 549

Query: 532 -------------KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
                        +   +   S++DM I+ ++F   I A+ +L  N  A V++V  +  +
Sbjct: 550 NKPYFDEVTAGFREGTTICAGSSIDMQILFDLFGYFIEASAILGDN-SAFVQQVKVAREK 608

Query: 579 LRPTKIAEDGSIMEW 593
           L P +I  DGS+ EW
Sbjct: 609 LVPPQIGRDGSLQEW 623


>gi|288925248|ref|ZP_06419183.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
 gi|288338013|gb|EFC76364.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
          Length = 787

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 209/604 (34%), Positives = 327/604 (54%), Gaps = 46/604 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M A   S  +P K+ +  PA+ +TDA+P+GNGRLGAMV+G   +E ++LNE+T+W G P 
Sbjct: 16  MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
              N  A KA+  ++ L+  G+Y +A         S   +G P   YQ  G++ +     
Sbjct: 76  GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   Y REL L++A A  +++   V + RE  +S  D V+  + +  + G ++FN 
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
              +  D+         II++       +    + ++  KG ++F   +  +      G 
Sbjct: 190 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           ++  +D  + V+G+D AVL +  +++F+    N  D   D    S   L++     Y+  
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+  +++L HRV++ L             E+    +P+ ER+  F   +D  LV   
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADHDDNYLVATY 344

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS    NINLEMNYW + P  L+E  E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELNE 404

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
           PLF  +  +S  G++TA+  Y  SGWV+HH TDIW  +   D  +    +W  GGAWLC 
Sbjct: 405 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 462

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
           HLWEHY YTMD+DFL +R YP+++G A FL   LI E   G+L  +PS SPE+   + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K+A ++  +TMD+ ++ E+F  +++A++VL ++  AL     + L  + P ++ + G + 
Sbjct: 522 KMA-IAAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579

Query: 592 EWVQ 595
           EW++
Sbjct: 580 EWME 583


>gi|399078665|ref|ZP_10752953.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
 gi|398033293|gb|EJL26598.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
          Length = 786

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 212/584 (36%), Positives = 316/584 (54%), Gaps = 44/584 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PAK + +A+PIGNGRLGAM++G V +E L+LNE+TLW+G P D  NP A + L  VR
Sbjct: 39  YDQPAKEWVEALPIGNGRLGAMIFGDVWAERLQLNENTLWSGGPYDPVNPRAREGLEPVR 98

Query: 77  SLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +L+ +G++AEA   A+  L   P     YQ  GD+ L +  +  + A   YRR LD++ A
Sbjct: 99  ALIAAGRFAEAEQRANETLVATPPREMAYQPFGDLGLRW--AGARGAVSGYRRSLDIDNA 156

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   + +  V + R   +S  DQVI  +++ S  G+L F+++L       +      +I
Sbjct: 157 VAETTFEIDGVRYRRRAVASPVDQVIALELTASRPGALDFDLTL-------APAQTVREI 209

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           ++E R    +I  + N  +       +     ++    G++    D ++ V G+  A + 
Sbjct: 210 VVE-RPDTLKISGRNNDGEGGVSGALTYCGRARVVTQGGSVKG-ADGQIAVRGASRATIY 267

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L  ++S+        D   DP + +   +      S+  L       ++ LF RVS+ L 
Sbjct: 268 LAMATSYR----RYDDVGGDPDAITRGQIDKAAAKSFDQLARAATAAHRALFDRVSLDLG 323

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
                       +++I   P+  R+   +T +DP LVEL FQ+ RYLLI+ SRPG Q AN
Sbjct: 324 -----------GKDDIG-APTDIRIARNETTDDPGLVELYFQYARYLLIACSRPGGQPAN 371

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQG+WN+ + P W S   +NIN +MNYW +    L+EC EPLFDF+  L+  G+ TA+  
Sbjct: 372 LQGLWNDQVKPPWGSNYTININTQMNYWPAEAGGLAECAEPLFDFIAELAERGAVTAREM 431

Query: 434 YLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
           Y A GWV HH +D+W  ++  D  K    LWP GGAWLC HLW+HY+Y  D+ FL  RAY
Sbjct: 432 YGARGWVAHHNSDLWRGTAPFDHAKA--GLWPTGGAWLCVHLWDHYDYGRDKRFL-ARAY 488

Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPE--HEFIAPDGKLACVSYSSTMDMAIIRE 549
           PL++G + F LD L  +   G+L T+PS SPE  H F    G   C     TMDM I+R+
Sbjct: 489 PLMKGASQFFLDTLQTDAATGWLVTSPSVSPENRHGF----GSTLCA--GPTMDMQILRD 542

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +F     A  +L  + D   E + ++  RL PT+I   G +MEW
Sbjct: 543 LFDHTREAGRILGLDPD-FGEDLARARDRLAPTRIGAGGQLMEW 585


>gi|423213429|ref|ZP_17199958.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693889|gb|EIY87119.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 822

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VEG+D A + +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGVLSVEGADEATVYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|393781509|ref|ZP_10369704.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676572|gb|EIY70004.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
           CL02T12C01]
          Length = 827

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 220/596 (36%), Positives = 331/596 (55%), Gaps = 60/596 (10%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E L+LNE+TLW G P +  NP+  K + 
Sbjct: 38  KLWYDRPAQVWTEALPLGNGRLGAMVFGNPAVEQLQLNEETLWAGRPNNNANPEGLKYIP 97

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y +  Y RE
Sbjct: 98  KVRELVFAGKYLEAQTLATEKVMSKTNSGMP---YQSFGDLRISFP-GHTRYRD--YYRE 151

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLD 182
           L+L++A  +V Y V +V + RE F+S  DQVI+ +++    G ++FN  L     D+L+D
Sbjct: 152 LNLDSACVKVGYRVDDVNYLREMFTSFTDQVIMVRLTADRPGKITFNAVLTTPHQDALVD 211

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKK 241
                        +G C    +   ++ ++  KG ++F   L  ++   +G   +  D  
Sbjct: 212 T------------DGEC--VTLSGVSSWHEGLKGKVEFQGRLATRV---QGGAVSCRDGV 254

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L VEG+D AV+ +  +++F    IN  D   D    +   L+     +Y++    H+D +
Sbjct: 255 LTVEGADEAVVYVSLATNF----INYKDISADQVERARQYLEKAMQKNYTEAKQSHVDFF 310

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           +    RVS+ L          T S E +   P+ +RV+ F+T  D  LV   FQFGRYLL
Sbjct: 311 KAYMDRVSLNLG---------TGSTEQL---PTDKRVEKFKTTHDAGLVATYFQFGRYLL 358

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SS+PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPLF     
Sbjct: 359 ICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLFRMTRE 418

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           +S  G +TA++ Y A GWV+HH TDIW + +    K    +WP GGAWLC HLWE Y YT
Sbjct: 419 VSETGKETAEIMYGAKGWVLHHNTDIW-RITGPLDKAPSGMWPSGGAWLCRHLWERYLYT 477

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            D +FL + AYP+++    F  + ++ E    +L   PS SPE+      GK A  +   
Sbjct: 478 GDVEFL-RSAYPIMKEAGRFFDETMVKEPLHNWLVVCPSNSPENTHAGSGGK-ATTAAGC 535

Query: 541 TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           TMD  ++ +++++II+ A +L  + +  + +E+ LK +P   P +I   G + EW+
Sbjct: 536 TMDNQLVFDLWTSIIATARLLGVDTEYASHLEERLKEMP---PMQIGRWGQLQEWM 588


>gi|374311601|ref|YP_005058031.1| alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
 gi|358753611|gb|AEU37001.1| Alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
          Length = 790

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 210/607 (34%), Positives = 334/607 (55%), Gaps = 47/607 (7%)

Query: 1   MMNAESTST-TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           +M+AE  S+ ++  ++ ++ PA  + +A+PIGNGR+G M++GG   E+  L E T W+G 
Sbjct: 14  LMHAEGQSSPSHKTELWYSRPATRWMEAVPIGNGRIGGMIYGGTSIESFALTESTTWSGA 73

Query: 60  PGDY-TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEF-DD 114
           P D    P A   L  +R L+ +G+YAE        L G+P     +  +  +EL F +D
Sbjct: 74  PNDKNVKPTALANLGKIRELMFAGKYAEGGELCKEHLLGNPGSFGTHLPMATLELAFPED 133

Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
            H     + YRR L+L+   A V YS G + F RE F+SNPD  ++  IS ++  S+S +
Sbjct: 134 EH----PQNYRRSLNLDEGIAYVDYSRGGLSFHREVFASNPDNALIAHISCNQPKSVSCS 189

Query: 175 VSLDSL-LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
           +S   L L       GN+ ++++G         +   ++  +G+ F     +++S   G 
Sbjct: 190 ISFPKLTLPGEVTTEGNDTLVLKGNAF------EHLHSNGKQGVAFET--RVRVSAKGGE 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           ++A E   L ++G+D   L +V +++F G          + ++ ++  LQ +R  +++ L
Sbjct: 242 VTAHEGA-LHLKGADAVTLHVVIATNFRG---------ANASTRNVQTLQVLRPKTFAQL 291

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVEL 352
              H+ D+Q LF RV+I       D+ T++ +E      P+ ER K+ +   +DP L  L
Sbjct: 292 RAAHVADHQSLFRRVAI-------DLGTNSSAESK----PTDERRKAVEAGADDPGLASL 340

Query: 353 LFQFGRYLLISSSRPGTQVA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLS 409
            FQ+GRYL I+ SR  + +   LQGIWN+ L+ +  W    H++IN E NYW +  CNLS
Sbjct: 341 FFQYGRYLTIAGSRVNSPLPLALQGIWNDGLASSMGWTDDFHLDINTEQNYWAAEVCNLS 400

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           ECQ PLFDF+  LSI G  TA+  Y A GWV H  T+ W  ++A  G + W ++  GG W
Sbjct: 401 ECQSPLFDFVEGLSIAGRSTARDMYGAPGWVAHVVTNPWGFTAAGWG-LGWGIFSTGGVW 459

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
           L   LWEHY +T D+ FL++R YP+ +G A F L ++++    G+L T PS SPE+ FIA
Sbjct: 460 LALQLWEHYRFTGDKQFLQQRLYPVYKGAAEFFLAYMVKHPQHGWLVTGPSVSPENWFIA 519

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
           PDGK    S   T+D   +  + S  I A+  L  +E+    K  ++L +L P +I + G
Sbjct: 520 PDGKQCSESMGPTVDRVFVHSLLSGCIEASTTLGIDEE-FRAKATEALKQLPPFQIGKHG 578

Query: 589 SIMEWVQ 595
            + EW++
Sbjct: 579 QLQEWLE 585


>gi|254445766|ref|ZP_05059242.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
           DG1235]
 gi|198260074|gb|EDY84382.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
           DG1235]
          Length = 784

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 223/590 (37%), Positives = 312/590 (52%), Gaps = 39/590 (6%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S+ + LK    G  +  ++ +PIGNG LGA+V G    E + LN DTLW G P D + P+
Sbjct: 24  SSASILKYDEPGQFEPLSEGLPIGNGSLGALVMGRTAEERIVLNHDTLWAGGPYDPSYPE 83

Query: 68  APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
           A + L ++RSL+   ++ EA A         P     YQ + D+ L     H +   + Y
Sbjct: 84  AAEVLPEIRSLIFQDKHREAQALVQSSFMSKPMRQMSYQAMADLLL-LVPGHERV--DDY 140

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R LDL+ A A V Y V  V +TREH +S  D V+  +I   + GS+   + LDSL    
Sbjct: 141 ERSLDLDKAIATVSYEVDGVRYTREHIASAVDGVVAIRIRADKPGSVDLTLQLDSL---- 196

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
                + Q   E    G RI  +  A++   G      +E+ +  D G  S   D  LKV
Sbjct: 197 -----HEQTRSEYWPEGMRISGRNGASEGIAG-ALDWSVEVAVQLD-GGWSMPGDGYLKV 249

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
             +D   LL+ A +S+    +N +D   +P  ++   + +     +S+L  RHL+D+Q L
Sbjct: 250 READSVTLLVAADTSY----VNWNDVSGNPRQKNAKTIVAASEFDFSELNERHLEDFQSL 305

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + RV ++L+ S  ++      E N D      R+ SF  D+DP + EL F F RYL+IS 
Sbjct: 306 YGRVDLELNTSRPEL-----GERNTDA-----RIASFSKDQDPKMAELYFNFARYLIISC 355

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+Q ANLQG+WN+ L   W S   +NIN EMNYW +    L EC EPL   L  LSI
Sbjct: 356 SRPGSQSANLQGLWNDKLFAPWGSKYTININTEMNYWPTQVVQLGECMEPLAAMLQDLSI 415

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +G +TA+  Y ASGWV HH TD+W  +    G   W +WPMGGAWL   LWE Y +T D 
Sbjct: 416 SGQRTAKNFYGASGWVTHHNTDLWRATGPIDG-AFWGMWPMGGAWLSLFLWERYEFTGDV 474

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
           D LE   Y +L+G A F LD L+E    GYL T PS SPE+   A     A      TMD
Sbjct: 475 DQLETD-YAILKGSAQFFLDTLVEDPRTGYLVTAPSNSPENAHHAGVSNAA----GPTMD 529

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            AI+R++F+A   A+ +L   + A  E VL++  +L P K+ + G + EW
Sbjct: 530 NAILRDLFAATAEASRIL-GVDSAFRESVLQTSNQLPPFKVGKAGQLQEW 578


>gi|294808085|ref|ZP_06766858.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
 gi|294444726|gb|EFG13420.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
          Length = 698

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNCV--TLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|443289925|ref|ZP_21029019.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
 gi|385886837|emb|CCH17093.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
          Length = 947

 Score =  348 bits (894), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 220/574 (38%), Positives = 309/574 (53%), Gaps = 45/574 (7%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      L+++R  V + Q+ +
Sbjct: 61  ALPIGNGRLGAMVFGNVDTERLQLNEDTIWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G+P     YQ +G++ L F  +        Y R LDL TAT    Y +  
Sbjct: 121 AQDLINQTMMGNPGGQLAYQPVGNLRLAFGSAS---GASQYNRTLDLTTATVTTTYVLNG 177

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V + RE F+S PDQVIV +++   +GS++FN + DS             I ++G      
Sbjct: 178 VRYQRESFASAPDQVIVIRLTADRAGSITFNATFDSPQRTTVSSPDAATIGVDG------ 231

Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
               + A +   G ++F A+     +   GT+S+     L+V G+    +L+   SS+  
Sbjct: 232 ---ISGAMEGVNGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLISIGSSY-- 283

Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
             +N      D    + + L + R +++  L +RHL DYQ LF+RV+I L R        
Sbjct: 284 --VNFRTVNGDYQGIARTRLNAARGVAFDQLRSRHLADYQALFNRVTIDLGR-------- 333

Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
           T + +     P+  R+    +  DP    LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 334 TAAADQ----PTDVRIAQHASTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDSM 389

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
           +P WDS   +N NL MNYW +   NL EC  P+FD +  L++ G++ AQ  Y A GWV H
Sbjct: 390 TPPWDSKYTINANLPMNYWPADTTNLPECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTH 449

Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
           H TD W  +S   G  +W +W  GGAWL T +WEHY +T D  FL    YP L+G A F 
Sbjct: 450 HNTDGWRGASVVDG-ALWGMWQTGGAWLSTLIWEHYLFTGDVGFLSAN-YPALKGAAQFF 507

Query: 503 LDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
           LD L+  H   GYL TNPS SPE     P    A V    TMD  I+R++F A+  A EV
Sbjct: 508 LDTLVA-HPTLGYLVTNPSNSPE----LPHHSNASVCAGPTMDNQILRDLFDAVAQAGEV 562

Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           L  +      +V  +  RL P+++   G++ EW+
Sbjct: 563 LGVDA-TFRSQVRTARDRLAPSRVGSRGNVQEWL 595


>gi|294648173|ref|ZP_06725715.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
 gi|292636492|gb|EFF54968.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
          Length = 822

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|302873491|ref|YP_003842124.1| alpha-L-fucosidase [Clostridium cellulovorans 743B]
 gi|307688330|ref|ZP_07630776.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
 gi|302576348|gb|ADL50360.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
          Length = 769

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 207/591 (35%), Positives = 317/591 (53%), Gaps = 45/591 (7%)

Query: 13  LKITFNGPAK--HFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
            K+ ++ PA+  ++  A+P+GNG+LGAMV+G V  E ++LNE++LW+G   D  NPDA  
Sbjct: 13  FKLWYDEPAEVWNWDQALPVGNGKLGAMVFGHVHKEQIQLNEESLWSGGYLDRNNPDALA 72

Query: 71  ALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
            L  VR L+  G+  EA    ++ + G P     Y+ LGD+ ++F   H     + YRRE
Sbjct: 73  QLPKVRQLLFDGKLKEAERLCAIAMMGTPEHQRHYETLGDLFIDF--YHDSDEVKNYRRE 130

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN--VSLDSLLDNHS 185
           LD+N A   V+Y +  V F RE  SS  D  IV +I+  +  ++SF   V  +  +D  +
Sbjct: 131 LDINKAMVTVQYEIDGVNFKREILSSAVDDAIVIRITADKKEAISFRGFVGRELFMDTRT 190

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
            +N ++ + + G C G            P  I +S IL  K + + G +  +    + VE
Sbjct: 191 ALN-DSTVALRGGCGG------------PDSINYSIIL--KGTSEGGNLYTM-GGNIVVE 234

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +D   L L + +S+            D  + ++S  +++   +Y  +   H+ +YQ  F
Sbjct: 235 NADAVTLYLTSKTSY---------LSNDFDAVAISTAEAVSKRTYESILQDHIAEYQSYF 285

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
            R+++QL    + +         +  +P+ ER++  +  + D  L+ L F FGRYLLIS 
Sbjct: 286 SRMTLQLGNKQEAL--------ELSKIPTDERLERVKEGKLDDGLISLYFHFGRYLLISC 337

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPGT  ANLQGIWN+  +  W     +NIN EMNYW +  CNLS+C  PLFD +  +  
Sbjct: 338 SRPGTLPANLQGIWNKHHTSPWGCKFTININTEMNYWPAETCNLSDCHTPLFDLIEKMRE 397

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+V Y   G+V HH  D+W  ++     +   +WPMG AWLC HLWEHY +T D 
Sbjct: 398 PGRHTAKVMYDCGGFVAHHNVDLWGDTAPQDHWMPATVWPMGAAWLCLHLWEHYEFTCDL 457

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            FL K+AY  L+  A F +D+LIE  +GYL T PS SPE+ +    G+   +    +MD 
Sbjct: 458 KFL-KKAYETLKESAEFFVDYLIEDRNGYLVTCPSVSPENTYRLESGETGSLCIGPSMDS 516

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            II  +FS+ I A+E+L  +++   E ++    RL    I + G IMEW +
Sbjct: 517 QIIYALFSSCIEASELLNTDKE-FAETLISLRERLPKPSIGKYGQIMEWAE 566


>gi|295084327|emb|CBK65850.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 822

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|298481330|ref|ZP_06999523.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
 gi|298272534|gb|EFI14102.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
          Length = 822

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|302549607|ref|ZP_07301949.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
 gi|302467225|gb|EFL30318.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
          Length = 953

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 220/590 (37%), Positives = 312/590 (52%), Gaps = 44/590 (7%)

Query: 11  NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N L + ++ PA   +  A+PIGNGRLGAMV+G   +E L+LNEDT+W G P D  NP   
Sbjct: 23  NDLALWYDKPAGADWLRALPIGNGRLGAMVFGNADTERLQLNEDTVWAGGPYDSANPRGA 82

Query: 70  KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
             ++++R  V + Q+  A    +  + G PA    YQ +G++ L F  +        Y R
Sbjct: 83  ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGVSQYNR 139

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL TATA   Y +  V + RE F+S PDQVIV +++   + S++FN + DS       
Sbjct: 140 TLDLTTATAVTTYVLNGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 I ++G                   ++F A+    ++   GT+S+     L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALANAAVTG--GTVSS-SGGTLRVSG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +L+   SS+    ++      D    +   L + R++    L  RHL DYQ LF+
Sbjct: 249 ATSVTVLVAIGSSY----VDFRRVDGDYQGIARRHLNAARDIGIDQLRRRHLADYQALFN 304

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+ L R+       T +++     P+  R+       DP    LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGRT-------TAADQ-----PTDVRIAQHAQANDPQFSALLFQFGRYLLISSSR 352

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN+ ++P+WDS   VN NL MNYW +   NLSEC  P+FD +  L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTVNANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           ++ AQ  Y A GWV HH TD W  +S  D  +  W +W  GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDID 470

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL    YP L+G A F LD L+     G+L TNPS SPE    A     A V    TMD 
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGHLVTNPSNSPELAHHAD----ATVCAGPTMDN 525

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            I+R++F ++  A E+L+ +     +       RL PTK+   G++ EW+
Sbjct: 526 QILRDLFHSVARAGEILDVDAAFRAQAKAAR-ERLAPTKVGSRGNVQEWL 574


>gi|238062935|ref|ZP_04607644.1| large secreted protein [Micromonospora sp. ATCC 39149]
 gi|237884746|gb|EEP73574.1| large secreted protein [Micromonospora sp. ATCC 39149]
          Length = 932

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 215/573 (37%), Positives = 308/573 (53%), Gaps = 43/573 (7%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      L+++R  V + Q+ +
Sbjct: 42  ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 101

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G+PA    YQ +G++ L F  +        Y R LDL TATA   Y +  
Sbjct: 102 AQDLINQTMVGNPAGQLAYQPVGNLRLAFGSAS---GASQYNRALDLTTATATTTYVLNG 158

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V + RE F+S PDQVIV +++   + S++FN + DS          +  I ++G      
Sbjct: 159 VRYQREVFASAPDQVIVIRLTADRANSITFNATFDSPQRTTVSSPDSATIGLDG------ 212

Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
               AN +     ++F A+    ++   GT+S+     L+V G+    +L+   +S+   
Sbjct: 213 --ISANMDGVTGQVRFLALANASVTG--GTVSS-SGGTLRVSGATSVTVLVSIGTSY--- 264

Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
            +N      D    + + L + R   +  L  RHL DYQ LF+RV+I L R+        
Sbjct: 265 -VNYRTVNGDYQGIARTRLNAARTAGFDQLRARHLADYQALFNRVTIDLGRT-------A 316

Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
            +++  D      R+       DP    LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 317 AADQTTDV-----RIAQHANTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 371

Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
           P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G++ AQ  Y A GWV HH
Sbjct: 372 PSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTHH 431

Query: 444 KTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
            TD W  +S  D  +    +W  GGAWL T +W+HY +T D +FL    YP ++G A F 
Sbjct: 432 NTDAWRGASVVDYAQS--GMWQTGGAWLATMIWDHYLFTGDLEFLRAN-YPAMKGAAQFF 488

Query: 503 LDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
           LD L+      YL TNPS SPE    +     A V    TMD  I+R++F+ +  A+EVL
Sbjct: 489 LDTLVAHPTLSYLVTNPSNSPELSHHSN----AFVCAGPTMDNQILRDLFNGVALASEVL 544

Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             +      +V  +  RL PTK+   G++ EW+
Sbjct: 545 GVDA-TFRTQVRTAKDRLPPTKVGSRGNVQEWL 576


>gi|325103216|ref|YP_004272870.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324972064|gb|ADY51048.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 822

 Score =  348 bits (893), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 205/594 (34%), Positives = 320/594 (53%), Gaps = 48/594 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           NP+++ +N PA ++ +A+PIGNG L  MV+GGV  + ++LNE+T+W G PG+   P+   
Sbjct: 27  NPMELWYNQPAANWNEALPIGNGFLAGMVFGGVQKDRIQLNEETIWAGEPGNNIIPNVYP 86

Query: 71  ALSDVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET 123
           A++++R L+  G+Y EA   S K F       G+    YQ  G++ L+F           
Sbjct: 87  AIAEIRKLLVEGKYKEAQDLSNKAFPRQAPKGGNYGMQYQTAGNLFLDFGHGGFI----N 142

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR LD+  ATA + Y    +++ RE+ +  P +VI  +++ S++ S+SF + +D+    
Sbjct: 143 YRRNLDIEKATASISYQANGIDYKREYIALIPKKVIAIRLTASKTKSISFTIDMDAPFKE 202

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
              +   ++++++           +++ D  KG ++F   +  K+  + GT+  ++D KL
Sbjct: 203 FQKIALTDRLLLKAV---------SSSVDGKKGRVKFETQVVPKL--EGGTLE-IKDNKL 250

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            V+ ++   L +   ++F+    N  D   +        L  +   SY  L   H+  YQ
Sbjct: 251 VVKEANAVTLFISIGTNFN----NYQDISANENIRVKQRLAEVTGQSYKKLKANHIKSYQ 306

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           + F+RV + L       VT    +      P+ +RV  F+   DP+LV L FQFGRYLLI
Sbjct: 307 QYFNRVKLDLG------VTSVMDK------PTNQRVIDFKEGNDPALVSLYFQFGRYLLI 354

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SS PG+Q ANLQG WNE LSP WDS   VNIN EMNYW +   NL E  +PLF  L  L
Sbjct: 355 CSSFPGSQPANLQGKWNEKLSPPWDSKYTVNINTEMNYWPAEVTNLPEMHQPLFKMLKEL 414

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G ++A   Y A GW +HH TD+W  +    G   + +WPMGGAWL  H+W+HY Y  
Sbjct: 415 SETGKESAGQMYKARGWNLHHNTDLWRITGPVDGG-FYGMWPMGGAWLSQHIWQHYLYNG 473

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D DFL +  Y +L+G A F +D L E     +L   PS SPE+ ++   G    V   +T
Sbjct: 474 DNDFL-REYYDVLKGAAMFYVDVLQEEPKHKWLVVAPSMSPENTYLPSVG----VGAGTT 528

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD  ++ +VF+  I  +E+L K + +  + V   + RL P ++ +   + EW+Q
Sbjct: 529 MDNQLVFDVFANFIRTSEIL-KQDQSFADTVRNMINRLPPMQVGQHAQLQEWLQ 581


>gi|262408009|ref|ZP_06084557.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
 gi|345511517|ref|ZP_08791057.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|229444055|gb|EEO49846.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|262354817|gb|EEZ03909.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
          Length = 822

 Score =  348 bits (892), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 219/603 (36%), Positives = 337/603 (55%), Gaps = 55/603 (9%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
           LWE Y YT D +FL +  YP+L+    F  + +++   H+ +L   PS SPE+     +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + 
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580

Query: 592 EWV 594
           EW+
Sbjct: 581 EWM 583


>gi|443292342|ref|ZP_21031436.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
 gi|385884621|emb|CCH19587.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
          Length = 1000

 Score =  348 bits (892), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 219/582 (37%), Positives = 308/582 (52%), Gaps = 44/582 (7%)

Query: 19  GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
           G    +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D +N     AL+++R L
Sbjct: 53  GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPHDPSNTRGAAALAEIRRL 112

Query: 79  VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           V++ Q+ +A    +  + G+P     YQ +G++ L F  +        + R LDL TAT 
Sbjct: 113 VNANQWTQAQDLINQTMMGNPGGQLAYQTVGNLRLAFGSAS---GASQHNRTLDLTTATT 169

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
              Y +  + + RE F+S PDQVI  +++   S S+SF  + DS             I +
Sbjct: 170 TTSYVLNGIRYQREVFASAPDQVIAMRLTADRSNSISFTATFDSPQRTTVSSPDGATIGL 229

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
           +G           N       ++F   L +  +   G   +     L+V  +    +L+ 
Sbjct: 230 DG--------VSGNMEGVTGQVRF---LALANATVSGGTVSSSGGTLRVTNATSVTVLVS 278

Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR- 314
             SS+    +N  +   D    +   L + R  SY  L +RH+ DYQ LF RV++ L R 
Sbjct: 279 IGSSY----VNYRNVGGDYGGIARQRLSAARASSYDQLRSRHVADYQALFGRVTLDLGRT 334

Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
           S  D  TD              R+    +  DP    LLFQFGRYLLISSSRPGTQ ANL
Sbjct: 335 SAADQTTDV-------------RIAQHNSVNDPQFSALLFQFGRYLLISSSRPGTQPANL 381

Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
           QGIWN+ L+P+WDS   +N NL MNYW +   NL+EC  P+FD +  L++ G++TAQV Y
Sbjct: 382 QGIWNDSLAPSWDSKYTINANLPMNYWPANTTNLAECHNPVFDLVRDLAVTGTRTAQVQY 441

Query: 435 -LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
             ASGWV HH TD W +++A      W +W  GGAWL T +W+HY +  D +FL    YP
Sbjct: 442 GAASGWVTHHNTDAW-RATAVVDGAFWGMWQTGGAWLSTLIWDHYLFNGDIEFLRTN-YP 499

Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
            ++G A F L+ L+ E   GYL TNPS SPE    A     A V    TMD  I+R++F 
Sbjct: 500 AMKGAAQFFLNTLVTEPTLGYLVTNPSNSPELSHHAN----ASVCAGPTMDNQILRDLFD 555

Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           A   A+E+L+  +     +V  +  RL P K+   G+IMEW+
Sbjct: 556 ACARASEILDV-DSTFRAQVRATRDRLPPMKVGSRGNIMEWL 596


>gi|402306106|ref|ZP_10825157.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
 gi|400379873|gb|EJP32702.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
          Length = 785

 Score =  348 bits (892), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 209/604 (34%), Positives = 326/604 (53%), Gaps = 46/604 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M A   S  +P K+ +  PA+ +TDA+P+GNGRLGAMV+G   +E ++LNE+T+W G P 
Sbjct: 14  MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 73

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
              N  A KA+  ++ L+  G+Y +A         S   +G P   YQ  G++ +     
Sbjct: 74  GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 130

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   Y REL L++A A  +++   V + RE  +S  D V+  + +  + G ++FN 
Sbjct: 131 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 187

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
              +  D+         II++       +    + ++  KG ++F   +  +      G 
Sbjct: 188 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 238

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           ++  +D  + V+G+D AVL +  +++F+    N  D   D    S   L++     Y+  
Sbjct: 239 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 294

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+  +++L HRV++ L             E+    +P+ ER+  F   +D  LV   
Sbjct: 295 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFAAHDDNYLVATY 342

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS    NINLEMNYW +    L+E  E
Sbjct: 343 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAELTQLTELNE 402

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
           PLF  +  +S  G++TA+  Y  SGWV+HH TDIW  +   D  +    +W  GGAWLC 
Sbjct: 403 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 460

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
           HLWEHY YTMD+DFL +R YP+++G A FL   LI E   G+L  +PS SPE+   + DG
Sbjct: 461 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 519

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           K+A +S  +TMD+ ++ E+F  +++A++VL ++  AL     + L  + P ++ + G + 
Sbjct: 520 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 577

Query: 592 EWVQ 595
           EW++
Sbjct: 578 EWME 581


>gi|383113013|ref|ZP_09933793.1| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
 gi|382948895|gb|EFS29444.2| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
          Length = 822

 Score =  347 bits (891), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 216/602 (35%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E+  +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 23  ETNVSAQEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F  SH +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-SHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A   V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARVIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D A++ +  +++F+    N  D   +    + + L+      + + 
Sbjct: 242 EIACADGILSVEKADEAIVYVSIATNFN----NYQDITGNQIERAKNYLEKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L +            +    VP+ +RV++F+   D  LV   
Sbjct: 298 KKNHIDFYRQYLTRVSLDLGK------------DQYSNVPTDKRVENFKNTNDAHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA+V Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKVMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDIEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  ++ ++++ IISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLVFDLWTTIISASQILDTDQE-FATHLAQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|315506426|ref|YP_004085313.1| cellulose-binding family II [Micromonospora sp. L5]
 gi|315413045|gb|ADU11162.1| cellulose-binding family II [Micromonospora sp. L5]
          Length = 936

 Score =  347 bits (891), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 220/590 (37%), Positives = 313/590 (53%), Gaps = 44/590 (7%)

Query: 11  NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N L + ++ PA   +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N    
Sbjct: 44  NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103

Query: 70  KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
             L+++R  V + Q+  A    +  + G P     YQ +GD+ L F  +        Y R
Sbjct: 104 ANLAEIRRRVFADQWTSAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL TAT    Y  G V + RE F+S PDQV+V +++   + +++F+ + DS       
Sbjct: 161 TLDLTTATITTTYVQGGVRYQREMFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 I ++G           +       ++F A+    ++   GT+S+     L+V G
Sbjct: 221 SPDGATIALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +L+   +S+    +N      D    + + L + ++++   L TRH  DYQ LF+
Sbjct: 270 ATSVTVLVSIGTSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFN 325

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV+I L R        T + +     P+  R+    +  DP    LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN+ L+P+WDS   VN NL MNYW +   NLSEC  P+FD +  L++ G
Sbjct: 374 PGTQPANLQGIWNDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           ++ AQ  Y A GWV HH TD W  +S   G   W +W  GGAWL T +W+HY +T D  F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L+   YP L+G A F LD L+  H   GYL TNPS SPE    A     A V    TMD 
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVA-HPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDN 546

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            I+R++F A   A+EVL   +     +V  +  RL P+++   G++ EW+
Sbjct: 547 QILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWL 595


>gi|423223594|ref|ZP_17210063.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638219|gb|EIY32066.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 823

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 209/588 (35%), Positives = 316/588 (53%), Gaps = 49/588 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+++ +A+P+GNGRLGAMV+G   +E ++LNE+T+  G P    NP+    LS++R
Sbjct: 31  YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 90

Query: 77  SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+  G+Y EA T A  +L     FG P   YQ  G + L F D         +RRELDL
Sbjct: 91  QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 144

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A   Y+V  V++ RE F+S  DQ+++ +++ S+ G L+F  +L    D     +G 
Sbjct: 145 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 204

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           + + MEG   G      A        ++F   L++ +   +G  ++  D  L V  ++ A
Sbjct: 205 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLIVTRANSA 253

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + L  S++F    IN  D   DP   +   L++    +Y+     H+ +YQK ++RVS+
Sbjct: 254 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 308

Query: 311 QLSRSPK-DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
            L R+ + D  TD              RVK F T  DP LV L FQFGRYLLISSS+PG 
Sbjct: 309 NLGRTAQADKPTDI-------------RVKEFATANDPHLVALYFQFGRYLLISSSQPGG 355

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN+ L+P W      NIN EMNYW +   NL E  EP    +  L  NG + 
Sbjct: 356 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEA 415

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   GW++HH TD+W  + A   K     WP   AWLC HLW+ Y Y+ D+DFL +
Sbjct: 416 AREMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ 474

Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
            AYP+++  + F +D+L++  + GY+   PS SPE+    P  +     ++  TMD  ++
Sbjct: 475 -AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLV 531

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            ++F+    AA +LEK+E    + +L    +L P ++ + G + EW +
Sbjct: 532 FDLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFE 578


>gi|56962910|ref|YP_174637.1| hypothetical protein ABC1138 [Bacillus clausii KSM-K16]
 gi|56909149|dbj|BAD63676.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
          Length = 782

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 200/596 (33%), Positives = 320/596 (53%), Gaps = 42/596 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +++T   PA+ +T+A PIGNGR+GAMV+GGV  E + LN D+LW+G P           +
Sbjct: 1   MQLTEQQPAQTWTEAYPIGNGRIGAMVYGGVEHEKIALNVDSLWSGPPAKRKQAPVKGTV 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           +D+R+ + +  +  A+  +  + G     Y  LGD+ + F      ++   Y R L L T
Sbjct: 61  ADMRAAIAARDFQAASRYAKDMQGPYTQSYLPLGDLHILF--PLCTHSSTRYERTLQLET 118

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           AT  V+  +    + R  F+S PD+ I+ ++       LSF+  L S L    + +  + 
Sbjct: 119 ATVTVEDGL----YKRSVFASKPDEAIILRLEAVAELPLSFSAWLTSPLRTIGWPD-QDH 173

Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
           + + G CP + + P    + +P           I+F++ +++  +D     +A+++ KL 
Sbjct: 174 VGLAGWCP-EYVAPNYVPSSEPIRYTSYETSSAIRFASAVQLLETDGN---AAVKNNKLV 229

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VE + +A +L+   +SF       +   K+P +     L      +Y  L +RHL DYQ 
Sbjct: 230 VEDARYATVLVHMETSFASA---QAPQGKEPITLIRKRLSETVTSTYETLQSRHLQDYQS 286

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF R++  L+ + ++ ++            ++ER+  +  + D  LVELLFQ GRYLLI+
Sbjct: 287 LFQRMTFTLNETEREKLS------------TSERLAKYGAN-DGKLVELLFQMGRYLLIA 333

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR GT+ ANLQGIWNE + P W S   +NIN +MNYW +    L EC +P   F+  LS
Sbjct: 334 SSREGTEAANLQGIWNEHIRPPWSSNYTLNINAQMNYWPAETAALPECHQPFLTFIEELS 393

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
             G   AQ  Y   GW  HH +DIW ++        G  VWA WPM   WL  HLWEHY 
Sbjct: 394 EQGKAVAQNYYQCRGWTAHHNSDIWRQAEPVGGFGGGDPVWAFWPMAAPWLTRHLWEHYL 453

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           ++ DR +L +RAYP+++G   F LDWL++   G + T+PSTSPEH F+   G+   VS  
Sbjct: 454 FSADRAYLTERAYPVMKGAILFCLDWLVQDESGAVYTSPSTSPEHRFLY-KGQPYPVSEG 512

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + MD+A++ +VF   ++A E++  ++  L   V  +L +L+   ++ +G++ EW  
Sbjct: 513 AVMDLALLEDVFHLFLAANELVGGDQQ-LATDVKDALNQLKKPPLSAEGALQEWTH 567


>gi|224536536|ref|ZP_03677075.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521792|gb|EEF90897.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 811

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 209/588 (35%), Positives = 316/588 (53%), Gaps = 49/588 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+++ +A+P+GNGRLGAMV+G   +E ++LNE+T+  G P    NP+    LS++R
Sbjct: 19  YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 78

Query: 77  SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+  G+Y EA T A  +L     FG P   YQ  G + L F D         +RRELDL
Sbjct: 79  QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 132

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A   Y+V  V++ RE F+S  DQ+++ +++ S+ G L+F  +L    D     +G 
Sbjct: 133 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 192

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           + + MEG   G      A        ++F   L++ +   +G  ++  D  L V  ++ A
Sbjct: 193 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLVVTRANSA 241

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + L  S++F    IN  D   DP   +   L++    +Y+     H+ +YQK ++RVS+
Sbjct: 242 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 296

Query: 311 QLSRSPK-DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
            L R+ + D  TD              RVK F T  DP LV L FQFGRYLLISSS+PG 
Sbjct: 297 DLGRTAQADKPTDI-------------RVKEFATANDPHLVALYFQFGRYLLISSSQPGG 343

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN+ L+P W      NIN EMNYW +   NL E  EP    +  L  NG + 
Sbjct: 344 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEA 403

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   GW++HH TD+W  + A   K     WP   AWLC HLW+ Y Y+ D+DFL +
Sbjct: 404 AREMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ 462

Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
            AYP+++  + F +D+L++  + GY+   PS SPE+    P  +     ++  TMD  ++
Sbjct: 463 -AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLV 519

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            ++F+    AA +LEK+E    + +L    +L P ++ + G + EW +
Sbjct: 520 FDLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFE 566


>gi|345513950|ref|ZP_08793465.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|423230895|ref|ZP_17217299.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
           CL02T00C15]
 gi|423244606|ref|ZP_17225681.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
           CL02T12C06]
 gi|229435764|gb|EEO45841.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|392630015|gb|EIY24017.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
           CL02T00C15]
 gi|392641455|gb|EIY35231.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
           CL02T12C06]
          Length = 824

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 215/589 (36%), Positives = 325/589 (55%), Gaps = 51/589 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+++ +A+P+GNGRLGAMV+G   +E ++LNE+T+  G P    NP+A  AL+ +R
Sbjct: 31  YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90

Query: 77  SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+ +G+Y EA A A  K+     FG P   YQ +G + L+F  SH  Y    +RRELDL
Sbjct: 91  QLIFAGRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A   Y+V  +++ RE F+S  DQ+++ +++ S+ G L+F+ SL         V+G 
Sbjct: 145 EKAVATTAYTVNGIDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           N +I+EG   G         +D  KG I F A L++   D +G  S   D  L V  ++ 
Sbjct: 205 NALILEGTTKG---------DDFTKGSICFRADLKL---DLQGGKSVAGDTLLSVTNANS 252

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A + +  +++F    +N  D   +P+  +  ++++    +Y+     H+  YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307

Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           + L R S  D  TD              R+K F   +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLGRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 354

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQGIWN+ L+P W      NIN EMNYW +   NL E  EP    +  L  NG +
Sbjct: 355 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 414

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            A+  Y   GWV+HH TD+W  + A DR       WP   AWLC HLW+ Y Y+ D+++L
Sbjct: 415 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 472

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
               YP+L+  + F +D+L+ + + GYL   PS SPE+      GK A +    TMD  +
Sbjct: 473 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 530

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + ++FS   SAA++L  ++    + +L    +L P ++ + G + EW +
Sbjct: 531 VSDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFE 578


>gi|293369104|ref|ZP_06615699.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
 gi|292635816|gb|EFF54313.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
          Length = 822

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 219/602 (36%), Positives = 334/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NPDA + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ + +     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDLE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|284036403|ref|YP_003386333.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
 gi|283815696|gb|ADB37534.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
          Length = 842

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 207/594 (34%), Positives = 322/594 (54%), Gaps = 49/594 (8%)

Query: 13  LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           LK+ +N PA K +T A+P+GNGRLGAMV+G    E +KLNE T+W+G P    NPDA  A
Sbjct: 37  LKLWYNQPAGKVWTSALPVGNGRLGAMVYGNPEQELIKLNEATVWSGGPNRNDNPDALAA 96

Query: 72  LSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
           L ++R L+ +G+ AEA    AA+++   +    YQ +G+++L F       +   Y REL
Sbjct: 97  LPEIRRLIFAGKQAEAQKLAAANIETKKNNGMKYQPVGNLQLSFTGHQ---SVTNYYREL 153

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           D+  A A   Y+V  V + R+  +S PDQVI  +++  + G LSF   L+S       V 
Sbjct: 154 DIEKAIATTMYTVDGVRYMRQVIASVPDQVIAVRLTADKPGKLSFTAFLNSPQKVQRSVE 213

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
              +++M G           + ++  KG + F+A + +     + T +   D  + + G+
Sbjct: 214 ETTKLVMTGTT---------SDHEGVKGQVNFNAHVRVVAEGGQTTKT---DTSVVISGA 261

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           +   L +  +++     ++      DP + + S L      S++ +   H+  YQ+ F R
Sbjct: 262 NATTLYVSMATNV----VDYKTLTADPKTRADSYLTPAAKRSFNAVLAAHVAAYQRYFKR 317

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V++ L  S            +   +P+ ER++ F +  DP LV L FQFGRYLLIS+S+P
Sbjct: 318 VNLDLGTS------------DAAKLPTDERIRQFASGNDPQLVSLYFQFGRYLLISASQP 365

Query: 368 GT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
                  QVA LQG+WN+ + P WDS   +NIN EMNYW +   NL+E  EPL   +  L
Sbjct: 366 SRNGVVGQVATLQGLWNDRMDPPWDSKYTININTEMNYWPAEVTNLTELHEPLVQMVKEL 425

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G +TA+V Y ASGW+ HH TD+W + +     + +++WPMGGAWL  HLWE Y Y+ 
Sbjct: 426 SQTGQETARVMYGASGWLAHHNTDLW-RITGPVDPIYYSMWPMGGAWLSQHLWEKYQYSG 484

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC-VSYSS 540
           D+ +L K  YP ++G A F +D+L+E  +  YL   P  SPE+   AP  +    +    
Sbjct: 485 DKAYL-KSVYPAMKGAAQFFVDYLVEDPNHHYLVVCPGMSPEN---APSTRPGVSIDAGV 540

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           TMD  ++ ++F+  I AA+ L  + D  V+ V   L +L P ++ + G + EW+
Sbjct: 541 TMDNQLVFDIFTNTIRAAQALGTDAD-FVKIVASKLAQLPPMQVGKHGQLQEWI 593


>gi|443288639|ref|ZP_21027733.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
           08]
 gi|385888040|emb|CCH15807.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
           08]
          Length = 952

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 220/585 (37%), Positives = 307/585 (52%), Gaps = 67/585 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      L+++R  V + Q+ +
Sbjct: 61  ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A    +  + G P     YQ +GD+ L F  +        Y+R LDL TAT    Y +  
Sbjct: 121 AQDLINQTMLGSPVGQLAYQPVGDLRLAFGSAS---GASQYQRTLDLTTATTTTSYVLNG 177

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----------DSLLDNHSYVNGNNQ 192
           V F RE F+S PDQVIV +++   + +++F  +            D+       V+G+  
Sbjct: 178 VRFQREMFASAPDQVIVIRLTADRANAITFTATFSSPQRTTVSSPDAATIGLDGVSGS-- 235

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
             MEG     R    ANA+                        +     L+V G+    L
Sbjct: 236 --MEGITGQVRFLALANASVSGG------------------TVSSSGGTLRVSGATSVTL 275

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           L+   SS+    +N      D    +   L + R + +  L  RH+ DYQ LF+RVSI L
Sbjct: 276 LVSIGSSY----VNYRTVNGDYQGIARRHLDAARAIGFDQLRGRHVADYQALFNRVSIDL 331

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA 372
            R+       T +++  D      R+    +  DP    LLFQ+GRYLLISSSRPG+Q A
Sbjct: 332 GRT-------TAADQTTDV-----RIAQHASVNDPQFSALLFQYGRYLLISSSRPGSQPA 379

Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
           NLQGIWN+ ++P+WDS   +N NL MNYW +   NL+EC  P+FD +  L++ G++TAQV
Sbjct: 380 NLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLAECYLPVFDMIKDLTVTGARTAQV 439

Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
            Y A GWV HH TD W  SS    + +W +W  GGAWL T +W+HY +T D +FL    Y
Sbjct: 440 QYGAGGWVTHHNTDAWRGSSV-VDEALWGMWQTGGAWLATMIWDHYQFTGDIEFLRAN-Y 497

Query: 493 PLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
           P ++G A F LD L+  H   GYL TNPS SPE          A V    TMD  I+R++
Sbjct: 498 PAMKGAAQFFLDTLVS-HPTLGYLVTNPSNSPELRHHTN----ASVCAGPTMDNQILRDL 552

Query: 551 FSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWV 594
           F+ +  A+EVL  N DA    +VL +  RL PT++   G++ EW+
Sbjct: 553 FNGVARASEVL--NVDATYRAQVLTARDRLPPTRVGSRGNVQEWL 595


>gi|300725824|ref|ZP_07059290.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
 gi|299776871|gb|EFI73415.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
          Length = 802

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 215/603 (35%), Positives = 323/603 (53%), Gaps = 46/603 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKA 71
           +++ ++ PA +F +++PIGNG+LG +V+G    +T+ LN+ TLWTG P D      A   
Sbjct: 23  MQLLYHEPAHYFEESLPIGNGKLGGLVYGNPKHDTIYLNDITLWTGKPVDLDEGKGASLW 82

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
           L ++R  + +  Y +A +  + L G  +  YQ LG ++L    D   +Y++  Y+R+LDL
Sbjct: 83  LPEIRKALFAENYRKADSLQLHLQGKNSAFYQPLGTLQLTSLTDE--RYSD--YQRQLDL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN-- 188
           +++  ++ Y  G V + RE+F+ NPD ++  +ISG + GS+S ++S+ SLL      +  
Sbjct: 139 DSSLVKISYRQGGVLYQREYFADNPDNMLAIRISGDKKGSVSMDISIGSLLPVQVKASLT 198

Query: 189 -------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                     Q+ M G   G             +   F  +L+ +     GT+  +  K 
Sbjct: 199 RSLQANTAQGQLTMLGHAQGV----------SSESTHFCTMLQARAQG--GTVQVIHGK- 245

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+VE +D  ++ +V  +SF G   +P        ++    L  ++N SY +L +RH+ DY
Sbjct: 246 LRVEHADTLIIYIVNETSFAGADKHPVQDGAPYLAQVTDDLWHLQNYSYDELRSRHVADY 305

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYL 360
           QK ++RV ++L        T   + + +DT    +   K+ Q   D  L  L FQ+GRYL
Sbjct: 306 QKFYNRVKLRLG-------TVDHAPQTVDTWSLLKNYGKNHQAYLDRYLETLYFQYGRYL 358

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LIS SR     ANLQG+WN  L   W     VNINLE NYW +   NLSE +EP+ DF+ 
Sbjct: 359 LISCSRTSGVPANLQGLWNHYLEAPWRGNYTVNINLEENYWPAEVANLSEMEEPIHDFMA 418

Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWE 476
            L+ NG  TA   Y +  GW   H +DIWAK++     R    W+ W MGGAWL + LWE
Sbjct: 419 SLAQNGHFTAHHFYGIDRGWCSSHNSDIWAKTAPVGEGRESPEWSNWNMGGAWLSSTLWE 478

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLA 534
           HY YT D DFL + AYP+L G + F+L WL++     G L T PSTSPE+E++   G   
Sbjct: 479 HYLYTQDLDFLRRTAYPILNGASQFVLRWLVDNPQKSGELITAPSTSPENEYVTDKGYHG 538

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDAL-VEKVLKSLPRLRPTKIAEDGSI 590
              Y  T D+AIIRE+    + A +VL   EK ED      V ++L RL P  + +DG +
Sbjct: 539 TTCYGGTADLAIIRELLLNTLHARQVLGLKEKKEDQKGYPTVSEALARLHPYTVGKDGDL 598

Query: 591 MEW 593
            EW
Sbjct: 599 NEW 601


>gi|408378982|ref|ZP_11176577.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
 gi|407747109|gb|EKF58630.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
          Length = 805

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 215/604 (35%), Positives = 316/604 (52%), Gaps = 60/604 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  P ++F +A+P+GNG LGAM+ GG   + + LN+D  W G          P  L 
Sbjct: 27  RLWYTAPGRNFNEALPLGNGSLGAMIRGGTAEDLVCLNDDRFWAGRDAPAPVATGPLVLE 86

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           +VR  + +G  A A A    KL       Y    D+ +++D      A E Y R+LDLNT
Sbjct: 87  EVRRRLFAGDVAGAEALVEQKLLTDFNQPYLTAADLVIQWDHD----AVERYTRQLDLNT 142

Query: 133 ATARVKY---SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           A A V Y    VG V   R  FSS PDQV V     ++       +SL S   + S ++ 
Sbjct: 143 AVAEVNYVASRVGGVR--RRAFSSFPDQVFVLDAGFADPSQARTVLSLSSKTRHVSRMSA 200

Query: 190 NNQIIM-------EGRCPGKRIPPKANA--NDDP--KGIQFSAILEIKISDDRGTISALE 238
            + I++       + R    RI    N     DP  + +  + +L   +S        + 
Sbjct: 201 RDLIVVADAPSMVDWRGIDDRIRDGENIFYEVDPPRRCLTVACVLAASVS--------VH 252

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
            + L V G D+ VL+  +  S  G  +           + ++ L++  +  +S L  RH+
Sbjct: 253 GEGLVV-GGDFTVLVATSVGSDVGLLLE----------DCLARLEAAESRGFSALLERHV 301

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFG 357
             ++ L+ R ++ L RSP            +  +P+ ER+ +      DP+L  LLF +G
Sbjct: 302 AAHRALYDRAALTL-RSPV----------GLSALPTDERLHRQASKMRDPALEALLFNYG 350

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYL+I+SSRPG++  NLQGIWN+ + P W S   +NINL+MNYW + PCNL+EC EPLFD
Sbjct: 351 RYLMIASSRPGSRAINLQGIWNDKVQPPWWSNYTININLQMNYWPAEPCNLAECHEPLFD 410

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGAW 469
           F+  LS+ G++TA V Y   GWV HH+ D   +++A            + + LW MGGAW
Sbjct: 411 FVKNLSLAGARTASVQYGMRGWVAHHQVDGRFQTTAIGALNGRAYDFPIRYGLWTMGGAW 470

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
           LC H W+HY +  D  FL + A+P+L   A F LDW++E  DG L T PSTSPE+ ++ P
Sbjct: 471 LCQHFWQHYLFNGDTKFLRETAWPILRNAAEFYLDWVVELPDGSLTTAPSTSPENSYLLP 530

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           DG    +S  +TMD+AI+RE FS I+ AA VL   +D +      +LPRL    IA DG 
Sbjct: 531 DGTRHALSIGATMDIAILREFFSTIVDAASVLGIPDDPIAISASAALPRLPGYGIAADGQ 590

Query: 590 IMEW 593
           ++EW
Sbjct: 591 LLEW 594


>gi|330996330|ref|ZP_08320214.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329573380|gb|EGG54991.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 809

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 216/590 (36%), Positives = 314/590 (53%), Gaps = 55/590 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PAK +T+A+P+GN +LGAMV+GG   E L+LNE+T W G P D  NP+A   L
Sbjct: 22  LKLWYGKPAKDWTEALPVGNSKLGAMVYGGTGREELQLNEETFWAGGPYDNNNPNALYVL 81

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR+L+  G+  EA       F    D   Y  +G + L+F   H K  +  + R+LD+
Sbjct: 82  PVVRNLIFQGKTREAQRLVDANFFTRKDGMSYLTMGSLFLDFP-GHDKATD--FYRDLDI 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             ATA  +Y V  V + R  F+S  D VIV ++   ++G+L+F V  D+ L +    +G+
Sbjct: 139 GNATATTRYKVDGVAYARTVFASFTDSVIVVRLQADKAGALAFTVGYDAPLKHEVSADGD 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
              ++   C GK          D +G++ +   E ++       +  + KKL+V G+  A
Sbjct: 199 ---MLSIACEGK----------DQEGVKAALCAECRVKVVSDGKTTADGKKLEVVGATKA 245

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L L A++++    ++  D   D  + +   LQ    + Y     +H+  Y+ LF RV +
Sbjct: 246 TLYLSAATNY----VDYHDVSGDAAARADRCLQRAVQIPYKKALEKHVAYYRNLFGRVEL 301

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L        T+  + E      +  R++ F    DPSL  LLFQ+GRYLLISSS+PG Q
Sbjct: 302 DLGE------TEAAARE------TPLRIRDFSQGGDPSLAALLFQYGRYLLISSSQPGGQ 349

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN   +  WDS   +NIN EMNYW +   NLSE  +PLF  L  LS+ G+KTA
Sbjct: 350 PANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLEDLSVTGAKTA 409

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +  Y   GWV HH TD+W  S    G V +A   +WP GGAWL  HLW+HY +T D+ FL
Sbjct: 410 RDMYNCGGWVAHHNTDLWRIS----GVVDFAAAGMWPSGGAWLAQHLWQHYLFTADKKFL 465

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
            K  YP+L+G A F LD+L E H  Y      PS SPEH           V+   TMD  
Sbjct: 466 -KAYYPVLKGTARFFLDFLTE-HPSYKWWVVAPSVSPEH---------GPVTAGCTMDNQ 514

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I+ +     + A+E++  ++ A  + + + L RL P ++   G + EW+Q
Sbjct: 515 IVFDALYNTLQASEIV-GDDAAFRDSLAQMLDRLPPMQVGRHGQLQEWLQ 563


>gi|456392980|gb|EMF58323.1| hypothetical protein SBD_0995 [Streptomyces bottropensis ATCC
           25435]
          Length = 974

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 219/573 (38%), Positives = 307/573 (53%), Gaps = 43/573 (7%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      ++++R  V + Q+  
Sbjct: 61  ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 120

Query: 87  AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A       + G PA    YQ +G++ L F  +        Y R LDL TATA   Y +  
Sbjct: 121 AQDLIDQTMLGSPAGQLAYQPVGNLLLSFGGA---TGVSQYNRTLDLTTATALTTYVLNG 177

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V + RE F+S PD+VIV +++   + SL+FN + DS             I ++G      
Sbjct: 178 VRYQREVFASAPDRVIVVRLTADRANSLTFNATFDSPQRTTVSSPDGATIALDGTS---- 233

Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
               A        ++F A+    ++   GT+S+     L+V G+    +L+   SS+   
Sbjct: 234 ----ATMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTVLVSIGSSY--- 283

Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
            +N  +   D    + S L + R++    L +RHL DYQ LF+RVS+ L R+       T
Sbjct: 284 -VNFRNVAGDYQGTARSRLNAARDVGIDALRSRHLADYQALFNRVSVDLGRT-------T 335

Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
            +++     P+  R+       DP    LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 336 AADQ-----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 390

Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
           P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G++ AQ  Y A GWV HH
Sbjct: 391 PSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMINDLTVTGARVAQAQYGAGGWVTHH 450

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
            TD W  +S   G   W +W  GGAWL T +W+HY +T D DFL    YP L+G A F L
Sbjct: 451 NTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFFL 508

Query: 504 DWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
           D L+  H   GYL TNPS SPE     P    A V    TMD  I+R++F+++  A E+L
Sbjct: 509 DTLVA-HPTLGYLVTNPSNSPE----LPHHANATVCAGPTMDNQILRDLFNSVARAGELL 563

Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             +     + V     RL P ++   G++ EW+
Sbjct: 564 GVDAAFRAQAVAAR-DRLAPMRVGSRGNVQEWL 595


>gi|388259826|ref|ZP_10136995.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
 gi|387936552|gb|EIK43114.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
          Length = 836

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 210/603 (34%), Positives = 330/603 (54%), Gaps = 55/603 (9%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S+ + +P  + +   A+H+ +A+P+GNGRLGAMV+GGV  + +++NE+T W G P +  N
Sbjct: 29  SSPSVSPHTLWYEQAAQHWEEALPLGNGRLGAMVYGGVTRDNIQINENTFWAGGPHNNVN 88

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEE 122
           P A ++L ++R L+ +G+Y  A A + K     G     YQ  G++ LEF  +H +++  
Sbjct: 89  PKALESLPEIRRLITAGEYLAAEALAEKTITSQGSNGMPYQTAGNLHLEFP-AHKQFSH- 146

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y R+LD+  A A  +Y VG+V +TRE FSS  DQV+V K+S S+ G LSF   L     
Sbjct: 147 -YYRDLDIGKAIATTRYQVGDVVYTREVFSSFVDQVVVVKLSASKPGQLSFTAHLSHPAT 205

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDK 240
                  N+ ++M+G             + D +GI+    L   + ++   G++S   + 
Sbjct: 206 MQFAQENNHTLLMQG------------MSKDHEGIKGQVKLATLVDVNTSGGSLSQ-NNN 252

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR---- 296
           ++ V  +D A++L+  +++F    +N  D   D  + + + L S +N    + YT     
Sbjct: 253 RIAVSNADSALILISMATNF----VNYKDISGDALARARNYLASAKNQFTHNQYTARKHV 308

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H + Y++ F RV++QL +S         ++E     P+ +R++ F +  DP L  L FQF
Sbjct: 309 HSNFYKQYFDRVALQLGKS-------EFAQE-----PTDQRIRLFASRHDPELASLYFQF 356

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLIS S+PG Q  NLQGIWN  + P WDS   +NIN EMNYW S    L+E  EP  
Sbjct: 357 GRYLLISGSQPGGQPTNLQGIWNHRMDPPWDSKYTLNINAEMNYWPSEVTQLNELNEPFI 416

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
             +  L+  G +TA+  Y A GW+ HH TDIW  +   D+    W  WP   AWL  HLW
Sbjct: 417 QMVKELAQTGQQTAKEMYGARGWMAHHNTDIWRITGGIDK---TWGSWPTSNAWLSQHLW 473

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA 534
           E Y Y+ D+ +L    YP+++   +F  D+LIE  D  +L  +PS SPE+   AP     
Sbjct: 474 EKYLYSGDKTYLAD-VYPVMKSAVTFFEDFLIESPDKKWLIVSPSMSPEN---APTATGV 529

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            ++   TMD  ++ ++ S  I+AAE+L  +K +  + +K+L  LP   P +I +   + E
Sbjct: 530 KIAAGVTMDNQLLFDLLSNTIAAAEILGQDKTQIPVWKKILSRLP---PMQIGKHHQLQE 586

Query: 593 WVQ 595
           W++
Sbjct: 587 WLE 589


>gi|423241477|ref|ZP_17222590.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
           CL03T12C01]
 gi|392641370|gb|EIY35147.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
           CL03T12C01]
          Length = 824

 Score =  345 bits (885), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 215/589 (36%), Positives = 325/589 (55%), Gaps = 51/589 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+++ +A+P+GNGRLGAMV+G   +E ++LNE+T+  G P    NP+A  AL+ +R
Sbjct: 31  YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90

Query: 77  SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+ + +Y EA A A  K+     FG P   YQ +G + L+F  SH  Y    +RRELDL
Sbjct: 91  QLIFADRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A   Y+V  V++ RE F+S  DQ+++ +++ S+ G L+F+ SL         V+G 
Sbjct: 145 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           N +I+EG   G         +D  KG I+F A L++   D +G  S   D  L V  ++ 
Sbjct: 205 NALILEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 252

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A + +  +++F    +N  D   +P+  +  ++++    +Y+     H+  YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307

Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           + L R S  D  TD              R+K F   +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLRRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 354

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQGIWN+ L+P W      NIN EMNYW +   NL E  EP    +  L  NG +
Sbjct: 355 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 414

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            A+  Y   GWV+HH TD+W  + A DR       WP   AWLC HLW+ Y Y+ D+++L
Sbjct: 415 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 472

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
               YP+L+  + F +D+L+ + + GYL   PS SPE+      GK A +    TMD  +
Sbjct: 473 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 530

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + ++FS   SAA++L  ++    + +L    +L P ++ + G + EW +
Sbjct: 531 VSDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFE 578


>gi|423289667|ref|ZP_17268517.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
           CL02T12C04]
 gi|423298161|ref|ZP_17276220.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
           CL03T12C18]
 gi|392663702|gb|EIY57249.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
           CL03T12C18]
 gi|392667378|gb|EIY60888.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
           CL02T12C04]
          Length = 810

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 222/600 (37%), Positives = 325/600 (54%), Gaps = 55/600 (9%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           T     LK+ ++ PA+ + +A+P+GN RLGAM++G    E ++LNE+T+W G P    NP
Sbjct: 16  TVRAEELKLWYSHPAEEWVEALPLGNSRLGAMIYGNPFEEEIQLNEETVWGGSPYRNDNP 75

Query: 67  DAPKALSDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
           +A   LS+VR L+ +G+  E TA       A  K  G P   YQ +G ++L F   H KY
Sbjct: 76  EAYGVLSEVRKLIFAGR--EITAEKLWKEHAFTKQNGMP---YQTVGSLKLHFP-GHEKY 129

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
            +  Y R+L++  A A V Y VG+V +TR  F+S  D  ++  +      S++F  S  +
Sbjct: 130 TD--YYRDLNIENAVATVSYKVGDVTYTRTLFTSLADNALIIHLEADRPHSIAFEASYST 187

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD-PKGIQFSAILEIKISDDRGTISALE 238
             +  + +   N++ +           KA+A+++ P  I+  +   IK S   G + + +
Sbjct: 188 PFEESAVIASKNRLTLSA---------KASAHEEVPAAIRLESQARIKTSG--GKVES-D 235

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + KL V  +D   + + A+++F    +N  D   + +      L  +   SY  L   H+
Sbjct: 236 NGKLIVTEADVVTIYVSAATNF----VNYQDVSANESKRVDVILNQVGKKSYRQLLDSHI 291

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
             YQ+ F RV + L  S         S++         R+K F+  +DP+LV L+FQFGR
Sbjct: 292 GKYQQQFGRVKLDLGHS-------LASQKETPV-----RLKEFREGKDPALVTLMFQFGR 339

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSS+PG Q ANLQGIWN+ L   WD    +NIN EMNYW +   NL E  EPLF  
Sbjct: 340 YLLISSSQPGGQPANLQGIWNQHLLAPWDGKYTININTEMNYWPAEITNLPETHEPLFRL 399

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+  G KTAQ  Y  +GWV HH TDIW  +    G   +  WP GGAWL  HLW+HY
Sbjct: 400 VNELAETGKKTAQTMYHCNGWVAHHNTDIWRATGPVDGP-FYGTWPNGGAWLSQHLWQHY 458

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
            YT D+DFL K  YP+L+G A F +D+L+E H  Y  L T PS SPE    AP GK   +
Sbjct: 459 LYTGDKDFLIKN-YPVLKGAADFYMDFLVE-HPQYHWLVTIPSISPEQG--AP-GKETSL 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +   TMD  I+ +V S  + AA+++   ED + + +V K L RL P +I +   + EW++
Sbjct: 514 TAGCTMDNQIVFDVLSNTLQAAKIV--GEDIVYQDRVKKVLDRLPPMQIGKYNQLQEWLE 571


>gi|424878767|ref|ZP_18302405.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392520277|gb|EIW45007.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 747

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 215/586 (36%), Positives = 306/586 (52%), Gaps = 46/586 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+ +TDA+P+GNGRLGAMV+G    E L++NE T W G P    NPDA   L  VR
Sbjct: 8   YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67

Query: 77  SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
            L+  G YA+A A A  +L   P     YQ +GD+ LEF     K+AE    YRR LDL+
Sbjct: 68  QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA A   Y+   + + RE F S  D V+V ++S     ++S  +S+DS       +   +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGEGS 182

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
           Q+   G+  GK     A A      ++F+    +++ +  GT+ A     L VEG+D  +
Sbjct: 183 QLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVKA-SGGGLSVEGADEVL 231

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           + L A++SF        D    P  + +  L+   +  +  L   H+ ++++LF   +I 
Sbjct: 232 VFLDAATSFR----RYDDVLGHPERDIVDRLERAASRDFVSLRDDHIAEHRRLFSAFAID 287

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L  +P              ++P+ +R+  F   +DP+L  L  QFGRYL+I+SSRPGTQ 
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN    P W S    NINL+MNYW   P NL EC EPL +    L+  G   A 
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKAMAH 395

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           V+Y ASGWV+HH TD+W  +    G   W LWPMGG WL   L +  +Y  D + + +R 
Sbjct: 396 VHYRASGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLDACDYLDDAEAMRRRL 454

Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           +P+    A FL D L+   G D YL TNPS SPE+    P G   C      MD  +IR+
Sbjct: 455 FPIAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            F  ++    V    E  LV  + + L RL P +I  +G + EW++
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLSRLAPDRIGANGQLQEWLE 554


>gi|408371866|ref|ZP_11169623.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407742715|gb|EKF54305.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 803

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 213/601 (35%), Positives = 330/601 (54%), Gaps = 60/601 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA+ +  +IP+GNGRLGAM  GGV  E + LN+ TLW+G P D  +P+A K L
Sbjct: 26  LKLWYKQPAELWEGSIPLGNGRLGAMPDGGVSQENIVLNDITLWSGGPQDADDPNAIKYL 85

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            ++R L+  G+ ++A A   K F         G+ ADV    YQ+LG++   +   HL  
Sbjct: 86  PEIRRLLFEGKNSQAEALMYKTFVSKGPGSGKGNGADVPYGSYQILGNLHFNY---HLPN 142

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+RELD+  ATA   +SV  VE+TRE+F+S  D VIV K++ S++  +SF++ +D 
Sbjct: 143 KAQDYKRELDITNATATTTFSVDGVEYTREYFTSFSDDVIVFKLTASKAAQISFDLGVDR 202

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +  +      +++M+G+          N   D  G++++  L +++  + GT+ A +D
Sbjct: 203 P-ERFTTTTQGEELLMQGQL---------NNGTDGNGMKYA--LRVRVIPEGGTLKA-KD 249

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRNLSYSDLYTRH 297
             L+V G++ AV+L+ A++ +   F+        P  E    + L       Y+ L   H
Sbjct: 250 GTLQVNGANSAVILISAATDY---FV--------PNVEQWVETQLDKAEKKPYNTLKETH 298

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQF 356
           +D Y+ +F R SI+L            SE   + +P+ ER+K F+ T +DP L EL FQ+
Sbjct: 299 IDFYKNMFDRASIELG-----------SETQAEALPTDERLKRFEITKDDPGLAELYFQY 347

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYL ISS+RPG    NLQG+W   +   W+   H+NINL+MN+W     NL    +P +
Sbjct: 348 GRYLAISSTRPGLLPPNLQGLWANTVQTPWNGDYHLNINLQMNHWPIDVVNLPMLNQPYY 407

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
             +  L   G KTA+  Y   GWV H  T+IW  +S       W     G  W+C  LW 
Sbjct: 408 KLIKGLVEPGEKTAKTYYGGDGWVAHVITNIWGYTSPGE-HPSWGSTNSGSGWMCQMLWR 466

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLAC 535
           HY +  D D+L K+ YP+L+G A F    L+E  D  +L T PS SPE+ F   +G+ A 
Sbjct: 467 HYAFNQDMDYL-KKIYPILKGSAQFYNSTLVEHPDRDWLVTAPSNSPENAFFLTNGEKAN 525

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWV 594
           V+ + T+D  IIR +F  +I A+++L+   D    K LK  + +L P +IA++G +MEW+
Sbjct: 526 VAIAPTIDNQIIRSLFQNVIEASQLLDV--DKQFRKQLKHRITKLPPNQIAKNGRLMEWI 583

Query: 595 Q 595
           +
Sbjct: 584 K 584


>gi|298482732|ref|ZP_07000916.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
 gi|298271195|gb|EFI12772.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
          Length = 823

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 218/590 (36%), Positives = 324/590 (54%), Gaps = 53/590 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+++ +A+P+GNGRLGAMV+G   +E ++LNE+T+  G P    NP+A  AL+ +R
Sbjct: 32  YDKPARYWEEALPLGNGRLGAMVYGNPVAEEIQLNEETVSAGSPYKNYNPEAKGALATIR 91

Query: 77  SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            L+ +G+Y EA   A  K+     FG P   YQ +G + L+F  SH  Y    +RRELDL
Sbjct: 92  QLIFAGRYPEAQELAGEKILSKNGFGMP---YQTVGSLCLDFP-SHENYT--NFRRELDL 145

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A   Y+V  V++ RE F+S  DQ+++ +++ S+ G L+F+ SL         V+G 
Sbjct: 146 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 205

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           N + +EG   G         +D  KG I+F A L++   D +G  S   D  L V  ++ 
Sbjct: 206 NALTLEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 253

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A + +  +++F    +N  D   +P+  +  ++++    +Y      H+  YQK ++RVS
Sbjct: 254 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYVRALQAHISAYQKYYNRVS 308

Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           + L R S  D  TD              R+K F   +DP LV L FQFGRYLLISSS+PG
Sbjct: 309 LNLGRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 355

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQGIWN+ L+P W      NIN EMNYW +   NL E  EP    +  L  NG +
Sbjct: 356 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 415

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            A+  Y   GWV+HH TD+W  + A DR       WP   AWLC HLW+ Y Y+ D+++L
Sbjct: 416 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 473

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
               YP+L+  + F +D+L+ + + GYL   PS SPE+      GK A +    TMD  +
Sbjct: 474 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 531

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           + ++FS   SAA++L  N+D      + SL R L P ++ + G + EW +
Sbjct: 532 VSDLFSNTRSAAQIL--NQDKQFCDTILSLKRQLPPMQVGQYGQLQEWFE 579


>gi|241518404|ref|YP_002979032.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
 gi|240862817|gb|ACS60481.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
          Length = 747

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 215/586 (36%), Positives = 307/586 (52%), Gaps = 46/586 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+ +TDA+P+GNGRLGAMV+G    E L++NE T W G P    NPDA   L  VR
Sbjct: 8   YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67

Query: 77  SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
            L+  G YA+A A A  +L   P     YQ +GD+ LEF     K+AE    YRR LDL+
Sbjct: 68  QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA A   Y+   + + RE F S  D V+V ++S     ++S  +S+DS       +   +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGERS 182

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +   G+  GK     A A      ++F+    +++ +  GT++A     L VEG+D  +
Sbjct: 183 LLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGGLSVEGADEVL 231

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           + L A++SF        D    P  + +  L+   +  +  L   H++++++LF   +I 
Sbjct: 232 VFLDAATSFR----RYDDILGHPERDIIDRLERAASRDFVSLRDDHIEEHRRLFSAFAID 287

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L  +P              ++P+ +R+  F   +DP+L  L  QFGRYL+I+SSRPGTQ 
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN    P W S    NINL+MNYW   P NL EC EPL +    L+  G   A 
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKVMAH 395

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           V+Y A GWV+HH TD+W  +    G   W LWPMGG WL   L E  +Y  D + + +R 
Sbjct: 396 VHYRARGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLEACDYLDDAEAMRRRL 454

Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           +P+    A FL D L+   G D YL TNPS SPE+    P G   C      MD  +IR+
Sbjct: 455 FPIALEAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            F  ++    V    E  LV  + + LPRL P +I  +G + EW++
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLPRLAPDRIGANGQLQEWLE 554


>gi|302867165|ref|YP_003835802.1| cellulose-binding family II protein [Micromonospora aurantiaca ATCC
           27029]
 gi|302570024|gb|ADL46226.1| cellulose-binding family II [Micromonospora aurantiaca ATCC 27029]
          Length = 936

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 219/590 (37%), Positives = 312/590 (52%), Gaps = 44/590 (7%)

Query: 11  NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N L + ++ PA   +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N    
Sbjct: 44  NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103

Query: 70  KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
             L+++R  V + Q+  A    +  + G P     YQ +GD+ L F  +        Y R
Sbjct: 104 ANLAEIRRRVFADQWTLAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL TAT    Y  G V + RE F+S PDQV+V +++   + +++F+ + DS       
Sbjct: 161 TLDLTTATVTTTYVQGGVRYQREVFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 + ++G           +       ++F A+    ++   GT+S+     L+V G
Sbjct: 221 SPDGATVALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +L+   SS+    +N      D    + + L + ++++   L TRH  DYQ LF 
Sbjct: 270 ATSVTVLVSIGSSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFD 325

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV+I L R        T + +     P+  R+    +  DP    LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIW++ L+P+WDS   VN NL MNYW +   NLSEC  P+FD +  L++ G
Sbjct: 374 PGTQPANLQGIWSDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           ++ AQ  Y A GWV HH TD W  +S   G   W +W  GGAWL T +W+HY +T D  F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L+   YP L+G A F LD L+  H   GYL TNPS SPE    A     A V    TMD 
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVA-HPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDN 546

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            I+R++F A   A+EVL   +     +V  +  RL P+++   G++ EW+
Sbjct: 547 QILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWL 595


>gi|383642312|ref|ZP_09954718.1| alpha-l-fucosidase [Sphingomonas elodea ATCC 31461]
          Length = 788

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 209/597 (35%), Positives = 321/597 (53%), Gaps = 42/597 (7%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
                S  +PL + +  PA+ + +A+P+GNGRLGAMV+GG  +E  +LNEDT + G P D
Sbjct: 33  GGAGASPRDPLTLWYRQPAQEWVEALPLGNGRLGAMVFGGTTTERFQLNEDTFFAGSPYD 92

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKY 119
            TNP A  A+  +R LV  G+  EA A + K + G PA    YQ +GD+ L F       
Sbjct: 93  ATNPAAGPAIRRIRQLVFEGKGKEAQALADKDVIGRPAGQMPYQPIGDLLLLFPGLE--- 149

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS-GSESGSLSFNVSLD 178
               Y R LDL+ A A  ++  G+    RE  +S  DQVI  +++ G   G ++  ++L 
Sbjct: 150 GIRGYERSLDLDGAIATTRFRTGSTTHVREAIASAVDQVIAIRLTAGQGRGGVTTTLALT 209

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S   + S+V G + +++ G  PG R          P GI+F   + +  +D  G ++A +
Sbjct: 210 SPQQSESFVEGGDTLVLRGIGPGAR--------GVPGGIRFETRVRMIATD--GIVTAGK 259

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              L VE +   VLLLVA+++    +    D   DP++   + + +     ++ L   H 
Sbjct: 260 -SDLSVEQAS-EVLLLVATAT---SYRRWDDIGGDPSAIVRAQIDAAAGKGWARLLADHQ 314

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            D+++LF R+++ L R+P               +P+ ER++     +DP+L  L  QFGR
Sbjct: 315 ADHRRLFRRMTLDLGRTPAA------------ALPTDERIRRSTELDDPALATLYHQFGR 362

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI++SRPGTQ ANLQGIWNE + P+WDS   +NIN EMNYW +    L E  EPL   
Sbjct: 363 YLLIAASRPGTQPANLQGIWNERVHPSWDSKWTLNINAEMNYWPADMTGLGELTEPLLRL 422

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  LS+ G +TA+ ++ A GW+ +H  D++  ++   G  VW LWPM GAWL + LW+H+
Sbjct: 423 VKELSVAGQRTARNDWGARGWMSYHNVDLFRNTALIDG-AVWGLWPMAGAWLLSSLWDHW 481

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           +Y+ DR FL +  YPL+ G   F LD L+     G L  NPS SPE++  A       V+
Sbjct: 482 DYSRDRTFLAE-LYPLMAGACDFYLDALVPHPTTGELVMNPSNSPENQHHAG----ISVT 536

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             + MD  ++R++F     AA +L ++E      +       +  +I + G + EW+
Sbjct: 537 AGAAMDSQLLRDLFGRTAEAARLLGRDESRARAVLAARARLPK-DRIGKAGQLQEWL 592


>gi|365122610|ref|ZP_09339511.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642358|gb|EHL81716.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 852

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 214/587 (36%), Positives = 312/587 (53%), Gaps = 43/587 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA  + +A+P+GN RLGAMV+G   +E ++LNE+T+W G P    NP+A   L
Sbjct: 64  LKLWYKQPATQWVEALPLGNSRLGAMVYGIPDNEEIQLNEETVWGGGPHRNDNPEAKDIL 123

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +VR L+  G+  EA     K F  P +   YQ +G ++L FD  H  Y +  Y R+LDL
Sbjct: 124 PEVRRLIFEGKSKEAKPIMEKKFRTPRNGMPYQTIGSLKLHFD-GHENYTD--YYRDLDL 180

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V +TRE F+S  D V++ +I+  + G+L+F     S L  H+     
Sbjct: 181 TRAVATTRYKVNGVTYTRELFTSFADNVVIMQITSDKQGALNFTADYVSPL-KHTVSTKK 239

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            ++I+ G+         A+    P  I+      IK +D +   S   D K+ V  +  A
Sbjct: 240 GKLILSGKG--------ADHEGVPGVIRLENQTFIKTTDGKVKTS---DNKISVSDATTA 288

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + + A+++F    +N +D   +    + + +++     Y      H+  Y+KLF RV++
Sbjct: 289 TIYISAATNF----VNYNDVSANEHKRADAYMKAALKKPYEKALADHIAYYKKLFDRVTL 344

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L  S +        EE      +  RVK+F+   D SL  L+FQFGRYLLISSS+PG Q
Sbjct: 345 DLGTSKE------AQEE------THLRVKNFKNGNDVSLAVLMFQFGRYLLISSSQPGGQ 392

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWNE L   WD    +NIN EMNYW +   NLSE  EPL   +  LS++G +TA
Sbjct: 393 PANLQGIWNEKLQAPWDGKYTININTEMNYWPAEVTNLSETHEPLIQMVKELSVSGQETA 452

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y  +GWV HH TD+W       G     +WP GGAWL  H+W+HY YT D+++L+  
Sbjct: 453 KEMYGCNGWVTHHNTDLWRSCGPVDGADY--VWPNGGAWLSQHVWQHYLYTGDKEYLQD- 509

Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
            YP L+G A F LD+L E H  Y  + T PS+SPEH    P G    +    TMD  I  
Sbjct: 510 VYPALKGVADFFLDFLTE-HPTYKWMVTVPSSSPEH---GPRGNGNSIVAGCTMDNQIAF 565

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +  S  + A ++L  + D    K+   + RL P +I +   + EW+Q
Sbjct: 566 DALSNALQATKILNGDAD-YCNKLQNMIDRLAPMQIGQYNQLQEWLQ 611


>gi|281421059|ref|ZP_06252058.1| putative large secreted protein [Prevotella copri DSM 18205]
 gi|281404977|gb|EFB35657.1| putative large secreted protein [Prevotella copri DSM 18205]
          Length = 790

 Score =  342 bits (878), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 208/606 (34%), Positives = 326/606 (53%), Gaps = 48/606 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           + A   S   PLK+ +N PA  F +++PIGNG+LGA+++GG  ++++ LN+ TLWTG P 
Sbjct: 17  LQAVPKSNIPPLKLWYNKPATAFEESLPIGNGKLGALIYGGANNDSIYLNDITLWTGKPV 76

Query: 62  DYT-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
           +     DA K +  +R  +    Y  A +  + + GH ++ YQ L  I ++ D +  +++
Sbjct: 77  NREEGGDAYKWIPKIREALFKEDYKAADSLQLHVQGHNSEYYQPLAIINIK-DANKGQFS 135

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              Y+REL L+ ATA + Y+ G +++ RE+F+S+PD++I   ++ ++  +++ ++SL SL
Sbjct: 136 --NYKRELSLDNATAALSYTRGGIQYQREYFASHPDKMIAIHLTATQKKAINCDISLTSL 193

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           +  H     N Q+ + G   GK              I F +IL IK  D  GTI+A  D 
Sbjct: 194 IP-HQVKASNKQLTITGHAMGK----------PENSIHFCSILSIKNQD--GTITA-SDS 239

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-------NLSYSDL 293
            L ++G   AV+ LV  +S++G         K P  E    ++ +        N +Y +L
Sbjct: 240 ILHLQGVSEAVIYLVNETSYNG-------FDKHPVKEGAPYIEKVNDNAWHLVNYTYPEL 292

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
             RH+ DYQ +F+R    L  +  D    T  ++  D     E        ++P L  L 
Sbjct: 293 KQRHITDYQNIFNRAKFALKGAKFD-NKRTTDQQLFDYTEKEE--------QNPYLEMLY 343

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQ+GRYLLIS SR     ANLQG+W       W     +NINLE NYW +   N+SE   
Sbjct: 344 FQYGRYLLISCSRTPGIPANLQGLWAPARKSPWRGNYTININLEENYWPAEVTNMSELVM 403

Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAW 469
           P+   +  +S+ G  TA+  Y + +GW   H TD WA ++     +    W+ W MGGAW
Sbjct: 404 PVDGLVKAMSVTGKYTAKHYYGIENGWCGGHNTDAWAMTNPVGTKKESPKWSNWNMGGAW 463

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFI 527
           L   LW+HY+YT D+++L + AYPL++G A F+LDW+IE     G L T P TSPE E+I
Sbjct: 464 LVQTLWDHYDYTRDKEYLRQTAYPLMKGAADFMLDWIIENPKKPGELLTAPCTSPEAEYI 523

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
              G   C  Y  T D+ I+RE+F   +  A++L+ ++ A   K+  ++ RL P +I + 
Sbjct: 524 TDKGYQGCSFYGGTADLTILRELFKNTLKGAQILDIDQ-AYQAKLQDAINRLHPYQIGKR 582

Query: 588 GSIMEW 593
           G++ EW
Sbjct: 583 GNLQEW 588


>gi|329928902|ref|ZP_08282716.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
 gi|328937273|gb|EGG33698.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
          Length = 874

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 208/626 (33%), Positives = 318/626 (50%), Gaps = 67/626 (10%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S    L++ ++ PA  + +A+PIGNGRLG MV+G    E ++LNED+LW G PG   NP+
Sbjct: 52  SANRRLRLWYDSPAAEWNEALPIGNGRLGGMVFGKPSLERVQLNEDSLWYGGPGRGGNPN 111

Query: 68  APKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETY 124
           A + LS++R ++  G+ AEA   A + +   P     YQ LGD+ L+F D   +   E Y
Sbjct: 112 ASRYLSEIRQMLFDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLDG--EETVEHY 169

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDN 183
            RELDL  +   V YS   + F R++F++ PD V+V ++S    G+L+F  +L     D 
Sbjct: 170 ERELDLERSMVTVSYSSRGIRFRRQYFATAPDGVLVIRLSADRPGALTFAANLMRRPFDG 229

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            +    ++ ++MEG C                GI F   + ++ +   G +  + D  L 
Sbjct: 230 GTASLRHDTLLMEGEC-------------GADGISFG--MALRAAAVGGIVQTIGDF-LS 273

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VEG+D   LLL A +SF           + P    +  L     +SY  L  RH  +Y++
Sbjct: 274 VEGADSVTLLLSAQTSF---------RCRQPVQVCLEQLDRAAGMSYEQLVNRHQAEYRE 324

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENI------DTVPSAERVK----------SFQTDE-- 345
            F R S+ L           C +         + + +++RV+          S  TD   
Sbjct: 325 KFERFSLTLGTGKNGAGRTECVDSGTSFSNGTEVIRASDRVEYPNGIEDDQPSLPTDRRL 384

Query: 346 -----------------DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
                            DP L+ L  Q+GRYLLIS SRP +  ANLQGIWN+  +P W+S
Sbjct: 385 NLLKDRVKTEGASAENSDPELIALYVQYGRYLLISCSRPESLAANLQGIWNDSFTPPWES 444

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
              +N+N++MNYW +    L+EC EPLFD +  +  NG  TA+  Y   G+  HH T++W
Sbjct: 445 KYTINVNIQMNYWPAELLGLAECHEPLFDLIDRMLPNGRDTAREMYGCRGFAAHHNTNLW 504

Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
            ++  +   +   +WPMG AWLC HLWEHY +  D DFL +RAYP+++  A FLLD++  
Sbjct: 505 GETRPEGILMTCTVWPMGAAWLCLHLWEHYRFGGDADFLRERAYPVMKEAAEFLLDYMTV 564

Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
             +G   T PS SPE+ F+  +G +  +     MD  I   +F A + A  ++  +E A 
Sbjct: 565 DEEGRRMTGPSVSPENRFVLSNGAVGSLCMGPAMDGQIATALFRACLEAGHLV-GDEPAF 623

Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
           + ++  +L  +   +I   G IMEW+
Sbjct: 624 LGELQTALEEIPAPQIGRHGGIMEWL 649


>gi|260066219|gb|ACX30659.1| Fuc19 [Sphingobacterium sp. TN19]
          Length = 821

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 216/593 (36%), Positives = 324/593 (54%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           LK+ ++ P   + +  A+PIGNGRLGAMV+G    E L+LNE+T++ G P    NP+A  
Sbjct: 33  LKLWYDQPVVDQIWEQALPIGNGRLGAMVYGIPEREELQLNEETIYAGGPYRNDNPNALN 92

Query: 71  ALSDVRSLVDSGQYAEATAAS-----VKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
           AL  ++ L+ +G+  EA   +      K  G P   YQ  G + L F D H  Y  + Y 
Sbjct: 93  ALPQIQQLIFAGKTEEADRLTNQSFFTKTHGMP---YQTAGSVILNFPD-HKHY--QHYY 146

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           RELDL  A  R +Y+V  V +TR+ FSS  D VIV +I+ S+ G+L+F++   +  +   
Sbjct: 147 RELDLEKAVVRSRYTVEGVTYTRQVFSSFADDVIVMEITASKKGALNFDLEYANPSECKV 206

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKV 244
           Y +G + +I+EG            +++  +G I++     +K  D R T   L D KL V
Sbjct: 207 YKSGQS-LILEG---------SGTSHEGIEGKIRYQKHTAVKNKDGRVT---LTDNKLTV 253

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+   V+ +  +++F    +N     ++   ++ S L   +  ++     +H+  Y K 
Sbjct: 254 SGATSVVIYMAVATNF----VNYKTVDQNAGVKAASTLALAQKKAFQTALKQHIAMYSKQ 309

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F R  + L +        T  +EN+ T    +R++SF+T +DP+LV LL QFGRYLLI S
Sbjct: 310 FARFKLDLGQ--------TAGQENLTTT---KRIESFKTTQDPALVALLVQFGRYLLICS 358

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN  ++P WDS   VNIN EMNYW +   NLSE  EPLF  +  LS 
Sbjct: 359 SQPGGQPANLQGIWNRSMNPPWDSKYTVNINTEMNYWPAEVTNLSETHEPLFQLIKELSE 418

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +G +TA+V Y A GWV HH TD+W  +S         +WP GG WL  HLWEHY YT D+
Sbjct: 419 SGRETARVLYGADGWVTHHNTDLWRVTSPIDFAAA-GMWPTGGTWLTQHLWEHYLYTGDQ 477

Query: 485 DFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
            FL +  YP+++G A F+L  LI    H  +L   PS SPEH           +S   TM
Sbjct: 478 KFLTE-VYPVMKGAADFILSILIAHPKHKDWLVIAPSISPEH---------GPISTGITM 527

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  +  ++ +    A+E+++++  A   K++K+  +L P ++     + EW++
Sbjct: 528 DNQLAFDILTRTALASEIVDQDA-AYKAKLIKTARKLPPMQVGRYAQLQEWLE 579


>gi|295132887|ref|YP_003583563.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
 gi|294980902|gb|ADF51367.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
          Length = 820

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 207/590 (35%), Positives = 330/590 (55%), Gaps = 46/590 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + ++ PAK + +A+P+GNGRLGAMV+G    ET++LNE+T+W G PG+     + + L
Sbjct: 27  MTLNYDEPAKVWEEALPVGNGRLGAMVFGRTGMETIQLNEETVWAGEPGNNVVTLSEEQL 86

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRREL 128
            ++R  +   +Y +A   + K      +     YQ +G++ L F +S+   A   Y+REL
Sbjct: 87  EEIRKAIFQEEYQKAQQLADKYLSKKDNNSGMSYQTVGNLILNFPNSN---AVRDYKREL 143

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           D++ A + V Y  G V + R   SS PD VI+ +++ ++ GS+SF + L S   +H    
Sbjct: 144 DISKAVSTVTYKTGGVAYKRRIISSFPDDVIMVELTANKPGSISFEMGLKSPHKSHDIQI 203

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            N+++ + G          ++  ++ KG ++F  I + KI  + G I   E++ LK+ G+
Sbjct: 204 KNDEVWLSGT---------SSDQENKKGKVKFLVIAKPKI--EGGRIETTENR-LKITGA 251

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           + AV+ +  +S+F     N  D  +D  S++++ L ++    +      H+ +YQ+ F+R
Sbjct: 252 NRAVIYISIASNFK----NYKDLSEDAESKAIALLNAVYIKEFGKCLDAHIAEYQQYFNR 307

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V +       D+ T     +  D      R++ F   +DP L+ L FQFGRYLLISSS P
Sbjct: 308 VQL-------DLGTSNAINKTTDI-----RLEEFNDSDDPQLIALYFQFGRYLLISSSMP 355

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GTQ ANLQGIWN++++  WDS   VNIN EMNYW +   NLSE  +PLF  +  +S  G 
Sbjct: 356 GTQPANLQGIWNKEINAPWDSKYTVNINTEMNYWPAEVANLSEMHKPLFGLIKDISETGK 415

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           ++A+  Y A GW +HH TDIW + S       + LWP GG WL  HLW+HY +T D  FL
Sbjct: 416 ESAEKMYHARGWNMHHNTDIW-RISGVVDPPFYGLWPHGGGWLSQHLWQHYLFTGDTKFL 474

Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K  YP+L+G A F  D L  E  + ++  NPS SPE+         + ++  +TM   I
Sbjct: 475 -KEVYPILKGTALFYKDILQQEPENKWMVVNPSNSPENGHTGG----SSLAAGTTMGNQI 529

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
           +++VFS  + A+++L  NED      +K++ P L P +I + G + EW++
Sbjct: 530 VQDVFSNFLEASQIL--NEDKKFSDSIKNVTPNLAPMQIGKWGQLQEWMK 577


>gi|345562260|gb|EGX45329.1| hypothetical protein AOL_s00170g36 [Arthrobotrys oligospora ATCC
           24927]
          Length = 826

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 210/598 (35%), Positives = 322/598 (53%), Gaps = 49/598 (8%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S ++PL+I       +F D+  IGNGR+GA + GG  SE +++NED+LW+G      NPD
Sbjct: 30  SASHPLRIWTTSAGSYFNDSYLIGNGRIGAALPGGAASEVIRVNEDSLWSGGKLSRVNPD 89

Query: 68  APKALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETY 124
           A   + D++SL+   +  EA   A     G P  A  Y+ LGD++L  + S    +   Y
Sbjct: 90  ANGKMRDIQSLLTQQRNPEAARLAGFAYAGTPVSARHYEPLGDLQLVMNHSS---STTGY 146

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
            R LDL  ++  V Y+VG V + RE+ +SNPD +I   I+ S+  S+SFN+ L      +
Sbjct: 147 ERWLDLFDSSVGVYYTVGGVSYRREYIASNPDNIIAIHITASKPASVSFNIHLRKGQSLN 206

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             ++++Y  G++  +M G   GK             G++FSA    K+    G +  L D
Sbjct: 207 RWEDYTYKVGSDTTVMGGESQGK------------DGVKFSA--GTKVVASGGKVYTLGD 252

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
             +  + +D A +   A +++          ++DP ++ +S L SI   SYSD+   H+ 
Sbjct: 253 YVI-CDNADEATIFFTAWTAY---------RQQDPINKVLSDLSSISVKSYSDIRATHVA 302

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQK F RVS+ L            S +    + + +R+ +  +  DP LV L FQFGRY
Sbjct: 303 DYQKYFGRVSLSLG----------SSSDTQKALSTPKRLAAIASTFDPELVALYFQFGRY 352

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           L ISSSR  T   NLQGIWN+++ P W S   VNINL+MNYW SL  N+ E   PL+D +
Sbjct: 353 LFISSSRVNTLPPNLQGIWNQEMDPQWGSKYTVNINLQMNYWPSLVTNMIELTTPLYDLI 412

Query: 420 TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
             L  +G KTAQ  Y  S GWV HH TDIWA ++          WP G AWL  H+ E Y
Sbjct: 413 ARLHSSGKKTAQSMYGNSQGWVCHHNTDIWADTAPQDNYASSTWWPAGSAWLVHHIIEEY 472

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-LACVS 537
            +T D++FL+K  Y  ++  A F  ++L   + G+  TNP+ SPE+ F     K    ++
Sbjct: 473 RFTRDKEFLQKY-YNTIKDAALFFTEFLTN-YKGWKVTNPTLSPENTFYLLGTKTTTAIT 530

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             ST+D ++I E+F +++   ++L K+++++   +     +L P +I + G IMEW++
Sbjct: 531 LGSTLDNSLIWELFGSLLEIMDILGKHDNSMKSTLHDLRAKLPPLRINKWGGIMEWIE 588


>gi|340619504|ref|YP_004737957.1| alpha-L-fucosidase [Zobellia galactanivorans]
 gi|339734301|emb|CAZ97678.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
          Length = 817

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 209/600 (34%), Positives = 324/600 (54%), Gaps = 51/600 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA  + +A+P+GNGR+GAMV+G    E ++ NE+T W+G P         K L +++
Sbjct: 42  YDKPASMWEEALPVGNGRIGAMVYGKSGEEKIQFNEETYWSGGPYSQVVKGGYKKLPEIQ 101

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             + +G+  +A     + L G+P +   YQ L ++ L F    +    + YRR LDL T 
Sbjct: 102 KYIFNGEPIKAHKLFGRALMGYPVEQQKYQSLANLHLFFGQDSV----DNYRRSLDLKTG 157

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
              V+Y+ G V +T+E F+S  DQ I  +I+  + GS++F+  L  + ++       +  
Sbjct: 158 VVTVEYTYGGVNYTKEVFASAVDQTIAIRITADKPGSINFDAELRGVRNSAHSNYATDYF 217

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSDWAV 251
            M+G   GK        + D  G++     E  IK   + GT+S ++   L ++ +D A 
Sbjct: 218 RMDGL--GKDQLKLTGKSADYMGVEGKLRYEARIKAVPEGGTMS-IDGTMLSIKNADAAT 274

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           L  VA+++F    +N  D   D        L  ++  S+  +    L DY++ F RVS+ 
Sbjct: 275 LYFVAATNF----VNYKDVSADENKRVEDMLAKVQQSSFDAIKKSALADYKEYFDRVSLT 330

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L  +    +            P+ +R+   Q+  DP L  L + FGRYLLISSSRPGTQ 
Sbjct: 331 LPTTDNSFL------------PTDKRMVEIQSSPDPQLSTLCYNFGRYLLISSSRPGTQP 378

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN D++P WDS    NIN EMNYW     NLSE  EPL   +  L+  G+K A+
Sbjct: 379 ANLQGIWNNDMNPAWDSKYTTNINTEMNYWAVESANLSELSEPLTTMVKELTDQGAKVAK 438

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
            +Y A GWV H  TD+W + +A      W  + +GGAWL THLWEHY +T D+++L K  
Sbjct: 439 EHYGADGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLTTHLWEHYLFTQDKEYL-KDI 496

Query: 492 YPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGK--------------LAC 535
           YP+++G   F +D+L+E  G D +L TNPS SPE+    P+GK                 
Sbjct: 497 YPVMKGSVEFFMDFLVEYPGTD-WLVTNPSNSPEN---PPEGKGYKYFYDEITGMYYFTT 552

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +   ST+DM I++++FS   SA+E+L+ + + L ++V  +  RL P++I +DG++ EW +
Sbjct: 553 IVAGSTIDMQILKDLFSYYDSASEILDVDPE-LRKQVSIARSRLVPSQIGKDGTLQEWTE 611


>gi|260642325|ref|ZP_05415419.2| alpha-L-fucosidase 2 [Bacteroides finegoldii DSM 17565]
 gi|260622630|gb|EEX45501.1| hypothetical protein BACFIN_06792 [Bacteroides finegoldii DSM
           17565]
          Length = 824

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 227/600 (37%), Positives = 342/600 (57%), Gaps = 49/600 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E+ ++T   K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 25  ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGIPGTEQIQLNEETIWAGRPNNNA 84

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 85  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ S  G ++FN  L 
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V  +++   EG C    +   ++ ++  KG ++F   L  +   +RG   A 
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A++ +  +++F+    N  D   +    +   L       + +    H
Sbjct: 248 ADGILSVEGADEAIIYVSIATNFN----NYLDITGNQIERTKDYLSKAMKHPFPEAKKNH 303

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
            D Y++   RVS+ L ++           ENI T    +RV++F+   D  LV   FQFG
Sbjct: 304 TDFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF 
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 411

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE 
Sbjct: 412 LIKEVSETGKETAKIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWPSGGAWLCRHLWER 470

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D DFL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK A  
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGNNGK-ATT 528

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+E+L+ ++D    +++ LK +P   P +I   G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585


>gi|222106243|ref|YP_002547034.1| hypothetical protein Avi_5141 [Agrobacterium vitis S4]
 gi|221737422|gb|ACM38318.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 741

 Score =  342 bits (876), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 214/585 (36%), Positives = 312/585 (53%), Gaps = 41/585 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ ++  A  +T+A+P+GNGRLGAMV+G   +E L++NE T W+G P    NPDA  AL 
Sbjct: 5   ELWYDRAASVWTEALPVGNGRLGAMVFGDAWNERLQINESTFWSGGPYQPINPDARAALP 64

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +VR+L+ + +Y EA   + +      D    YQ +GD+ L   D H       YRR LDL
Sbjct: 65  EVRNLILAERYQEADRKAYEGAMAKPDRQTSYQPIGDVWL---DLHHDMTVTNYRRSLDL 121

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TA A  +Y    V F R+ F+S    VIV KIS  + G+LS  V L S  +       +
Sbjct: 122 ETAVAVTQYDCHGVHFRRDVFASAIQDVIVCKISVDQPGALSMTVMLSSPQNGDPIDIAD 181

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             +  +GR            N     ++F+    +++  + G +  + ++ ++V  +   
Sbjct: 182 ATLGYDGR--------NRRQNGIDSALRFA--FRVRVLAEGGFVD-IGEETIRVREASSV 230

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           +LL+ A +SF     N      DP ++  + L +   LSY  L   H+ ++++LF+R+ I
Sbjct: 231 MLLIDAGTSFQ----NYRTVDGDPQAQIKARLDAAAMLSYEALLEAHVTEHRRLFNRMQI 286

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L   P            + T+P+ +RV ++   +DPSL  L  Q+GRYL IS SRPGTQ
Sbjct: 287 ALGDKP------------VPTLPTDKRVAAYAEGDDPSLAALYLQYGRYLAISCSRPGTQ 334

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWNED+ P W S   VNINLEMNYW +   NLSE   PL + +  ++  G + A
Sbjct: 335 AANLQGIWNEDILPAWGSKYTVNINLEMNYWLADVANLSETFLPLVELVEDVAETGREMA 394

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           + +Y A GWV+HH TDIW  +    G   W LWPMGGAWLC  L++HY +  DR  LE R
Sbjct: 395 KAHYGARGWVLHHNTDIWRATGPIDGP-HWGLWPMGGAWLCAQLYDHYRFNPDRAVLE-R 452

Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
            YPL++G   F LD L+   D  YL T PS SPE+    P G   C   +  MD  I+R+
Sbjct: 453 IYPLIKGAVEFALDTLVALPDSNYLGTCPSLSPENSH--PFGSSLCA--APAMDNQILRD 508

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +F A   A+  L ++ +   E    +  RL   +I + G + EW+
Sbjct: 509 LFEAFADASATLGRDGELRTEAA-ATRARLPEDRIGKGGQLQEWM 552


>gi|336251922|ref|YP_004585890.1| alpha-L-fucosidase [Halopiger xanaduensis SH-6]
 gi|335339846|gb|AEH39084.1| Alpha-L-fucosidase [Halopiger xanaduensis SH-6]
          Length = 786

 Score =  341 bits (874), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 210/595 (35%), Positives = 320/595 (53%), Gaps = 47/595 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA  + DA+P+GNGRLGAM +GG+  E ++ NE+TLW G   +     A +   ++R
Sbjct: 11  YDEPADEWIDALPLGNGRLGAMAYGGLERERIQCNEETLWAGGHEEKVVEGASEHGEEIR 70

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            L   G+Y EA    +  L G P  +   L   +L  +      A   YRRELDL     
Sbjct: 71  QLCFEGEYEEAQRRCNEHLQGEPPGIRPYLPFCDLLIEQPGHDEAT-AYRRELDLADGCY 129

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
           RV+Y +    +TRE+F S PD V+V ++      S+  ++ LD      + V+  N++++
Sbjct: 130 RVEYDLEGTTYTREYFVSAPDDVLVVRLECDGPRSIDASIRLDRDRCARAGVDEENRLLL 189

Query: 196 EGRCPGKRIPPKANANDDPKG--IQF---------SAILEIKISDDRGTISALEDKKLKV 244
            G+     +P  A+      G  ++F          A +E  + DD G   +     + V
Sbjct: 190 RGQV--IDVPNTADMYQGSGGWGLRFEGRAAVRTSGASVEPNVDDDWGQSPS----AVTV 243

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+D   ++  A++ FDG          DP+  + + L++  +  Y +L  RH+DD++ L
Sbjct: 244 TGADAVTVVFAAATDFDG---------DDPSDATTATLEAAADRRYEELKRRHVDDHRAL 294

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F RVS++L   P D   D    E +  V +  R        DP LV+L FQ+GRYLL++S
Sbjct: 295 FDRVSLELG-DPVDAPID----ERLAAVRNGSR--------DPHLVQLYFQYGRYLLLAS 341

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPGT  ANLQGIWNE+  P W S   +++NLEMNYW +   NL+EC EPL  F+  +  
Sbjct: 342 SRPGTLPANLQGIWNEEYDPPWHSCYTLDVNLEMNYWHAEVANLAECAEPLVAFVDSMRE 401

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +G +TA+  Y   G+  H  TD+W +++       W  WPM  AWLC +LW+HY ++ DR
Sbjct: 402 SGRRTAREYYDCDGFAAHVDTDLW-RTTVQTVDARWGHWPMAPAWLCRNLWDHYAFSGDR 460

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
             LE   YP+L+  A FLLD+L+E  D G+L T PS SPE++F  PDG+ A V    TMD
Sbjct: 461 TDLET-IYPILKDAARFLLDFLVEHPDRGWLVTAPSASPENQFRTPDGQEATVCEGPTMD 519

Query: 544 MAIIREVFSAIISAAE---VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + +  ++F+  I AA    V +  +++ V  +  +L RL P +I E G + EW++
Sbjct: 520 VQLATDLFTHCIEAATELGVADGADESFVADLSDALERLPPMQIGEHGQLQEWLE 574


>gi|238060476|ref|ZP_04605185.1| large secreted protein [Micromonospora sp. ATCC 39149]
 gi|237882287|gb|EEP71115.1| large secreted protein [Micromonospora sp. ATCC 39149]
          Length = 826

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 217/580 (37%), Positives = 305/580 (52%), Gaps = 41/580 (7%)

Query: 19  GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
           G    +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      L+++R  
Sbjct: 53  GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRR 112

Query: 79  VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           V + Q++ A    +  + G P     YQ +G++ L F  +        Y R LDL TAT 
Sbjct: 113 VFADQWSSAQDLINQTMMGTPGGQLAYQTVGNLRLAFGSAS---GASQYNRTLDLTTATV 169

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
              Y +  V + RE F+S PDQVIV +++   + S++F+ + DS           N I  
Sbjct: 170 TTTYVLNGVRYQREVFASAPDQVIVLRLTADRASSITFSATFDSPQRTTMSSPDANTIAA 229

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
           +G           +       ++F A+     +   GT+S+     L+V G+    +L+ 
Sbjct: 230 DG--------ISGSMEGINGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLIS 278

Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
            +SS+    +N      D    + + L + R +S   L +RH+ DYQ LF+RV+I L R 
Sbjct: 279 IASSY----VNYRTVNGDYQGIARTRLNAARTVSIDQLRSRHIADYQALFNRVTINLGR- 333

Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
                  T + +     P+  R+    +  DP    LLFQFGRYLLISSSRPGTQ ANLQ
Sbjct: 334 -------TAAADQ----PTDVRIAQHASSNDPQFSALLFQFGRYLLISSSRPGTQPANLQ 382

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
           GIWN+ L+P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G++ AQ  Y 
Sbjct: 383 GIWNDSLAPSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYG 442

Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
           A GWV HH TD W  +S   G  +W +W  GGAWL T +WEHY +T D  FL+   YP L
Sbjct: 443 AGGWVTHHNTDAWRGASVVDG-ALWGMWQTGGAWLATLIWEHYLFTGDVGFLQAN-YPAL 500

Query: 496 EGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAI 554
           +G A F LD L +     YL TNPS SPE     P      V    TMD  I+R++F A 
Sbjct: 501 KGAAQFFLDTLVVHPTLNYLVTNPSNSPE----LPHHSNVSVCAGPTMDNQILRDLFDAA 556

Query: 555 ISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             A+E L   +     +V  +  RL P+++   G+I EW+
Sbjct: 557 ARASETLGV-DTTFRSQVRTAKDRLPPSRVGSRGNIQEWL 595


>gi|149197929|ref|ZP_01874977.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
 gi|149138841|gb|EDM27246.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
          Length = 765

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 210/600 (35%), Positives = 312/600 (52%), Gaps = 75/600 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A P GNGRLGAMV+G +  E + LN+DTL+ G   D  NPD    L  +R L+  G+ +E
Sbjct: 19  AFPAGNGRLGAMVFGDIDEERIALNDDTLYNGGQRDRFNPDCLPNLDCIRQLIFDGKLSE 78

Query: 87  ATAASVK-LFGHPADV--YQLLGDIEL---------------EFDDSHLKYAE------E 122
           A A + + + G P  +  Y+ L D+ +                FD   L Y +       
Sbjct: 79  AEALTQEAVTGLPPIMRNYEPLADLLISQKYSKEAYKQVDPNNFDPMDLAYGKIYQAAFS 138

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
            YR+ LDL  +    ++ V  +++ RE  SS PD +I  ++S SE  S++  + ++    
Sbjct: 139 DYRKSLDLENSIITTEFEVAGIKYKRELISSFPDDLIYLRLSASEKKSINVKLRIERGDA 198

Query: 179 SLLDNHSYVNGN----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
           ++     Y   +    N + +EGR                +GI F A L  ++   +G  
Sbjct: 199 AMYSTRHYDKVSSPVENSLFIEGRTGSN------------EGIDFVAGLRTQV---QGGS 243

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
                + L ++ +D  V+ +   +S           +  P +    +L+  +N  + ++Y
Sbjct: 244 CEKIGESLIIKDADEVVIAICGHTSV---------RQNSPMTSLKKSLE--KNFDWQEVY 292

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELL 353
            RH +DYQKL+ RV ++++            +EN+   P+ ER++  Q ++ D  L +L 
Sbjct: 293 LRHREDYQKLYKRVKLEIAHQ---------DDENL---PTDERLRKAQNNQSDVVLDQLY 340

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           F FGRYLLIS SRPG+  ANLQGIWN+  SP+W S   +NIN++MNYW +  CNLSEC E
Sbjct: 341 FNFGRYLLISCSRPGSMTANLQGIWNDSFSPSWGSKYTININIQMNYWPAEVCNLSECHE 400

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLFD L  L ING +TA+  Y   G+V HH TD    +      V  + WPMGGAWL  H
Sbjct: 401 PLFDMLEKLHINGQETAKKMYNCRGFVCHHNTDNTYDTYPTDRNVTASYWPMGGAWLALH 460

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T DRDFL K  Y ++   A F +D+L E   G L T+PS SPE+ ++ P+G+ 
Sbjct: 461 LWEHYKFTQDRDFLSK-YYQIIHDAALFFVDFLCENPKGQLVTSPSVSPENTYLLPNGEY 519

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +    TMD +IIRE+  A   A+ +L K  D   + +L  LP   P +I + G IMEW
Sbjct: 520 GTICAGPTMDNSIIREIILATQEASRLLNKTLDQDYDGILAKLP---PLEIGKHGQIMEW 576


>gi|224538524|ref|ZP_03679063.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519862|gb|EEF88967.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 1061

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 219/595 (36%), Positives = 318/595 (53%), Gaps = 50/595 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           TS  N +K+ +N PA+ + +A+P+GN RLGAMV+GG   E L+LNE+T W G P +  NP
Sbjct: 265 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
              + L ++R L+  G+  EA     + +  P     Y  LG + L F   H   +E  Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 380

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R+L+L  ATA  +Y V  V+F R  F+S  D VI+ +I   ++ +L+F VS  S L + 
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             V G   II    C G      A     P  ++  A  ++++  D G +S  E+  L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 487

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L + A+++F    +N  D   + +  + + LQ    + Y      H+  Y+K 
Sbjct: 488 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + RVS+ L  +             +  + +  RV+ F    D ++  L+FQ+GRYLLISS
Sbjct: 544 YDRVSLTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 591

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN      WDS   VNIN EMNYW +   NLSE  EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSPYAPWDSKYTVNINAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            GS+TA+V Y A GWV HH TDIW ++        + +WP GGAWL  HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 710

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +FL K  YPLL+G A F L  L+E H  Y  + T PS SPEH +    G    ++   TM
Sbjct: 711 EFLRKY-YPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765

Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  +     + A+ +L   ++ ED+L + +L  LP   P +I +   + EW+
Sbjct: 766 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWL 816


>gi|224538426|ref|ZP_03678965.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519961|gb|EEF89066.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 828

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 209/603 (34%), Positives = 321/603 (53%), Gaps = 48/603 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M N    +  +P+ + ++ PA+++ +A+P+GNGRLGAMV+G    E ++LNE+T+  G P
Sbjct: 22  MGNVNVYAQKHPI-LWYDKPAQYWEEALPLGNGRLGAMVYGNPVHEEIQLNEETVSAGSP 80

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDD 114
            +  NP+A  ALS +R L+  G+Y EA A A  K+     FG P   YQ +G + L+F  
Sbjct: 81  YNNYNPEAKNALSTIRQLIFDGKYPEAQALAETKILSKNGFGMP---YQTVGSLRLDFQG 137

Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
               Y+   +RRELDL  A     YSV  V++ RE F+S  DQ+I+ +++ S++G L+F+
Sbjct: 138 QE-NYS--NFRRELDLERAVTTTTYSVDGVKYKREVFASLTDQLIIIRLTASQAGKLTFS 194

Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
            +L           G N++IMEG   G    P A        + F A +E+   D +G  
Sbjct: 195 AALTCPQKVDVSTLGKNRLIMEGTTKGDGFTPGA--------VCFRADVEL---DLQGGK 243

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
           S   D  L +  +  A + +  +++F    IN  D   +P   +   L++ R   Y+   
Sbjct: 244 SVANDTLLSITNATSATIYIAMATNF----INYKDISGNPVERNKVYLKNARK-PYTKAL 298

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H++ YQK + RV++ L  +P+               P+  RVK F T  DP LV L F
Sbjct: 299 QAHVNMYQKYYRRVALDLGYTPQA------------DKPTDIRVKEFATSNDPHLVALYF 346

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           Q+GRYLLIS S+PG Q ANLQGIWN   +P W      NIN EMNYW +   NL E  EP
Sbjct: 347 QYGRYLLISCSQPGGQPANLQGIWNHKTNPAWRCRYTTNINAEMNYWPAEVTNLREMHEP 406

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
               +  L  NG + A+  Y   GW++HH TD+W  + A DR       WP   AWLC H
Sbjct: 407 FLQMIRELYENGQEAAREMYGCRGWMLHHNTDLWRMNGAVDRPYC--GPWPTCNAWLCQH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
           LW+ Y Y+ D+++L    YP+++  + F +D+L++  + GY+   PS SPE+      GK
Sbjct: 465 LWDRYLYSGDKEYLNS-IYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENSPKLWKGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
               +   TMD  ++ ++FS   +AA++L +++    + +L    RL P ++ + G + E
Sbjct: 524 SNLFA-GVTMDNQLVFDLFSNTNAAAQILNRDKQ-FCDTILSLKKRLPPMQVGQYGQLQE 581

Query: 593 WVQ 595
           W +
Sbjct: 582 WFE 584


>gi|406858935|gb|EKD12015.1| alpha-l-fucosidase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 835

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 218/600 (36%), Positives = 320/600 (53%), Gaps = 52/600 (8%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S + PL++  + PA  F+D+  IGNGR+GA + G    E L LNED+LW+G P D  NPD
Sbjct: 33  SASVPLRLWDSAPAGGFSDSYLIGNGRIGAALSGSAQKEYLGLNEDSLWSGGPIDRVNPD 92

Query: 68  APKALSDVRSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETY 124
           A   + +++S V  G++ E  T AS    G+P  A  Y  LG+++L  +          Y
Sbjct: 93  ASAYMGNIQSSVSKGRFQEGQTTASFAYVGNPVSARHYDYLGELQLVMNHGT---KVTGY 149

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------SLD 178
            R LDL  +TA ++YSV  V F RE+ +SNP  V+  KIS  ++G++ FN+      +L+
Sbjct: 150 ERWLDLQDSTAGLQYSVDGVTFQREYLASNPAGVMAIKISADKAGAVDFNILLRRGGTLN 209

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
             +D +S   GN+ I+M G   G             K + F+A   +  S  R  +  + 
Sbjct: 210 RWVD-YSVKVGNDTIVMGGGSGGV------------KPVVFAAGASVVASGGR--VYTIG 254

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D  +KVEG+D A +   A + F          K+DP +   S L+S+++ SY  +   H+
Sbjct: 255 DY-VKVEGADEAWIYFSAWTDF---------RKEDPRAAVESDLKSVKSQSYKSIREAHV 304

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ L  RVSI L  S      D  S           RV       DP +V L FQFGR
Sbjct: 305 EDYQSLASRVSIDLGTSSAKQKKDATSA----------RVAGLGAAFDPEIVALAFQFGR 354

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           Y+LISS+R GT    LQGIWN+D +P W S   +NIN +MN+W +L  NL+E  EPLF  
Sbjct: 355 YMLISSARQGTLAPTLQGIWNKDPNPQWGSRYTININTQMNHWLALVTNLAELNEPLFSL 414

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  +   G +TAQ  Y A+G V HH TDIW  S+      +   WP G  WL TH+ + Y
Sbjct: 415 IENVRQTGLQTAQKMYGAAGAVCHHNTDIWGDSAPVDNWALSTWWPTGLVWLVTHIHDTY 474

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACV 536
            +T +   LEK+ Y  L   A+F LD  I  + G++ TNPS SPE+ +  P+  G  A +
Sbjct: 475 LFTGNATLLEKK-YDTLVDAAAFFLD-FITPYKGWMVTNPSVSPENVYRIPNGGGGTAAM 532

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED-GSIMEWVQ 595
           +   TMD +++R +FS ++ A  VL K + AL +++  +   L P  +++  G I EW++
Sbjct: 533 TAGPTMDNSLLRALFSIVLEAQSVLGKKDTALADRLEAARASLPPLMVSKRYGGIQEWIE 592


>gi|424876717|ref|ZP_18300376.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393164320|gb|EJC64373.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 747

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 210/584 (35%), Positives = 308/584 (52%), Gaps = 42/584 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+ +TDA+P+GNGRLGAMV+G    E L++NE T W G P    NPDA   L  VR
Sbjct: 8   YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67

Query: 77  SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+   +YA+A A + K L   P     YQ +GD+ LEFD    + +   YRR LDL+TA
Sbjct: 68  QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y+   + + RE F S  D V+V ++S     ++S  +S+DS       +   +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMRIGQGSQL 184

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
              G+  GK     A A      ++F+    +++ +  GT++A     L VEG+D  ++ 
Sbjct: 185 SFSGK--GKAESGIAAA------LRFT--FGVRMVNSGGTVNA-SRGALSVEGADEVLVF 233

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A++SF        D    P  + +  L+   +  ++ L   H++++++LF   +I L 
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLERAASRDFASLRDDHIEEHRRLFSAFAIDLG 289

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
            +P              ++P+ +R+  F   +DP+L  L  QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIWN +  P W S    NINL+MNYW   P NL EC EPL +    L+  G   A ++
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHIH 397

Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
           Y A GWV+HH TD+W  +    G   W LWP GG WL   L +  +Y  D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456

Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           +    A FL D L+   G D YL TNPS SPE+    P G   C      MD  +IR+ F
Sbjct: 457 VAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             ++    V    E  LV  + + LPRL P +I  +G + EW++
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLE 554


>gi|423221840|ref|ZP_17208310.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645258|gb|EIY38987.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 1074

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 217/595 (36%), Positives = 319/595 (53%), Gaps = 50/595 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           TS  N +K+ +N PA+ + +A+P+GN RLGAMV+GG   E L+LNE+T W G P +  NP
Sbjct: 278 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
              + L ++R L+  G+  EA     + +  P     Y  LG + L F   H   +E  Y
Sbjct: 337 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 393

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R+L+L  ATA  +Y V  V+F R  F+S  D VI+ +I   ++ +L+F VS  S L + 
Sbjct: 394 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 453

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             V G   II    C G      A     P  ++  A  ++++  D G +S  E+  L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 500

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L + A+++F    +N  D   + +  + + LQ    + Y      H+  Y+K 
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + RV++ L  +             +  + +  RV+ F    D ++  L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE  EPLFD +T L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 664

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            GS+TA+V Y A GWV HH TDIW ++        + +WP GGAWL  HLW+HY +T D+
Sbjct: 665 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +FL K+ YPLL+G A F L  L+E H  Y  + T PS SPEH +    G    ++   TM
Sbjct: 724 EFL-KKYYPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778

Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  +     + A+ +L   ++ ED+L + +L  LP   P +I +   + EW+
Sbjct: 779 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWL 829


>gi|255532590|ref|YP_003092962.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
 gi|255345574|gb|ACU04900.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
          Length = 825

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 206/598 (34%), Positives = 328/598 (54%), Gaps = 44/598 (7%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
            ++  T   LK+ ++ PA ++ +A+PIGNGRLGAMV+G    E L+LNE+T+W+G P   
Sbjct: 21  GQAKKTDGTLKLWYDRPAANWNEALPIGNGRLGAMVFGNPAKEQLQLNEETVWSGGPNSN 80

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHLKYA 120
               +  A+  +R L+  G++ EA A A V++F   +   +YQ +G++ LEF+ +     
Sbjct: 81  VTAASGAAIPALRKLIFEGKFEEAQALADVEMFPKKNSGMIYQPVGNLFLEFEGTE---K 137

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              Y R+L++  A A V Y  G + + RE FSS  DQV++ +++  + G ++F   +D+ 
Sbjct: 138 ARNYYRDLNIEKALATVTYEAGGIRYKREIFSSFTDQVLIVRLTADKPGKITFRALMDTE 197

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                 +   +++++ G          A+   +   I+F++  ++K+  + G  S L++ 
Sbjct: 198 QKGGLRME-KDRLLLSGLT--------ADHEGEQGKIRFAS--QVKVVAEGGKAS-LQNN 245

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
              V+ ++ A + +  +++F     N  D   D   ++ S L      +Y++    H+  
Sbjct: 246 AWIVKAANSATVYVSIATNFK----NYHDVSADAGLKAASFLDRAVKKNYAEALAAHIKF 301

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           YQ+ F+RV   +       +TD  ++      P+ ER+ +F    DP L  L FQFGRYL
Sbjct: 302 YQQYFNRVKFDIG------ITDAVNK------PTDERIAAFARSNDPHLTALYFQFGRYL 349

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSS+PG Q   LQGIWN+ +   WDS   +NIN EMNYW +   NLSE  +PLF  L 
Sbjct: 350 LISSSQPGNQPPTLQGIWNDKMLAPWDSKYTININTEMNYWPAEVTNLSELHDPLFKMLK 409

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYN 479
            LS+ G +TA++ Y A GWV HH TD+W  +   DR      LWPMGG WL  HLW+HY 
Sbjct: 410 DLSVTGRETAKLMYGAKGWVTHHNTDLWRITGPVDRPYA--GLWPMGGNWLSQHLWDHYM 467

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           +T D+ FL K  YP+L+G + F LD L E     +L  +PS SPE+ ++   GK   ++ 
Sbjct: 468 FTGDKQFL-KEYYPVLKGASEFYLDVLQEEPTHKWLVVSPSNSPENTYVP--GKRVSIAA 524

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
            +TMD  ++ ++F+    AAE+L    DA    +LK+ L RL P +I +   + EW+ 
Sbjct: 525 GTTMDNQLLFDLFTRTGKAAELL--GMDAEFRGLLKTALGRLAPMQIGKYSQLQEWMH 580


>gi|431796298|ref|YP_007223202.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
 gi|430787063|gb|AGA77192.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
          Length = 813

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 207/590 (35%), Positives = 331/590 (56%), Gaps = 49/590 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA ++ +A+P+GNGRLGAMV+G    E L+LNE+T+W G P    +  A +A+ 
Sbjct: 26  KLWYDQPASNWNEALPLGNGRLGAMVFGVPAMERLQLNEETIWAGSPNSNAHTSAKEAIP 85

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            VR L+  G Y  A   A+ K+     D   Y+  G++ + F   H  Y  + Y R+L+L
Sbjct: 86  YVRRLIFDGDYQAAQELANEKIMSQTNDGMPYETFGNVYISFP-GHQDY--QDYYRDLNL 142

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT+ V+YSV  V++TRE  S+  D VI+ K++    GS++ NV + S  DN       
Sbjct: 143 EDATSTVRYSVDGVQYTREVLSAFEDDVIMVKLTADRPGSITCNVHMTSPHDNAEARVRG 202

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           +Q+ + G          +  +D  +G ++F     IK ++  G + A++D  + V+G+D 
Sbjct: 203 DQLTLSG---------VSQTHDHQRGGVKFQG--RIKATNKGGQL-AVKDGLISVDGADE 250

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             L +  +++F     N +D   +   ++ + L +     ++ +   H++ YQ+ + RV+
Sbjct: 251 VTLYISIATNFK----NYNDLSVEYERKAEALLDAALQKDFAAIKREHIEHYQQFYDRVA 306

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           I       D+ +   +E+     P+ +R++ F    DP L  L FQF RYLLIS S+PG 
Sbjct: 307 I-------DLGSTEAAEK-----PTDQRIQQFSEVHDPQLAALYFQFARYLLISCSQPGG 354

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN+ L P W+S   VNIN EMNYW +   NLSE  EP    +  +S  G +T
Sbjct: 355 QPANLQGIWNDMLFPPWESKYTVNINAEMNYWPAELTNLSEMHEPFLQMVREVSETGQQT 414

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A++ Y A GWV+HH TDIW  +    G + +A   +WP GGAWL  HLWE Y Y+ D DF
Sbjct: 415 AKMMYGARGWVLHHNTDIWRIT----GPIDYAASGMWPSGGAWLSQHLWERYLYSGDEDF 470

Query: 487 LEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L K AYP+++G A F LD LIE   +G+L  +PS+SPE+  +      A ++   TMD  
Sbjct: 471 L-KEAYPIMKGAAQFFLDVLIEEPVNGWLVVSPSSSPENSHVHG----ATIAAGVTMDNQ 525

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++ ++FS +I ++E+L +++ A  + +  +  +L P ++ + G + EW+ 
Sbjct: 526 LLFDLFSNLIRSSEILGEDQ-AFADTLKATRSKLAPMQVGQYGQLQEWMH 574


>gi|384419108|ref|YP_005628468.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353462021|gb|AEQ96300.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 776

 Score =  338 bits (867), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 220/595 (36%), Positives = 312/595 (52%), Gaps = 47/595 (7%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            +    L++ +  PA  + +A+P+GNGRLGAMVWGG     L+LNEDTL+ G P D T+P
Sbjct: 41  VAAAEALQLWYPQPANEWVEALPVGNGRLGAMVWGGSAHAHLQLNEDTLYAGGPYDATSP 100

Query: 67  DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
           DA  AL  VR+L+ +G YAE    A  KL   P     YQ LGD+ L+FD +        
Sbjct: 101 DALAALPQVRALIFAGGYAEVEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GMSD 157

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRR+LDL+TA A   +  G     RE F S   Q +V ++S    G +S  V +DS   N
Sbjct: 158 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAHAQCVVVRLSCDHPGGISLRVGIDSP-QN 216

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
                    ++  GR            N    GI+      L +      G  S + D+ 
Sbjct: 217 GEVTAEQGGLLFSGR------------NGSCAGIEGKLRFALPVLPQVTGGKRSQVRDR- 263

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L+++ +D  VLLL A++S     ++  D   DP + + ++L+    L ++ L   HL D+
Sbjct: 264 LRIDAADEVVLLLSAATSDQ--RVDTVDG--DPLALTAASLRKAAKLEFAALLRAHLADH 319

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF RV+I L  S  D V           + + ERV+ F   +DP+L  L  Q+GRYLL
Sbjct: 320 QRLFRRVAINLGSS--DAVQ----------LSTNERVQRFAEGDDPALAALYHQYGRYLL 367

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I SSRP TQ ANLQGIWN+ + P W+S   +NIN EMNYW S    L EC EPL      
Sbjct: 368 ICSSRPCTQPANLQGIWNDLMQPPWESKYTININAEMNYWPSEANALHECVEPLEAMWFD 427

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L+  G+ TA+  Y A  WV+H+ TD+W ++    G   W LWPMGG W    LW  ++Y 
Sbjct: 428 LAKTGAHTAKAMYDAPAWVVHNNTDLWRQAGPIDG-AKWRLWPMGGVWQ-QQLWHRWDYG 485

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            DR  L    YPL +G A F +  L+ +   G + TNPS SPE+++  P G   C     
Sbjct: 486 RDRADLST-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQY--PFGAALCA--VP 540

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD  ++R++F+  I+  ++L  + D L +++     RL P +I + G + EW Q
Sbjct: 541 TMDAQLLRDLFAQCIAMRKLLCIDAD-LAQQLAALRERLPPNRIGKAGQLQEWQQ 594


>gi|423299820|ref|ZP_17277845.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473629|gb|EKJ92151.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
           CL09T03C10]
          Length = 824

 Score =  338 bits (866), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 226/600 (37%), Positives = 340/600 (56%), Gaps = 49/600 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E+ ++T   K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 25  ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 85  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ S  G ++FN  L 
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V  +++   EG C    +   ++ ++  KG ++F   L  +   +RG   A 
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D AV+ +  +++F+    N  D   +    +   L       + +    H
Sbjct: 248 ADGILSVEGADEAVIYVSIATNFN----NYLDITGNQIERAKDYLSKAMKHPFPEAKKNH 303

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
              Y++   RVS+ L ++           ENI T    +RV++F+   D  LV   FQFG
Sbjct: 304 TGFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF 
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVSNLSELNEPLFR 411

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +W  GGAWLC HLWE 
Sbjct: 412 LIKEVSETGKETARIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWSSGGAWLCRHLWER 470

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D DFL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK A  
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGSNGK-ATT 528

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+E+L+ ++D    +++ LK +P   P +I   G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585


>gi|256426140|ref|YP_003126793.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
 gi|256041048|gb|ACU64592.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
          Length = 811

 Score =  337 bits (865), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 212/592 (35%), Positives = 314/592 (53%), Gaps = 50/592 (8%)

Query: 13  LKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           LK+ +  PA + +T A+P+GNGR+  MV+G    E L+LNE T+WTG P    NP+A  A
Sbjct: 22  LKLWYKQPAGNVWTAALPVGNGRIAGMVFGNPAEELLQLNEATVWTGSPNRNENPEALAA 81

Query: 72  LSDVRSLVDSGQYAEAT-----AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
           L  +R L+  G+  EA          KL G    +YQ +G + L F   H  Y  + Y R
Sbjct: 82  LPQIRQLIFDGKQKEAQDLAGEKIQTKLSG--GQMYQPVGTLHLAFP-GHEHY--DNYYR 136

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           ELD+  A A   Y V  V++TRE F+S P Q I+ ++S S+ G+L F+  L +   N   
Sbjct: 137 ELDIEKAVATTTYMVDGVKYTREVFASVPAQTIIVRLSSSKPGTLGFSAYLTTPQKNAVV 196

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
                 + + G            +++  +G ++F+ I  +  S   G   A  D  + ++
Sbjct: 197 KASGKDLTVNGIT---------GSHEGVEGKVKFNGITRVIAS---GGSVATSDTAVTIK 244

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            ++ A+L +  ++++    +N  D   D   ++ + L +     Y+ L   H+  YQ+ F
Sbjct: 245 NANSALLFISMATNY----VNYQDLSADEVKKASAYLNAAVKQPYATLLKEHIAAYQRYF 300

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           +RV I L  S  D+  D          P+  R+ +F    DP  + L FQFGRYLLIS S
Sbjct: 301 NRVKIDLGTS--DVAKD----------PTDVRLVNFSKTYDPQFISLYFQFGRYLLISCS 348

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PG Q A LQG+WN ++SP WDS   +NIN EMNYW +   NL E  EPL   +  LS+ 
Sbjct: 349 QPGGQPATLQGLWNSEMSPPWDSKYTININTEMNYWPAEKDNLPEMHEPLVQMVKELSVT 408

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G  TA++ Y A GWV HH TD+W + +    ++ + +W MGGAWL  HLW+ Y Y  DR 
Sbjct: 409 GQGTARILYGARGWVAHHNTDLW-RITGPVDRIFYGIWSMGGAWLAQHLWDRYLYNGDRR 467

Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS--TM 542
           +L    YP ++G A F +D L+E     YL  NP TSPE+   AP  +   VS+ +  TM
Sbjct: 468 YLAD-VYPAIKGAALFFVDDLVEDPKRKYLVVNPGTSPEN---APSTR-PNVSFDAGCTM 522

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I+ +  SA I+AAE+L K+  ALV+       RL P ++ + G + EW+
Sbjct: 523 DNQIVFDALSAAINAAEILGKDA-ALVDTFKTVRRRLPPMQVGQYGQLQEWI 573


>gi|300726579|ref|ZP_07060021.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
 gi|299776147|gb|EFI72715.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
          Length = 803

 Score =  337 bits (865), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 209/593 (35%), Positives = 325/593 (54%), Gaps = 48/593 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +N PA+ +TDA+P+GNGRLGAMV+G   +E ++LNE+T+WTG P    N  A  A+ 
Sbjct: 6   KLWYNEPAQVWTDALPLGNGRLGAMVYGIPSTEHIQLNEETIWTGQPNHNANKKALNAIP 65

Query: 74  DVRSLVDSGQY--AEATAASVKLFG-HPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            ++ L+  G+Y  A+  A    + G +    YQ  GD+ +   ++ L+Y    YRREL L
Sbjct: 66  KIQQLLFEGRYHTADKMANDNVMSGTNWGMAYQTFGDVYITTPNA-LRYT--NYRRELSL 122

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           ++A A   Y+V  V + RE  +S    VI   ++ S+ G L+F     +  +     +  
Sbjct: 123 DSAIAVTTYTVDGVTYRREVITSFDSNVITIHLTASKPGKLTFGAHYSTPQEEILIRSEK 182

Query: 191 NQIIMEG------RCPGK-RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           N+ I+EG       C GK R   +        G++  A              +  D ++ 
Sbjct: 183 NEAILEGVSGKLEGCKGKVRFMGRMLCETMKNGVRQEA--------------SSRDGEIT 228

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           VE +D A + +  +++F    +N  D   D  ++S   L+     +Y      H+  +Q 
Sbjct: 229 VENADEATIYISIATNF----VNYKDISGDEVAKSEQILRQAIAKNYEQSKKTHIAKFQS 284

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
             +RVS+ L    KD+  +          P+ +R+ +F   +D  L+   F FGRYLLI 
Sbjct: 285 FMNRVSLSLG---KDLYQNE---------PTDQRIINFAHRDDNGLIATYFNFGRYLLIC 332

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  + P+WDS    NINLEMNYW S   NLS+  EPLF  +  +S
Sbjct: 333 SSQPGGQAANLQGIWNHRVWPSWDSKYTTNINLEMNYWPSEIANLSDLNEPLFRLIREVS 392

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +GS +A++ Y   GWV+HH TDIW + +         +W +GGAWLC HLW+HY YT D
Sbjct: 393 ESGSISAKMMYGKDGWVLHHNTDIW-RVTGGIDHASSGMWMLGGAWLCAHLWQHYLYTGD 451

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           ++FL K+AYPL++G A FL + LI E   G+L  +PS SPE+   + DGK+A ++Y +TM
Sbjct: 452 KEFL-KKAYPLMKGAAIFLDEMLIPEPEHGWLVISPSVSPENYHPSKDGKIA-ITYGTTM 509

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  ++ E+F+++  A+++L   +D L     + L ++ P +I + G + EW++
Sbjct: 510 DNTLLHELFNSVSVASQILGV-DDTLKSYYAERLKKMAPMQIGKWGQLQEWLK 561


>gi|116248791|ref|YP_764632.1| hypothetical protein pRL120117 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115253441|emb|CAK11831.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 747

 Score =  337 bits (865), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 210/584 (35%), Positives = 308/584 (52%), Gaps = 42/584 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+ +TDA+P+GNGRLGAMV+G    E L++NE T W G P    NPDA   L  VR
Sbjct: 8   YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67

Query: 77  SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+   +YA+A A + K L   P     YQ +GD+ LEFD    + +   YRR LDL+TA
Sbjct: 68  QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y+   + + RE F S  D V+V ++S     +++  +S+DS       +   +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAINCRISIDSPQQGEMRIGQGSQL 184

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
              G+  GK     A A      ++F+    +++ +  GT++A     L VEG+D  ++ 
Sbjct: 185 SFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGALSVEGADEVLVF 233

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A++SF        D    P  + +  L+S  +  +  L   H++++++LF   +I L 
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLESAVSRDFVSLRDDHIEEHRRLFSAFAIDLR 289

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
            +P              ++P+ +R+  F   +DP+L  L  QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIWN +  P W S    NINL+MNYW   P NL EC EPL +    L+  G   A V+
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHVH 397

Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
           Y A GWV+HH TD+W  +    G   W LWP GG WL   L +  +Y  D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456

Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           +    A FL D L+   G D +L TNPS SPE+    P G   C      MD  +IR+ F
Sbjct: 457 IAREAAHFLFDVLVPFPGTD-HLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             ++    V    E  LV  + + LPRL P +I  +G + EW++
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLE 554


>gi|427383711|ref|ZP_18880431.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728416|gb|EKU91274.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1074

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 211/595 (35%), Positives = 319/595 (53%), Gaps = 50/595 (8%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           TS  N +K+ +  PA+ + +A+P+GN RLGAMV+GG   E L+LNE+T W G P +  NP
Sbjct: 278 TSAQN-MKLWYGRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
              + L ++R L+  G+  EA     + +  P     Y  +G + L F   H   +E  Y
Sbjct: 337 RGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 393

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R+L+L  ATA ++Y V  V+F R  F+S  D VI+ +I   ++ +L+F +S +S L ++
Sbjct: 394 YRDLNLENATATIRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAISYNSPLKSN 453

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             V G   II    C G      A     P  ++    +++K     G +S  E+  L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGVPAAMRAECQVQVKTD---GKVSK-EESSLAV 500

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+  A L + A+++F    +N  D   + +  + + LQ    + Y      H+  Y+K 
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + RV++ L  +             +  + +  RV+ F    D ++  L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------KVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE  EPLFD +  L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVADLAV 664

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            GS+TA+V Y A GWV HH TDIW ++        + +WP GGAWL  HLW+HY +T D+
Sbjct: 665 AGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +FL K+ YP+L+G A F L  L+E H  Y  + T PS SPEH +    G    ++   TM
Sbjct: 724 EFL-KKYYPVLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778

Query: 543 DMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  +   + + A+ +L+ +   ED+L + +L  LP   P +I +   + EW+
Sbjct: 779 DNQIAFDALYSTLQASRILDGDKQYEDSL-QTMLDKLP---PMQIGKHNQLQEWL 829


>gi|406867099|gb|EKD20138.1| hypothetical protein MBM_02090 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 743

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 209/593 (35%), Positives = 311/593 (52%), Gaps = 58/593 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  ++ ++PIGNGRLGAMV+G   +E L+LNED++W G P D    DA K L 
Sbjct: 4   RLHYTTPATEWSQSLPIGNGRLGAMVYGRTTTELLQLNEDSVWYGGPQDRIPRDALKNLP 63

Query: 74  DVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R L+ + Q++EA     K F    H    Y+ LG   LEF   H       Y+RELDL
Sbjct: 64  RLRELIRAEQHSEAEDLVRKAFFATPHSKRHYEPLGTFTLEF--GHEDSEVTDYKRELDL 121

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES--GSLSFNVSLDSLLDNHSYVN 188
            TA A V+Y    V++ R+ F+S PD VIV ++  SE    +L      +   + + Y++
Sbjct: 122 ETAIASVQYRYRGVDYKRKVFASGPDNVIVLQLKSSERVRATLRLTRVSEREYETNEYLD 181

Query: 189 G----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
                N+  I+    PG R         +P       ++++K  +D GT+ A+    L +
Sbjct: 182 SVTASNDGSIVMRATPGGR-------GSNP----LCCVVKVKC-EDGGTLEAV-GGCLVI 228

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           E S   ++++ A + F  P         DP S ++    + R L+   L  RH+++Y+ L
Sbjct: 229 E-SKATMIVISAQTKFRSP---------DPESAALE--DATRALTRGGLRGRHVENYRSL 276

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + R+ +QL     ++ TD                K      DP LV L   +GRYLL++S
Sbjct: 277 YARMKLQLGSPASELSTD----------------KRLLRSVDPGLVALYHNYGRYLLVAS 320

Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SRPG +   A LQGIWN    P W S   +NIN +MNYW +  CNL+EC+ PLFD L  +
Sbjct: 321 SRPGPRALPATLQGIWNPSFQPAWGSRYTININTQMNYWPANLCNLAECEMPLFDLLERM 380

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +I G +TAQ  Y   GW  HH TDIWA +      V   +WP+ GAWLC H+WE+Y +  
Sbjct: 381 AIRGKQTAQEMYGCRGWCAHHNTDIWADTDPQDRWVPATVWPLAGAWLCFHIWENYLFNG 440

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSYSS 540
               LE R +P+L+G   F+LD+L+E      YL TNPS SPE+ F++ + +   +   S
Sbjct: 441 STTLLE-RMFPILKGSVQFILDFLVEDATSGQYLVTNPSLSPENTFLSANNREGVLCEGS 499

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           T+D+ II  +F A I A   L++ +D L+  V+ +  RL P  +   G + EW
Sbjct: 500 TIDIQIINALFGAFIDALGELDRTDD-LLPAVIHARDRLPPMAVGSLGQLQEW 551


>gi|380695292|ref|ZP_09860151.1| hypothetical protein BfaeM_15197 [Bacteroides faecis MAJ27]
          Length = 824

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 221/600 (36%), Positives = 341/600 (56%), Gaps = 49/600 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E   +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  
Sbjct: 25  EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 84

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 85  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V  N++   +G C    +   ++ ++  KG ++F   L ++   ++G   A 
Sbjct: 199 S---PHQDVMINSE---KGNC--VILSGVSSLHEGLKGKVEFQGRLTVR---NQGGKIAC 247

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A + +  +++F+    N  D   + T  + S L       +++    H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 303

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++ Y++   RVS+ L             E+    V + +RV++F+   D  LV   FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLS+  EPLF 
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S +G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE 
Sbjct: 412 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D +FL +  YP+L+G   F  + ++ E    +L   PS SPE+     DGK A  
Sbjct: 471 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+ +L+ +++  A +E+ LK +    P ++   G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKEFAAHLEQRLKEMA---PMQVGHWGQLQEWM 585


>gi|338209373|ref|YP_004646344.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
 gi|336308836|gb|AEI51937.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
          Length = 849

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 218/598 (36%), Positives = 335/598 (56%), Gaps = 47/598 (7%)

Query: 6   STSTTNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           S+     LK+ +  P+ + + +A+PIGNG+LGAMV+G V  ET++LNE T+W+G P    
Sbjct: 47  SSQEVKSLKLWYTKPSGNTWENALPIGNGQLGAMVYGNVEKETIQLNEHTVWSGSPNRND 106

Query: 65  NPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAE 121
           NP+A  AL ++R L+  G+  +A   + K+         ++Q +G++ L FD  H  Y +
Sbjct: 107 NPEALAALPEIRQLIFDGKQKDAERLANKVIITKKSHGQMFQPVGNLHLTFD-GHGNYTD 165

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y RELDL  A A+  Y+V  V++TRE  +S PD+VIV  ++  +  SLSF  S  +  
Sbjct: 166 --YYRELDLERAVAKTAYTVNGVKYTREILASFPDRVIVMHLTADKPNSLSFVASYATQH 223

Query: 182 DNHSYVN--GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALE 238
              + +N   +N++ + G           + ++  KG + F  +  IK   + GT++A  
Sbjct: 224 KKRA-INPTASNELSLSGTT---------SDHEGVKGMVNFKGVTRIKT--EGGTVAA-N 270

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D  + V+G+  A L +  +++F+    +  D   D  + + + L      SY+ + T H+
Sbjct: 271 DSSIAVKGATTATLYVSIATNFN----SYKDISGDENARATAYLNKAYPKSYAAILTPHM 326

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
             YQK F+RV         D+ T   ++     +P+ ER+K+F+T  DP +V L +QFGR
Sbjct: 327 AAYQKYFNRVQF-------DLGTTEAAK-----LPTDERLKNFRTVNDPHMVTLYYQFGR 374

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSS+PG+Q ANLQGIWN  ++P WDS   +NIN +MNYW +   NLSE   P    
Sbjct: 375 YLLISSSQPGSQPANLQGIWNHRMNPPWDSKYTININAQMNYWPAEKTNLSELHAPFLKM 434

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  LS  G +TA+V Y A GW+ HH TDIW  + A  G     +W  GG W   HLWEHY
Sbjct: 435 VKELSETGQETARVMYGAKGWMAHHNTDIWRATGAIDGAFW-GMWTGGGGWTAQHLWEHY 493

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
            Y+ D+ FL +  YP+L+G A+F  D+L+E H  Y  L  NP +SPE+   A  G  + +
Sbjct: 494 LYSGDKAFLTE-IYPILKGAAAFYADFLVE-HPKYHWLVINPGSSPENAPKAHAG--SSL 549

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
              +TMD  I+ + FS  I AAE+L+K + A V+ + +   +L P  + + G + EW+
Sbjct: 550 DAGTTMDNQIVFDAFSTAIRAAELLKK-DAAFVDTLRQLRNKLAPMHVGQHGQLQEWL 606


>gi|298385755|ref|ZP_06995313.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
 gi|298261896|gb|EFI04762.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
          Length = 824

 Score =  336 bits (861), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 221/600 (36%), Positives = 340/600 (56%), Gaps = 49/600 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E   +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 25  EKKVSVQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 85  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V  N++   EG C    +   ++ ++  KG ++F   L  +   ++G   A 
Sbjct: 199 S---PHQDVMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A + +  +++F+    N  D   + T  + S L       +++    H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++ Y++   RVS+ L             E+    V + +RV++F+   D  LV   FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLS+  EPLF 
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S +G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE 
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     DGK A  
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+ +L+ +++  A +E+ LK +    P ++   G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKEFAAHLEQRLKEMA---PMQVGHWGQLQEWM 585


>gi|399025527|ref|ZP_10727523.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
 gi|398077904|gb|EJL68851.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
          Length = 820

 Score =  336 bits (861), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 209/595 (35%), Positives = 316/595 (53%), Gaps = 57/595 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ + +A+PIGNGRL AMV+G    E L+LNE T W+G P    NPD PK L 
Sbjct: 27  KLWYDKPARQWVEALPIGNGRLAAMVFGDPFKEKLQLNESTFWSGGPSRNDNPDGPKVLD 86

Query: 74  DVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R  + +  Y +A   + K           +Q +GD+ LEF++       E Y RELD+
Sbjct: 87  SIRYYLFNENYKKAEILANKGLTAKTLHGSAFQNIGDLNLEFNNPG---DIENYYRELDI 143

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A     +S   + + RE F+S PD VI+ K+S  +  +L+FN   +S L  +      
Sbjct: 144 EKALITTTFSSNGIHYKREAFASVPDNVIIIKLSSDKKNALNFNAGFNSELKKNVKTIDA 203

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           N + M+G          ++  D  +G ++F+ + +      +G  +++ D ++ V  +D 
Sbjct: 204 NTLQMDGI---------SSTLDGVQGQVKFNVLAKFIT---KGGTNSVSDNRISVANADE 251

Query: 250 AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            ++L+  +++F D   +N      D  S+S   +      +++ L+  HL+ YQK F R+
Sbjct: 252 VLILISIATNFTDYKTLN-----TDEVSKSKKYISQSETKNFNTLFKNHLNAYQKYFKRI 306

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
              L  SP                P+  RVK+F +  DP L+ L +QFGRYLLISSS+PG
Sbjct: 307 DFSLGTSPAA------------QFPTDLRVKNFASGYDPELISLYYQFGRYLLISSSQPG 354

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQGIWN    P WDS   +NIN EMNYW +   NL+E  EPL   +  LS+ G +
Sbjct: 355 GQPANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLAEMHEPLVQLVKDLSVTGVE 414

Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           TA++ Y + GWV HH TDIW  +     A+ G+     WPMGGAWL  HLWE Y Y  D+
Sbjct: 415 TARIMYKSRGWVAHHNTDIWRITGVVDFANAGQ-----WPMGGAWLSQHLWEKYLYGGDK 469

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           ++L K  Y +L+  A F  D+LIE   H  +L  +PS SPE+  I    + + +S  +TM
Sbjct: 470 NYL-KSIYTVLKSAALFYEDFLIEEPVHQ-WLVVSPSISPEN--IPKRNRGSALSAGNTM 525

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  +I ++FS    AA++L  + D +     ++  LP   P KI   G + EW++
Sbjct: 526 DNQLIFDLFSKTKKAAQILNVDSDKIPVWNTIISKLP---PMKIGRYGQLQEWME 577


>gi|189467819|ref|ZP_03016604.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
           17393]
 gi|189436083|gb|EDV05068.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
           17393]
          Length = 1061

 Score =  335 bits (859), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 212/593 (35%), Positives = 314/593 (52%), Gaps = 46/593 (7%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           TS  N +K+ +  PA+ + +A+P+GN RLGAMV+GG   E L+LNE+T W G P +  NP
Sbjct: 265 TSAQN-MKLWYARPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
              + L ++R L+  G+  EA     + +  P     Y  +G + L F   H   +E  Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 380

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R+L+L  ATA  +Y V  V+F R  F+S  D VI+ +I   ++ +L+F VS  S L + 
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             V G   II    C G      A     P  ++    +++K     G +S  E   L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMRAECQVQVKTD---GKVSKAESA-LAV 487

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+    L + A+++F    +N  D   + +  + + LQ    + Y      H+  Y+K 
Sbjct: 488 NGATEVTLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + RV++ L  +             +  + +  RV+ F    D ++  L+FQ+GRYLLISS
Sbjct: 544 YDRVALTLEST------------GVSALETPVRVQRFIEGNDMAMAALMFQYGRYLLISS 591

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+PG Q ANLQGIWN  L   WDS   +NIN EMNYW +   NLSE  EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSLYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            GS+TA+V Y A GWV HH TDIW ++        + +WP GGAW+  HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAASFGMWPNGGAWVAQHLWQHYLFTGDK 710

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +FL K+ YP+L+G A F L  L+E H  Y  + T PS SPEH +    G    ++   TM
Sbjct: 711 EFL-KKYYPILKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
           D  I  +   + + A+ +L    D L E  L++ L +L P +I +   + EW+
Sbjct: 766 DNQIAFDALYSTLLASRIL--GGDKLYEDSLQAMLDKLPPMQIGKHNQLQEWL 816


>gi|224535714|ref|ZP_03676253.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522669|gb|EEF91774.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 822

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 217/598 (36%), Positives = 333/598 (55%), Gaps = 45/598 (7%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           ES  +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 23  ESRLSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPATEQIQLNEETIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPNALEYIPRVRDLVFAGKYLEAQTLATEKVMAKSNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y    Y REL L++A   V+Y V  V++ RE  +S  DQVI+ +++ +  G ++FN  L 
Sbjct: 139 YT--NYYRELSLDSARTLVRYEVDGVQYRRETITSFTDQVIMVRLTANRPGRITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V   ++   EG C    +   ++ ++  KG ++F   L  + +  R T +  
Sbjct: 197 S---PHQDVVITSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTARNTGGRMTCA-- 246

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A++ +  +++F+    N  D   +P   +   L      S+++    H
Sbjct: 247 -DGVLSVEGADEAIVYVSIATNFN----NYQDITGNPAERAKDYLVRAMTHSFTEARKNH 301

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
            D Y++   RVS+ L             +   + V + +RV++F+   D  LV   FQFG
Sbjct: 302 TDFYRRYLTRVSLDLG------------DNRYEHVTTDKRVENFKQTNDAHLVATYFQFG 349

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF 
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 409

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    LWP GGAWLC HLWE 
Sbjct: 410 LIREVSETGKETARIMYGANGWVLHHNTDIWRITGA-VDKAPSGLWPSGGAWLCRHLWER 468

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D +FL +  YP+L     F  + ++ E    +L   PS SPE+     +GK +  
Sbjct: 469 YLYTGDTEFL-RSVYPILRESGRFFDEIMVKEPAHNWLVVCPSNSPENVHSGSNGK-STT 526

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   T+D  +I ++++AII+A+++L+ +  A   ++ + L  + P ++   G + EW+
Sbjct: 527 AAGCTLDNQLIFDLWTAIIAASDILDTDR-AFAARLSQRLREMAPMQVGRWGQLQEWM 583


>gi|383122650|ref|ZP_09943342.1| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
 gi|382984352|gb|EES70332.2| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
          Length = 822

 Score =  334 bits (857), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 217/598 (36%), Positives = 336/598 (56%), Gaps = 45/598 (7%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E   +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  
Sbjct: 23  EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR L+ +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPNALEYIPKVRELIFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H     N++   EG C    +   ++ ++  KG ++F   L  +   ++G   A 
Sbjct: 197 S---PHQDAMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 245

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A + +  +++F+    N  D   + T  + S L       +++    H
Sbjct: 246 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 301

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++ Y++   RVS+ L             E+    V + +RV++F+   D  LV   FQFG
Sbjct: 302 VEFYRQYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 349

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLS+  EPLF 
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 409

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S +G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE 
Sbjct: 410 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 468

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D +FL +  YP+L+G   F  + ++ E    +L   PS SPE+     DGK A  
Sbjct: 469 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGNDGK-ATT 526

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+ +L+ +++     + + L  + P ++   G + EW+
Sbjct: 527 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWM 583


>gi|408370425|ref|ZP_11168202.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
 gi|407744183|gb|EKF55753.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
          Length = 792

 Score =  334 bits (856), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 205/588 (34%), Positives = 311/588 (52%), Gaps = 39/588 (6%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +   A+ +  A+P+GNGRLGAM++G    E L+LNED++W G P    +    + L  +R
Sbjct: 35  YEQAAEDWMQALPVGNGRLGAMIFGNPDIEHLQLNEDSMWPGGPTLGDSKGTVEDLVALR 94

Query: 77  SLVDSGQYAEATAASVKLFGH--PADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           +L+D G+  +A    V  F H      +Q  GD+ L+F     +  E T Y R LDL+ A
Sbjct: 95  ALIDQGKVHQADKFIVDKFSHLEVTRSHQTAGDLFLDFK----RKGEVTDYYRGLDLDKA 150

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
            A V Y V   +FT +  +SN D  ++  +  +    L F++ L   +D  +       +
Sbjct: 151 VATVSYKVDGDQFTEKIIASNVDDALIISLETTAEKGLDFDIQLSRPMDKSAPTVLVTTH 210

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            ++++IM+G    +    +       +G++F     ++ + + GTI    D  L++ G  
Sbjct: 211 NSDELIMDGMVTQRGGVVENKPYPMQEGVEFQT--RLRATTEGGTIEP-SDGILELRGVR 267

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            AV+ LV  +SF           +D  +++   L  + + S+ +L  RH  D+ + + RV
Sbjct: 268 KAVIYLVTKTSF---------YHQDFKAKAQENLNEVASKSFDELLRRHSQDFGEFYDRV 318

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
           +  L  S            ++D++P+ +R++ ++  + D  L   LF +GRYLLISSSR 
Sbjct: 319 NFSLGSS------------DLDSLPTDKRLQRYKDGQVDLDLQTKLFDYGRYLLISSSRE 366

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT  ANLQGIWN  +S  W++  H+NINL+MNYW S+  NLSE Q+PLFDF   L   G 
Sbjct: 367 GTNPANLQGIWNNHISAPWNADYHLNINLQMNYWPSMVANLSELQQPLFDFSDRLLQRGK 426

Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           KTA+  Y +  G V+HH TD+WA +     +  W  W  GG WL  H W+HY +T D DF
Sbjct: 427 KTAKEQYGIQRGAVMHHTTDLWAPAFMFSSQPYWGSWIHGGGWLAQHYWDHYRFTQDADF 486

Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           LE RAYP ++  A F +DWL  +   G   + P TSPE+ ++A DGK A VS  + M   
Sbjct: 487 LENRAYPFMKEIALFYMDWLQKDATTGKWVSYPETSPENSYLAADGKPAAVSKGAAMGHQ 546

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           II EVF   +SAA+VL  N++   E   K         + EDG I+EW
Sbjct: 547 IIAEVFDNALSAAKVLNINDEFTQELKAKRADLTPGIVLGEDGRILEW 594


>gi|325105288|ref|YP_004274942.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324974136|gb|ADY53120.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 826

 Score =  334 bits (856), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 217/595 (36%), Positives = 314/595 (52%), Gaps = 60/595 (10%)

Query: 13  LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           LK+ +N P     +  A+PIGNGRLGAMV+G    E L+LNE+T+W G P    N  A +
Sbjct: 39  LKLWYNKPVIDNVWEQALPIGNGRLGAMVYGIPQREQLQLNEETIWGGGPYRNDNNKALE 98

Query: 71  ALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYR 125
            L  V+ +V  GQ  EA     + F     G P   +Q  G + L F   H +Y  E Y 
Sbjct: 99  VLPLVQKMVFDGQTQEADKLINQSFFTQTHGMP---FQTAGSLILNFP-GHNQY--ENYY 152

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           RELDLN A  +  Y+V  V++TRE FSS  D VI+ +++ SE G L+F++   +    H+
Sbjct: 153 RELDLNKAVVKTTYTVNGVKYTREVFSSFTDDVIIMQLTSSEKGGLNFDIGYVNP-SQHT 211

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLK 243
               +N +++EGR              D +GI+     +I   +S   G + A+ D K+ 
Sbjct: 212 VSKKDNSLVLEGR------------GSDHEGIEGKIRYQIHTLVSHADGHV-AVSDHKIN 258

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +  +  A + +   ++F     N      +P   + S L   +  ++     +H   Y K
Sbjct: 259 ITEASSATIYISIGTNF----TNYKSVDANPAERAASKLAVAKKKNFKSALQQHSATYYK 314

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F R  + L         D   EE     P+  R+++F+  +DP+LV LL QFGRYLLIS
Sbjct: 315 QFGRFKLNLGSQ------DISKEE-----PTDVRIRNFKETQDPALVTLLTQFGRYLLIS 363

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q +NLQGIW   + P WDS   +NIN EMNYW +   NLS+  EPLF  L  LS
Sbjct: 364 SSQPGGQPSNLQGIWCNSMHPAWDSKYTININTEMNYWPAEVTNLSDTHEPLFQMLKDLS 423

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +G +TA+  Y A GWV HH TDIW  +S         +WP GGAWL  HLWEHY +T D
Sbjct: 424 ESGRETAKTLYGADGWVAHHNTDIWRVTSPIDFAAA-GMWPTGGAWLSQHLWEHYLFTGD 482

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           R FL + AYP+L+G A F L +LIE   + G++  +PS SPEH           ++   T
Sbjct: 483 RKFLAE-AYPILKGSADFFLSFLIEHPKYKGWMVVSPSISPEH---------GPITAGVT 532

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWVQ 595
           MD  ++ +V +  + A E+L K+ + +    LKS+  R+ P +I +   + EW++
Sbjct: 533 MDNQLVFDVLTRTVVAGEMLGKDTNYIAR--LKSMAKRIPPMQIGKYTQLQEWLE 585


>gi|160886122|ref|ZP_02067125.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
 gi|423286896|ref|ZP_17265747.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
           CL02T12C04]
 gi|156108935|gb|EDO10680.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
 gi|392674434|gb|EIY67882.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
           CL02T12C04]
          Length = 822

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 218/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E  S+    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+++W G P +  
Sbjct: 23  EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 83  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y+   Y REL L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196

Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
           S           +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G 
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L VE +D AV+ +  +++F+    N  D   + T  + + L       + + 
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H+D Y++   RVS+ L R            +    V + +RV++F+   D  LV   
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  +S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LWE Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            A  +   TMD  +I ++++AIISA+++L+ +++     + + L  + P ++   G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581

Query: 593 WV 594
           W+
Sbjct: 582 WM 583


>gi|325103196|ref|YP_004272850.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324972044|gb|ADY51028.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 821

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 203/600 (33%), Positives = 326/600 (54%), Gaps = 44/600 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + NA +      LK+ ++ P++++ +A+PIGNGRLGAMV+G    E ++LNE+T+W+G P
Sbjct: 15  VANANAQQHDKTLKLWYDAPSRNWNEALPIGNGRLGAMVFGNPDREKIQLNEETVWSGGP 74

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHL 117
                 ++  A+  +R L+   ++ EA A A V +F   +   +YQ +GD+ + F   H 
Sbjct: 75  NTNITAESGAAIPKLRQLIFEEKFLEAQALADVDMFPKKNSGMIYQPVGDLLINFP-GHA 133

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
           +   E Y R+L++  A   V Y +  V + RE F+S PDQVI+ +++  +   ++FN SL
Sbjct: 134 QV--EKYYRDLNIEKAVTTVSYRLNGVNYKRETFASFPDQVIIVRLTADKPNKITFNASL 191

Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
            S  ++   +  N ++I+ G          A+   +   I+F   ++ K+   +G  + L
Sbjct: 192 TSPQNSAQKIE-NGKLILTGLT--------ADHEGEKGQIKFETQVKTKV---KGGKAEL 239

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
                KV  ++ A++ +  +++F    +  +D   +   ++ + L      +Y D   +H
Sbjct: 240 TGSLWKVTNANEAIIYISMATNF----VKYNDISGNQHVKASNYLDKAFVKNYDDALKQH 295

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           +  YQ+ F+RV         D+  +    +     P+  R+  F    DP L  L FQFG
Sbjct: 296 IAFYQQYFNRVKF-------DVGVNASVNK-----PTDRRIYEFAKSFDPHLAALYFQFG 343

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q   LQGIWN+ +   WDS   +NIN EMNYW +   NLSE  +PLF+
Sbjct: 344 RYLLICSSQPGNQPPTLQGIWNDRMDAPWDSKYTININTEMNYWPAEVTNLSELHQPLFN 403

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWE 476
            L  L++ G  TAQ  Y A GWV HH TD+W  +   DR      LWPMGG WL  HLW+
Sbjct: 404 MLEDLAVTGQATAQSMYGAKGWVTHHNTDLWRITGPVDRPYA--GLWPMGGNWLSQHLWD 461

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
           HY +T ++DFL K+ YP+L+G + F LD L E     +L  +PS SPE+ ++  +GK   
Sbjct: 462 HYQFTGNKDFL-KKYYPVLKGASDFYLDILQEEPKHKWLVVSPSNSPENTYV--EGKRVS 518

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
           ++  +TMD  ++ ++FS    AAE+L  ++D     +LK  + RL P +I +   + EW+
Sbjct: 519 IAAGTTMDNQLLFDLFSKTAKAAEILGIDKD--YSTLLKQKINRLAPMQIGKYSQLQEWM 576


>gi|332663343|ref|YP_004446131.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332157|gb|AEE49258.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 818

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 202/597 (33%), Positives = 316/597 (52%), Gaps = 43/597 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA+ + +A+P+GNGRLGAMV+G    E ++LNE+T WTG P         + L +++
Sbjct: 37  YKEPAQKWEEALPVGNGRLGAMVFGKSGEERIQLNEETYWTGGPYSTVVKGGHEVLPEIQ 96

Query: 77  SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             V  G+  +A      +  G+P +   YQ L ++ L F ++        Y+R LDL T 
Sbjct: 97  KYVFEGKMLKAHNLFGRRTMGYPVEQQKYQSLANLHLFFAEAE---PATVYKRWLDLETG 153

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
              V+Y V  V + R+ F S PDQV+V +++ SE+  +SF  +L  + +      G +  
Sbjct: 154 ITSVEYRVQEVRYRRDVFVSAPDQVVVLRLTASEAQKISFKANLRGVRNPAHSNYGTDYF 213

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSDWAV 251
            M+    G+        + D  G++     E  +K+  + GT+   +D  L VE +D   
Sbjct: 214 TMDPY--GQDGLMLKGKSSDYLGVEGKLRFEGQVKVVAEGGTVRT-DDVDLWVEKADAVT 270

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           +   A+++F    +N  D   DP +   +  +++   SY  +    + D+QK F R ++Q
Sbjct: 271 VYFTAATNF----VNYHDVSADPHARVEAVWKNMAGKSYPQIRDAAVKDHQKYFQRTTLQ 326

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L  +    +            P+ ER+ + Q   DPSL  L + FGRYLLI SSRPGTQ 
Sbjct: 327 LEIAASSYL------------PTNERMLNIQKTADPSLAALCYNFGRYLLIGSSRPGTQP 374

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN D++P WDS    NIN EMNYW +   NL EC EPL   +  L   GS+ A+
Sbjct: 375 ANLQGIWNNDMNPAWDSKYTTNINTEMNYWPAETGNLPECVEPLIQMVKELMDQGSQVAK 434

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
            +Y   GWV H  TD+W + +A      W  +  GGAWLCT LWEHY ++MD+++L K  
Sbjct: 435 EHYGCRGWVFHQNTDLW-RVAAPMDGPSWGTFTTGGAWLCTQLWEHYLFSMDKEYL-KEI 492

Query: 492 YPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKL------------ACVSY 538
           YP+++G   F +D+L+E  D  +L TNPSTSPE+   +P  +               + Y
Sbjct: 493 YPVMQGSVQFFMDFLVETPDKKWLVTNPSTSPENFPASPGNQPYFDEVTGMNLPGTTICY 552

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            S++DM I+ ++F   + A+ +L+ +++    KV  +  R  P +I +DG++ EW +
Sbjct: 553 GSSIDMQILSDLFGYYVQASALLQVDQE-FAAKVAAARKRFPPPQIGKDGALQEWAE 608


>gi|336428235|ref|ZP_08608219.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336006471|gb|EGN36505.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 721

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 211/597 (35%), Positives = 312/597 (52%), Gaps = 61/597 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + +   A+ + +++PIGNG LGAM+ GG   E L LNE+++W+G   D  N  A   L
Sbjct: 4   MMLWYEKSAERWEESLPIGNGSLGAMILGGAEEEILGLNEESVWSGYYKDKNNAKAADCL 63

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
            +VRSLV SG+  EA       + G   + Y  LG+++L+F     K  + E YRR+LDL
Sbjct: 64  EEVRSLVFSGKNKEAERLIQNNMLGEYNESYLPLGNLKLKFAYGIGKEGKAEGYRRQLDL 123

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNG 189
             A A+V Y+   V + RE+F+S P + I   ++ ++   + F VS  S L    S  +G
Sbjct: 124 ENAVAQVSYTCNEVHYQREYFASYPAKAIFVLLT-ADKPVMDFTVSFISQLCLAVSAEDG 182

Query: 190 NNQIIMEGRCPGKRIPP-----KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             Q+   GRCP    P      + +     KG+Q +A  E ++    G +   E++ L V
Sbjct: 183 ALQVT--GRCPEHVDPSYLPEREGSVVQGTKGMQVNA--EFRVVSCDGQVRE-EEEMLHV 237

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
            G+   +L+L A      P + P                   N+ Y  L   H+ DY+ +
Sbjct: 238 SGASRCLLMLSAMR----PPVLPD------------------NMDYEALKAAHIQDYRSI 275

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           + +V + L    KD+ T    EE ++ +   E        ED  L  L FQ+GRYLLI+S
Sbjct: 276 YDKVELYLGEQ-KDLPT----EERLELLKKGE--------EDNGLYGLFFQYGRYLLIAS 322

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SR G+  ANLQGIW+ +L   W S   +NIN +MNYW +L CNL EC EP   F+  +S 
Sbjct: 323 SREGSLPANLQGIWSWELRAPWSSNWTININTQMNYWHALSCNLEECLEPYIRFVERVSE 382

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSS----------ADRGKVVWALWPMGGAWLCTHL 474
            G KTA VNY   G V HH  D W  +S           + G V WA WPMGGAWL   +
Sbjct: 383 EGKKTAAVNYRCRGSVAHHNVDYWGNTSPVGVPQGEKAGEDGCVNWAFWPMGGAWLTQEI 442

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           +  Y Y+ D ++L+  A P++   A FL DWL+E + G   T PSTSPE++F  PDG++ 
Sbjct: 443 FRAYEYSGDEEYLKNTAAPIIREAALFLNDWLVE-YQGEWVTCPSTSPENQFRLPDGQIT 501

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            ++Y+S MDMAI++EVF+      E+L   +D L  ++ + +P L P +    G ++
Sbjct: 502 GLTYASAMDMAIVKEVFTHYCRICEIL-GAQDELYREICEKMPCLAPFRTGSFGQLL 557


>gi|425767412|gb|EKV05986.1| hypothetical protein PDIG_81830 [Penicillium digitatum PHI26]
 gi|425779681|gb|EKV17720.1| hypothetical protein PDIP_30190 [Penicillium digitatum Pd1]
          Length = 740

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 213/585 (36%), Positives = 300/585 (51%), Gaps = 45/585 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA+ +  A+P+GNGRLGAMV+G   +E L+LNED++W G P D    DA + L  +R
Sbjct: 6   YQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLPRLR 65

Query: 77  SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             + +  +AEA   A +  F +P     Y+ LG++ L  D  H       YRR LDL  A
Sbjct: 66  EAIRAENHAEAEKIAKLAFFANPISQRNYEPLGNLFL--DLGHNPSQVTGYRRSLDLARA 123

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG-NNQ 192
           TA V+Y    + F RE  +SNPD V+  ++  S      F V L  + D     N   + 
Sbjct: 124 TAHVRYEYQGICFEREVLASNPDDVLAIRLHSSSKAE--FVVRLTRMSDVEFETNEWLDD 181

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           I   G      + P    +      +   ++ ++     GTI+ +  K L V  +D  +L
Sbjct: 182 ISASGNSITMHVTPGGKNSS-----RVCCVVSVRCDGADGTITKI-GKNLVVNSTD-TLL 234

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           ++ A ++F           +D    +    +    LS  DL TRH  DYQ L+ R+ +QL
Sbjct: 235 VIAAQTTF---------RHEDIDQRTKQDAEIALGLSLKDLRTRHTADYQSLYDRMELQL 285

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV- 371
                +I TD             +R+KS     DP L+ L   + RYLLIS SR G +  
Sbjct: 286 GPGSPEIPTD-------------QRLKS---SRDPGLIALYHNYSRYLLISCSRDGHKSL 329

Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN    P W S    NINL+MNYW +  CNLSEC+ PLFD L  +   G  TA
Sbjct: 330 PANLQGIWNPSFHPAWGSRFTTNINLQMNYWSANVCNLSECEFPLFDLLERMVEPGKTTA 389

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           Q+ Y   GW  H  TDIWA ++     +  ++WP+GGAWLC H+W+H+ YT D  FL +R
Sbjct: 390 QIMYGCRGWTAHSNTDIWADTAPVDRWMPASIWPLGGAWLCYHIWDHFQYTCDEVFL-RR 448

Query: 491 AYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
            +P L GC  FLLD+LI   +G YL T+PS SPE+ F    G+   +   ST+D+ II  
Sbjct: 449 MFPTLRGCVEFLLDFLIVDANGAYLITSPSASPENSFYDHKGQKGVLCEGSTIDIQIIDA 508

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +  A  S  + L+  +DAL+  V  +  RL P KI+  G + EW 
Sbjct: 509 ILGAFQSCTKKLDL-QDALLPAVYATKSRLPPLKISPAGYLQEWA 552


>gi|305665057|ref|YP_003861344.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
 gi|88709809|gb|EAR02041.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
          Length = 787

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 198/600 (33%), Positives = 326/600 (54%), Gaps = 56/600 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA---PK 70
           K+ +  PAK +  A+P+GNGRLGAMV+G    E ++LNED++W   PG+   PD      
Sbjct: 30  KLWYGKPAKEWMQALPVGNGRLGAMVFGDPNHERIQLNEDSMW---PGEADWPDYRGNSD 86

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
            L ++R+L++ G+  E  +  V+ F +   V  +Q +GD+ ++F++     + E Y R L
Sbjct: 87  DLEEIRNLLNEGKTGEVDSLIVEKFSYKTIVRSHQTMGDLYIDFENER---SVENYTRSL 143

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           +LN A     Y  G   ++++ FSS PD V+V ++S   +  + F + ++   D+     
Sbjct: 144 NLNDALITAAYQSGGNSYSQKVFSSKPDDVMVIELSTDATDGMDFTLRMNRPTDD----- 198

Query: 189 GNNQIIM----EGRCPGKRIPPKANANDDPK------GIQFSAILEIKISDDRGTISALE 238
           GN  +      E     K +  + +   D K      G++F   L  ++ ++ GT++A +
Sbjct: 199 GNATVTTRNPSESEISMKGVVTQYSGKRDSKSFPLDYGVKFETRL--RVHNEGGTVTA-D 255

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
             +L ++G    ++ LV ++SF           ++ T +++  L+ + N S+  L   H 
Sbjct: 256 KGQLTLKGVKTVLIHLVGNTSFY--------HGENYTKKNLETLEKVNNSSFKTLLKNHT 307

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
            DY++L++RV + L                +D++P   R++   + ++DP L   LF++G
Sbjct: 308 KDYEELYNRVGLDLGG------------RELDSLPIDARLQRIKEGNDDPDLAAKLFKYG 355

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI+SSR GT  ANLQGIWNE ++  W++  H+NINL+MNYW +   NLSE  +P F+
Sbjct: 356 RYLLIASSRQGTNPANLQGIWNEHITAPWNADYHLNINLQMNYWPAEVANLSELHQPFFE 415

Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +L  +   G  TA+  Y +  G + HH +D+WA       +  W  W  GG W   H WE
Sbjct: 416 YLDRVLERGKNTAKKQYGINRGTMAHHASDLWATPFMRAERAYWGSWVHGGGWCAQHYWE 475

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLA 534
           HY YT D++FL+ RAYP+L+G + F LDWL+  E    ++ ++P TSPE+ +   DG  A
Sbjct: 476 HYRYTEDKEFLKNRAYPVLKGISEFYLDWLVWDETSKAWV-SSPETSPENSYFNADGNSA 534

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
            VS+ S M   II EVF  ++ AA+VL   +D   ++V     +L P   + +DG ++EW
Sbjct: 535 AVSFGSAMGHQIIAEVFDNVLEAAKVL-GIQDEFTKEVKAKREKLFPGIVVGDDGRLLEW 593


>gi|149196081|ref|ZP_01873137.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
 gi|149140928|gb|EDM29325.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
          Length = 790

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 210/605 (34%), Positives = 318/605 (52%), Gaps = 72/605 (11%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +   A+ F  ++PIGNGRLGAMV+G V  E + +NE+++W+G   +   P   K L+
Sbjct: 28  KLWYKQAAQGFEQSLPIGNGRLGAMVFGDVDEERIVINEESVWSGSKVENNIPVGYKHLA 87

Query: 74  DVRSLVDSGQYAEAT---------------AASVKLFGHPADVYQLLGDIELEFDDSHLK 118
            +R L+   ++ EA                A  +  FG     YQ+LG+I L+F  +  K
Sbjct: 88  KIRQLLGEEKFTEANKLMKQAFKVKNAPKYAKGISAFGR----YQVLGNIHLKFLGNKAK 143

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            ++  Y+RELDLN+A A V Y  G  +FTREHF S PD+V V++ SG     +SF++S+D
Sbjct: 144 VSQ--YKRELDLNSALATVNYQAGKQQFTREHFVSAPDEVFVSRFSGP----ISFSISMD 197

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                 + V   ++++M G             ND  +    + +  +++      I A +
Sbjct: 198 RPERFKTSVVNKHELLMTGAL-----------NDGFEKDGLTYVARLRVIAPNAKIKA-D 245

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
             KL VE  +  +LLL A++ + G          DP   +   L      S+++L     
Sbjct: 246 GNKLIVESQEEVMLLLAAATDYRGI---AGRQLSDPFKATSEDLDKAEKKSFTELRQAQK 302

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
            D++K + RV + L+            E +   +P+ +R+ +++  + DP+L  L F  G
Sbjct: 303 ADHEKYYRRVKLNLA------------ESHNSALPTDQRLAAYRKGKADPALAALFFNVG 350

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RY LISSSRPG   ANLQGIW E++   W+   H NIN +MNYW +L CN+ E QEP+ +
Sbjct: 351 RYFLISSSRPGGLPANLQGIWAEEVHTMWNGDYHFNINTQMNYWPALSCNMVEMQEPMNN 410

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIW---AKSSADRGKVVWALWPMGGAWLCTHL 474
           F+  L   GSKTA+  Y + GW+ H  T+IW   A +  D G         G AWLC HL
Sbjct: 411 FIASLVEPGSKTAKAYYDSPGWIAHRLTNIWGYTAPAGMDIG---------GPAWLCEHL 461

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
           WE Y YT+DR+FL K  YP+++    F L  L E   + +L T PS SPE+ F  P  K 
Sbjct: 462 WEQYAYTLDREFL-KSVYPIMKSSIDFYLHNLWEEPENKWLVTGPSASPENGFKLPGNKR 520

Query: 534 --ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSI 590
             + +    T+DM  +RE+F   + AA++L    DA ++K L +  PRL P +IA DG +
Sbjct: 521 GGSGICAGPTIDMQQLRELFGNTLRAAKIL--GIDAELQKELAEKRPRLAPNQIAPDGVL 578

Query: 591 MEWVQ 595
            EW++
Sbjct: 579 QEWLK 583


>gi|340347371|ref|ZP_08670480.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
 gi|433651138|ref|YP_007277517.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
 gi|339609463|gb|EGQ14335.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
 gi|433301671|gb|AGB27487.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
          Length = 784

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 212/591 (35%), Positives = 303/591 (51%), Gaps = 38/591 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           PL++ ++ PA  F +++PIGNG+LGA+++GG     + LN+ T W+G P D T + DA  
Sbjct: 26  PLRLWYDRPATCFEESLPIGNGKLGAIIYGGPDDNVIHLNDITFWSGKPVDLTIDSDAHV 85

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELD 129
            +  +R  +    Y  A +    + G  +  YQ LG + +      L+  E + Y R+L 
Sbjct: 86  WIPKIREALFREDYRLADSLQHHVQGANSQYYQPLGTLRIR----DLQPGEASGYHRQLS 141

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L++A    +Y  G V +TRE+F+S PD+VI  ++  S  G LS ++ L S +D H     
Sbjct: 142 LDSAVCHDRYVRGGVTYTREYFASAPDKVIAVRLRASRPGMLSCSIGLGSQVD-HGTKTS 200

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           + QIIM G           NA  DP+  I F  +L  ++S+D G++    D  L V G++
Sbjct: 201 DRQIIMTG-----------NAAGDPQETIHFCTVL--RVSNDGGSVER-TDSSLVVTGAN 246

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A + LV  +SF+G   +P          +M     + N S   L  RHLDDYQ +FHRV
Sbjct: 247 GATIYLVNETSFNGYDKHPVTQGTPYIENAMDDAWHLANYSCDSLLRRHLDDYQPIFHRV 306

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           S  L  S  +    T          S  R    Q   D  L  L FQFGRYLLISSSR  
Sbjct: 307 SFTLDGSRYNATQPT---------DSMLRAYGSQPAYDRYLEALYFQFGRYLLISSSRTP 357

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQG+WNE     W     +NINLE NYW     N+ E   PL  F   L+  G++
Sbjct: 358 GVPANLQGLWNEKKKAPWRGNYTININLEENYWPCDVANMPEMFAPLATFCQNLAQTGAQ 417

Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            A+  Y +  GW   H +DIWA ++     R    W+ W MGGAWL  ++++HY YT DR
Sbjct: 418 NARNYYGIGRGWSCGHNSDIWAMTNPVGEKRESPTWSNWNMGGAWLMQNVYDHYLYTQDR 477

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           D+L   AYPL+ G + F+LDWL+    +   L T PSTSPE  ++   G      Y  T 
Sbjct: 478 DYLSGTAYPLMRGASDFILDWLVPNPRNPEELITAPSTSPEAYYVTDKGYKGATLYGGTA 537

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           D+AIIRE+ +  + AA  L ++  A  + +  +L RL P  +   G + EW
Sbjct: 538 DLAIIRELLTNTLEAARTLNRDR-AYQDTLRHTLARLHPYTVGRQGDLNEW 587


>gi|448410558|ref|ZP_21575263.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
 gi|445671594|gb|ELZ24181.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
          Length = 822

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 213/610 (34%), Positives = 312/610 (51%), Gaps = 56/610 (9%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           T+  ++ ++ PA  + +A+P+GNGRLG MV G    E + LN+D LW G   D T    P
Sbjct: 20  THDDRLWYDAPATEWVEALPVGNGRLGGMVHGRPARERVALNDDRLWVGDHADRTADGGP 79

Query: 70  KALSDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
             L  VR  +  G++  A     +LF G    V  YQ LGD+ +   D       + YRR
Sbjct: 80  DDLDAVRECLWDGEFERAQRLCNELFVGDLTGVAPYQPLGDLLI---DCPAHDDPDEYRR 136

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL    +RV+Y+VG   F RE F+S PD V+  +I   ESG++   V LD      + 
Sbjct: 137 SLDLRAGVSRVEYTVGGTRFERECFASEPDGVLAMRIEADESGAVDARVRLDRDRSARTT 196

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG--IQFSAILEIK----------------IS 228
           V  ++ +++ G+       P  + + DP G   +F A   ++                I 
Sbjct: 197 VV-DDTVVLRGQVIDL---PGDDESVDPGGWGQRFEARARVRAEGGIVAAAADEAAPSIG 252

Query: 229 DDRGTI--SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
           D  G    +A     + V G+D   ++L A        + PSD   DP  E   AL  + 
Sbjct: 253 DGDGEREGAAYGTDGIVVAGADAVTVVLTAG-------VAPSDG--DPRDECREALAGVA 303

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
           +  Y+ +  RH+ D+++   RV + L   P D   D    E +D V   ER        D
Sbjct: 304 DDDYAAIRERHVADHREHMDRVDLDLG-EPVDAPVD----ERLDRVRDGER--------D 350

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
           P L +L  Q+GRYLL+ SSRPGT  ANLQGIWNE+  P WDS    ++NLEMNYW +   
Sbjct: 351 PHLAQLYVQYGRYLLLGSSRPGTLPANLQGIWNEEFHPPWDSDYTQDVNLEMNYWHAEVA 410

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           NL EC +PL +F+      G +TA+  Y   G+  H  +D W  ++A      W  WPMG
Sbjct: 411 NLRECADPLVEFVDESREPGRETARERYGCEGFTTHLHSDRW-HTTAQTADAHWGHWPMG 469

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHE 525
            AWLC +LWE Y ++ DR+ LE R YP+L   A FLLD+L+E   + +L T PS SPE++
Sbjct: 470 AAWLCQNLWERYAFSGDREDLE-RIYPILREAAEFLLDYLVEHPEEEWLVTAPSASPENQ 528

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
           F   DG+ A       MD+ + R++F   + AAE L+++ D   E + ++L RL P  + 
Sbjct: 529 FRTADGQEATTCVMPAMDIQLTRDLFGHCVEAAETLDRDADFAAE-LAEALERLPPMGVD 587

Query: 586 EDGSIMEWVQ 595
           + G++ EW++
Sbjct: 588 DRGALREWLR 597


>gi|395213355|ref|ZP_10400162.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
 gi|394456724|gb|EJF10981.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
          Length = 827

 Score =  332 bits (852), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 214/591 (36%), Positives = 321/591 (54%), Gaps = 47/591 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA ++ +A+P+GNGRLGAMV+     E L+LNE+T+W G PG+   P    AL 
Sbjct: 32  KLWYKQPAANWNEALPLGNGRLGAMVFSQPAREQLQLNEETVWAGEPGNNVLPALNSALP 91

Query: 74  DVRSLVDSGQYAEAT-AASVKLFGHPAD------VYQLLGDIELEFDDSHLKYAEETYRR 126
           ++R L+ +G++ EA   A  KL   PA        YQ +G++ + F   H +  +  Y R
Sbjct: 92  EIRQLIAAGKHKEAQDLAMEKLPRQPAADNNYGMPYQPVGNLFISFP-GHEQATD--YYR 148

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           +LD+  A + V Y V  V F RE FSS  D V++ ++S  +  S++F +S DS   N++ 
Sbjct: 149 DLDIQQAISTVYYKVNGVTFKREMFSSFTDDVVIVRLSADKPKSINFTLSADSPHKNYTV 208

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
               NQ+I+ G          +   D+ KG ++F  ++E +   + G I++  +  ++V 
Sbjct: 209 RTRGNQLILSG---------VSGDVDNKKGKVKFQTLVEPET--EGGKITSTPEG-VQVS 256

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G++ A L +   ++F     +  D   D  +++   L S     Y      H   Y+  +
Sbjct: 257 GANAATLYISIGTNFK----SYRDLSGDGEAKAAKLLSSAVKKKYKKAKAEHTAFYRNYY 312

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R S+ L  +  D+             P+ ER+ +F    DP L  L FQFGRYLLISSS
Sbjct: 313 DRASLNLGTT-ADLQK-----------PTDERLAAFARSNDPHLAALYFQFGRYLLISSS 360

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PGTQ ANLQGIWN+ ++P WDS   VNIN EMNYW +   NLSE   PLF  L  LS +
Sbjct: 361 QPGTQPANLQGIWNDKIAPPWDSKYTVNINTEMNYWPAEVTNLSEMHGPLFSMLKDLSES 420

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G ++A   Y A GW++HH TDIW  +    G   + +WPMGGAWL  HLW+HY YT D+ 
Sbjct: 421 GRESASKMYGARGWMMHHNTDIWRITGPIDG-AFYGMWPMGGAWLTQHLWQHYLYTGDQK 479

Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL K  YP+L+G A F  D L E   + +L  +PS SPE++  +       +S  +TMD 
Sbjct: 480 FL-KVVYPVLKGSAMFYADVLQEEPTNKWLVVSPSMSPENKHQSG----VSISAGTTMDN 534

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +I ++FS +I  AEVL  ++ A  + +     RL P +I +   + EW++
Sbjct: 535 QLIFDLFSNVIRTAEVLNTDQ-AFADSLRTMRDRLPPMQIGQHNQLQEWLR 584


>gi|336412946|ref|ZP_08593299.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942992|gb|EGN04834.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
           3_8_47FAA]
          Length = 799

 Score =  332 bits (852), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 201/594 (33%), Positives = 328/594 (55%), Gaps = 51/594 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
           +++ +  PAK +  ++PIGNGR+GAMV+GG+  ET+ LNE ++W+G   +    P   + 
Sbjct: 29  VELWYEQPAKEWMSSVPIGNGRIGAMVFGGIEEETIALNESSMWSGQYDENQEIPFGKER 88

Query: 72  LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEET---YR 125
           ++++R L   G+  E    + +     GH    +  +GD++L F      Y E T   YR
Sbjct: 89  MNELRKLFFEGKIQEGNQIAGEFLHGNGHSFGTHLPIGDLKLTFS-----YPENTVSNYR 143

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL TA +   Y++G+V + RE F++NPD V+V ++S S+  +++  +SL  L ++  
Sbjct: 144 RSLDLGTAISTTNYTIGDVNYVRECFATNPDDVLVLRMSASKKKAINAKLSLSMLRESEI 203

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
             +GN Q+I EG       P +      P G+ F     I IS   GT+ A ED  + V 
Sbjct: 204 STDGN-QLIFEGTV---NFPKQG-----PGGVSFQG--RIAISAPNGTLQA-EDSSISVN 251

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +D   +++   +++       +D+ K    E++   +     +Y  L   HL+DY  LF
Sbjct: 252 DADMLTIVIDVRTNYK------NDAYKSLCKETVVKAEK---KTYEKLKKTHLNDYTPLF 302

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RVS+QL          T     + T    E+VK  +   DP L  LLFQ+GRYLL++SS
Sbjct: 303 DRVSLQLG---------TGEYAGLPTDKRWEQVK--KGGYDPGLDVLLFQYGRYLLLASS 351

Query: 366 RPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           R  + + A LQG +N++L+    W +  H++IN + NYW +   NL+EC  PLF ++  L
Sbjct: 352 RENSPLPAALQGFFNDNLACNMGWTNDYHLDINTQQNYWIANVGNLAECHLPLFKYIEDL 411

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S++G+KTAQ  Y   GW  H   +IW   +A  G ++W L+P   +W+ +HLW  Y YT 
Sbjct: 412 SVHGAKTAQKIYGCKGWTAHTTANIWG-YTAPSGSILWGLFPTASSWIASHLWTQYEYTR 470

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D+D+L K AYPLL+G A FLLD+++E  + GY+ T PS SPE+ F+     L C S   T
Sbjct: 471 DKDYLTKTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPSISPENSFLYQGNNL-CASMMPT 529

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            D  +  E+F+A I +A++L  +++   + + +++ +  P ++  +G + EW++
Sbjct: 530 CDRVLAYEIFNACIQSAQILNIDKE-FSDSLQQAIKKFPPIRLRANGGVREWLE 582


>gi|313204128|ref|YP_004042785.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
 gi|312443444|gb|ADQ79800.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
          Length = 826

 Score =  332 bits (852), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 203/592 (34%), Positives = 317/592 (53%), Gaps = 50/592 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA ++ +A+PI NGR+ AMV G    E L+LNE + W+G P    NPD  K L
Sbjct: 29  LKLWYDKPAANWNEALPIANGRIAAMVHGNPSKELLQLNESSFWSGGPSRNDNPDGLKGL 88

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R+ +  G Y  A   S +           +Q +G++ + F ++  K+ +  Y R+LD
Sbjct: 89  DSIRTYIFQGNYTRANTLSNQFLTAKQLHGSKFQSIGNLNISFPNAE-KFTD--YYRDLD 145

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           +  A + V Y V +V + RE  +S PDQVIV +++ S+ G L+F  + DS L   S    
Sbjct: 146 IENALSSVSYKVDDVIYKREILASIPDQVIVVRLTASKPGKLTFTTNFDSQLKKTSVALD 205

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           N+ + M G          +  ++   G ++F A    K+ ++ GT+S + D  LKV+ ++
Sbjct: 206 NHTLEMTGL---------SGTHEGVIGQVKFDA--RAKVINNGGTVSFVSDS-LKVKNAN 253

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
             ++++  +++F    ++  +   + T + +  L       ++ +   H+  YQK F RV
Sbjct: 254 EVIIMVSIATNF----VDYQNLTANETQKCIQYLSVAEKKPFNTILKNHISTYQKYFKRV 309

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           +  L  S     T            + +R+K+F    DP LV L +QFGRYLLI SS+P 
Sbjct: 310 NFDLGTSEAAKAT------------TKDRIKNFSKSYDPELVSLYYQFGRYLLICSSQPN 357

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q +NLQGIWN   +P WDS   +NIN EMNYW +   NL+E  EPL   +  LS +G +
Sbjct: 358 GQPSNLQGIWNGSNNPMWDSKYTININTEMNYWPAEKTNLTEMHEPLIKMIKELSQSGKE 417

Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           TA+V Y ++GWV HH TDIW  +     AD G+     WPMGGAWL  HLWE Y Y  + 
Sbjct: 418 TAKVMYGSNGWVAHHNTDIWRITGVVDFADAGQ-----WPMGGAWLSQHLWEKYLYNGNL 472

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
            +LE   YP+L+    F  D+LIE     +L  +PS SPE+    P G  + +    T+D
Sbjct: 473 KYLE-SVYPVLKSACEFYKDFLIEEPTHKWLVVSPSVSPEN---TPQGHKSALVAGCTID 528

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             ++ ++F+  I AA++L+K+   +V+   K L RL P +I   G + EW++
Sbjct: 529 NQLLFDLFTKTIKAAKLLKKDASLMVD-FQKILDRLPPMQIGRLGQLQEWLE 579


>gi|29346420|ref|NP_809923.1| hypothetical protein BT_1010 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29338316|gb|AAO76117.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 824

 Score =  332 bits (851), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 216/598 (36%), Positives = 337/598 (56%), Gaps = 45/598 (7%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           E   +    K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  
Sbjct: 25  EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
           NP+A + +  VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +
Sbjct: 85  NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y++  Y R+L L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L 
Sbjct: 141 YSD--YYRDLSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
           S    H  V  +++   EG C    +   ++ ++  KG ++F   L  +   ++G   A 
Sbjct: 199 S---PHQDVMIHSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            D  L VEG+D A + +  +++F+    N  D   + T  + S L       +++    H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           ++ Y++   RVS+ L             E+    V + +RV++F+   D  LV   FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLS+  EPLF 
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S +G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE 
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     DGK A  
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  +I ++++AIISA+ +L+ +++     + + L  + P ++   G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWM 585


>gi|326790118|ref|YP_004307939.1| alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
 gi|326540882|gb|ADZ82741.1| Alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
          Length = 756

 Score =  332 bits (851), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 199/587 (33%), Positives = 313/587 (53%), Gaps = 54/587 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +   A+++ +A+PIGNG LG M++GG+  E +++NE++LW G   D  N DA K L  +R
Sbjct: 8   YKQAARNWNEALPIGNGALGGMIFGGIKKELIQMNEESLWYGTFRDRNNKDARKYLPVIR 67

Query: 77  SLVDSGQYAEATAA-SVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET----YRRELD 129
            L+  G+  EA    S+ +FG P     Y +LGD+ ++       + +E     YRR LD
Sbjct: 68  DLLWQGKIGEAEKLLSMSMFGTPDGQRQYSVLGDLVIQC------FGQEEPVSHYRRTLD 121

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A V Y     +F RE+F S PD ++  ++   +   +     +D    N      
Sbjct: 122 LETACATVGYVSPKGKFEREYFCSKPDNLLAVRLRCDQEEQIELMAYIDRWKYNDEIEMS 181

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            + + + G          ++     +GI +  ++  K+  + GT   +  ++L  +G + 
Sbjct: 182 KDGMSLYG----------SSGPCSSEGIGYHFMM--KLIPNGGTAQNI-GQRLYAKGCNE 228

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            ++L+ A++ +        DS  +P S     L+      Y +L  RH+ DY+ L+ R+S
Sbjct: 229 VIILVTATTDY-------KDS--NPRSICEERLKKATQKGYEELKARHVADYKSLYKRLS 279

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
           + L              E+++ +P+ ER++  +   ED  L+ + FQ+GRYLLIS SR G
Sbjct: 280 LDLKG------------ESLNHLPTDERLERIKKGGEDLDLIAMYFQYGRYLLISCSREG 327

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              A LQGIWN +  P WDS   +NIN EMNYW +  C+LSEC  PL + L  + I+G K
Sbjct: 328 GLPATLQGIWNGEWLPPWDSKYTININTEMNYWLAEKCHLSECHLPLVEHLEKVRIHGEK 387

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y   G++ HH TDIW  ++     +   +WPMG AWL  H+WEHY YT+D+ FL 
Sbjct: 388 TAEQMYGCRGFMAHHNTDIWGDAAPQDMWMPATIWPMGAAWLVLHIWEHYEYTLDQAFL- 446

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           K  Y LL+G   F  D+L+   +GYL T PSTSPE+ +    G+   V    +MD  I+ 
Sbjct: 447 KEKYHLLKGAGDFFKDYLMMDENGYLVTGPSTSPENTYRLSSGEQGTVCIGPSMDSQILF 506

Query: 549 EVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEW 593
           E+F+AII A +++ + E+ +   +++ K LP   P +I + G IMEW
Sbjct: 507 ELFTAIIEAGQLVGEAEEEIQCFKEMRKKLP---PIQIGKYGQIMEW 550


>gi|325298118|ref|YP_004258035.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
 gi|324317671|gb|ADY35562.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
          Length = 820

 Score =  332 bits (851), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 216/595 (36%), Positives = 312/595 (52%), Gaps = 41/595 (6%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
           +S    LK+ +  PA  + +A+P+GN  +G MV+GG   E L+LNE+T+W G P    NP
Sbjct: 18  SSWAESLKLWYRQPAHVWVEALPLGNSNMGVMVYGGTGVEQLQLNEETMWGGGPHRNDNP 77

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETY 124
            A +AL +VR L+   +  EA     K F  G     YQ +G + +E    H ++A + Y
Sbjct: 78  KALQALPEVRKLIFDNRNMEAQQLIDKTFYSGRNGMPYQTIGSLMIE-QPGH-EHATDYY 135

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
           R +LDL  A A V+Y V  V + RE F+S  D+VI   ++    G L+F +   S L  H
Sbjct: 136 R-DLDLERAVATVRYQVDGVTYRREVFASLVDKVIRVHLTADRPGMLTFTLGYQSPLTRH 194

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKL 242
                         C GK +    N  +D +G++    +E   ++    G + A  DK L
Sbjct: 195 QVT-----------CKGKTLVLTGNG-EDHEGVKGVIRMETGTQVMAKGGKVKAQGDK-L 241

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            VEG+D  V L VAS++    F + +D   +P       L+     SY+     H   Y+
Sbjct: 242 CVEGAD-EVTLYVASAT---NFRSYNDVSGNPHRSVQELLKKAVKTSYTQALADHEAYYR 297

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           K F RV + L             E   D   + ER++ F   +D SL  L+FQ+GRYLLI
Sbjct: 298 KQFDRVRLDLG------------EGQGDQWETTERIRRFNEGKDVSLAALMFQYGRYLLI 345

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SSS+PG Q ANLQGIWN+ L   WD    +NIN EMNYW +   NL E  +PLF+ +  L
Sbjct: 346 SSSQPGGQAANLQGIWNDKLLAPWDGKYTININTEMNYWPAEVTNLPETHQPLFELVKEL 405

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G +TA+V Y A+GWV HH TDIW + +    K  +  WP GGAWL THLW+HY YT 
Sbjct: 406 SQTGQETARVMYGANGWVAHHNTDIW-RCTGPVDKAFYGTWPNGGAWLTTHLWQHYLYTG 464

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD-GKLACVSYSS 540
           D++FLE+  YP L+G A F L +LI     G++   PS SPEH     + GK + +    
Sbjct: 465 DKEFLEE-VYPALKGAADFYLSYLIPHPKYGWMVEAPSMSPEHGPQGENTGKASTIVAGC 523

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           TMD  I+ +V +  + A  +L+ +  A  + +   + +L P +I +   + EW++
Sbjct: 524 TMDNQIVFDVLNNALHATRILDGSV-AYQDSLRWMIEQLPPMQIGQYNQLQEWLE 577


>gi|393782187|ref|ZP_10370376.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
           CL02T12C01]
 gi|392674221|gb|EIY67670.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
           CL02T12C01]
          Length = 1400

 Score =  332 bits (850), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 209/607 (34%), Positives = 327/607 (53%), Gaps = 55/607 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA ++ +A+P+GNGRLGAMV+G    +T+++NEDT W+G P +  NP+A   L
Sbjct: 27  LKLWYDRPADYWVEALPLGNGRLGAMVYGIASQDTIQINEDTYWSGSPYNNANPNALTHL 86

Query: 73  SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
            D+R+ +++G+YAEA         A   + GH   +Y+ +G++ L+F ++H       Y 
Sbjct: 87  EDIRNYINNGEYAEAQKLALANIIADRNITGHGM-IYESIGNLLLDFPENH--KTPSNYY 143

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
           RELDL+ A A++ Y+V  V +TRE F+S  DQ+I+ KIS  + G ++F  S    L  + 
Sbjct: 144 RELDLSNAVAKITYTVDGVNYTREVFTSLADQLIIIKISADQPGKVTFKTSFVGPLKTNR 203

Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
                  V G + ++      GK+          P  +   ++  IK+  D G+ +A  +
Sbjct: 204 TKVTVKLVEGADNMLSVYTEGGKKTEENI-----PNLLHAHSL--IKVVADGGSQTA-AN 255

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
             L V  ++ A + +  +++    F++  D   D  + +   L    +  Y      H+ 
Sbjct: 256 SSLNVTNANSACIYISTATN----FVSYKDISADSEARAKEYLDKF-DKDYEQAKADHIA 310

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
            YQ+ F RV++ L  +         SE+  +  P+  R++ F T  DPSL  L FQFGRY
Sbjct: 311 KYQEQFGRVTLNLGNN---------SEQ--EKKPTDVRIEEFSTVNDPSLAALYFQFGRY 359

Query: 360 LLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSS+PGTQ ANLQGIWN +    P WDS    NIN+EMNYW +   NLSEC  P   
Sbjct: 360 LLISSSQPGTQPANLQGIWNPNAGQYPAWDSKYTANINVEMNYWPAEVTNLSECHNPFLQ 419

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +S+ G ++A   Y   GW +HH TDIW +S+    K    +WP   AW C HLWEH
Sbjct: 420 MVKDVSVTGEESAGKMYGCRGWTLHHNTDIW-RSTGAVDKSACGVWPTCNAWFCFHLWEH 478

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE---FIAPD--- 530
           Y +T D++FL +  YP+L+  + F  D+LI + + GY   +PS SPE+    F   D   
Sbjct: 479 YLFTGDKEFLAE-IYPVLKSASEFYQDFLITDPNTGYKVVSPSNSPENHPGLFSYTDDSG 537

Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDG 588
             + A +    TMD  ++ ++    I AAE+L  ++  + +  LK L  +L P  + + G
Sbjct: 538 SKQNAAIFSGVTMDNQMVYDLLRNTIEAAEILNTDKGFVAD--LKELKEQLPPMHVGKYG 595

Query: 589 SIMEWVQ 595
            + EW++
Sbjct: 596 QLQEWLE 602


>gi|423221590|ref|ZP_17208060.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645917|gb|EIY39637.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 826

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 201/593 (33%), Positives = 318/593 (53%), Gaps = 48/593 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++GPA  + +A+P+GNGR+GAMV+G    E  +LNE+T+W G P + TNP A  AL
Sbjct: 27  LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 86

Query: 73  SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
             +R L+  G+  EA      T  S    G P   YQ +G + L+FD     Y +  Y R
Sbjct: 87  PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 140

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           +LD+  A A  +++   V +TRE ++S PDQV+V +++ S+  S+SF     +   ++  
Sbjct: 141 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 200

Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
             ++   ++ + G         KAN ++  KG ++F+A+   +I +  G++ A  D  L+
Sbjct: 201 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 249

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+ ++ +V L V   S    F+N  D   +  S +   L+ + N +Y+     H++ YQK
Sbjct: 250 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 304

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RVS+ L R+ +               P+  RVK F T  DP +  L FQFGRYLLI 
Sbjct: 305 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 352

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +   +L E  EP    +   +
Sbjct: 353 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 412

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           I G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C HLW+ Y ++ D
Sbjct: 413 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 470

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +++L +  YPL+ G   F LD+L+ E  + +L   PS SPE+  +    +   V   +TM
Sbjct: 471 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 529

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  ++ ++F   I+AA ++ +N  A  + +   +  L P ++   G + EW+ 
Sbjct: 530 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMH 581


>gi|212540772|ref|XP_002150541.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
 gi|210067840|gb|EEA21932.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
          Length = 755

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 204/596 (34%), Positives = 316/596 (53%), Gaps = 50/596 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PA+ + +A+P+GNGRLG MV+G   +E L LNED++W G P   T   +   L+
Sbjct: 4   KLWYQQPAQCWNEALPVGNGRLGVMVYGRTSTELLALNEDSVWYGGPQSRTPQPSIGELA 63

Query: 74  DVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFD-DSHLKYAEETYRRELD 129
            +R L+   ++ +A   + K  F  PA    Y+ LG + ++F+ D+  K  +  Y+R LD
Sbjct: 64  LLRDLIRKEKHTDAEKLARKSFFASPASQRHYEPLGTVFIDFNHDNEQKLLD--YQRSLD 121

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---- 185
           +  +   V+Y    +   R+  +S PD V+   I  S     +  ++  + LD  +    
Sbjct: 122 IEKSLCHVEYEYDGICIARDLIASYPDSVLAMHIQSSAPIEFTVRLTRVNELDYETNEFL 181

Query: 186 --YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
                  N ++M     GKR              +   +L  +  DD G ++A  +  L 
Sbjct: 182 DDVAAKGNSLVMSVTPGGKR------------SNRACCVLSARCIDDEGIVTARPNNSLH 229

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           + G +  +LL++A+ +        +D  K   ++  +ALQ     S+ +L TRH+ DY  
Sbjct: 230 IRGQN--ILLVIAAQTE----YRCNDIDKVTVTDCNNALQK----SWDELLTRHIQDYSA 279

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           L+ R+S+++         D+ +   +  +P+  R++      D  L+ L   + RYLLIS
Sbjct: 280 LYTRMSLRIG--------DSANLHELQKIPTDVRLRE---SRDLGLISLYHNYSRYLLIS 328

Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           SSR G +   A LQGIWN   +P W S   +NINL+MNYW    CNLSEC +PLF  L  
Sbjct: 329 SSRNGYKALPATLQGIWNPSFTPAWGSKYTININLQMNYWPVNVCNLSECSQPLFALLRR 388

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           ++ NG KTA+  Y   GW  HH TDIWA +      +   LWP+GGAWLC H+WEH++YT
Sbjct: 389 MAENGVKTAKSMYNCGGWAAHHNTDIWADTDPQDRWMPATLWPLGGAWLCFHIWEHFDYT 448

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV-SYS 539
            D++FL +  +P+L+GC  FLLD+LIE  DG YL TNPS SPE+ F   + +   V    
Sbjct: 449 QDKEFLSE-MFPVLQGCVEFLLDFLIESVDGKYLVTNPSLSPENTFYTHNRENQGVFCEG 507

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ST+D+ II  VF+A +S+ +VL   ++ L  +V  +  RL P +I   G + EW+ 
Sbjct: 508 STIDIQIIEAVFTAFLSSVDVLNLTDNELGGRVQDAKKRLPPMQIGSFGQLQEWMH 563


>gi|224538245|ref|ZP_03678784.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520142|gb|EEF89247.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 827

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 201/593 (33%), Positives = 318/593 (53%), Gaps = 48/593 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++GPA  + +A+P+GNGR+GAMV+G    E  +LNE+T+W G P + TNP A  AL
Sbjct: 28  LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 87

Query: 73  SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
             +R L+  G+  EA      T  S    G P   YQ +G + L+FD     Y +  Y R
Sbjct: 88  PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 141

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           +LD+  A A  +++   V +TRE ++S PDQV+V +++ S+  S+SF     +   ++  
Sbjct: 142 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 201

Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
             ++   ++ + G         KAN ++  KG ++F+A+   +I +  G++ A  D  L+
Sbjct: 202 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 250

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+ ++ +V L V   S    F+N  D   +  S +   L+ + N +Y+     H++ YQK
Sbjct: 251 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 305

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RVS+ L R+ +               P+  RVK F T  DP +  L FQFGRYLLI 
Sbjct: 306 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 353

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +   +L E  EP    +   +
Sbjct: 354 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 413

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           I G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C HLW+ Y ++ D
Sbjct: 414 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 471

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +++L +  YPL+ G   F LD+L+ E  + +L   PS SPE+  +    +   V   +TM
Sbjct: 472 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 530

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  ++ ++F   I+AA ++ +N  A  + +   +  L P ++   G + EW+ 
Sbjct: 531 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMH 582


>gi|237718536|ref|ZP_04549017.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
 gi|229452243|gb|EEO58034.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
          Length = 1100

 Score =  331 bits (849), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 210/592 (35%), Positives = 302/592 (51%), Gaps = 49/592 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA+H+ +A+PIGN RLGAMV+GG   E L++NE+T W G P    +P A   L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGCEELQINEETFWAGGPHHNNSPKAKTVL 347

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
            + R L+   +  EA    +   F  P  +  L     L     H K     Y RELD+ 
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
            ATA   Y V  V +TR  FSS  DQVI+ ++  +  G+L F++  D+        LL  
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGSALLHP 465

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
              V GN   +   +C G      A+A             ++++  D   ++  +  +L 
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+G+  A + L A+++F    +N  D   + +  + + L++     Y      H   YQ 
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RV + L   P  I +           P+ +RV  F   +D +L+ LL+Q+GRYLLI 
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIW   L   WDS   +NIN EMNYW +   NLSEC EPLF  L  LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           + G +TA+  Y A GWV HH TD+W  +    G   W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           + FL K  YP+++G A F++  L++    G+L T PS SPEH + A      C     TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  ++ +    AA +L +   A  + +  +  +L P +I +   I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840


>gi|160882310|ref|ZP_02063313.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
 gi|156112318|gb|EDO14063.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
          Length = 1100

 Score =  330 bits (846), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 210/592 (35%), Positives = 302/592 (51%), Gaps = 49/592 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA+H+ +A+PIGN RLGAMV+GG   E L++NE+T W G P    +P A   L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGREELQINEETFWAGGPHHNNSPKAKTVL 347

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
            + R L+   +  EA    +   F  P  +  L     L     H K     Y RELD+ 
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
            ATA   Y V  V +TR  FSS  DQVI+ ++  +  G+L F++  D+        LL  
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALLFSLGYDTPKEADGSALLHP 465

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
              V GN   +   +C G      A+A             ++++  D   ++  +  +L 
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+G+  A + L A+++F    +N  D   + +  + + L++     Y      H   YQ 
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RV + L   P  I +           P+ +RV  F   +D +L+ LL+Q+GRYLLI 
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIW   L   WDS   +NIN EMNYW +   NLSEC EPLF  L  LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           + G +TA+  Y A GWV HH TD+W  +    G   W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           + FL K  YP+++G A F++  L++    G+L T PS SPEH + A      C     TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  ++ +    AA +L +   A  + +  +  +L P +I +   I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840


>gi|255936621|ref|XP_002559337.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583957|emb|CAP91981.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 740

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 210/593 (35%), Positives = 313/593 (52%), Gaps = 55/593 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA+ +  A+P+GNGRLGAMV+G   +E L+LNED++W G P D    DA + L 
Sbjct: 3   ELWYQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLP 62

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R  + +G +AEA   A +  F +P+    Y+ LG++ L  D  H       YRR LDL
Sbjct: 63  RLREAIRAGNHAEAEKIAKLAFFANPSSQRNYEPLGNLFL--DLGHDPSQVTGYRRSLDL 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVN 188
            +ATA V Y    V + R+  +S PD VI  K+  S        ++  S L+   H +++
Sbjct: 121 TSATAHVSYEYQGVRYERQVLASYPDDVIAIKMYSSSRAEFVVRLTRMSELEFETHEWLD 180

Query: 189 G----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
                 N I M     GK      N+N      +   ++ I+      TI+ + +  L V
Sbjct: 181 DVSATGNSITMHVTPGGK------NSN------RACCMVSIRCDGAESTITRVGNN-LVV 227

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
             SD A+L++ A ++F           +D    +M   ++       D+  RH+ DYQ L
Sbjct: 228 NSSD-ALLVVAAQTTF---------RHEDNDQRTMQDAENALGFPLEDIRARHVADYQSL 277

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           ++R+ +QL     +I TD             +R+KS +   DP L+ L   + RYLLIS 
Sbjct: 278 YNRMELQLGPDSPEIPTD-------------QRLKSLR---DPGLIALYHNYNRYLLISC 321

Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SR   +   ANLQGIWN    P W S   +N+NL+MNYW +   NLSEC+ PLFD L  +
Sbjct: 322 SRDRHKSLPANLQGIWNPSFHPAWGSRFTINVNLQMNYWSANMGNLSECELPLFDLLERM 381

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
              G  TA++ Y   GW  H  TDIWA ++     +  ++WP+GGAWLC H+W+H+ YT 
Sbjct: 382 VEPGKVTARIMYGCRGWTAHPNTDIWADTAPFDRWMPASIWPLGGAWLCYHIWDHFRYTG 441

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D++FL +R +P L GC  FLLD+LIE  +G YL T+PSTSPE+ F    G+   +   ST
Sbjct: 442 DQNFL-RRMFPTLRGCVEFLLDFLIEDANGEYLVTSPSTSPENSFYDGKGQKGVLCEGST 500

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +D+ II  +  A  S A+ L   EDA++  V  +  R+ P +++  G + EW 
Sbjct: 501 IDIQIIDAILDAFQSCAKSLGL-EDAILPAVQATRSRIPPMRVSPAGYLQEWA 552


>gi|254446849|ref|ZP_05060324.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
           DG1235]
 gi|198256274|gb|EDY80583.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
           DG1235]
          Length = 800

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 209/613 (34%), Positives = 312/613 (50%), Gaps = 54/613 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T+ T    + F+      T++IP+GNGRLGA  +G V  ET+ LNE  +W+G P +   
Sbjct: 21  ATAQTPERSVWFDSAGASLTESIPLGNGRLGASFFGMVEEETVILNESGMWSGSPQEADR 80

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIEL-- 110
            DA KAL +++ L+  G+ AEA A     F               P   YQ+L  + +  
Sbjct: 81  MDAHKALPEIKRLLLEGRNAEAEALVNANFTCAGRGSGYGGGANDPYGSYQILAKLHIVD 140

Query: 111 --EFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES 168
             E  D+ +K     YRRELDL TAT R  +  G V + RE F+S PD+ +V + + SE+
Sbjct: 141 RSESSDTVVK----NYRRELDLATATYRHSFERGGVGYIRESFASRPDEALVVRFTASEA 196

Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           G L  + SL           G + ++M G+          +      G++++ +L+   +
Sbjct: 197 GGLDLDFSLSREERMQVEPLGADALLMTGQL--------NDGYGGEDGVRYAGVLK---A 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-----VASSSFDGPFINPSDSKKDPTSESMSALQ 283
             RG     E+ +L+V G+D  ++       +A  SF G  +      +DP + +   L 
Sbjct: 246 SARGGEVRSEEGRLEVRGADEVIVYFTTANDIAKRSFAGRMV------EDPIATAKLDLA 299

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
            + + S+ +L  RH+  +++ + RVS+QL        ++  +            V  ++ 
Sbjct: 300 GVESYSFEELKRRHVAAFREYYGRVSLQLG-------SEELAASRAKVATPQRLVDHWEG 352

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
            +DP L  L F FGRYLLISSSRPG Q ANLQGIW++ +   W+   H NIN++MNYW +
Sbjct: 353 VDDPDLAALYFDFGRYLLISSSRPGGQPANLQGIWSDTIQTPWNGDWHANINVQMNYWPA 412

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
             CNLSE  EP+F  +  L   G KTA+  Y A GWV     + W  +S       W   
Sbjct: 413 ELCNLSELHEPMFKLIESLVEPGRKTAKAYYDAEGWVSFLLANPWGFTSPGE-SASWGST 471

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSP 522
               AWLC HLW+HY +T D  FL + AYP+L+  A F    L+E    G+L T PS SP
Sbjct: 472 VSCSAWLCQHLWDHYLFTKDEAFL-RWAYPILKDSAVFYSQMLMEDTRTGWLVTCPSNSP 530

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           E  F   +G+   VS   T+D  ++R +F A I AAE+L ++ +   E   KS  RL PT
Sbjct: 531 ESAFKLANGETVHVSMGPTIDQQLLRYLFGACIEAAEILGQDPEFAAELAEKS-ARLAPT 589

Query: 583 KIAEDGSIMEWVQ 595
           +I  DG +MEW++
Sbjct: 590 QIGSDGRVMEWLE 602


>gi|255035637|ref|YP_003086258.1| glycoside hydrolase [Dyadobacter fermentans DSM 18053]
 gi|254948393|gb|ACT93093.1| glycoside hydrolase family protein [Dyadobacter fermentans DSM
           18053]
          Length = 781

 Score =  329 bits (844), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 207/616 (33%), Positives = 319/616 (51%), Gaps = 59/616 (9%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
            +N  + +   PL++ +  PA  + + IP+GNGRLG M  GGV  ET+ LN+ TLW+G P
Sbjct: 13  FLNLAALAQQAPLRLWYTKPASQWEETIPLGNGRLGMMGDGGVTKETVVLNDITLWSGAP 72

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGD 107
            D    DA ++L ++R L+ +G+  EA A   K F       GH      P   YQ+LG+
Sbjct: 73  QDANRYDAHESLPEIRRLILAGKNDEAQALVNKNFVAKGAGSGHGDGANVPFGCYQVLGN 132

Query: 108 IELEFDDSHLKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
           + LEF    +  A      Y+REL L+ A + V Y V  V +TRE+F+S  D + + KI+
Sbjct: 133 LHLEFGYKGVDTARVQVRDYKRELSLDEAVSSVIYQVNGVTYTREYFTSFGDDLGIIKIT 192

Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
             + G L+  ++LD   +    V  NN + M G+          N   D KG+++   ++
Sbjct: 193 ADKPGQLNLRIALDRP-ERFQTVIKNNTLEMSGQL---------NNGTDGKGMRYLTKIK 242

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
             +   + ++S    K++ +  +D  ++   A + F           K+  +E+   + +
Sbjct: 243 PLVKGGKTSVSG---KQIVISDADEIIVYFSAGTDF---------KNKNFETETQRLIDA 290

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT- 343
               SYS     H  +YQKLF+R  I L  S  D             VP+ +R+ +FQ  
Sbjct: 291 AVKKSYSVQKNLHTTNYQKLFNRTKIHLGGSKGD------------GVPTDQRLSAFQKN 338

Query: 344 -DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
            ++D  L  L FQFGRYL ISS+R G    NLQG+W   +   W+   H+++N++MN+W 
Sbjct: 339 PEKDNELAVLYFQFGRYLSISSTRVGLLPPNLQGLWANQIRTPWNGDYHLDVNVQMNHWP 398

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
               NLSE   PL D +  +   G KTA+  Y A+GWV H  T++W  +     +  W  
Sbjct: 399 VEVANLSELNLPLADLVKGMVKQGEKTAKAYYNANGWVAHVITNVWGYTEPGE-EASWGA 457

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTS 521
              G  W+C +LWEHY +T D+++L K  YP+L+G A F +  LI+    G+L T PS S
Sbjct: 458 SNAGSGWICNNLWEHYAFTHDKNYL-KDIYPVLKGSAEFYISALIKDPKTGWLVTAPSVS 516

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRL 579
           PE+ F  P+GK A +    T+D  I RE+F+ +I+A EVL  + D    ++  LK LP  
Sbjct: 517 PENSFYLPNGKTAAICMGPTIDNQITRELFTNVITACEVLGVDADFAKSLQNKLKELP-- 574

Query: 580 RPTKIAEDGSIMEWVQ 595
            P  +  DG +MEW++
Sbjct: 575 PPGVVGSDGRLMEWLE 590


>gi|423298609|ref|ZP_17276665.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
           CL03T12C18]
 gi|392662352|gb|EIY55913.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
           CL03T12C18]
          Length = 822

 Score =  329 bits (844), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  NP+A + + 
Sbjct: 32  KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y+   Y RE
Sbjct: 92  KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L S        
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197

Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
              +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G   A  D  L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            VE +D A++ +  +++F+    N  D   +    + + L       + +    H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +   RVS+ L             E+    V + +RV++F+   D  LV   FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDTHLVATYFQFGRYLLI 354

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF  +  +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE Y YT 
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK A  +   T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           MD  ++ ++++AIISA+++L+ + +     + + L  + P ++   G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583


>gi|336416256|ref|ZP_08596592.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
           3_8_47FAA]
 gi|335938987|gb|EGN00866.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
           3_8_47FAA]
          Length = 822

 Score =  329 bits (844), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  NP+A + + 
Sbjct: 32  KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y+   Y RE
Sbjct: 92  KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L S        
Sbjct: 146 LSLDSARAIVRYEVDGVQYQREMITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197

Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
              +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G   A  D  L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            VE +D A++ +  +++F+    N  D   +    + + L       + +    H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHIDFYR 306

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +   RVS+ L             E+    V + +RV++F+   D  LV   FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF  +  +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE Y YT 
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK A  +   T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           MD  ++ ++++AIISA+++L+ + +     + + L  + P ++   G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583


>gi|423241186|ref|ZP_17222300.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
           CL03T12C01]
 gi|392642334|gb|EIY36101.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
           CL03T12C01]
          Length = 825

 Score =  329 bits (844), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 203/590 (34%), Positives = 318/590 (53%), Gaps = 45/590 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           KI ++ PA ++ +AIPIGNGR+ AMV+G    E L+LNE+T+  G P    N +   AL 
Sbjct: 27  KIWYDTPAHYWEEAIPIGNGRIAAMVFGNPQLEQLQLNEETISAGSPYQNYNKEGKGALK 86

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++R L+  G Y EA   + K    P      YQ +G++ + + +       + Y RELDL
Sbjct: 87  EIRRLIFDGHYEEAQNMAEKKILSPVGREMPYQTVGNLNIRYKNHK---QIKKYYRELDL 143

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNG 189
             A A  +Y + +VE T E F+S  DQ+I+  I  S+ GS++  +   + +D       G
Sbjct: 144 TRAIATTRYQIKDVEITEETFASFTDQLIIKHIKSSKKGSINCELFFQTPMDAPKRSACG 203

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
             ++ +EG   G         N  P  + + A L +K SD  G + AL D  +KVE +  
Sbjct: 204 KKKLRLEGITSGN--------NHIPGKVHYCADLSVKNSD--GKVFALNDTLIKVEKATE 253

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ-SIRNLSYSDLYTRHLDDYQKLFHRV 308
             L +  +++F    +N  D   +P   +   L+ S+++   + +   H+  Y+K+F+RV
Sbjct: 254 ICLYVSMATNF----VNYKDISANPYERNEKYLKNSMKDFEKAKI--EHVAAYKKMFNRV 307

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           +++L  SP+               P+  R+K F++  DP LV L FQFGRYLLISSS+PG
Sbjct: 308 TLELGHSPQI------------NKPTNIRLKEFESSYDPHLVSLYFQFGRYLLISSSQPG 355

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            Q ANLQG WN  + P W S    NIN EMNYW +   NLSE  EPL   +   S +G +
Sbjct: 356 CQPANLQGKWNAKVRPPWSSNYTTNINTEMNYWPAEVTNLSELHEPLIQIIQDWSQSGRE 415

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           TA   Y   GWV+HH +D+W  + A DR      +WP  GAW+C HLW+ Y ++ ++++L
Sbjct: 416 TADQMYGCRGWVLHHNSDLWRVTGAVDRAYC--GVWPTAGAWMCQHLWDRYLFSGNKEYL 473

Query: 488 EKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K+ YP++   + F +D+L++  + GY    PS SPE+       K +  S  +TMD  +
Sbjct: 474 -KKIYPIMRSASKFFIDFLVQNPNTGYWVVGPSPSPENSPKKIKQKASLFS-GNTMDNQL 531

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWVQ 595
           I ++FS    AA++L  ++D+ +   LK++  +L P ++ E G + EW +
Sbjct: 532 IFDLFSNTCEAAKIL--SQDSTLCDTLKTMRNQLPPMQVGEYGQLQEWFE 579


>gi|153812246|ref|ZP_01964914.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
 gi|149831653|gb|EDM86740.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
          Length = 754

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 198/593 (33%), Positives = 301/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + F+ PA+ + +A+P+GNG +GAM +G +  E ++LN DTLW+G      N +     
Sbjct: 9   LTLAFDRPAEAWNEALPLGNGSMGAMSYGRLREEKIELNLDTLWSGTGRSKENKNTDVDW 68

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
             +R  +  G+Y EA A     + G   + Y   G++ ++ +   LK    +Y+R+L + 
Sbjct: 69  DFLRQKIFDGEYEEAEAYCKENILGDWTESYLPAGNLHIDANIPELK-EHGSYQRQLSIK 127

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A  +V Y      + RE F S  + V+          SL   +SLDS + +     G +
Sbjct: 128 DALEQVVYRQDGQGYLREFFVSMSEPVMALHYRADAGSSLELRISLDSQIRHVCSGYGTS 187

Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           ++++EG+ P    P   +       ++ KG +F+  + I +   +G I   +D  L V  
Sbjct: 188 ELVLEGQAPVYAAPLYYSCEQPIVYEEGKGTRFA--IGISVQAPKGCIRQ-KDNTLLVTA 244

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
                + L   + F         ++    S     L+ I +LSY  L   H   Y   F 
Sbjct: 245 DGDVYIYLSGITDFQ--------AQDSYLSRKKQMLEQICDLSYPQLKEAHKKAYAAYFD 296

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           R+ + L                             Q D    L+  +F + RYL+ISSS+
Sbjct: 297 RMDLTLD-------------------------PGIQND----LITKMFHYARYLMISSSK 327

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGTQ ANLQGIWN +L   W S   VNIN EMNYW +   NLS+C E LFD +   + +G
Sbjct: 328 PGTQCANLQGIWNHNLRAPWSSNYTVNINTEMNYWMAEKANLSDCHESLFDLIERTASHG 387

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            KTA+  Y  +GWV HH  DIW  SS       D     +++WPM   WLC+HLWEHY Y
Sbjct: 388 KKTAKEVYHLNGWVSHHNVDIWGHSSPVGYFGQDENPCTYSMWPMSSGWLCSHLWEHYRY 447

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           T+DR+FL K+A+PL+ G   F L +L+  +DGYL T PSTSPE+ F A D  +  V++ S
Sbjct: 448 TLDREFLRKKAFPLIRGAVEFYLGYLVP-YDGYLVTAPSTSPENTFTASDHSVHSVTFGS 506

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           TMD +I++E+F   + A E+L+  +  L+++V  +L +L P KI ++G + EW
Sbjct: 507 TMDCSILKELFGNYLKACEILDITD--LMDEVKAALKKLLPFKIGKEGQLQEW 557


>gi|349572636|gb|AEP84398.1| glycoside hydrolase family protein [bacterium enrichment culture
           clone g13]
          Length = 824

 Score =  328 bits (842), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 215/600 (35%), Positives = 315/600 (52%), Gaps = 53/600 (8%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           ST    K+ +  PAK + +++P+GNGRLGAMV+G V S+ ++LNE+T W G P +  NP 
Sbjct: 21  STAVEQKLWYEQPAKQWEESLPLGNGRLGAMVYGDVLSDNIQLNENTFWAGGPHNNLNPA 80

Query: 68  APKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETY 124
           A  AL ++R L+  G Y  A   + K     G     YQ  G++ LEF + H  Y    Y
Sbjct: 81  ALNALPEIRRLITVGDYLAAEKLAAKTIASQGSNGMPYQTAGNLRLEFSE-HKNYNH--Y 137

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R+LD+ +A A  +Y V +V +TRE FSS  DQVIV K++ S+ G LSF+  +       
Sbjct: 138 YRDLDIGSAVATTRYRVNDVVYTREVFSSFVDQVIVVKLTASKRGQLSFDAYMSHPSAMV 197

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKL 242
                 N ++M+G+              D +GI+    L   + IS   G+I+   D ++
Sbjct: 198 FSREDANTLLMQGQSM------------DHEGIKGQVRLASLVNISTIGGSINQ-RDNRI 244

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY----TRHL 298
            V+ +D A++L+  +++F    +N  D   +  + +   +   +N   +D Y      H 
Sbjct: 245 TVKNADSALILVSMATNF----VNYKDVSANALARARHYMAQAKNNFANDHYELRKQAHS 300

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           + Y+  F RV + L +S         S+E+ D     +R+  F    DP L  L FQFGR
Sbjct: 301 NFYKNYFDRVILNLGKS-------EFSKESTD-----QRIALFSGRHDPELASLYFQFGR 348

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSS+PG Q ANLQG+WN    P WDS   +NIN EMNYW +   NLSE  EPL   
Sbjct: 349 YLLISSSQPGGQPANLQGLWNHRQDPPWDSKYTLNINAEMNYWPAEITNLSELHEPLITM 408

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
              LSI G ++A+  Y A GW+ HH TDIW  +        W  WP   AWL  HLWE Y
Sbjct: 409 TKELSITGQESAKTMYGARGWMAHHNTDIWRITGGV--DYTWGSWPTSSAWLSQHLWERY 466

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            Y+ D+ +L +  YP+++    F  D+LI   +  +L  +PS SPE+   A   K+A   
Sbjct: 467 LYSGDKQYLAE-IYPVMKSAVVFFDDFLISSPNKKWLIVSPSMSPENVPKATGTKIAA-- 523

Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              TMD  ++ ++FS  I+AA++L  +K    L EK L  LP   P +I +   + EW++
Sbjct: 524 -GVTMDNQLLFDLFSNTIAAAKILGEDKQHIPLWEKTLSRLP---PMQIGKYHQLQEWLE 579


>gi|299145505|ref|ZP_07038573.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
 gi|298515996|gb|EFI39877.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
          Length = 822

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G   +E ++LNE+T+W G P +  NP+A + + 
Sbjct: 32  KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGHPNNNANPNALEYIP 91

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV +G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y+   Y RE
Sbjct: 92  KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A A V+Y V  V++ RE  +S  DQV++ +++ +  G ++FN  L S        
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197

Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
              +Q +M    EG C    +   ++ ++  KG ++F   L  K   ++G   A  D  L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            VE +D A++ +  +++F+    N  D   +    + + L       + +    H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           +   RVS+ L             E+    V + +RV++F+   D  LV   FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SS+PG Q ANLQGIWN+ L P+WDS    NINLEMNYW S   NLSE  EPLF  +  +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G +TA++ Y A+GWV+HH TDIW  + A   K    +WP GGAWLC HLWE Y YT 
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D +FL +  YP+L+    F  + ++ E    +L   PS SPE+     +GK A  +   T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           MD  ++ ++++AIISA+++L+ + +     + + L  + P ++   G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583


>gi|224536491|ref|ZP_03677030.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521893|gb|EEF90998.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 815

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 211/601 (35%), Positives = 316/601 (52%), Gaps = 60/601 (9%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
           I ++ PA+ + +A+PIGNGRLGAM +GG+  E L+LN+ T+W+G P   ++  DA K L 
Sbjct: 35  IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 94

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
           ++R  + +  Y  A   + +     +    D+Y        Q LGD+ L+F+    +   
Sbjct: 95  EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFELPEGEMG- 153

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
            +YRR LD+  A + V + +G   F+RE FSS PD VIV K+     G LSF++ LD   
Sbjct: 154 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 212

Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
                 D+H  V   N   ME R          N + + +         +K+  D G +S
Sbjct: 213 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 254

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
                K+ V+G+D A + +   +S+   +        D + +++  L  +    Y D+ +
Sbjct: 255 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 312

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
            H+ DYQ +F+R+S+ L            + ++ID +P+ +R+  F +  +D   V+L +
Sbjct: 313 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 360

Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           QFGRYL+ISSSR    +  N QGIW +     W S    NIN +MNYW     NLSEC  
Sbjct: 361 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 420

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           P+      L   G KTAQ  + ASGW+    T+ W  +S  +   +W  +  G  W C  
Sbjct: 421 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 479

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
            WEHY YT D+++L K  YP+L+    F L  LIE  DGYL T+PSTSPE+ +IAPDG  
Sbjct: 480 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 538

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
             V+  ST++++IIR +FS  I A  +L  NED   +++L KSL RLRP +I   G +ME
Sbjct: 539 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 596

Query: 593 W 593
           W
Sbjct: 597 W 597


>gi|288929797|ref|ZP_06423640.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
           str. F0108]
 gi|288328898|gb|EFC67486.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
           str. F0108]
          Length = 792

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 213/597 (35%), Positives = 323/597 (54%), Gaps = 44/597 (7%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
           PL+I  N P   F +++PIGNG+LGAMV G    + LKLN+ TLW+G P D  N DA   
Sbjct: 24  PLRIWDNRPGSFFENSMPIGNGKLGAMVDGNPHCDYLKLNDITLWSGKPID-PNEDAGAH 82

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLG-----DIELEFD-DSHLKYAEET 123
           K +  +R  +    YA A +  +++ GH +  YQ L      D++   + D+ LK     
Sbjct: 83  KWIPQIRKALFEENYALADSLQLRVQGHNSAWYQPLSTLCICDVKAAANADAPLK----N 138

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YRRELDL+++  +V Y    V + RE+F+S+P + I+ +++ ++  ++S  +SL SLL++
Sbjct: 139 YRRELDLDSSLVKVSYESEGVSYRREYFASHPGRAIMVRLTANKPHAISLQLSLTSLLNH 198

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
            + V GN   +M             +A   P   + F  +L+ K +   GTI+A +D  L
Sbjct: 199 QTRVEGNTIRLM------------GHAEGHPDSTVHFCNLLQAKATG--GTITA-QDSTL 243

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            +  +   VL +V  +S++G   +P          + + L++++N ++  L   H DDYQ
Sbjct: 244 LISNATQVVLYIVNETSYNGFDKHPVTQGAPYVQLAETDLKNLQNCTFEQLKQNHTDDYQ 303

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
            LF R+++ L  +  D+   T  ++  D     E         +P L  L FQFGRYLLI
Sbjct: 304 ALFGRLALHLDGTKLDM-HRTTEQQLQDYTKRGE--------TNPYLETLYFQFGRYLLI 354

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SSSR     ANLQG+WN  +   W S   VNINLE NYW +   NL+E   PL   +  L
Sbjct: 355 SSSRTPGVPANLQGLWNPHVRAPWRSNYTVNINLEENYWPAQVANLAELTTPLVGMVKAL 414

Query: 423 SINGSKTAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHY 478
           S+NG   A+  Y +  GW   H TD+WA ++     R    WA W +GGAWL ++LWE Y
Sbjct: 415 SVNGRYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWANWNLGGAWLLSNLWEQY 474

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACV 536
           ++T DR +L    YPL++G   F+L WL+E     G L T PSTSPE+E++ PDG     
Sbjct: 475 DFTRDRHYLRHTLYPLMKGACDFMLQWLVENPKQPGELITAPSTSPENEYVTPDGYHGTT 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            Y  T D+AI+RE+F+   +A E+L     A  + + +++ RL P  I ++G + EW
Sbjct: 535 VYGGTADLAILRELFANTATADEILNGRPTAYSKILRQTIGRLHPYTIGKEGDLNEW 591


>gi|300777551|ref|ZP_07087409.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503061|gb|EFK34201.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 836

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 208/598 (34%), Positives = 314/598 (52%), Gaps = 63/598 (10%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PAK + +A+P+GNG + AMV+G    E L+LNE T W+G P    NPDAPK L 
Sbjct: 26  KLWYDKPAKQWVEALPVGNGNMAAMVYGDPYQEKLQLNEGTFWSGGPSRNDNPDAPKVLD 85

Query: 74  DVRSLVDSGQYAEA--------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
            +R  +  G Y  A        TA +V         +Q +GD  L+ ++  LK     Y 
Sbjct: 86  SIRYYLFHGNYKRAQILADKGLTAKTVH-----GSAFQNIGDFTLDLNN--LKEIR-NYY 137

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           RELD+  A A   ++ G + F RE F+S PD VIV K+S     +L+F    +S L  + 
Sbjct: 138 RELDIEKAIATTTFTSGGIYFKREVFASIPDHVIVIKLSSDHKNALNFTAKFNSELKKNV 197

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
                N + M+G          +  +  P  ++F+A+ +      +G  +   ++ + V 
Sbjct: 198 KAIDANTLQMDGIS--------STLDGIPGQVKFNALAKFIT---KGGKTQTSEEGISVS 246

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +   ++L+  +++F     +  +   D  +++   +++  N S+  L   HL+ YQ  F
Sbjct: 247 NAHEVMILISIATNF----TDYKNLNTDEVAKARKYIEAAANKSFKTLVQNHLNAYQNYF 302

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV + L  S         + +N    P+  R+K+F T  DP L+ L +QFGRYLLISSS
Sbjct: 303 KRVDLNLGTSE--------AAKN----PTDVRIKNFATGYDPELISLYYQFGRYLLISSS 350

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PG Q ANLQGIWN    P WDS   +NIN EMNYW +   NLSE  EPL   +  LS  
Sbjct: 351 QPGGQPANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLSEMHEPLIQMIKDLSET 410

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTM 482
           G +TA+  Y + GWV HH TDIW  +    G V +A   +WPMGGAWL  HLWE Y Y+ 
Sbjct: 411 GKETAKTMYNSRGWVAHHNTDIWRIT----GVVDFANAGMWPMGGAWLSQHLWEKYLYSG 466

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDG-KLACVSYS 539
           D  +L +  YP+L+  A F  D+LIE   H  +L  +PS SPE+    P G + + ++  
Sbjct: 467 DEHYL-RTIYPVLKSAAQFYEDFLIEEPAHH-WLVASPSMSPEN---IPQGHQGSALAAG 521

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +TMD  ++ ++F+    AA++L  + D +     ++  LP   P KI   G + EW++
Sbjct: 522 NTMDNQLMFDLFTKTKKAAQILNTDSDKIQVWNTIISKLP---PMKIGSYGQLQEWME 576


>gi|423303028|ref|ZP_17281049.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470357|gb|EKJ88892.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1100

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 207/592 (34%), Positives = 301/592 (50%), Gaps = 49/592 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA+ + +A+PIGN RLGAMV+GG   E L++NE+T W G P    +P A   L
Sbjct: 288 LKLWYNRPAQRWEEALPIGNSRLGAMVYGGAGHEELQINEETFWAGGPHHNNSPKAKAVL 347

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
            + R L+   +  EA    +   F  P  +  L     L     H K     Y RELD+ 
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----- 186
            ATA   Y V  V +TR  FSS  DQVI+ ++  +  G+L F++  D+  +   +     
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGFAPLHP 465

Query: 187 ---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
              V GN   +   +C G      A+A             ++++  D   ++  +  +L 
Sbjct: 466 IVKVRGNRLTM---QCTGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V+G+  A + L A+++F    +N  D   + +  + + L++     Y      H   YQ 
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RV + L   P  I +           P+ +RV  F   +D +L+ LL+Q+GRYLLI 
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIW   L   WDS   +NIN EMNYW +   NLSEC EPLF  L  LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
           + G +TA+  Y A GWV HH TD+W  +    G   W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           + FL K  YP+++G A F++  L++    G+L T PS SPEH + A      C     TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           D  I  ++ +    AA +L +   A  + +  +  +L P +I +   I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840


>gi|423223626|ref|ZP_17210095.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638251|gb|EIY32098.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 814

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 211/601 (35%), Positives = 315/601 (52%), Gaps = 60/601 (9%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
           I ++ PA+ + +A+PIGNGRLGAM +GG+  E L+LN+ T+W+G P   ++  DA K L 
Sbjct: 34  IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 93

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
           ++R  + +  Y  A   + +     +    D+Y        Q LGD+ L+F     +   
Sbjct: 94  EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFKLPEGEMG- 152

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
            +YRR LD+  A + V + +G   F+RE FSS PD VIV K+     G LSF++ LD   
Sbjct: 153 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 211

Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
                 D+H  V   N   ME R          N + + +         +K+  D G +S
Sbjct: 212 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 253

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
                K+ V+G+D A + +   +S+   +        D + +++  L  +    Y D+ +
Sbjct: 254 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 311

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
            H+ DYQ +F+R+S+ L            + ++ID +P+ +R+  F +  +D   V+L +
Sbjct: 312 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 359

Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           QFGRYL+ISSSR    +  N QGIW +     W S    NIN +MNYW     NLSEC  
Sbjct: 360 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 419

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           P+      L   G KTAQ  + ASGW+    T+ W  +S  +   +W  +  G  W C  
Sbjct: 420 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 478

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
            WEHY YT D+++L K  YP+L+    F L  LIE  DGYL T+PSTSPE+ +IAPDG  
Sbjct: 479 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 537

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
             V+  ST++++IIR +FS  I A  +L  NED   +++L KSL RLRP +I   G +ME
Sbjct: 538 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 595

Query: 593 W 593
           W
Sbjct: 596 W 596


>gi|389793150|ref|ZP_10196324.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
 gi|388434883|gb|EIL91810.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
          Length = 802

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 205/605 (33%), Positives = 311/605 (51%), Gaps = 42/605 (6%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + +  S +  N L++ ++ PA  F +A+P+GNGR+G MV+GGV      L+E ++++G  
Sbjct: 28  LFSGASLAAQN-LQLHYDAPANTFNEALPLGNGRMGVMVYGGVQQARYSLSEISMFSGSR 86

Query: 61  GDYTN-PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADV----YQLLGDIELEF 112
            D  +  +A   L  +R L+  G+  EA   + + F   G  A+     YQ LG + L+F
Sbjct: 87  YDGADRKEAVNYLPKIRQLLLQGRNVEAEQLTNQHFTWSGEGANAHYGTYQGLGTLTLDF 146

Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
             +    ++  YRR LD+ +AT+ V+Y+   V + RE F S PDQV+V  +S   +G+L+
Sbjct: 147 AANAAPVSD--YRRRLDIPSATSDVRYAQDGVRYRREMFVSAPDQVMVLHLSADRAGALN 204

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
           F   LD         +G N ++M G           ++    KG+ F+A + +      G
Sbjct: 205 FVARLDRAERASVEGDGANGLLMRGEL---------DSGGSGKGLAFAARVRVIAP---G 252

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
                +   ++VE      +L+  ++ +DG          DP + S + LQ + + S + 
Sbjct: 253 ASMHADAHGIRVEHGTDVTVLISEATDYDG---FAGRHTTDPVAASATDLQRVASRSVAQ 309

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
           L+  H+ D+   F R S+QL             +   +T+    R+ ++    DP    L
Sbjct: 310 LHAAHVADFSSWFDRFSLQLG----------SVDNTRETMSMRARLDTYGASGDPGFAAL 359

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            FQ+ RYLLISSSRPG   ANLQG+W E  S  W+   H N+N+EMNYW + P  L E  
Sbjct: 360 YFQYARYLLISSSRPGGLPANLQGLWAEGTSTPWNGDYHTNVNIEMNYWPAEPTGLGELV 419

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           +PLF     L   G+KTAQ  Y A GWV+H  T++W   +A   +  W +W    AWL  
Sbjct: 420 QPLFALTASLQQPGAKTAQRYYGARGWVVHTLTNLWG-FTAPGAEASWGVWQGAPAWLSF 478

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPD 530
           H+W+HY YT DRDFL +R YP+L G A F  D LIE   H  +L T PS+SPE+     +
Sbjct: 479 HIWDHYRYTGDRDFL-RRYYPVLRGAAQFYADVLIEEPSHH-WLVTAPSSSPENTVYMEN 536

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
           G  A +    TMD  +IR +F A+I A++ L  + D   E   K   RL P +I  DG I
Sbjct: 537 GGKAAIVMGPTMDEELIRFLFGAVIEASQTLHVDADFRRELEAKR-ARLAPIQIGPDGRI 595

Query: 591 MEWVQ 595
            E+++
Sbjct: 596 QEYLK 600


>gi|259485946|tpe|CBF83399.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 757

 Score =  327 bits (837), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 309/594 (52%), Gaps = 46/594 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+P+GNGRLGAMV+G   +E L+LNED++W G P +    DA K L 
Sbjct: 3   ELWYQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLP 62

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R L+  G + EA   A    F  P     Y+ LG + LEF   H       YRR LDL
Sbjct: 63  RLRELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDL 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           N     V Y    V++ R+  +S PD V+  ++  S        +S  S L+ +      
Sbjct: 121 NEGITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELE-YETNEFL 179

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSD 248
           + ++++G+     + P    ++     +   ++ I+  SDD+  I      K L +   D
Sbjct: 180 DDLVVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD 234

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A++++VA S++            D    +++ L+++   S  D++ RH+ DYQ L+ R+
Sbjct: 235 -ALIVIVAQSTY-------RCDDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRL 286

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L     DI TD             +R+   +    P LV +  ++ RYLLIS SRPG
Sbjct: 287 ELNLGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPG 330

Query: 369 TQ-------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
            +        A LQGIWN    P W     +NINL+MNYW +   NL EC+EPLF  L  
Sbjct: 331 RKGSSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLER 390

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L++ G++TA+  Y   GW +HH TD+WA ++     +   LWP+GGAWLCTH+WE + + 
Sbjct: 391 LAVTGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFN 450

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSS 540
            ++ FL KR +P+L GC  FL D+L++   G Y  TNPS SPE+ F    G+   +   S
Sbjct: 451 GNKAFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGS 509

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           T+D+ ++R V  A + + EVL  ++D L+  V  +L RL P +I   G + EW+
Sbjct: 510 TIDIQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWM 563


>gi|402307321|ref|ZP_10826347.1| putative lipoprotein [Prevotella sp. MSX73]
 gi|400378835|gb|EJP31686.1| putative lipoprotein [Prevotella sp. MSX73]
          Length = 796

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 206/595 (34%), Positives = 313/595 (52%), Gaps = 38/595 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           PLK+ +N PA  F +A+PIGNGRLGA+V+GG  ++++ +N+ TLWTG P +     DA +
Sbjct: 26  PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
            +  +R  + +G Y  A      + GH ++ YQ   LL   +L    +  +  E+    +
Sbjct: 86  WIPVIRKELIAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGGLK 145

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LD+++A  R  Y  G V + RE+F+S PD +I  +I  + SG+++  ++L S++ +  
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAIRIRANRSGAINCRLALTSVVPHQV 205

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
              G  Q+ M G   G          D  + I F AIL++K  D  G ++A  D  L V 
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 251

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+    +  V  +SF+G   +P  +     +++   +    N++Y++   RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVTDYKRLF 311

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R    LS +  D  + T  E+ +    + ER        +P L  L  Q+GRYLLIS S
Sbjct: 312 DRFRFTLSGAKPD-YSRTTEEQLMAYSDNGER--------NPYLEMLYMQYGRYLLISCS 362

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R     ANLQG+W       W     +NINLE NYW +   +L E   P+   +  ++  
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           G  TA   Y +  GW   H +DIWA ++     +    W+ W MGGAWL   LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
            D  +L   AYPL++G A F+L WL+E     G L T P TSPE E+I   G   C  Y 
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
            T D+AI+RE+F+  + AAE+L  N DA   + L+ SL  L P KI + G++ EW
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 595


>gi|67525297|ref|XP_660710.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
 gi|40744501|gb|EAA63677.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
          Length = 1679

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 309/594 (52%), Gaps = 46/594 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + +A+P+GNGRLGAMV+G   +E L+LNED++W G P +    DA K L 
Sbjct: 3   ELWYQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLP 62

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +R L+  G + EA   A    F  P     Y+ LG + LEF   H       YRR LDL
Sbjct: 63  RLRELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDL 120

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           N     V Y    V++ R+  +S PD V+  ++  S        +S  S L+ +      
Sbjct: 121 NEGITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELE-YETNEFL 179

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSD 248
           + ++++G+     + P    ++     +   ++ I+  SDD+  I      K L +   D
Sbjct: 180 DDLVVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD 234

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A++++VA S++            D    +++ L+++   S  D++ RH+ DYQ L+ R+
Sbjct: 235 -ALIVIVAQSTY-------RCDDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRL 286

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L     DI TD             +R+   +    P LV +  ++ RYLLIS SRPG
Sbjct: 287 ELNLGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPG 330

Query: 369 TQ-------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
            +        A LQGIWN    P W     +NINL+MNYW +   NL EC+EPLF  L  
Sbjct: 331 RKGSSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLER 390

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L++ G++TA+  Y   GW +HH TD+WA ++     +   LWP+GGAWLCTH+WE + + 
Sbjct: 391 LAVTGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFN 450

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSS 540
            ++ FL KR +P+L GC  FL D+L++   G Y  TNPS SPE+ F    G+   +   S
Sbjct: 451 GNKAFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGS 509

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           T+D+ ++R V  A + + EVL  ++D L+  V  +L RL P +I   G + EW+
Sbjct: 510 TIDIQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWM 563


>gi|427387089|ref|ZP_18883145.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725694|gb|EKU88563.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
           12058]
          Length = 826

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 198/603 (32%), Positives = 312/603 (51%), Gaps = 48/603 (7%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N ++      LK+ ++ PA  + +A+P+GNGR+GAMV+G    E  +LNE+T+W G P +
Sbjct: 17  NVQAQQADETLKLWYDTPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPHN 76

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSH 116
            TNP A +AL  +R L+  G+ AEA A       S    G P   YQ +G + L+FD   
Sbjct: 77  NTNPKAKEALPRIRQLIFEGKNAEAQALCGPAICSQSANGMP---YQTVGTLHLDFDGIS 133

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
             Y +  Y R+LD+  A +  +++   V +TRE ++S PDQV+V +++ S+  S+SF   
Sbjct: 134 -NYTD--YYRDLDIEKAISTTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 190

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
                    Y     + I+    P K +     AND       ++F+ +   +I +  G 
Sbjct: 191 ---------YTTPYKENIVRCISPRKELQLNGKANDHEGIEGKVEFTTL--TRIENSGGN 239

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           +  L D  L+V+ ++ +V L V   S    F+N  D   +  + +   L ++ N +Y+  
Sbjct: 240 LEVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGNAQTTAQKYLANV-NKNYTKS 294

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H   YQK F+RVS+ L R+ +               P+  RVK F +  DP +  L 
Sbjct: 295 KATHTSTYQKFFNRVSLDLGRNAQA------------DKPTDVRVKEFSSSFDPQMAALY 342

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+P  Q ANLQGIWN  L   WD     +IN+EMNYW +   +L E  E
Sbjct: 343 FQFGRYLLICSSQPDGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 402

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           P    +  ++I G K+A + Y   GW +HH TDIW  + A  G   + +WP   AW C H
Sbjct: 403 PFLQLVKEVAIQGRKSAAM-YGCRGWTLHHNTDIWRSTGAVDGP-GYGIWPTCNAWFCQH 460

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LW+ Y ++ D+++L +  YPL+ G   F LD+L+ E  + +L   PS SPE+  +    +
Sbjct: 461 LWDRYLFSGDKNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENRPVVNGKR 519

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
              V   +TMD  ++ ++F   I+AA+++ +N     + +   +  L P ++   G + E
Sbjct: 520 DFVVVAGATMDNQMVYDLFYNTIAAAQLMNENT-TFTDSLQTVVNHLAPMQVGRWGQLQE 578

Query: 593 WVQ 595
           W+ 
Sbjct: 579 WMH 581


>gi|406660853|ref|ZP_11068981.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
 gi|405555406|gb|EKB50440.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
          Length = 778

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 201/589 (34%), Positives = 302/589 (51%), Gaps = 38/589 (6%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
           +  PA  + +A+P+GNGRLGAMV+G   +E ++LNED+LW G P D+   +  P+ L  +
Sbjct: 28  YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 87

Query: 76  RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           R L+  G+  +A +  V  F   +    +Q LGD+ L+     +      YRRELDL+ A
Sbjct: 88  RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 143

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
              + Y+V    F ++ FSS PDQ IV ++       ++  + L    D+          
Sbjct: 144 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIRLSRPEDDGYPTVTVQAT 203

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            N  + MEG    +R    +  +    G++F  I  + I ++ G      D  +++EG +
Sbjct: 204 SNQTLQMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 260

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              + LV ++S+           +D   ++   LQ+I+  ++ +L  RH+ DYQ LF RV
Sbjct: 261 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFQRV 311

Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
              L   +P DI TD             ERVK  + + D  L  LLF FGRYLLISSSRP
Sbjct: 312 KFSLEEPNPLDIPTDQ----------RIERVK--EGNSDLYLESLLFDFGRYLLISSSRP 359

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT  ANLQG+WN  +   W++  H+NINL+MNYW +   NLSE  EP FD++  L ++G 
Sbjct: 360 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 419

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           KTA+  Y   G  + H +D+W  +     +  W  W   G W+  H WE Y +T D++FL
Sbjct: 420 KTARETYGMRGSALAHGSDLWHMTFLQAAQAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 479

Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            +R  P +E  A+F LDWL+    DG   ++PSTSPE+ FI   G+    +  + MD  I
Sbjct: 480 RQRFLPAMEEIAAFYLDWLVPYPEDGTWVSSPSTSPENSFINAKGESVASTMGAAMDQQI 539

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I EVF   + A+++L      L E   K        +   DG ++EW Q
Sbjct: 540 IAEVFDHFMQASKILGYQSPVLDEVKSKRQNLRSGLRTGNDGRLLEWDQ 588


>gi|404448807|ref|ZP_11013799.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
 gi|403765531|gb|EJZ26409.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
          Length = 778

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 202/599 (33%), Positives = 313/599 (52%), Gaps = 58/599 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
           +  PA  + +A+P+GNGRLGAMV+G    E ++LNED+LW G P D+      P  L+ +
Sbjct: 29  YEQPADKWEEALPLGNGRLGAMVFGRTDVERIQLNEDSLWPGGPNDWGLAQGKPDDLACI 88

Query: 76  RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           R L+  G+  +A +  V LF   +    +Q +GD+ LE     +      Y+R LDL+ A
Sbjct: 89  RELLVKGENKKADSLMVALFSRKSITRSHQTMGDLWLELGHQDIS----NYQRSLDLDKA 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-----HSYVN 188
            A V Y     EF ++  +S  DQ I+ +I+ +    L+  + LD   D+          
Sbjct: 145 LATVTYQYEGYEFEQKAIASAKDQGIIIQITTTHPKGLNGKIRLDRPEDDGYPTVKISTP 204

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            NN + M+G    ++    +       G++F             TI+ LE++  K+EG  
Sbjct: 205 ANNSLQMDGEVTQRKGQIDSKPAPILHGVRFQ------------TIALLENEGGKLEGKG 252

Query: 249 WAVLL---------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            A+ +         LVA++SF            D   ++ + L +++ L++++L  RH  
Sbjct: 253 DAIWIENVKTLSIKLVANTSF---------YHTDFRGKNQADLMALKELNFAELQKRHQK 303

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           D+Q LF RV+ QL             E++IDT+P+  R+++ +    D  L +LLF +GR
Sbjct: 304 DHQGLFRRVNFQLG------------EKSIDTIPTDRRIENIKAGATDLHLEKLLFDYGR 351

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI SSRPGT  ANLQGIWN+ ++  W++  H+NIN++MNYW +   NLSE  +P F+F
Sbjct: 352 YLLIGSSRPGTLPANLQGIWNQHIAAPWNADYHMNINMQMNYWPAEVTNLSELHDPFFEF 411

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
              L  +G KTA+  Y   G    H TD+W  +     +  W  W   G W+  H WE Y
Sbjct: 412 TDALIPSGQKTAKETYGMRGAAFAHGTDLWKMTFLQAAQAYWGSWLGAGGWMMQHYWERY 471

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D +FL++R  P+ E   +F  DW++    DG L ++PSTSPE+ FI  +G  A  +
Sbjct: 472 LFTQDVEFLKERFIPVAEEIVAFYADWIVPHPLDGKLASSPSTSPENSFINSNGDHAAST 531

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
             + MD  II EVF   I+A E+L    D L++++ +   RLR   ++  DG +MEW Q
Sbjct: 532 IGAAMDQQIIAEVFDNYINAVELLGIQSD-LLQEIKEKRSRLRSGLQVGSDGRLMEWDQ 589


>gi|290962265|ref|YP_003493447.1| hypothetical protein SCAB_79571 [Streptomyces scabiei 87.22]
 gi|260651791|emb|CBG74917.1| putative secreted protein [Streptomyces scabiei 87.22]
          Length = 945

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 212/588 (36%), Positives = 310/588 (52%), Gaps = 44/588 (7%)

Query: 13  LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           L + ++ PA   +  A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D  N      
Sbjct: 42  LALWYDKPAGADWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAAN 101

Query: 72  LSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
           ++++R  V + Q+  A    +  + G PA    YQ +G++ L F  +        Y+R L
Sbjct: 102 IAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGASQYKRTL 158

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL TATA   Y++  V + RE F    DQVIV +++   + +++ + + DS         
Sbjct: 159 DLTTATALTTYALNGVRYQREVFVGARDQVIVVRLTADRANAITCSATFDSPQRTTLSSP 218

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
               I ++G                   ++F A+     +   GT+S+     L+V G+ 
Sbjct: 219 DGATIALDG--------TSGTMEGITGRVRFLALAHAAATG--GTVSS-SGGTLRVSGAT 267

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              +L+   SS+    ++  ++  D    +   L + R++    L +RH  D+Q LF RV
Sbjct: 268 SVTVLVSIGSSY----VDFRNTDGDHRGIARRHLDAARDIDIDALRSRHRTDHQALFDRV 323

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           SI L R+       T +++     P+  R+       DP    LLFQFGRYLLISSSRPG
Sbjct: 324 SIDLGRT-------TAADQ-----PTDVRIAQHAQVSDPQFAALLFQFGRYLLISSSRPG 371

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           TQ ANLQGIWN+ ++P+WDS   +N NL MNYW +   NLSEC  P+FD +  L++ G++
Sbjct: 372 TQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECLLPVFDMIDDLTVTGAR 431

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+  Y A GWV HH TD W  +S   G   W +W  GGAWL T +W+HY +T D DFL 
Sbjct: 432 VARAQYGAGGWVTHHNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDTDFLR 490

Query: 489 KRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
              YP L+G A F LD L+  H   G+L TNPS SPE     P    A V    TMD  I
Sbjct: 491 SN-YPALKGAAQFFLDTLVA-HPTLGHLVTNPSNSPE----LPHHTNATVCAGPTMDNQI 544

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +R++F+++  A E L  +      + L +  RL PT++   G++ EW+
Sbjct: 545 LRDLFTSVARAGETLGVDA-GFRAQALAARDRLAPTRVGSRGNVQEWL 591


>gi|402814854|ref|ZP_10864447.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
 gi|402507225|gb|EJW17747.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
          Length = 810

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 203/611 (33%), Positives = 320/611 (52%), Gaps = 63/611 (10%)

Query: 21  AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
           A  +T+A PIGNGRLG +V+GG+  E ++LNED++W G   D  N  A  AL D+++L+ 
Sbjct: 15  ASKWTEAFPIGNGRLGGVVYGGIQREQIQLNEDSIWYGGARDNDNRAAQAALPDIKNLLL 74

Query: 81  SGQYAEATAASVKLFGHPADV------YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
            G   +A    +K   H  +V      YQ LG++ L+F+ +   +A   Y R+LDL+ A 
Sbjct: 75  QGNVRKAEKLVLK---HMTNVPQYFNPYQTLGNLFLDFEPNIEVHAINQYCRKLDLDHAL 131

Query: 135 ARVKYSVGN-------------------VEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
            +V Y VG                    ++++RE FSS  DQV+V +++ ++   L+F  
Sbjct: 132 VQVNYEVGRQDKEGRTATQATGEAQKEAIQYSREIFSSAADQVLVIRMTTTDEAGLTFAA 191

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
             D        V  ++         G+ I  +     D  G++++ +L+  +    G   
Sbjct: 192 KFDRRPFTGEMVQTDD---------GQGIAMQGQLGAD--GVRYAVVLQAVVE---GGQC 237

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
                 L +  +    L++ A +SF       +D+      +++ A +    + Y  L  
Sbjct: 238 QTAGNYLDIRQARAVTLIVAAQTSF-----RCADAYAVACQQAIQAAK----VPYEKLKQ 288

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
           RHLDDY+ LF+RV++ L     +             + +++R++ + Q   D  L  L +
Sbjct: 289 RHLDDYKPLFNRVTLDLEAEEGERTEPQQQVPGQQCLSTSQRLERYRQGATDNGLEALFY 348

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           Q+GRYLL++SSRPGT  ANLQGIWN+  +P W+S  H+NINL+MNYW +   NL+EC  P
Sbjct: 349 QYGRYLLLASSRPGTLPANLQGIWNDSFTPPWESDYHLNINLQMNYWLAETGNLAECHMP 408

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LFDF+  L ING +TA+  Y A G+V H  +++WA +      V   +WPMGGAW+  H+
Sbjct: 409 LFDFIERLVINGRQTARNIYGARGFVAHTSSNLWADTGIYGEYVSANMWPMGGAWIALHM 468

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           WEHY Y     FL +RAYP+L+  A F LD+L+E   G L T PS SPE+ + +  G++ 
Sbjct: 469 WEHYCYNGSLSFLRERAYPVLKEAALFFLDFLLELPSGQLVTVPSLSPENSYRSEQGEVG 528

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-----------LVEKVLKSLPRLRPTK 583
            + Y  +MD  I+  +F+A I A E+L+ +E+            L+ +  +   +L   +
Sbjct: 529 ALCYGPSMDSQILYALFTACIRAGELLQLDEEGHLKQGFHEDKDLLAQWQQVRSKLPQPQ 588

Query: 584 IAEDGSIMEWV 594
           I   G IMEW 
Sbjct: 589 IGRHGQIMEWA 599


>gi|386820649|ref|ZP_10107865.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
 gi|386425755|gb|EIJ39585.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
          Length = 780

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 201/595 (33%), Positives = 308/595 (51%), Gaps = 48/595 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + ++ PA  + +A+P+GNGR+GAM++GG+ +E  +LNED++W G P         + L+ 
Sbjct: 25  VWYSQPADTWMEALPVGNGRMGAMIYGGIETEHFQLNEDSMWPGSPNLSNAKGTAEDLAL 84

Query: 75  VRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           +R L+D G+  EA +  +  F     V  +Q  GD+ L F +   +     Y+R LD   
Sbjct: 85  IRKLIDEGKVHEADSLIIDKFSRQDIVRSHQTAGDLFLHFKN---RGEVTNYKRSLDFEK 141

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH------SY 186
           AT+ V YSV    F    FSS PD V+V K+  S    + F++ +    D        + 
Sbjct: 142 ATSYVSYSVDGNTFKETAFSSQPDNVLVIKLETSNRNGMDFDIEMSRPKDEGVETVKVAT 201

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 ++M G         ++       G++F   L++K     G I++    +L V  
Sbjct: 202 FPEKQLMLMNGEVTQMGGVVESVPTPIKNGVKFQTRLKVK--SKSGIITS-NGNRLTVRN 258

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +   +LL+   +S+  P         D   ++   +++  +  Y  L   H+ D++ L++
Sbjct: 259 AKEVLLLIATETSYYHP---------DYIEKAELVIENAESKGYKALVNNHIQDFKNLYN 309

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           RVS+        I TD  ++E     P+ +R++ ++    D  L E LF +GRYLLISSS
Sbjct: 310 RVSLH-------IETDNSNKE----FPTDKRLERYKAGVVDVGLQETLFNYGRYLLISSS 358

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R GT  ANLQGIWN  ++  W++  H+NINL+MNYW +   NL+EC+ PLFDF   L I 
Sbjct: 359 RKGTNPANLQGIWNNHITAPWNADYHLNINLQMNYWLAPITNLAECELPLFDFGNRLIIR 418

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+   +  G + HH TD+W  +        W  W  G  WL  H W +Y +T D  
Sbjct: 419 GKETAKQYGINRGSMSHHATDLWGPAFMRARTPYWGAWIHGAGWLAQHYWGYYLFTEDEV 478

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETN------PSTSPEHEFIAPDGKLACVSYS 539
           FL+++ YP L+  A+F LDWL      Y E+       P TSPE+ +IA DGK A VS  
Sbjct: 479 FLKEQGYPYLKEVATFYLDWL-----QYDESTKEWFSYPETSPENSYIANDGKPAAVSRG 533

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
           + M   II EVF  IISA+E+L   +D L+++V K    LRP  +I  DG ++EW
Sbjct: 534 TAMGQQIIGEVFRNIISASEILAI-DDELIKEVKKKAENLRPGVQIGADGRVLEW 587


>gi|376260262|ref|YP_005146982.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
           BNL1100]
 gi|373944256|gb|AEY65177.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
           BNL1100]
          Length = 1159

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 204/586 (34%), Positives = 302/586 (51%), Gaps = 57/586 (9%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           F  A+P+GNGR+GAMV+G  P E + LNE T W+  PG+     A  +L   +  + +GQ
Sbjct: 76  FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFAGQ 135

Query: 84  Y-AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
           Y   +T  +  + G     YQ +GD++L F  S +      Y R+LD+NT      Y+  
Sbjct: 136 YKTGSTTIANSMIGGGEAKYQSIGDLKLLFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
             ++ RE F S PDQ++VTKI+ S  GS+S     +S L     V+  GN+ ++M G   
Sbjct: 192 GKQYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH-- 249

Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
                      D   GI ++       KI +  G++SA  + ++ V  +D  V+L    +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL----T 294

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           S    F+N      D   ++ + + +    SY  LY  H+ DYQ LF RV + L  S   
Sbjct: 295 SIRTNFVNYKTCNGDEKGKATTDITNASAKSYDTLYNNHVADYQNLFKRVDVDLGGSG-- 352

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
                 SE N    P  +R+  F T  DP L ++LFQ+GRYL+IS+SR  +Q  NLQGIW
Sbjct: 353 ------SENN---KPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQSMNLQGIW 402

Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
           N+  +P W      NIN EMNYW +   NL+EC EP       L   G++TA+ +Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARAHYNISN 462

Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
           GWV+HH TD+W +++   G+  W LWP G  W+   L++ YN+  D  +L +  YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGE--WGLWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVIKG 519

Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
            A FL   +    I G + Y    PSTSPE   + P     G+ A  SY  TMD  I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGISRE 575

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
           +F  +I AA +L  N D      L+S + +++P  I   G + EW 
Sbjct: 576 LFKDVIQAAGIL--NVDPAFRSTLQSKVSQIKPNTIGSWGQLQEWA 619


>gi|189468049|ref|ZP_03016834.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
           17393]
 gi|189436313|gb|EDV05298.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
           17393]
          Length = 830

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 201/603 (33%), Positives = 315/603 (52%), Gaps = 48/603 (7%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N ++      LK+ ++ PA  + +A+P+GNGR+G MV+G    E  +LNE+T+W G P +
Sbjct: 18  NLQAQQEDQTLKLWYDKPATQWVEALPLGNGRIGTMVFGDPVHEQFQLNEETVWGGSPHN 77

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSH 116
            TNP A  AL  +R L+  G+  EA      T  S    G P   YQ +G + L+FD  +
Sbjct: 78  NTNPKAKDALPRIRQLIFEGKNKEAQELCGPTICSQSANGMP---YQTVGSLHLDFDGIN 134

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
            +Y +  Y R+LD+  A A  +++   V +TRE ++S PDQV+V +++ S+  S+SF   
Sbjct: 135 -EYND--YYRDLDIEKAIATTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 191

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
                    Y       ++    P K +     AND       ++F+A+   +I ++ G 
Sbjct: 192 ---------YSTPYKSSVIRCISPRKELQLNGKANDHEGIEGKVEFTAL--TRIENNGGK 240

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           +  L D  L+V+ ++ +V+L V   S    F+N  D   D  + +   L+ + N +Y   
Sbjct: 241 LEILSDSTLQVKDAN-SVILYV---SIGTNFVNYKDVSGDALNSAQQYLKLV-NKNYPKS 295

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H++ YQK F+RVS+ L            S   I+  P+  RVK F +  DP +  L 
Sbjct: 296 KASHINAYQKYFNRVSLNLG-----------SNAQINK-PTDVRVKEFSSSFDPQMAVLY 343

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQFGRYLLI SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +   +L E  E
Sbjct: 344 FQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 403

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           P    +  ++I G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C H
Sbjct: 404 PFLQLVKEVAIQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGS-SYGVWPTCNAWFCQH 461

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
           LW+ Y ++ D+++L + AYPL+ G   F LD+L+ E  + +L   PS SPE+       +
Sbjct: 462 LWDRYLFSGDKNYLSE-AYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPAVNGQR 520

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
              V   +TMD  ++ ++F   ISAA+++ +   A  + +   +  L P ++   G + E
Sbjct: 521 TFVVVAGTTMDNQMVYDLFYNTISAAKLMNETT-AFTDSLQTVVNNLAPMQVGRWGQLQE 579

Query: 593 WVQ 595
           W+ 
Sbjct: 580 WMH 582


>gi|261878761|ref|ZP_06005188.1| fibronectin type III domain protein [Prevotella bergensis DSM
           17361]
 gi|270334768|gb|EFA45554.1| fibronectin type III domain protein [Prevotella bergensis DSM
           17361]
          Length = 814

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 209/585 (35%), Positives = 306/585 (52%), Gaps = 45/585 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ ++ PA  + +A+PIGNG LGAMV+GG   ETL LNE T W+G P D  + ++   L 
Sbjct: 23  RLWYHQPASKWVEALPIGNGFLGAMVYGGTRQETLALNETTFWSGGPHDNNSTESLSYLP 82

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQL-LGDIELEFDDSHLKYAEETYRRELDLN 131
           ++R  +  G+  EA       +   P  +  L LGD+ + F++ H +  +  Y R L+L 
Sbjct: 83  EIRQKIFEGKENEAQKLIDQHVVKGPHGMRFLPLGDVRIRFEE-HGEVGQ--YSRSLNLE 139

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A   V Y++G V+  R  F+S PD+VI  +I  S     SF +S+ SL  + +  +GN 
Sbjct: 140 KALHEVSYTIGGVKIQRVSFASLPDRVIGMRIKSSRR--TSFTISVHSLFQSEAQTHGN- 196

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
              +EG   G          D  +G+  +  A   I +  + G +    D  L+VE +  
Sbjct: 197 --ALEGTVYG----------DSQEGVAGRLRAHYRIVVKGN-GKVVPTGDS-LRVERASN 242

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             + + A+++F    +N  D   D  +     +  +   S+  L  RH+  Y+  + RVS
Sbjct: 243 TEIYMAAATNF----VNFKDVSGDEKAVVNRLMAGVSGQSFDRLLKRHVRAYRCQYDRVS 298

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L         +  S      +P+ ER++ F   +D  +V L+F +GRYLLISSS+PG 
Sbjct: 299 LTL---------NGASPSPHAQLPTDERLRQFAGSQDMGMVALIFNYGRYLLISSSQPGG 349

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN + +  WDS   +NIN EMNYW +  CNL E  +PLF  +  LS+ G KT
Sbjct: 350 QPANLQGIWNGERNAPWDSKYTININTEMNYWPAETCNLREAVKPLFSLIGDLSLTGEKT 409

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   GWV HH TD+W  +    G   W ++P GG WL THLW+HY YT DR FL +
Sbjct: 410 ARQMYGCRGWVAHHNTDLWRIAGPVDG-AYWGMFPNGGGWLSTHLWQHYLYTGDRVFL-R 467

Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
             Y +L+G A F LD++  +   GYL   PS SPEH    P GK + V    TMD  I  
Sbjct: 468 LWYSVLKGAADFYLDYMQTDPRTGYLVVVPSVSPEH---GPHGK-SPVGAGCTMDNQIAF 523

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +V S  + A E+L  N  A  + + K++  L P KI   G + EW
Sbjct: 524 DVLSNCLQATEILNGNR-AYADSLRKAIAALPPMKIGRHGQLQEW 567


>gi|374333663|ref|YP_005086791.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
 gi|359346451|gb|AEV39824.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
          Length = 798

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 207/602 (34%), Positives = 300/602 (49%), Gaps = 64/602 (10%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF- 95
           MV+G   S  + LNEDTL++G P   Y  P+    +  V +L+  G+  EA     K + 
Sbjct: 1   MVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHVEALLRDGKLFEAQEFVRKNWT 60

Query: 96  GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
           G     YQ +G++ +   DDS +      YRR LD+  +     Y      F R  F+S 
Sbjct: 61  GRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHSLHHESYEQNRTTFERTSFASF 116

Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVNGNNQIIMEGRCP------------ 200
           PD VIV +++  + G+LSF++  DS       ++   N ++ + G+ P            
Sbjct: 117 PDNVIVVRLTADKPGTLSFSLRYDSPHPTCRTTHEAENTRLHLRGQAPAFTSSRVIERIE 176

Query: 201 ---------------GKRIPPKANANDDPKG------------IQFSAILEIKISDDRGT 233
                          GK  P   N  D  +G              F A L +++   R  
Sbjct: 177 HDQEQSRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDGLGEGTYFEAGLSVELEGGR-- 234

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
               E  +L +EG+    L +  ++SF+GP  +PS   KDP     SAL +  ++SY D 
Sbjct: 235 -IRPERGELHIEGATAVTLRIAIATSFNGPDKSPSREGKDPAPIVKSALDTAGSVSYEDT 293

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
             +H DD  +LF RVS++L  +             I  +P++ R++ FQ   DP+L  L 
Sbjct: 294 LQKHSDDVLRLFDRVSLKLGNNA------------IPDLPTSTRLEQFQEKGDPALAALQ 341

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQ+GRYLLI+SSR G+Q  NLQGIW+    P W S   +NINLEMNYW +    LS+  E
Sbjct: 342 FQYGRYLLIASSRGGSQPPNLQGIWSNLRRPQWSSNYTMNINLEMNYWPAEITGLSDLHE 401

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLF  +  L+++G++TA+  + A GW   H T IW  S         A WPM   WL +H
Sbjct: 402 PLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSVPSPCDPASAFWPMAAGWLLSH 461

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           +WEH+ YT D++FL+ RAYPL++  A F   WL E  DGYL    STSPE+ ++  DG +
Sbjct: 462 MWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDGYLVPKVSTSPENRYLDEDGHV 521

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             V   STMD AIIRE F+   +AA++L  + + L   +     RL P +I   G + EW
Sbjct: 522 ITVDQGSTMDCAIIRETFTNTAAAAKLLGLDAE-LANTLEAKAARLLPYQIGAQGQVQEW 580

Query: 594 VQ 595
            Q
Sbjct: 581 SQ 582


>gi|410029118|ref|ZP_11278954.1| hypothetical protein MaAK2_07959 [Marinilabilia sp. AK2]
          Length = 754

 Score =  325 bits (832), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 203/592 (34%), Positives = 309/592 (52%), Gaps = 44/592 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
           +  PA  + +A+P+GNGRLGAMV+G   +E ++LNED+LW G P D+   +  P+ L  +
Sbjct: 4   YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 63

Query: 76  RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           R L+  G+  +A +  V  F   +    +Q LGD+ L+     +      YRRELDL+ A
Sbjct: 64  RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 119

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
              + Y+V    F ++ FSS PDQ IV ++       ++  + L    D+          
Sbjct: 120 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIKLSRPEDDGYPTVTVQAT 179

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            N  + MEG    +R    +  +    G++F  I  + I ++ G      D  +++EG +
Sbjct: 180 SNQTLHMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 236

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              + LV ++S+           +D   ++   LQ+I+  ++ +L  RH+ DYQ LFHRV
Sbjct: 237 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFHRV 287

Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
              L   +P D  TD             ERVK  +TD    L  LLF FGRYLLISSSRP
Sbjct: 288 KFSLDDPNPLDSPTDQ----------RIERVKGGKTD--LYLESLLFDFGRYLLISSSRP 335

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GT  ANLQG+WN  +   W++  H+NINL+MNYW +   NLSE  EP FD++  L ++G 
Sbjct: 336 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 395

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           KTA+  Y   G  + H +D+W  +     +  W  W   G W+  H WE Y +T D++FL
Sbjct: 396 KTARETYGMRGAALAHGSDLWNMTFLQAAEAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 455

Query: 488 EKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            +R  P +E  A+F LDWL+   EG  G   ++PSTSPE+ FI   G+    +  + MD 
Sbjct: 456 RQRFLPAMEEIAAFYLDWLVPYPEG--GKWVSSPSTSPENSFINAKGESVASTMGAAMDQ 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
            +I EVF   + A+++L   +  ++++V      LR   +I  DG ++EW Q
Sbjct: 514 QVIAEVFDNFMQASKIL-GYQSPILDEVKSKRQNLRSGLRIGSDGRLLEWDQ 564


>gi|319642679|ref|ZP_07997325.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
 gi|345520274|ref|ZP_08799672.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|254836101|gb|EET16410.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|317385767|gb|EFV66700.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
          Length = 814

 Score =  325 bits (832), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NPDA + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   +  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+       D+  D  +    D      RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +    ++ + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|294777781|ref|ZP_06743227.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|294448369|gb|EFG16923.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 814

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 320/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NPDA + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   +  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+ L       VT            +  RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSLNLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +    ++ + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|150005172|ref|YP_001299916.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149933596|gb|ABR40294.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
          Length = 814

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NPDA + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   +  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+       D+  D  +    D      RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +    ++ + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|423311885|ref|ZP_17289822.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
           CL09T03C04]
 gi|392689264|gb|EIY82542.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
           CL09T03C04]
          Length = 814

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NPDA + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   +  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+       D+  D  +    D      RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +    ++ + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|345569032|gb|EGX51901.1| hypothetical protein AOL_s00043g635 [Arthrobotrys oligospora ATCC
           24927]
          Length = 723

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 200/570 (35%), Positives = 295/570 (51%), Gaps = 60/570 (10%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           MV+G   +E L+LNED++W G P D     A + L ++R L+  G+  EA A      F 
Sbjct: 1   MVYGQTTTEVLQLNEDSVWYGGPQDRLPKAALQNLPELRRLIREGRQKEAEALVRAAFFA 60

Query: 97  HPADVY--QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
           +P+     + LG + L+FD  +       YRRELD++ A +RV+YS   +++ RE  +S 
Sbjct: 61  YPSSQRHSEPLGTLHLDFDYGYQGIDIRDYRRELDISQAISRVQYSCNGIQYQREAIASY 120

Query: 155 PDQVIVTKISGSESGSLSFNVS--------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPP 206
           PDQVI   +S S+S   +  ++         +  LD  +  +G  +IIM           
Sbjct: 121 PDQVIGINLSSSQSSKYTIRLNRVSEREYETNEFLDTLTTRDG--KIIM----------- 167

Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
             +A     G +   ++  + +D  G +  L +  L V G   + +LL + ++F      
Sbjct: 168 --HATPGGGGSRLCCVVSARSNDPDGRVQVLGNT-LVVTGKS-STILLASQTTF------ 217

Query: 267 PSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCS 325
                +DP    ++AL  I    S++ +  RHL DY+ L+ RV ++LS     I TD   
Sbjct: 218 ---RVEDP---ELAALGDIEKCGSWTQILDRHLKDYKNLYGRVCLKLSSDDSHIPTDL-- 269

Query: 326 EENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLS 383
                           Q   DP LV L   +GRYLLIS SRPG +   A LQGIWN    
Sbjct: 270 --------------RLQRKPDPGLVGLYHNYGRYLLISCSRPGDKALPATLQGIWNPSFQ 315

Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
           P W S   +NIN +MNYW +   NL EC+ PLF+ L  + +NG++TA+  Y   GW  HH
Sbjct: 316 PPWGSKYTININTQMNYWPANISNLPECETPLFELLERVQVNGARTAKEMYGCRGWCAHH 375

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
            TDIWA ++     +   LWP+GGAWLCTH+WE Y +  D+ FL+ R +P+LEGC  FLL
Sbjct: 376 NTDIWADTNPQDKWMPATLWPLGGAWLCTHIWERYLFFEDKSFLQ-RLFPVLEGCVRFLL 434

Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
           D+LI+   G+  TNPS SPE+ F    G+      +STMD+ I+  VF A I++  +LE 
Sbjct: 435 DFLIKDDHGFYVTNPSLSPENTFKNQRGEEGVFCEASTMDIQILTAVFKAYITSCHILEG 494

Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
                + +V K+L  L P  ++  G + EW
Sbjct: 495 LGTVDMAEVNKALAGLPPVIVSSTGLLQEW 524


>gi|408787527|ref|ZP_11199255.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
 gi|408486464|gb|EKJ94790.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
          Length = 739

 Score =  324 bits (831), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 207/589 (35%), Positives = 313/589 (53%), Gaps = 48/589 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ ++  A  +T+A+PIGNGRLGAMV+GG   E +++NE T + G P    NPDA   L 
Sbjct: 5   RLWYDTAASAWTEALPIGNGRLGAMVFGGAWDERIQINESTFYNGGPYQPINPDAKDHLP 64

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            VR  +  G+Y EA   +        D+   YQ +GD+++ F           YRRELDL
Sbjct: 65  AVRQRILDGKYMEAERLAYDHVMARPDLQTSYQPIGDLKIAFQHDMTTI---NYRRELDL 121

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            T  A  +Y    V + R+ F+S    VIV K++  + GSLS ++ L S  +  +    +
Sbjct: 122 ETGIAVTRYDCDGVHYHRQIFASAIADVIVCKVTVDKPGSLSLSLLLSSPQNGEAEDRRD 181

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD---DRGTISALEDKKLKVEGS 247
           + +   GR            N  P  ++F+   ++  +    DRG  S      ++V  +
Sbjct: 182 HVLGYLGR--------NRKQNGIPGALRFAFRTQVVATGGFVDRGPES------IRVREA 227

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  ++ + A +SF        D   DP   +   L      ++ DL   H++D+++LF R
Sbjct: 228 DSVIIFIDAGTSFR----RYDDVSGDPEKTTEMRLARASTRAFEDLLEEHVEDHRRLFGR 283

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           ++I +               ++  VP+ +RV+      DP L  L  Q+GRYL I+SSRP
Sbjct: 284 MAIDIG-------------PDLSHVPTDKRVRDNVAKPDPQLAALYTQYGRYLAIASSRP 330

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           GTQ +NLQGIWNE++ P W+S   +NIN +MNYW + P NL+E   PL + +  L+  G 
Sbjct: 331 GTQPSNLQGIWNEEILPPWNSKFTLNINTQMNYWLADPANLAETFIPLIEMVEDLAETGQ 390

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           + A+ +Y A GWV+HH TDIW  S    G   W LWP GGAWLC  L++HY+++ D   L
Sbjct: 391 EMARAHYGARGWVVHHNTDIWRASGPIDGP-KWGLWPTGGAWLCAQLYDHYSFSGDEAIL 449

Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            +R YPL++G A F+LD L++     Y  T PS SPE+    P G   C      MD  I
Sbjct: 450 -RRIYPLMKGSAEFILDILVDLPGTSYRVTCPSLSPENRH--PGGTSLCA--GPAMDNQI 504

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           IR+VF+A+ISA+E L  +E AL  +++ +  RL   K+ + G + EW++
Sbjct: 505 IRDVFAAVISASEALAIDE-ALRAELVAARARLPEDKVGKVGQLQEWIE 552


>gi|333382100|ref|ZP_08473777.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829131|gb|EGK01795.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 820

 Score =  324 bits (830), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 200/588 (34%), Positives = 320/588 (54%), Gaps = 47/588 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
           +  PA  +  ++P+GNGR+GAMV+GG+  E + LNE T+W+G P  +   P     L+D+
Sbjct: 47  YENPADEWMKSLPLGNGRIGAMVFGGIEKEVIALNEVTMWSGQPDKFQERPLGKTMLNDI 106

Query: 76  RSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           R L   G+YA+      +      H    +   GD++L+F   +   A   Y+REL+L  
Sbjct: 107 RQLFFEGKYAKGNRVVSEFMSGTPHSFGSHVPAGDLKLDF--KYPAGAVSGYKRELNLEN 164

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG-NN 191
           A   V + VGN+ +TRE+F SNPD   + +++ +++ SL+ +VSLD L +  S +   +N
Sbjct: 165 AINTVSFKVGNILYTREYFCSNPDNAFIVRLTANKAKSLTLDVSLDMLRE--SVIKAVDN 222

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +   G+      P +      P G+ F   + +   D  G +SA  + K+ +  +    
Sbjct: 223 SLEFSGKVS---FPKQG-----PGGVDFMGKVGVTAKD--GNVSA-SNNKISIADATSVT 271

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           ++L   + +     N    K+D  +    AL       Y+ L  +H+ DY  LF RV + 
Sbjct: 272 IILDLRTDY-----NNKHYKEDCFATVNKALSQ----DYNRLKNKHVSDYSNLFKRVDLF 322

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L +S  D          + T    ERVK+ +  ED  L  L FQ+ RYLLI++SR  + +
Sbjct: 323 LGKSEAD---------KLPTDKRWERVKAGK--EDVGLDALFFQYARYLLIAASREDSPL 371

Query: 372 -ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
            ANLQGIWN++L+    W +  H++IN + NYW S   NL EC  PLFD++  LS+ G K
Sbjct: 372 PANLQGIWNDNLACNMGWTNDYHLDINTQQNYWLSNIGNLHECNTPLFDYIKDLSVYGQK 431

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y A GWV +   ++W  +++ +G V W L+P+ G W+ +HLW HY YTMD ++L 
Sbjct: 432 TAKNVYGARGWVANTVANVWGYTASGQG-VNWGLFPLAGTWIASHLWTHYIYTMDENYLR 490

Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            +AYP+L+  A FLLD++++   +GYL T PSTSPE+ F     +L+ VS     D  + 
Sbjct: 491 NKAYPILKSNAEFLLDYMVQDPKNGYLMTGPSTSPENSFRYKGNELS-VSLMPACDRQLA 549

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            E F++ I A+++L   +D   + +  +L +L P  I ++G+I EW +
Sbjct: 550 YEAFASCIQASKILNV-DDKFRDSLSIALKKLPPIIIGKNGAIQEWFE 596


>gi|290769720|gb|ADD61497.1| putative multimodular carbohydrate-active enzyme [uncultured
           organism]
          Length = 1083

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 199/595 (33%), Positives = 312/595 (52%), Gaps = 42/595 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M+N +  +    +K+ ++ PA+ + +A+P+GN RLGAMV+GG   E ++LNE+T W G P
Sbjct: 282 MINKQEATR---MKLWYSAPARRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGP 338

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
               NP   + L+  R LV + + +EA     + F       + L    L  +    K  
Sbjct: 339 YRNDNPKGKEVLAKTRELVFANRLSEAQKLIDENFFTGQHGMRFLTMGSLLINQPEHKNV 398

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
           E  Y RELD+  A A  +Y V  V +TR  FSS  D VIV ++   +  +L+F++S +S 
Sbjct: 399 E-NYYRELDIENAVAVTRYMVDGVTYTRTVFSSFADNVIVVRMEADKPKALNFDLSYNSP 457

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
           L +     GN  ++   +C G           + +GI  +   E ++       S   +K
Sbjct: 458 LKHVVMAKGNELVV---KCEGM----------EQEGIPAALNAECRVLVRHNGKSGKSNK 504

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + V+ +  A L + A+++F    +N  D   + +  + S L+    + Y      H+  
Sbjct: 505 SVVVDQATVATLYISAATNF----VNYHDVGGNASKLASSILKRAVKVPYEQALANHIAA 560

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y++ F RV+  +        T+T       T+ + +RV +F   +D +L+ L+FQ+GRYL
Sbjct: 561 YKEQFDRVTFSIPS------TET------STLETDKRVVAFGEGKDLNLIALMFQYGRYL 608

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSS+PG Q ANLQG+W   +   WDS   +NIN EMNYW +   NLSE  +PLFD ++
Sbjct: 609 LISSSQPGGQPANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVS 668

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            LS+NG KTA+  Y A GWV HH TD+W ++        + +WP GGAWL  HLW+HY +
Sbjct: 669 DLSVNGKKTAETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLF 727

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           T D++FL +R YP+++G A F L  L++   +G+L T PS SPEH +        C    
Sbjct: 728 TGDKEFL-RRYYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC---- 782

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            TMD  I  +     + AA +L +++ A  + +  +  +L P +I     I EW+
Sbjct: 783 -TMDNQIAFDALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQIQEWL 835


>gi|383778158|ref|YP_005462724.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
 gi|381371390|dbj|BAL88208.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
          Length = 746

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 202/588 (34%), Positives = 308/588 (52%), Gaps = 53/588 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKAL 72
           ++ ++ PA  + +A+PIGNGRLG MV GGV +E ++L+E T W+G P D+  NP A +++
Sbjct: 3   RLLYDRPASRWFEALPIGNGRLGGMVHGGVGTEIIRLSESTAWSGAPSDHDVNPAAAQSI 62

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
             +R L+  G++AEA   A+  L G P      L    L  D + L  A+  YRRELDL+
Sbjct: 63  PVIRRLLFEGEHAEAQRLAAEHLTGRPTSFGTNLPLPRLRLDFA-LDQAD-GYRRELDLD 120

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           T  A V++      F RE F+S+P  VI  ++S S + ++SF  +LD  +   ++  G +
Sbjct: 121 TGLASVEFDQNQTHFVRETFASHPHGVIAMRLSASRAAAISFTAALDDTVLPGTFTGGAD 180

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +   GR        +   +D  +G+     + ++   D GT+ A +D  + V G+D   
Sbjct: 181 GLAFRGRAV------ETLHSDGEQGVDVE--IRVRFVIDGGTLLAADDT-VTVTGADVVD 231

Query: 252 LLLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           + +  S+SF  P  + P+                     Y  +   H++D+Q+L  RVS+
Sbjct: 232 VFVTVSTSFCAPSLVEPA--------------------PYEVMRAAHVEDHQRLMRRVSL 271

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L  +P D+ TD             ER+   + D+D  L+ L FQ+GRYL I+ SR  + 
Sbjct: 272 DLG-TPIDLPTDV----------RRERLARGERDDD--LIALYFQYGRYLTIAGSRADSP 318

Query: 371 VA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           +   LQG+WN+  + +  W +  H++IN + NYW +   NL+EC  PLF FLT L+ +G 
Sbjct: 319 LPLALQGVWNDGFASSMGWSNDFHLDINTQQNYWAAESTNLAECHTPLFRFLTGLASSGR 378

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
            TAQ  Y A GWV H  T+ W  S+  RG + W L   GGAWL   LWEHY Y  D  FL
Sbjct: 379 STAQQMYGADGWVAHTVTNAWGYSAPGRG-IGWGLNVTGGAWLALQLWEHYEYRPDVRFL 437

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
             +AYP+L  CA FLLD+L  E   G+L   PS SPE+ ++A DG    ++  +T D   
Sbjct: 438 RDQAYPVLRSCALFLLDYLTPEPSHGWLVAGPSESPENSYLAADGTPCSIAMGTTADRVF 497

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
              +      AA +L+ + + L  +V  +  RL P +I   G + EW+
Sbjct: 498 AEAILRICGQAAAILDVDPE-LRSRVAAARDRLSPFRIGRHGQLQEWL 544


>gi|347840685|emb|CCD55257.1| glycoside hydrolase family 95 protein [Botryotinia fuckeliana]
          Length = 747

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 201/596 (33%), Positives = 313/596 (52%), Gaps = 60/596 (10%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + +  PAK +++++PIGNGRLGAMV+GG+  ETL+LNE+++W G P D T  DA + L  
Sbjct: 10  LHYTSPAKEWSESLPIGNGRLGAMVYGGISRETLQLNENSIWYGGPQDRTPKDAFRNLDR 69

Query: 75  VRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           +R  +  G + EA   + + F    H    Y+ LG + L+      K ++  Y R L+L+
Sbjct: 70  LRHFIRIGDHTEAEKLAEQAFFATPHSQRHYEPLGTLTLDLGHDPAKVSK--YWRGLELS 127

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLLDN 183
           TA    +Y    V   R  F+S PD V+V ++  SE    +  +S         D  +D+
Sbjct: 128 TANVTTEYEHLGVRHKRTVFASYPDDVLVVQLESSEKAQFTIRLSRYSDREFATDEFVDS 187

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
               +G   I+M G  PG R     N+N+      F  ++ ++     G +  + +    
Sbjct: 188 IEAQDGT--IVMHG-TPGGR-----NSNN------FCCVVSVQELAGDGNVETVGN--CV 231

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +  S  A++++ A ++F       +D +     ++ +AL S     ++DL  RH+ DY  
Sbjct: 232 IVNSSKAIIIISAQTTF-----RYTDVEAKTLIQARNALHS-----HADLSKRHVQDYSS 281

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           L+ R  ++L      I             P+ ER+    T  DP LV L   +GRYLLIS
Sbjct: 282 LYGRFKLRLFPDAAHI-------------PTNERL---LTSPDPGLVALYANYGRYLLIS 325

Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
            SRPG +   A LQG+WN    P W S   +NIN +MNYW +  CNL EC++PLFD L  
Sbjct: 326 CSRPGDKALPATLQGLWNPSFQPAWGSKYTININTQMNYWPANVCNLEECEDPLFDMLER 385

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           ++  G KTA+V Y   GW  H  TDIWA +      +   LWPM GAWLCTH+W+ + + 
Sbjct: 386 MANRGEKTARVMYGCRGWASHSCTDIWADTDPQDRWMPGTLWPMSGAWLCTHIWQRHLFG 445

Query: 482 MDRDF-LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYS 539
            D++    +R +P+L G   F+LD+L++   G YL TNPS SPE+ +I   G+   +   
Sbjct: 446 GDQNLKFLQRMFPVLRGSVQFILDFLVKDSSGDYLITNPSLSPENSYIDLKGQKGVLCEG 505

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S +D+ II+ +F A + + + L+  +D L E +  +  +L P++I E G + EW+Q
Sbjct: 506 SAIDIQIIKSLFKAFLLSVDSLQM-KDELTEPLKLARDKLPPSEIGEFGQLQEWLQ 560


>gi|218129730|ref|ZP_03458534.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
 gi|217988142|gb|EEC54466.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
          Length = 1063

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 197/585 (33%), Positives = 307/585 (52%), Gaps = 43/585 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ ++ PA  + +A+P+GN RLGAMV+GG   E ++LNE+T W G P    NP    AL
Sbjct: 271 MKLWYSAPAHRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGPYSNDNPKGKGAL 330

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           + VR LV + + +EA     + F  G     +  +G +   F +       E Y RELD+
Sbjct: 331 AKVRELVFANRLSEAQKMIDENFFTGQHGMRFLTMGSL---FINQPEHKNVENYYRELDI 387

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A  +Y V  V +TR  FSS  D VIV ++   +  +L+F++S +S L +     GN
Sbjct: 388 ENAVAVTRYMVDGVTYTRTVFSSFADDVIVVRMEADKPKALNFDLSYNSPLKHAVTAKGN 447

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             I+   +C G           + +GI  +   E ++       S   ++ + V  +  A
Sbjct: 448 ELIV---KCEGA----------EQEGIPAALNAECRVLVKHNGKSGKSNESVVVNQATVA 494

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L + A+++F    +N  D   + +    ++L+    + Y      H+  Y+K F RV  
Sbjct: 495 TLYISAATNF----VNYHDVSGNASKLVSTSLKRAVKIPYEQALANHIAAYKKQFDRVKF 550

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +        T+T       T+ + +RV +F   +D +L+ L+FQ+GRYLLISSS+PG Q
Sbjct: 551 SIPS------TET------STLETDKRVAAFGEGKDQNLMALMFQYGRYLLISSSQPGGQ 598

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQG+W   +   WDS   +NIN EMNYW +   NLSE  +PLFD ++ LS++G KTA
Sbjct: 599 PANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVSDLSVSGKKTA 658

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y A GWV HH TD+W ++        + +WP GGAWL  HLW+HY +T D++FL +R
Sbjct: 659 ETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLFTGDKEFL-RR 716

Query: 491 AYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
            YP+++G A F L  L++   +G+L T PS SPEH +        C     TMD  I  +
Sbjct: 717 YYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC-----TMDNQIAFD 771

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
                + AA +L +++ A  + +  +  +L P +I     + EW+
Sbjct: 772 ALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQLQEWL 815


>gi|312131915|ref|YP_003999255.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
 gi|311908461|gb|ADQ18902.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
          Length = 793

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 204/604 (33%), Positives = 317/604 (52%), Gaps = 69/604 (11%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + F  PA+HFT+++P+GNGRLGAMV+G    E + LNE +LW+G P D    +A K+L  
Sbjct: 23  LLFYAPARHFTESLPLGNGRLGAMVFGQTAKERIALNEISLWSGGPQDADREEAYKSLKP 82

Query: 75  VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
           ++ L+  G+  EA     K F               P   YQ LGD+ LE+ D  +    
Sbjct: 83  IQQLLLEGKNKEAQTLLEKEFIAKGRGSGFGRGAKDPYGSYQTLGDLFLEWKDGEVS--- 139

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y+R LDL+ A A  +++   ++ T E F+   + +I  ++  S++  L   V L S  
Sbjct: 140 -NYKRWLDLDNALATTQFTRNGIQITEEVFTDFKNDLIWVRLRSSKAKGLYLKVGL-SRE 197

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
           +N      + +I + G+ P         A  +P G++F+AIL+           A  D K
Sbjct: 198 ENAQVQADSKEIKLWGQLP---------AGSEP-GMKFAAILQ----------EAHVDGK 237

Query: 242 LKVEGSDW-------AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           ++VEG+ W        +L + A++++ +G  I     ++D T ++    Q  + L+YS  
Sbjct: 238 VEVEGNTWNIVGASEVILQISAATNYHEGKLI-----EEDVTQKARKYFQ--KGLTYSAA 290

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVEL 352
           +   L+ +Q  FHR  +QL             ++ +  + + +R+K   +   D  L  L
Sbjct: 291 FKSSLEKFQSYFHRSELQLK-----------GQDKLAHLSTPDRLKRLAEGKSDLDLYAL 339

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            + +GRYLLI SSRPG   ANLQG+W  +    W+   H+NIN++MNYW +    L E  
Sbjct: 340 YYHYGRYLLICSSRPGLLPANLQGLWAVEYQAPWNGDYHLNINVQMNYWPAELTGLGELA 399

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           EPL  F   L  NG KTA+  Y A GWV H  ++ W  +S   G   W     GGAWLC 
Sbjct: 400 EPLHRFTANLVKNGEKTAKAYYQAEGWVAHVISNPWFFTSPGEG-ADWGSTLTGGAWLCE 458

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG 531
           H+WEHY +T D +FL K  YP+L+G A FL   LIE   +G+L T PS SPEH ++ PDG
Sbjct: 459 HIWEHYRFTKDIEFLRKY-YPVLKGSAQFLSSILIEEPKNGWLVTAPSNSPEHAYVLPDG 517

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
                +   TMDM I RE+F+A+I +AE+L  +++   +++   +  L P ++ ++G + 
Sbjct: 518 TKVNTAMGPTMDMQICRELFNAVIQSAEILGVDKE-FRDELSAKVRNLAPNRVGKNGDLN 576

Query: 592 EWVQ 595
           EW++
Sbjct: 577 EWLE 580


>gi|431798012|ref|YP_007224916.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
 gi|430788777|gb|AGA78906.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
          Length = 819

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 201/586 (34%), Positives = 305/586 (52%), Gaps = 41/586 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA  + +A+PIGNGRLGAMV+G    E ++LNE+TL+ G P    NPDA +AL
Sbjct: 30  LKLWYDDPAASWVEALPIGNGRLGAMVFGDPYEEVIQLNENTLYAGRPHRNDNPDAKEAL 89

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           ++V+S++  GQY  A     + F  G     YQ +G ++L FDD       + YRRELDL
Sbjct: 90  AEVQSMIFDGQYGAAQHRINETFFSGINGMPYQTMGQLKLYFDDER---EVKEYRRELDL 146

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A     Y  G+  FT +  +S+PDQV+V  ++  + G++ F   +D           N
Sbjct: 147 KKALVTTHYKKGDTHFTTQVLASHPDQVMVIHLTADKPGAIHFTALVDRPGPFQLQHAAN 206

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            +++M G           +      G++F+  + +K S      +    + + V  ++ A
Sbjct: 207 GELLMTGTS--------GDHEGIKGGVEFATRVRVKHSKGEMVKTG---EGIAVNNANSA 255

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + +  +++F        D   +    S   L+     S+  +   H +D+++ F RVS+
Sbjct: 256 TIYISMATNFK----QYDDISGNAVELSKQHLEKALGKSFDQIRKSHEEDHRRYFDRVSL 311

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L             E   +  P+ +RV++F   +DP L  L FQFGRYLLI++SR G Q
Sbjct: 312 DLG------------ESEAEKDPTDKRVENFSKRDDPGLAALYFQFGRYLLIAASRAGGQ 359

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+ L+P WDS   VNIN EMNYW S   +LSE  EPL + +  LS  G KTA
Sbjct: 360 PANLQGIWNDQLNPAWDSKYTVNINTEMNYWPSEITHLSEMNEPLVEMVRELSQTGRKTA 419

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y A GW +HH TD+W  +    G   W +WPMGGAWL  HL + ++++ D  +L K 
Sbjct: 420 KDMYGARGWAMHHNTDLWRITGPVDG-AFWGMWPMGGAWLTQHLLDKFDFSGDTTYL-KS 477

Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIR 548
            YP+L+    F LD L +    G+    PS SPE+  ++  D   A V    TMD  ++ 
Sbjct: 478 IYPILKEACLFYLDILKVAPETGWKVVVPSISPENAPYLDHD---ASVGAGHTMDNQLLS 534

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           ++F     AA +L+  + A  E++  S   L P +I   G + EW+
Sbjct: 535 DLFQRTSRAASILD--DKAFAEQLKDSWALLAPMQIGRWGQLQEWM 578


>gi|325263746|ref|ZP_08130479.1| fibronectin type III domain protein [Clostridium sp. D5]
 gi|324030784|gb|EGB92066.1| fibronectin type III domain protein [Clostridium sp. D5]
          Length = 769

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 211/605 (34%), Positives = 309/605 (51%), Gaps = 57/605 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           +I F   A+ +T+A+PIGNG LGAMV+G    E +++NED++W+G   +  NPDA   L 
Sbjct: 3   EIWFRKEAEEWTEALPIGNGFLGAMVFGRTSVERIQVNEDSVWSGGYMERLNPDAKGHLD 62

Query: 74  DVRSLVDSGQYAEATA-ASVKLFG-HP-ADVYQLLGDIELEFDD--------------SH 116
           +VR L+  G+  EA   AS  ++  +P    YQ LGD+ ++F +              S 
Sbjct: 63  EVRQLLMQGRVQEAELLASRSMYAVYPHMRHYQTLGDVWIDFFNTRGRQTVKKKENGTSF 122

Query: 117 LKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
           ++Y     E YRR L+L  A   + Y+       RE F+S+P  V+V ++   E  +L F
Sbjct: 123 VEYESPVFEEYRRSLNLEDAVGNIVYTAEKGAVKREFFASSPAGVLVYRMCAEEDEALDF 182

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGK----RIPPKANANDDPKGIQFSAILEIKISD 229
            VSL +  DN S   G      +G         R+  K   ND   GI F   + ++I+ 
Sbjct: 183 EVSL-TRKDNRS---GRGSSFCDGTMAVGDDTIRLYGKNGGND---GIAFE--MAVRIAS 233

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
             G    +    + VEG+  AVL +   +++           KDP +  M  L+    L 
Sbjct: 234 VGGRQYRM-GSHIIVEGAKEAVLYITGRTTY---------RSKDPAAWCMETLEKAAGLP 283

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPS 348
           Y +L  +HL+DY  L++             V +   EE ++ + + ER+   +T  ED  
Sbjct: 284 YEELKMQHLEDYHSLYN-----------SCVLELDEEEELEQLSTPERLARMRTGKEDVG 332

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           LV L + FGRYLLISSSR  +  ANLQGIWNED  P W S   +NIN++MNYW +    L
Sbjct: 333 LVNLHYNFGRYLLISSSRENSLPANLQGIWNEDFEPAWGSKYTININIQMNYWMAEKTGL 392

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
           S    PL + L  +  +G +TA+  Y A G+  HH TDIW   +     V   +WPMGGA
Sbjct: 393 SRLHMPLLEHLKTMRPHGQETAEKMYGARGFCCHHNTDIWGDCAPQDSHVSATIWPMGGA 452

Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
           WLC H+ EHY YT DR F+E+  Y +L     F  D++++   G+  T PS+SPE+ ++ 
Sbjct: 453 WLCLHIIEHYLYTKDRVFMEE-FYGILRDSVQFFADYMVQDEQGHWITGPSSSPENIYMN 511

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
             G+  C+     MD  I+RE+FS  +   E L++  D L  +V   L  L P KI + G
Sbjct: 512 EQGECGCLCMGPAMDSEILRELFSGYLRITEELDRG-DGLEAEVKMRLEGLPPVKIGKYG 570

Query: 589 SIMEW 593
            I EW
Sbjct: 571 QIQEW 575


>gi|357043574|ref|ZP_09105265.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
 gi|355368238|gb|EHG15659.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
          Length = 808

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 206/608 (33%), Positives = 308/608 (50%), Gaps = 55/608 (9%)

Query: 7   TSTTNP----LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           TST N     + + ++ PA+ F +++P+GNG+LGA+++GG  ++T+ LN+ T WTG P  
Sbjct: 14  TSTINAQQQSMLLWYDHPAQFFEESLPMGNGKLGALIYGGTKNDTIYLNDITYWTGKP-- 71

Query: 63  YTNPDAPKALS----DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
             NP+     S     +R  + +  Y  A +    + G  +  YQ LG   L    +   
Sbjct: 72  -VNPNEGIGKSVWIPRIREALFAENYRLADSLQHYVQGEQSASYQPLGTFNL---INLTP 127

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            A + YRREL++++A A V Y    V + +E+F S  D +I  +I+ ++ G ++F +SL 
Sbjct: 128 GAIQNYRRELNIDSAMAHVSYQQDGVTYKKEYFVSQSDSLIAIRITANKPGKVNFKISLT 187

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           + +  H     + Q+ M G   GK            +     A   ++++   G  S   
Sbjct: 188 AQVP-HKTKASDEQLTMIGHATGK------------ENETIHACTIVRLTHKEGQDSH-T 233

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D  L VE +D A L +V ++SF+G   +P D   D  + ++ A    +N +Y++   RH+
Sbjct: 234 DSTLTVENADEATLYIVNATSFNGFNKHPVDDGADYMNNAIDAAWHTKNFTYNEFKQRHI 293

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVE 351
           + YQ+L+ R+++QL     D           + +P+ E +K + T   P        L  
Sbjct: 294 NAYQRLYQRLNLQLGHDKYD-----------NNIPTDELLKKYSTPHTPLSVAAQRYLET 342

Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
           L FQFGRYLL+S SR     ANLQG+W   L   W     +NINLE NYW +   N+SE 
Sbjct: 343 LYFQFGRYLLLSCSRTPGVPANLQGLWTPYLFSPWRGNYTMNINLEENYWPANSTNISET 402

Query: 412 QEPLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGG 467
            +PLF FL  L+ NG  TA   Y +  GW   H +DIW K++    GK    WA W +GG
Sbjct: 403 IQPLFSFLKGLAANGKYTAHNFYGVNEGWCASHNSDIWCKTAPVGEGKESPEWANWNLGG 462

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHE 525
           AWL   LW++Y YT D   L+   YPL+EG + F   WLIE   H G L T PST+PE+E
Sbjct: 463 AWLVNTLWDYYLYTQDFQMLKSTIYPLMEGASRFCKQWLIENPKHPGELITAPSTTPENE 522

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
           ++   G      Y  T D+AIIRE+F     A  +L    D  +   LK   RL P  I 
Sbjct: 523 YLTDKGYHGTTCYGGTADLAIIRELFENTQQARRILNIKPDKQLNNTLK---RLHPYTIG 579

Query: 586 EDGSIMEW 593
            +G + EW
Sbjct: 580 AEGDLNEW 587


>gi|294674990|ref|YP_003575606.1| hypothetical protein PRU_2351 [Prevotella ruminicola 23]
 gi|294471732|gb|ADE81121.1| conserved hypothetical protein [Prevotella ruminicola 23]
          Length = 769

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 204/595 (34%), Positives = 308/595 (51%), Gaps = 48/595 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
           + + +N PA  F +++PIGNG++GA+++GG     + LN+ TLWTG P D   + DA K 
Sbjct: 1   MVLEYNKPATFFEESLPIGNGKMGALIYGGTDDNVIYLNDITLWTGKPVDRNLDADAHKW 60

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
           + ++R  + +  YA A +  + + G  +  YQ LG + + +     +KY    YRR LD+
Sbjct: 61  IPEIRKALFNENYALADSLQLHVQGPNSQHYQPLGTLHIKDLGLGEIKY----YRRTLDI 116

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           ++A  R  Y       TRE+F+SNPD++I  ++ G  +  ++    +      H   +G 
Sbjct: 117 DSAIVRDSYERDGRHITREYFASNPDKLIAIRLRGDINCQIALTAQVP-----HQVKSGL 171

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
            Q+ M G   G          D  +   F  IL +K   +     A  D  L +  +  A
Sbjct: 172 GQLTMTGHATG----------DAQESTHFCTILSVKTDGEM----AASDSSLTITKAKEA 217

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           ++ +V  +SF+G   +P     +      + L   +N+++ + Y RHL DY+ ++ RV I
Sbjct: 218 IIYIVNETSFNGFDKHPVREGANYLEAVTNDLWHTQNMTFDEFYARHLADYKTIYDRVKI 277

Query: 311 QLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSS 365
            L+   R+PKD+          D   + E +  +    D+ P L EL FQFGRYLLIS+S
Sbjct: 278 CLNKGGRNPKDLPGAK------DRRMTDEMLLDYTNGNDQTPYLEELYFQFGRYLLISAS 331

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R     ANLQG+W   L   W     VNINLE NYW +   N++E  EPL  F+  L+ N
Sbjct: 332 RTKNVPANLQGLWAPQLWSPWRGNYTVNINLEENYWPAFVANMAEMAEPLDGFIAGLAAN 391

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGK---VVWALWPMGGAWLCTHLWEHYNYT 481
           G  TA+  Y +  GW   H +DIWA ++    K     W+ W +GGAWL   LWE Y +T
Sbjct: 392 GKFTAKNYYNIHEGWCSSHNSDIWAMTNPVGEKNESPEWSNWNLGGAWLVNTLWERYQFT 451

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
            D+ +L+  AYPL++G A F L WLI+     G L T PSTSPE+E+    G      Y 
Sbjct: 452 QDKTYLKNIAYPLMKGAAQFCLRWLIDNPKQPGELITAPSTSPENEYKTDKGYHGTTCYG 511

Query: 540 STMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            T D+AIIRE+F   I+A +VL  KN++     + ++L +L P  I   G + EW
Sbjct: 512 GTADLAIIRELFINTIAAGKVLGLKNKE-----MEQALAKLHPYTIGHMGDLNEW 561


>gi|288925542|ref|ZP_06419475.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
 gi|288337758|gb|EFC76111.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
           (Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
          Length = 796

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 205/595 (34%), Positives = 309/595 (51%), Gaps = 38/595 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           PLK+ +N PA  F +A+PIGNGRLGA+V+GG  ++++ +N+ TLWTG P +     DA +
Sbjct: 26  PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
            +  +R  + +G Y  A      + GH ++ YQ   LL   +L    +  +  E+    +
Sbjct: 86  WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 145

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LD+++A  R  Y  G V + RE+F+S PD +I   I     G+++  ++L S++ +  
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAMHIRADRPGAINCCLALTSIVPHQV 205

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
              G  Q+ M G   G          D  + I F AIL++K SD  G ++A  D  L V 
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTSD--GQVAA-SDSSLTVS 251

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+    +  V  +SF+G   +P  +     +++   +    N++Y++   RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 311

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R    L  +  +    T  EE +          S Q + +P L  L  Q+GRYLLIS S
Sbjct: 312 DRFKFTLGGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 362

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R     ANLQG+W       W     +NINLE NYW +   +L E   P+   +  ++  
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           G  TA   Y +  GW   H +DIWA ++     +    W+ W MGGAWL   LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
            D  +L   AYPL++G A F+L WL+E     G L T P TSPE E+I   G   C  Y 
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
            T D+AI+RE+F+  + AAE+L  N DA   + L+ SL  L P KI + G++ EW
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 595


>gi|402087340|gb|EJT82238.1| hypothetical protein GGTG_02212 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 833

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 205/596 (34%), Positives = 313/596 (52%), Gaps = 46/596 (7%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           S + PL++    P  +F D+  IGNGRLG  + GG  SE++ LNED+ W+G   D  NPD
Sbjct: 27  SASKPLRMWQTTPGVNFNDSFLIGNGRLGFSLPGGALSESIVLNEDSFWSGGEMDRVNPD 86

Query: 68  APKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
           A   + ++++L+  G+  EA+  AS+   G P  V  +  +G + +    S  +  +  Y
Sbjct: 87  AAAHMPEIQALIARGEIREASRLASMSYVGTPVSVRHFDWVGKLGISMRGSAGQVRD--Y 144

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
            R LD+    A V Y+VG V + RE+ +S PD VI  +IS ++SG++SF++        +
Sbjct: 145 ERWLDVGEGVAGVYYTVGGVAYKREYTASFPDDVIAVQISANKSGAVSFDLHQSRGIGLN 204

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
           L  + +  +G + I+M G   G             K I F+A  ++ I  D G++  + D
Sbjct: 205 LFQDSAGGSGKDTILMGGGSFGA------------KAIVFAAGAKVTI--DGGSMKRIGD 250

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
             + V+G+D A +   A +++         S  +  S  M+ L       Y  L + H+ 
Sbjct: 251 T-IVVDGADSATIYWSAWTTY-------RKSAGELQSAVMADLSQASRKGYGALRSDHVK 302

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ L  RV + L +S         SE+   T  +A+R++  +T  DP +  L F F RY
Sbjct: 303 DYQSLAGRVELSLGKS--------TSEQKAKT--TADRLRGLRTAFDPEIATLYFYFARY 352

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLI+S RPGT  ANLQG+WN DL+P W S   +NINLEMNYW SL  N+ E  E +F+ +
Sbjct: 353 LLIASGRPGTLPANLQGLWNNDLNPMWGSKYTININLEMNYWPSLLTNMPELHESMFEHI 412

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             +   G   A+  Y ASG V HH TDIW   +          WP G AW+ TH++EHY 
Sbjct: 413 MKMHEKGRDVAKRMYNASGSVCHHNTDIWGDCAPQDNYAASTFWPSGLAWMATHIYEHYQ 472

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-GKLACVSY 538
           +T D D L K  YP L   A F LD++ E HDG+L TNPS SPE  +  P+  +   ++ 
Sbjct: 473 FTGDVDVLRKY-YPALRDAAVFFLDFMTE-HDGHLVTNPSVSPEISYRLPNTTQSVALTL 530

Query: 539 SSTMDMAIIREVFSAIISAAEVL-EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             T D +II E+   ++ + ++L + + D + +++     RL P +  + G I E+
Sbjct: 531 GPTADNSIIWELVGMVLESQKILGDSDPDNIGQRLTGLRARLPPLRKDQYGGIAEF 586


>gi|326201460|ref|ZP_08191331.1| coagulation factor 5/8 type domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325988060|gb|EGD48885.1| coagulation factor 5/8 type domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 1026

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 197/586 (33%), Positives = 302/586 (51%), Gaps = 57/586 (9%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           F  A+P+GNGR+GAMV+G  P E + LNE T W+  PG+     A  +L   +  + +GQ
Sbjct: 76  FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFAGQ 135

Query: 84  YAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
           Y   +    K + G     YQ +GD++L F  S +      Y R+LD+NT      Y+  
Sbjct: 136 YTNGSTTIAKSMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
             ++ RE F S PDQ++VTKI+ S  GS+S     +S L     V+  GN+ ++M G   
Sbjct: 192 GKKYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLSGQYTVSTSGNDTLVMNGH-- 249

Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
                      D   GI ++       K+ +  G++SA  + ++ V  +D  V+L    +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKLINTNGSVSA-NNNQISVSNADSVVIL----T 294

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           S    +IN      D   ++ + + +    SY  L   H+ DYQ LF RV + L  S  +
Sbjct: 295 SIRTNYINYKTCNGDEKGKATTDITNASAKSYDTLLNNHVADYQSLFKRVDVDLGGSGSE 354

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
                      ++ P ++R+  F +  DP L ++LFQ+GRYL+IS+SR  +Q  NLQGIW
Sbjct: 355 -----------NSKPMSQRISEFGSTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQGIW 402

Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
           N+  +P W      NIN EMNYW +   NL+EC EP  +    L   G++TA+ +Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVEKAKALQAPGNETARAHYNISN 462

Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
           GWV+HH TD+W +++   G+  W  WP G  W+   L++ YN+  D  +L +  YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGE--WGFWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVIKG 519

Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
            A FL   +    I G + Y    P TSPE   + P     G+ A  SY  TMD  I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPGTSPE---LTPPGNSGGQGAYNSYGVTMDNGISRE 575

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
           +F A+I AA +L  N D+     L+S + +++P  I   G + EW 
Sbjct: 576 LFKAVIQAAGIL--NIDSSFRSTLQSKVSQIKPNTIGSWGQLQEWA 619


>gi|315606675|ref|ZP_07881686.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
 gi|315251685|gb|EFU31663.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
          Length = 807

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 204/595 (34%), Positives = 310/595 (52%), Gaps = 38/595 (6%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           PLK+ +N PA  F +A+PIGNGRLGA+V+GG  ++++ +N+ TLWTG P +     DA +
Sbjct: 37  PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 96

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
            +  +R  + +G Y  A      + GH ++ YQ   LL   +L    +  +  E+    +
Sbjct: 97  WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 156

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LD+++A     Y  G V + RE+F+S PD +I  +   + SG+++  ++L S++ +  
Sbjct: 157 RSLDIDSAVCSDTYHRGGVTYIREYFASAPDSLIAIRFRANRSGAINCRLALTSVVPHQV 216

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
              G  Q+ M G   G          D  + I F AIL++K  D  G ++A  D  L V 
Sbjct: 217 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 262

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+    +  V  +SF+G   +P  +     +++   +    N++Y++   RH+ DY++LF
Sbjct: 263 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 322

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            R    LS +  +    T  EE +          S Q + +P L  L  Q+GRYLLIS S
Sbjct: 323 DRFKFTLSGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 373

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R     ANLQG+W       W     +NINLE NYW +   +L E   P+   +  ++  
Sbjct: 374 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEMTDLGELVMPVDGLVRAMAAT 433

Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           G  TA   Y +  GW   H +DIWA ++     +    W+ W MGGAWL   LW+HY++T
Sbjct: 434 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 493

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
            D  +L   AYPL++G A F+L WL+E     G L T P TSPE E+I   G   C  Y 
Sbjct: 494 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 553

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
            T D+AI+RE+F+  + AAE+L  N DA   + L+ SL  L P KI + G++ EW
Sbjct: 554 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 606


>gi|325680593|ref|ZP_08160136.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
 gi|324107730|gb|EGC02003.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
          Length = 759

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 206/587 (35%), Positives = 307/587 (52%), Gaps = 49/587 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  +  A+P+GNGR+GAMV+     E ++LNED++W+G   +  N  A   L  VR
Sbjct: 9   YKTPADDWNKALPLGNGRIGAMVFSQPLEERIQLNEDSVWSGGFRERNNKSALPNLEKVR 68

Query: 77  SLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYR-RELDLNT 132
            L+   +  EA       F G P +   Y  LGD+ +     H K +E  ++ R LDLNT
Sbjct: 69  KLLFEEKINEAEKIIYDAFCGTPVNQRHYMPLGDMNV----IHYKESECDFKSRSLDLNT 124

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYVNG 189
           A    +Y++  V++TRE F S PDQV+V  I+ SE  ++S  V +D      D++S V+ 
Sbjct: 125 AVCTTEYAINGVDYTREVFISQPDQVLVMHITASEKKAISVRVRIDGRDDYFDDNSPVHD 184

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           N+ +   G           + ++D  GI F+A   IK+    G +       +  E  D 
Sbjct: 185 NDILFYGG-----------SGSED--GINFAAY--IKVLHKGGKVYPY-GSFITCEDCDE 228

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             +LL A +S+           +D   +++  ++     +Y+ L   H+ DY+  + R +
Sbjct: 229 VTILLGAQTSY---------RCEDYKGQAVFDVERAEEKTYAQLKADHIADYKSYYDRAN 279

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
           I L         D  S  +  T+P+ +R+    + + D  L+E+   FGRYLLI+ SR  
Sbjct: 280 ISLC--------DNSSGNS--TLPTDKRLALVKEGNPDNKLIEMYHNFGRYLLIAGSREK 329

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           T   NLQGIWN+D+ P W     +NIN EMNYW +  CNLSE   PL D +  L  NG K
Sbjct: 330 TLPTNLQGIWNKDMWPAWGCKFTININTEMNYWCAENCNLSELHMPLIDHIEKLRPNGRK 389

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  Y   G+V HH TDIW  ++     +    WPMG AWLC H+WEHY Y  DR+FL 
Sbjct: 390 TARNMYGCRGFVCHHNTDIWGDTAPQDLWIPGTQWPMGAAWLCLHIWEHYLYVQDREFLS 449

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           ++ Y  L+  A F LD+LIE   G L T PS SPE+ ++   G    +    +MD  II 
Sbjct: 450 EK-YDTLKEAAEFFLDFLIEDKKGRLVTCPSVSPENTYLTASGSKGSICIGPSMDSQIIY 508

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           E+F+A+  A+++LE  +    +KVL++  RL   +I + G IMEW +
Sbjct: 509 ELFTAVAEASKILE-TDGGFRKKVLEARDRLPAPEIGKYGQIMEWAE 554


>gi|333381846|ref|ZP_08473525.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829775|gb|EGK02421.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 808

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 209/594 (35%), Positives = 298/594 (50%), Gaps = 57/594 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PAK + +A+P+GN RLG MV+G    E L+LNE+T+W G P    NP A  AL
Sbjct: 24  LKLWYNTPAKIWEEALPLGNSRLGVMVYGIPEKEELQLNEETIWGGGPYRNDNPKALGAL 83

Query: 73  SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            + R L+  G+  EA     + F     G P   +Q  G + L F   H  Y  + Y RE
Sbjct: 84  PEARELIFKGKSREADQLINRTFFTKTHGMP---FQTAGSVILNFP-GHQNY--QDYSRE 137

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL+ A A  +Y+V  V++TRE FSS  D VI+ +I+    G+L+F     +    H+  
Sbjct: 138 LDLDKALAITRYTVNGVKYTREVFSSFADDVIIMRITAGRKGTLNFETEYTNN-SQHTIS 196

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
             +N +I+EG+              D +GI      E KI     T+    D K++V GS
Sbjct: 197 KKDNILILEGK------------GSDHEGI------EGKIRYQIHTLIRNHDGKIEVTGS 238

Query: 248 DWAVLLLVASS---SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
             ++     ++   S    F+N    + DP  ++  AL       Y      H D Y K 
Sbjct: 239 KISISGATVATIYISIGTNFLNYKSVEGDPAKKASDALAKALKTDYRSALKNHSDIYGKQ 298

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F R  + L   P+ +   T            +R+  FQ + DP+LV LL QFGRYLLI S
Sbjct: 299 FKRFKLDLGNVPEAMKLTTT-----------QRIIDFQKNHDPALVTLLTQFGRYLLICS 347

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+ G Q ANLQGIW   + P WDS   +NIN EMNYW +   NLSE   P+   +  LS 
Sbjct: 348 SQLGGQPANLQGIWCNSMHPAWDSKYTININAEMNYWPAEVTNLSETHLPMIQMVKDLSE 407

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +G +TA+  Y A GWV HH TDIW  +S         +WP GGAWL  HLWEHY +T D+
Sbjct: 408 SGQQTAKTMYGARGWVAHHNTDIWRVTSPVDFAAA-GMWPTGGAWLVQHLWEHYLFTGDK 466

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
            +L    YP ++G A + L  L+E    G++   PS SPEH           +S   TMD
Sbjct: 467 KYLAD-VYPAMKGAADYFLSSLVEHPQYGWMVVCPSVSPEH---------GPMSAGCTMD 516

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
             ++ +V +    A  +L +NE+    ++L  + +L P  I +   + EW++ +
Sbjct: 517 NQLVFDVLTRTAQANNILGENEE-YRNQLLAMVSKLPPMHIGKYSQLQEWLEDK 569


>gi|329962425|ref|ZP_08300425.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
           12057]
 gi|328529981|gb|EGF56869.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
           12057]
          Length = 827

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 201/604 (33%), Positives = 314/604 (51%), Gaps = 54/604 (8%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           ++    N LK+ ++ PA  + +A+P+GNGRLGAMV+G   +E  +LNE+T+W G P + T
Sbjct: 20  QAQQQENNLKLWYDKPATQWVEALPLGNGRLGAMVFGDPANEQFQLNEETVWGGSPYNNT 79

Query: 65  NPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLK 118
           NP A  AL  +R L+  G+ AEA A       S    G P   YQ +G + L+F+ +   
Sbjct: 80  NPKAKDALPRIRQLIFEGRNAEAQALCGPGICSQSANGMP---YQTVGSLHLDFEGTS-- 134

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
                Y RELDL  A    +++ G + +TRE ++S P+Q++V +++ S+  S+SF     
Sbjct: 135 -GYTNYYRELDLEKAVTTTRFTAGGITYTREAYTSFPEQLLVIRLTASQKKSISFTAR-- 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ----FSAILEIKISDDRGTI 234
                  Y     + +     P K +     AND  +GI+    F+A+   +I +  G++
Sbjct: 192 -------YTTPYKKNVERSISPDKELQLDGKANDH-EGIEGKVRFTAL--TRIENSGGSL 241

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDL 293
             L D  L+V+ ++ +V L V   S    F+N  D   D  + +   + Q+ +N +   L
Sbjct: 242 EVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGDALATARKYMKQAGKNYTKGKL 297

Query: 294 YTRHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
              H++ Y+K F RVS+ L S +  D  TD              RVK F    DP +  L
Sbjct: 298 --AHINAYRKYFDRVSLNLGSNAQADKPTDV-------------RVKEFSGSFDPQMAAL 342

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            FQFGRYLLI SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +   +L E  
Sbjct: 343 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMH 402

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           EP    +  +++ G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C 
Sbjct: 403 EPFLQLVKEVALTGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-GYGIWPTCNAWFCQ 460

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
           HLW+ Y ++ D+ +L +  YPL+ G   F LD+L+ E  + +L   PS SPE+  +    
Sbjct: 461 HLWDRYLFSGDKAYLAE-IYPLMRGACEFYLDFLVREPKNNWLVVAPSYSPENRPVVNGK 519

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           +   V   +TMD  ++ ++F   I AA+++ +N  A  + +      L P ++   G + 
Sbjct: 520 RDFVVVAGTTMDNQMVYDLFYNTIQAAKLMNEN-IAFTDSLQAVSDHLAPMQVGRWGQLQ 578

Query: 592 EWVQ 595
           EW++
Sbjct: 579 EWME 582


>gi|220928668|ref|YP_002505577.1| family 6 carbohydrate binding protein [Clostridium cellulolyticum
           H10]
 gi|219998996|gb|ACL75597.1| Carbohydrate binding family 6 [Clostridium cellulolyticum H10]
          Length = 1164

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 199/586 (33%), Positives = 300/586 (51%), Gaps = 57/586 (9%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           F  A+P+GNGR+GAMV+G  P E + LNE T W+  PG+     A   L   +  + +GQ
Sbjct: 76  FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANFLKTAQDQLFAGQ 135

Query: 84  YAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
           Y   +A  +  + G     YQ +GD++L F  S +      Y R+LD+NT      Y+  
Sbjct: 136 YKTGSATIANNMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
             ++ RE F S PDQV+VTKI+ S  GS+S     +S L     V+  GN+ ++M G   
Sbjct: 192 GKKYHRESFVSYPDQVMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH-- 249

Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
                      D   GI ++       KI +  G++SA  + ++ V  +D  V+L    +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL----T 294

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           S    F+N      D   ++ + + +    SY  LY  H+ DYQ LF RV + L  S  +
Sbjct: 295 SIRTNFVNYKTCNGDEKGKATTDIANASAKSYDTLYNNHVTDYQNLFKRVDVDLGGSGSE 354

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
                      +  P  +R+  F T  DP L ++LFQ+GRYL+IS+SR  +Q  NLQGIW
Sbjct: 355 -----------NGKPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQGIW 402

Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
           N+  +P W      NIN EMNYW +   NL+EC EP       L   G++TA+V+Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARVHYNISN 462

Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
           GWV+HH TD+W +++   G   W  WP G  W+   L++ Y++  D  +L +  YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGD--WGFWPTGAGWVSNMLFDAYSFNQDTVYLNE-IYPVIKG 519

Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
            A FL   +    I G + Y    PSTSPE   + P     G+ A  SY  TMD  I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGISRE 575

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
           +F  +I A+++L  N D+     L S + +++P  +   G + EW 
Sbjct: 576 LFKDVIQASKIL--NIDSSFRSTLASKVSQIKPNTVGSWGQLQEWA 619


>gi|262405238|ref|ZP_06081788.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294646990|ref|ZP_06724607.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|345508052|ref|ZP_08787692.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|229444703|gb|EEO50494.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|262356113|gb|EEZ05203.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292637661|gb|EFF56062.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
          Length = 811

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 204/591 (34%), Positives = 307/591 (51%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PAK++++A+PIGN RLGAMV+GG   E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR L+  G+  EA     A+     H    Y  LG++ LEF     K A++ YR +L+
Sbjct: 82  PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  AT   +Y V  + +TR  F+S  D VI+  I  S+  +L+FNVS +  L N   V  
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           +  II    C GK          + +G++ +   E ++      I       L++ G   
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  +   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L  S                + +  R+++F    D ++  LLFQ+GRYLLISSS+PG 
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|315499511|ref|YP_004088314.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
 gi|315417523|gb|ADU14163.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
          Length = 789

 Score =  318 bits (816), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 210/603 (34%), Positives = 315/603 (52%), Gaps = 52/603 (8%)

Query: 2   MNAESTSTTNP---LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
           + A+S     P   L + +  PA  +  A+P+GNGRLG MV+GGV  E ++LNEDT + G
Sbjct: 24  VKAQSAPPEQPSPDLSLWYERPADEWVKALPVGNGRLGGMVFGGVAFERIQLNEDTFFAG 83

Query: 59  VPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFD-- 113
            P   TNP +   L  V+SL+  G+YAEA   A+  L   PA    YQ +GD+ L F   
Sbjct: 84  SPYTPTNPRSRDGLPQVQSLIFEGKYAEAERLANETLISQPAKQMAYQPVGDLILLFPGL 143

Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
           D+  KY      R LDL+   A  +++ G+    RE F S  DQV+V ++S  +  +++ 
Sbjct: 144 DNTSKYV-----RRLDLSEGVAVTEFNAGSNRHRREVFVSAVDQVMVVRLSSEKGKAITV 198

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDR 231
           ++SL +           + +I++G  P +            +GI+     E+  K+    
Sbjct: 199 DLSLSTPQKAEIDTIDGDTLIIKGVSPTQ------------QGIEGKLPFELRAKVIAPT 246

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           GT+++ E   + + G+  AV+L+ A++ +    +   D   DP+  +   +       Y+
Sbjct: 247 GTLTSREGG-VYISGAQDAVVLISAATGY----VRYDDISGDPSVLNAGRIAIAAAKGYA 301

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
            L   HL DY+ LF RVS+ L   P               +P+ +R+  +   +DP L  
Sbjct: 302 ALKADHLKDYKALFDRVSLSLGEGPNA------------RLPTDQRIARYGEGKDPGLAA 349

Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
           L  Q+GRYLL+SSSR   Q ANLQGIWN+ L+P+W S   +NIN +MNYW +  CNL+E 
Sbjct: 350 LYLQYGRYLLVSSSRGSRQPANLQGIWNDKLNPSWQSKWTLNINTQMNYWPAEMCNLTET 409

Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
            +PL   +  L+  G+K A+  Y A GWV  + TD+W  +S   G  VWALWPMGGAWL 
Sbjct: 410 IDPLVCLVEDLAETGAKLAKDMYGAPGWVAFNNTDVWRVASPPDG-AVWALWPMGGAWLL 468

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPD 530
            +LWE + Y  D  +L +R YPL++G + F    L++     Y+ TNPS SPE+    P 
Sbjct: 469 QNLWEPWLYNGDEAYL-RRIYPLMKGASEFYQATLLKDPRSDYMVTNPSNSPENRH--PF 525

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
           G   C      MD  ++R++F+    AA+VL K + A     L    +L P KI + G +
Sbjct: 526 GSSVCA--GPAMDNQLLRDLFAHTAEAAKVL-KTDAAFARACLAMRSKLPPEKIGKAGQL 582

Query: 591 MEW 593
            EW
Sbjct: 583 QEW 585


>gi|325299782|ref|YP_004259699.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319335|gb|ADY37226.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
          Length = 826

 Score =  318 bits (816), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 203/607 (33%), Positives = 318/607 (52%), Gaps = 53/607 (8%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           + N          KI ++ PA ++ +A+P+GNGR+ AMV+G    E L+LNE+T+  G P
Sbjct: 15  VCNVTGLCAQESYKIWYDKPAAYWEEALPVGNGRIAAMVFGNARMERLQLNEETVSAGSP 74

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSH 116
               NP+A  AL ++R L+  G+  EA      A +   G+    YQ +G++ + + + H
Sbjct: 75  YQNYNPEAKAALPEIRRLIFEGKNEEAQLLAGKAIISQVGNEMP-YQTVGNLNIRYKN-H 132

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
              ++  Y R+LD++ A A  +Y VG+ E+T E F+S  DQ+IV  I  S++G++  +V 
Sbjct: 133 ENVSD--YYRDLDISRAVATTRYRVGSTEYTEETFASFTDQLIVKHIKASKAGAIDCDVF 190

Query: 177 LDSLLDN-HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
            D+ +        G   + +EG   G +  P          + + A L++K+   +   S
Sbjct: 191 FDTPMKRPQRSAIGKKGLRLEGMADGTKFFPGK--------VHYCADLQVKLKGGKAETS 242

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
              D  L V+G+    L +  +++F    +N  D   DP   +   L++     Y    +
Sbjct: 243 --NDTLLSVKGATELTLYISMATNF----VNYKDVSADPYVRNRVYLKNAGK-EYEKAKS 295

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+  Y++ F RV++ +  +P+       +++ +D      R+K F +  DP L+ L FQ
Sbjct: 296 AHIAAYREQFDRVTLDMGTTPQ-------ADKPMDV-----RIKEFASSYDPHLIALYFQ 343

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSS+PG Q ANLQG WN    P W+     NIN EMNYW +   NL E  EPL
Sbjct: 344 YGRYLLISSSQPGCQPANLQGKWNAKTKPAWNCNYTTNINTEMNYWPAEVTNLPELHEPL 403

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL---WPMGGAWLCT 472
              +  LS NG + A   Y   GWV+HH TD+W  +    G V +A    WP+  AWLC 
Sbjct: 404 IRMIRELSENGKEAASKMYGCRGWVLHHNTDLWRMT----GAVDYAYCGTWPVCNAWLCQ 459

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD- 530
           HLW+ Y Y+ D+ +L K  YP+++  + F +D+L+ + + GYL   PS SPE+   AP  
Sbjct: 460 HLWDRYLYSGDKQYL-KEVYPIMKSASQFFVDFLVRDPNTGYLVVTPSNSPEN---APRW 515

Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
             K A +    TMD  ++ ++FS    AA VL  NED L    L+S+ R L P ++ + G
Sbjct: 516 IKKKANLFAGITMDNQLVFDLFSNTCRAASVL--NEDTLFCDTLRSMRRQLPPMQVGQYG 573

Query: 589 SIMEWVQ 595
            + EW +
Sbjct: 574 QLQEWFE 580


>gi|281422553|ref|ZP_06253552.1| putative large secreted protein [Prevotella copri DSM 18205]
 gi|281403377|gb|EFB34057.1| putative large secreted protein [Prevotella copri DSM 18205]
          Length = 807

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 196/600 (32%), Positives = 316/600 (52%), Gaps = 63/600 (10%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           M A+++ T N   + +N PA+ + +A+PIGN  LG MV+GG   E ++LNE+T W+G P 
Sbjct: 21  MMAKTSCTDNSTLLWYNAPAQQWLEALPIGNSHLGGMVYGGTTDENIQLNEETFWSGGPH 80

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
           +  +  + + L  VR L+ +G+  EA A   + F       + L    L     +   AE
Sbjct: 81  NNNSKKSLENLPKVRELIFNGREEEAAALINQTFIPGPHGMRFLPMANLHITMKNQGKAE 140

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
           + + R LDL  A A   + +  V +TR  F+S  D VIV  I  S  G+L+ +V+LDS  
Sbjct: 141 Q-FVRNLDLKRAIATTSFVMDGVRYTRTTFASLADGVIVCHIKASRKGALNIDVTLDS-- 197

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK- 240
                                  P +      P G+    +L++K  D  G  +AL  + 
Sbjct: 198 -----------------------PFEHQTQKMPSGV----MLKVKGQDQEGIKAALTAEC 230

Query: 241 --KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              ++ +G++  +++  A++     F+N  D   +    +   +  ++ +SY+ L  RH+
Sbjct: 231 VADVRKDGTEATIIVSAATN-----FVNYHDVSGNAAQRNADYINKVKLMSYAQLEKRHV 285

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           + YQK F   S+ L   P DI           ++P+ +R++ F   +D ++V L++ +GR
Sbjct: 286 EAYQKQFATSSLIL---PTDINA---------SLPTNQRLEKFAGSKDMAMVALMYNYGR 333

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSS+PG Q ANLQG+WN+  +  WDS   +NIN EMNYW +   NL    EPL+  
Sbjct: 334 YLLISSSQPGGQAANLQGVWNDSKNAPWDSKYTININTEMNYWPAEVTNLGNTTEPLYSL 393

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  LS+ G++TA+  Y   GW+ HH TDIW  +    G   W ++P GGAWL THLW+HY
Sbjct: 394 IKDLSVTGAQTAREMYGCRGWMAHHNTDIWRIAGPVDG-AQWGMFPNGGAWLTTHLWQHY 452

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
            YT D+ FL K+ YP+++G A F LD++  + G +  +   PS SPE     P GK   V
Sbjct: 453 LYTGDKAFL-KQWYPVIKGAAEFYLDYMQKLPGTEWKVSV-PSVSPEQ---GPKGKRTAV 507

Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   TMD  I  +  ++ + A+E+L  ++ E   +++++  +P   P +I + G + EW+
Sbjct: 508 TAGCTMDNQIAFDALTSAVKASEILGVDEAERKDMQQLVSQIP---PMQIGKYGQLQEWL 564


>gi|225011898|ref|ZP_03702336.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
 gi|225004401|gb|EEG42373.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
          Length = 792

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 206/590 (34%), Positives = 309/590 (52%), Gaps = 44/590 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAP-KALS 73
           +   A  + +A+P+GNGRLG MV+G    E ++LN+D+LW   P D  + NP+   + L 
Sbjct: 40  YEQAASEWEEALPLGNGRLGVMVFGNPTKEHIQLNDDSLW---PKDIEWGNPEGTFEDLK 96

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
            +R+L+  G   +     ++ F     V  +Q LGD+ +  D   +      Y+R L+LN
Sbjct: 97  QIRNLLIDGDIEKTDHLLIEKFSRKTVVRSHQTLGDLHIRLDHDSIS----DYKRSLNLN 152

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE----SGSLSFNVSLDSLLDNHSYV 187
            ATA V Y           F S+P Q IV  I        +GS+  +  +D      S +
Sbjct: 153 KATAYVNYKTEGYPVKESVFVSHPHQAIVVIIESEHPKGINGSIQLSRPMDEGFPTVSVL 212

Query: 188 NGNN-QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + NN +IIM G    +     +      +G+ F  IL  K S + G+I++ E+K L+++G
Sbjct: 213 SRNNSEIIMTGEVTQRGGKFDSKTLPILEGVSFETIL--KTSHEGGSIASNENK-LELKG 269

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
              AVL +V++SSF           ++ TS++      I   S SD+  +H+ D+Q  + 
Sbjct: 270 VRKAVLYIVSNSSF---------YHENYTSQNQKNFAVIEKTSLSDIEEQHIRDHQNYYE 320

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
           R+         +I T   S+     +P+ +R+++ +  + D  L ELLF FGRYLLI+SS
Sbjct: 321 RIDF-------NIETKNISQ----LIPTDKRIEAVKKGNVDLELQELLFHFGRYLLIASS 369

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R GT  ANLQG+WN+ +S  W++  H+NINL+MNYW +    L E   PLFD++  L IN
Sbjct: 370 REGTLPANLQGLWNQHISAPWNADYHLNINLQMNYWLANVTQLDELNNPLFDYVDRLLIN 429

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G KTAQ N+ A G  + H TDIWA +        W      G W+  H W H+ YT D +
Sbjct: 430 GKKTAQENFGARGSFLPHATDIWAPTWLRAPTAYWGASFGAGGWMVQHYWNHFEYTQDYN 489

Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL  RA+P +E  A F  DWLIE   DG L + PSTSPE+ +I   G        S MD 
Sbjct: 490 FLRNRAFPAIEEVAKFYSDWLIEDPRDGSLISAPSTSPENRYINDQGVAVSSCLGSAMDQ 549

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEW 593
            +I+EVF+  + A  +L  + +  ++K+ K L +LRP  +   DG I+EW
Sbjct: 550 QVIKEVFTNYLKAVRLLNIDNE-WIQKIEKQLKQLRPGFVLGSDGRILEW 598


>gi|333377780|ref|ZP_08469513.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
           22836]
 gi|332883800|gb|EGK04080.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
           22836]
          Length = 788

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 194/595 (32%), Positives = 323/595 (54%), Gaps = 49/595 (8%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +  F+ P+  + ++IP+GNGR+G M WGGV  E + LNE +LW+G   D  NP+A K
Sbjct: 25  NEWQYYFDKPSSIWEESIPLGNGRIGMMPWGGVERERVVLNEISLWSGNKQDADNPEAYK 84

Query: 71  ALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFDDSHLKYAEE 122
            L ++R L+   +  EA     K F        G     +Q+  ++ ++F       A +
Sbjct: 85  YLGEIRRLLFEKKNKEAQELMYKTFTCKGKGSAGLEYGKFQIFANLYVDFLYPDKSEATQ 144

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y+R LD+N A + V +S  +VE+ RE+F+S  + + + K + S+S +LS  +SL    +
Sbjct: 145 -YKRVLDMNNALSTVSFSKNDVEYKREYFTSFSNDIGLVKYTASKSEALSLKISLQRDEN 203

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             +Y +GN   I            +  A ++  G+++  +  +K+ +  G +SA  DK +
Sbjct: 204 FKTYASGNTLYIF----------GQLEAGENHSGMKYLGM--VKVINKGGKLSA-TDKVI 250

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            ++ ++   L +  +++++G              +  S L +   ++Y  L  +H+  YQ
Sbjct: 251 DIKNANEVTLYVSLATNYNGT----------NHEKVASDLLNNAGVNYEKLKKKHIAKYQ 300

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
            LF+RV + L ++    +        ID     +R+++F TD+ D +L  L  Q+GRYLL
Sbjct: 301 ALFNRVDLTLEKNKNSSLA-------ID-----KRLEAFATDKTDYNLAALYMQYGRYLL 348

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ISS+R G    NLQG+W   ++  W++  H+NINL+MN W +   NLSE  +P  +F+  
Sbjct: 349 ISSTREGGLPPNLQGLWAPQINTPWNADYHLNINLQMNLWGAEMFNLSELHKPTIEFVKS 408

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           L   G KTA++ Y + GWV+H  +++W  +S       W      GAW+C HLWEHY YT
Sbjct: 409 LVEPGEKTAKIYYNSRGWVVHILSNVWGFTSPGE-HPSWGATNTAGAWMCQHLWEHYLYT 467

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
            D+++L K  YP ++  A F  D LIE  ++GYL T P+TSPE+ +I P G +  +   S
Sbjct: 468 QDKEYL-KSVYPTMKSAALFFEDMLIEDPNNGYLVTAPTTSPENAYITPSGDVVSICAGS 526

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            MD  IIRE+F+ + +AA++LE + +  ++ +     RL PT I + G +MEW++
Sbjct: 527 AMDNQIIRELFTNVENAAKILEVDNE-WIKDISAKKERLAPTSIGKYGQVMEWLE 580


>gi|312621676|ref|YP_004023289.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
 gi|312202143|gb|ADQ45470.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
          Length = 786

 Score =  318 bits (814), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 209/616 (33%), Positives = 309/616 (50%), Gaps = 71/616 (11%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           +P  ++F  PA  + +A+P+GNGRLGAMV+G    E ++LN+D+LW+G   D  NP   +
Sbjct: 3   HPYHLSFYKPASTWYEALPLGNGRLGAMVYGHTAVERIQLNDDSLWSGTFIDRNNPSLKE 62

Query: 71  ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAE------ 121
            L ++R LV  G    A    ++ + G PA +  Y  LG++++  +  HL +A       
Sbjct: 63  KLPEIRRLVLVGDLYHAEELIMQYMVGTPASMRHYTTLGELDIALN-QHLPFATGWIPNS 121

Query: 122 ---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
              E Y  +LDL      + +    V + RE F S P QV+  +    + G+++ ++ LD
Sbjct: 122 NGCEDYYCDLDLMNGILSITHRQAGVRYCREMFVSYPAQVMCIRFVSEKPGTINMDIMLD 181

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIP---PKANANDDPKGIQFSAILEIKISDDRGTIS 235
             + +       ++ + + R PG+R+    P  N       + F   ++ +    RG  S
Sbjct: 182 RTVIS-------DETVPDERRPGQRVRRGWPTVN-------VDFIRTMDERTILMRGNES 227

Query: 236 ALE---------DKKLKVEGSDW------AVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
            +E         D KL+   S         V+L +ASS+        ++  +DP SE   
Sbjct: 228 GVEFATAVRVVCDGKLQNPVSQLLARNCGEVILYLASST--------TNRSEDPVSEVFR 279

Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
            L +     Y  L   H++D+  L  R  + L  SP                P+ ER+ +
Sbjct: 280 LLDAAEKKGYVALREEHINDFSNLMWRCVLDLGPSPDK--------------PTDERIAA 325

Query: 341 FQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
            +  D DP+L  L FQ GRYL++S SR G+   NLQGIWN D  P WDS   +NINL+MN
Sbjct: 326 LRAGDNDPALAALYFQLGRYLIVSGSREGSAPLNLQGIWNADFMPIWDSKYTLNINLQMN 385

Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
           YW    CNLSE   PL + L  +   G +TA+V Y   G V HH TD +   +     + 
Sbjct: 386 YWPVEICNLSELHMPLMELLGKMHEKGRETARVMYGMRGMVCHHNTDFYGDCAPQDRYMA 445

Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPS 519
              W +GGAWL  H+WEHY +T D +FL +  YP+L   A F  D+LIE  DG L T PS
Sbjct: 446 ATPWVIGGAWLGLHVWEHYLFTKDLNFL-REMYPILRDIAMFYEDFLIE-VDGKLVTCPS 503

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPE+ +I PDG    +  S  MD  I+RE+F+A I AA +L  +++ L EK L+   RL
Sbjct: 504 VSPENRYILPDGYDTPMCVSPAMDNQILRELFAACIEAANLLGVDQE-LTEKWLEISQRL 562

Query: 580 RPTKIAEDGSIMEWVQ 595
              KI   G ++EW Q
Sbjct: 563 PKDKIGSKGQLLEWDQ 578


>gi|29348564|ref|NP_812067.1| hypothetical protein BT_3155 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340469|gb|AAO78261.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 808

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 201/598 (33%), Positives = 313/598 (52%), Gaps = 43/598 (7%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
           +T N +K+ ++ PA  +  ++P+GNGRLG M++GG+ +ETL LNE T+W+G   ++   P
Sbjct: 24  ATENKMKLWYDKPADEWMKSLPLGNGRLGVMIYGGIETETLALNESTMWSGEYDEHQQRP 83

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
              + L+ VR L      +E    +  +     H    +  +GD+++ F  S+ +     
Sbjct: 84  FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YR ELDL+TA   V Y VGN E+ R+  +SNPD V+   I  S   +++  + L  LL  
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            + V   NQ+I  G    ++            G+ F   + ++I    GTI A E KKL 
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +E +    LL    S     F N + S  +   +    ++      +  L  +H++DY  
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
           LF RV +      K            D +P+ ER    +  E DP L  L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354

Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           +SSRP + +   LQG +N++L+    W +  H++IN E NYW +   NL+EC  PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             LSI+G+KTA+  Y   GW  H   + W  ++   G ++W L+P   +WL +HLW  Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           YT D+DFL+  AYPLL+  A FLLD++ I+  + YL T PS SPE+ F    G+  C S 
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             T D  +  E+FSA + + E+L  N DA   + +  ++ +L P +I+ +G + EW +
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISKLPPFRISTNGGVQEWFE 588


>gi|380693852|ref|ZP_09858711.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
          Length = 772

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 199/566 (35%), Positives = 293/566 (51%), Gaps = 46/566 (8%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKL-- 94
           MV+G   +E ++LNE+T+  G P    N +A +AL  +R L+  G YAEA   A  K+  
Sbjct: 1   MVYGDPVNEEIQLNEETVSAGSPYKNYNSEAKEALPAIRKLIFDGNYAEAQLMAGEKILS 60

Query: 95  ---FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHF 151
              FG P   YQ +G + L F           YRRELD++ A A   Y V  VE+ RE F
Sbjct: 61  KNGFGMP---YQTVGSLRLHFQGQE---NHTDYRRELDIDKALAITTYRVNGVEYKRETF 114

Query: 152 SSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
           +S  DQ+++ +++ S+ G L+F  +L         V+G N I M G   G +    A   
Sbjct: 115 TSFTDQLVIVRLTASKPGMLTFTAALTCPQAVEVSVSGKNTIKMSGITKGDQFTEGA--- 171

Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
                I+F+A L++++   +G  S  +D  L V  +D AVL +  +++F    +N  D  
Sbjct: 172 -----IRFAADLKLEL---QGGKSIAQDSVLSVSNADSAVLYIAMATNF----VNYKDIS 219

Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENID 330
            D    +   L++    +YS     H+  YQK +HRVS+ L   S  D  TD        
Sbjct: 220 ADAVKRNQVYLRNAGK-NYSKALQEHIAAYQKYYHRVSLDLGYTSQADKPTDV------- 271

Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
                 RVK F   +DP L+ L FQ+GRYLLISSS+PG Q ANLQGIWN+ L+P W    
Sbjct: 272 ------RVKEFAVSDDPQLISLYFQYGRYLLISSSQPGRQPANLQGIWNDKLNPVWKCRY 325

Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
             N+N EMNYW +   NLSE  EP    +  L  NG + A+  Y   GWV+HH TD+W  
Sbjct: 326 TTNVNAEMNYWPAEVTNLSEMHEPFLQMIRELYENGQEAAREMYGCRGWVLHHNTDLWRM 385

Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EG 509
           + A   K     WP   AWLC HLWE Y Y+ D+DFL    YP+++  + F +D+L+ + 
Sbjct: 386 NGA-VDKAYCGTWPTCNAWLCHHLWERYLYSGDKDFLAS-VYPIMKSASEFFVDFLVRDP 443

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
           + GY+   PS SPE+      GK A +    TMD  ++ ++F+   +AA +L   ++   
Sbjct: 444 NTGYMVVTPSNSPENAPRQWKGK-ANLFAGITMDNQLVFDLFTNTEAAAHILNGKDEQFC 502

Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + +     +L P ++ + G + EW +
Sbjct: 503 DTIRSLKKQLPPMQVGQYGQLQEWFE 528


>gi|298386944|ref|ZP_06996498.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
 gi|298260094|gb|EFI02964.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
          Length = 809

 Score =  317 bits (813), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 201/600 (33%), Positives = 322/600 (53%), Gaps = 43/600 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S +TT+ +K+ ++ PA  +  ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G   ++  
Sbjct: 24  SEATTDNMKLWYDKPADEWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQ 83

Query: 66  -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE 121
            P   + L ++R L   G  AE    A   + G  H A  +  +GD++L F     + ++
Sbjct: 84  RPLGREKLDEIRKLFFEGNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD 143

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             Y  ELDL+TA   V Y +G+ E+TR+  +SNPD VI   I+ S   +++  + L+ LL
Sbjct: 144 --YHHELDLSTAVNTVTYKIGDTEYTRQSIASNPDDVIAMYITASRPEAITMELELN-LL 200

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
            N   +   NQ+I  G    ++            G+ F   + ++I    GTI A + KK
Sbjct: 201 RNAEVIASGNQLIYTGNAEFEK--------HGRGGVLFEGRIAVEIKG--GTIKA-DGKK 249

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           L ++ +    LL    S     + N + +  D   +    +++    S+  L   H++DY
Sbjct: 250 LLIDKATEVTLL----SDVRTNYKNTTFAGYDYKQKCKETIEAASKKSFKTLRNIHVEDY 305

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
             LF RV++    + K           +  +P+ +R    +  E DP L  L FQ+ RYL
Sbjct: 306 APLFSRVALSFGDNGK-----------LSHLPNDQRWARVKAGESDPGLDALFFQYARYL 354

Query: 361 LISSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LI+SSRP + +   LQG +N++L+    W +  H++IN E NYW +   NL EC  PLFD
Sbjct: 355 LIASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFD 414

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           ++  LS++GSK AQ  Y   GW  H  ++ W  ++   G ++W L+P   +WL +H+W  
Sbjct: 415 YIKDLSVHGSKIAQDLYGCKGWTAHTTSNPWGYTAVS-GSILWGLFPTASSWLTSHVWTQ 473

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y YT D+ FL++ AYPLL+  A FLLD++ I+  + YL T PS SPE+ F    G+  C 
Sbjct: 474 YEYTQDKKFLQETAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-HYQGQEFCA 532

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S   T D  +  E+FSA + + E+L  N DA   + +  ++ +L P +I+ +G + EW +
Sbjct: 533 SMMPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISANGGVQEWFE 590


>gi|429860996|gb|ELA35710.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
          Length = 776

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 209/599 (34%), Positives = 304/599 (50%), Gaps = 47/599 (7%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           + +S     PL + +  PA  +++A+PIGNGRLGAMV G   +E L+LNED++W G P D
Sbjct: 12  SGQSQQQPRPLLLHYESPASEWSEALPIGNGRLGAMVHGRTQTELLQLNEDSVWYGGPQD 71

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKY 119
            T  DA + L  +R L+   ++AEA +      F  PA +  Y+ LG   +EF   H+  
Sbjct: 72  RTPKDALRHLPKLRQLIRDEEHAEAESLVREAFFATPASMRHYEPLGTCTIEF--GHVVE 129

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               YRR L L TA   V+Y    V + R+  +S PD V+  ++  SE+    F V L+ 
Sbjct: 130 DVTDYRRYLCLETAQTTVEYRCRGVSYRRDAIASFPDNVLAFRVVASEA--TRFVVRLNR 187

Query: 180 LLDNHSYVNGNNQII--MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
           L +     N     I    GR   K  P   N+N      + +  L +   D  G++ A+
Sbjct: 188 LSEIEYETNEFLDSIDATNGRIVLKATPGGHNSN------RLAIALGVSCDDAEGSVEAI 241

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            +    +  S    +++ A ++F           +DP + ++  +    +  +SDL  RH
Sbjct: 242 GNAL--IVNSTSCTIVIGAQTTF---------RTEDPEAAAVDDVLKALSHQWSDLVERH 290

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
             DY  LF+R S+++S        D C       +P+ ER+K+     DP LV L   +G
Sbjct: 291 QQDYAGLFNRTSLRMS-------PDACH------LPTDERIKN---SRDPGLVALYHNYG 334

Query: 358 RYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           RYLLIS SR   +   A LQGIWN   +P W S   +NINL+MNYW + PC+L EC  P+
Sbjct: 335 RYLLISCSRNSKKALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCSLIECAIPV 394

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
              L  ++  G KTA+V Y   GW   H TDIWA +      +   +WP+GG W+C  ++
Sbjct: 395 LGLLEKMAERGKKTARVMYGCEGWCARHNTDIWADTDPHDRWMPSTIWPLGGVWVCIDIF 454

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLA 534
           E   Y  D + L KRA  +LEG   FLL++LI    G YL TNPS SPE+ F++  G+  
Sbjct: 455 EMLQYQYDEN-LHKRAAVVLEGAIMFLLEYLIPSACGRYLVTNPSLSPENTFLSVSGEPG 513

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +   S +DM II   F   + +  +L   E+ L  KV ++L RL P  I  DG I EW
Sbjct: 514 ILCEGSVIDMTIIHIAFEKFLWSTNIL-GGENPLRAKVEEALERLPPLVINSDGLIQEW 571


>gi|302413419|ref|XP_003004542.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
 gi|261357118|gb|EEY19546.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
          Length = 765

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 207/597 (34%), Positives = 300/597 (50%), Gaps = 61/597 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + ++ PA  +++A+P+GNGRLG MV+G   +E L+LNED++W G P D T  DA + L
Sbjct: 8   LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
             +R L+   ++A A A      F  PA +   + LG+  LEF   H       YRR LD
Sbjct: 68  DTLRQLIRDEEHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
           L TA A V+Y    V + RE  +S PD V+  + S SE       ++         +  L
Sbjct: 126 LATAQATVEYQCRGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
           D+    NG  +I++     GK        N +P     S +L I    SDD G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDASDDGGSIEAIGN 231

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
             +    S    L++ A ++F            DP + +   + +    S+ +L  R   
Sbjct: 232 ALVVKAFS--CTLVIAAHTAF---------RNADPEAAARQDVDNALKRSWHELVLRQRT 280

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DY  LF R S+++  +  D+             P+ ER+   + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR   +   A LQGIWN   +P W     +NINL+MNYW + P NL EC  P+  
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPGNLVECALPMLG 384

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +++ G+KTA++ Y   GW  HH TDIWA +      +   +WP+GG WLC  + E 
Sbjct: 385 LVERMAVRGAKTARIMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
             Y  DR  L +RA  LLEGC  FLLD+LI      +L TNPS SPE+ F++  G    +
Sbjct: 445 LLYHYDRK-LHERAAVLLEGCIVFLLDFLIPSACRTFLVTNPSLSPENTFVSKSGDTGIL 503

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
              S +D  I+R  F   + +  +LEK  + LV KV  ++ RL    I  DG I EW
Sbjct: 504 CEGSAIDTTIVRIAFEKFLWSTAILEKG-NPLVPKVRDAMARLPDLTINNDGLIQEW 559


>gi|329957629|ref|ZP_08298104.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
           12056]
 gi|328522506|gb|EGF49615.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
           12056]
          Length = 827

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 194/604 (32%), Positives = 320/604 (52%), Gaps = 48/604 (7%)

Query: 2   MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           ++ + T+  N LK+ ++ PAK + +A+P+GNGR+GAMV+G    E  +LNE+T+W G P 
Sbjct: 15  ISGKITAHDNSLKLWYDKPAKQWVEALPLGNGRIGAMVFGDPAHERFQLNEETVWGGSPH 74

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDS 115
           + TNP+A +AL  +R L+  G+  EA         S    G P   YQ +G + L+F+  
Sbjct: 75  NNTNPNAKEALPRIRRLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGI 131

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
           +     + + R+LD+  A A  +++   + + RE F+S PD++++ K++ S+  S+SF  
Sbjct: 132 N---QYDDFYRDLDIEKAIATTRFTANGITYIREAFTSFPDRLLIIKLTASKKKSISFTA 188

Query: 176 SLDS-LLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRG 232
              +   +N  + ++   ++ + G         KAN ++  +G I+F+A+   +I ++ G
Sbjct: 189 HYTTPYTENTEFCISPRKELQLNG---------KANDHEGIEGKIRFTAL--TRIDNNGG 237

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
           T+    D  L+V+ +D +V L V   S    FIN  D   D    +   ++     +Y+ 
Sbjct: 238 TLKVTSDSTLQVKNAD-SVTLYV---SIGTNFINYKDVSGDALKAARQYMKQAGK-NYTK 292

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
               H+  YQ+ F+RVS+ L            S + I   P+  RV+ F +  DP +  L
Sbjct: 293 RKEAHIAAYQQYFNRVSLDLG-----------SNDQIKK-PTDRRVREFSSVTDPQMAAL 340

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            FQFGRYLLI SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +    LSE  
Sbjct: 341 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALSEMH 400

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           EP    +  ++I G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C 
Sbjct: 401 EPFLQLVKEVAIQGRESASM-YSCRGWTLHHNTDIWRTTGAVDG-AKYGVWPTCNAWFCQ 458

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
           HLW+ Y ++ D+++L +  YP++ G   F LD+L+ E  + +L   PS SPE+       
Sbjct: 459 HLWDRYLFSGDKNYLAE-VYPIMRGACEFYLDFLVREPKNNWLVVAPSYSPENSPSVNGK 517

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           +   +   +TMD  ++ ++F   I AA ++ +N  A  + +      L P ++   G + 
Sbjct: 518 RGFVIVAGTTMDNQMVYDLFYNTIQAANLMNENT-AFTDSLQTVANHLAPMQVGRWGQLQ 576

Query: 592 EWVQ 595
           EW++
Sbjct: 577 EWME 580


>gi|154503234|ref|ZP_02040294.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
 gi|153796228|gb|EDN78648.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
          Length = 784

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 198/614 (32%), Positives = 302/614 (49%), Gaps = 73/614 (11%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           I F   A+ +  A PIGNG LGAMV+G V  E +++NED++W+G   +  NPDA + L  
Sbjct: 20  IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 79

Query: 75  VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
           +R  +  G  Q AE  A        P   VYQ LGDI + F           D+S L Y 
Sbjct: 80  IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 139

Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
           +E+      Y+R L+L  A  +++Y VG  ++ RE F+SNP +V +  I       ++  
Sbjct: 140 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 199

Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
           +S  +  DN S              N  I +EG   G+            +GI F+  + 
Sbjct: 200 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 244

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
           +++    G    +   ++ VE +   ++     ++F            +P       L S
Sbjct: 245 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 294

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QT 343
           +   +Y++    H+ DYQ  F+   +   +           E N+D + + ER+K   + 
Sbjct: 295 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 343

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
             D  LV L + F RYLLISSSR G+  ANLQGIWNE+  P W S   +NIN++MNYW +
Sbjct: 344 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 403

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
               L     PL + L  +   G + A   Y   G+  HH TDIW   +         +W
Sbjct: 404 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 463

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           PMGGAWLC H++EHY YT D+ FLE+  +P+L+    F ++++++  DG   T PS+SPE
Sbjct: 464 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 522

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
           + +I    +  C+    TMD+ I+RE+FS  +   E+LEK E    LV+  +++LP+L  
Sbjct: 523 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 580

Query: 582 TKIAEDGSIMEWVQ 595
            K+ + G I EW Q
Sbjct: 581 -KVGKYGQIQEWDQ 593


>gi|336432957|ref|ZP_08612787.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
           2_1_58FAA]
 gi|336017627|gb|EGN47385.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
           2_1_58FAA]
          Length = 768

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 198/614 (32%), Positives = 302/614 (49%), Gaps = 73/614 (11%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           I F   A+ +  A PIGNG LGAMV+G V  E +++NED++W+G   +  NPDA + L  
Sbjct: 4   IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 63

Query: 75  VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
           +R  +  G  Q AE  A        P   VYQ LGDI + F           D+S L Y 
Sbjct: 64  IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 123

Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
           +E+      Y+R L+L  A  +++Y VG  ++ RE F+SNP +V +  I       ++  
Sbjct: 124 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 183

Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
           +S  +  DN S              N  I +EG   G+            +GI F+  + 
Sbjct: 184 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 228

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
           +++    G    +   ++ VE +   ++     ++F            +P       L S
Sbjct: 229 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 278

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QT 343
           +   +Y++    H+ DYQ  F+   +   +           E N+D + + ER+K   + 
Sbjct: 279 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 327

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
             D  LV L + F RYLLISSSR G+  ANLQGIWNE+  P W S   +NIN++MNYW +
Sbjct: 328 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 387

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
               L     PL + L  +   G + A   Y   G+  HH TDIW   +         +W
Sbjct: 388 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 447

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           PMGGAWLC H++EHY YT D+ FLE+  +P+L+    F ++++++  DG   T PS+SPE
Sbjct: 448 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 506

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
           + +I    +  C+    TMD+ I+RE+FS  +   E+LEK E    LV+  +++LP+L  
Sbjct: 507 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 564

Query: 582 TKIAEDGSIMEWVQ 595
            K+ + G I EW Q
Sbjct: 565 -KVGKYGQIQEWDQ 577


>gi|383124735|ref|ZP_09945397.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
 gi|251841110|gb|EES69191.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
          Length = 808

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 200/598 (33%), Positives = 313/598 (52%), Gaps = 43/598 (7%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
           +T N +K+ ++ PA  +  ++P+GNGRLG +++GG+ +ETL LNE T+W+G   ++   P
Sbjct: 24  ATENKMKLWYDKPADEWMKSLPLGNGRLGVIIYGGIETETLALNESTMWSGEYDEHQQRP 83

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
              + L+ VR L      +E    +  +     H    +  +GD+++ F  S+ +     
Sbjct: 84  FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           YR ELDL+TA   V Y VGN E+ R+  +SNPD V+   I  S   +++  + L  LL  
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            + V   NQ+I  G    ++            G+ F   + ++I    GTI A E KKL 
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +E +    LL    S     F N + S  +   +    ++      +  L  +H++DY  
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
           LF RV +      K            D +P+ ER    +  E DP L  L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354

Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           +SSRP + +   LQG +N++L+    W +  H++IN E NYW +   NL+EC  PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             LSI+G+KTA+  Y   GW  H   + W  ++   G ++W L+P   +WL +HLW  Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
           YT D+DFL+  AYPLL+  A FLLD++ I+  + YL T PS SPE+ F    G+  C S 
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             T D  +  E+FSA + + E+L  N DA   + +  ++ +L P +I+ +G + EW +
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISTNGGVQEWFE 588


>gi|393789783|ref|ZP_10377902.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
           CL02T12C05]
 gi|392650186|gb|EIY43857.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
           CL02T12C05]
          Length = 800

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 191/587 (32%), Positives = 317/587 (54%), Gaps = 50/587 (8%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP---KALSDVR 76
           PAK + +++PIGNGRLGAM +GG+  ETL LNE ++W+G   +  N D P     L ++R
Sbjct: 35  PAKEWMESLPIGNGRLGAMTYGGIEEETLALNESSMWSGQFNE--NQDKPFGRAKLDNLR 92

Query: 77  SLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L   G+  E    A   L G       +  +GD++++F  ++ K     YRR L+LN A
Sbjct: 93  KLFFEGKLWEGNQTAGDNLNGMQTSFGTHLPIGDLKMKF--TYPKGDITGYRRSLNLNEA 150

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            + V ++ G V + RE+F++NPD V+V ++S  +  S++ +++LD L+   ++   NNQ+
Sbjct: 151 ISSVSFNAGGVNYKREYFATNPDNVLVLRLSADKPKSVTMDMALD-LMRQSAFTVENNQL 209

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           I  G+      P        P G+ F     I +  D G +  +++  + V  +D   ++
Sbjct: 210 IFTGKV---DFPLHG-----PGGVNFEG--RIAVLADNGEVK-MDEAGISVSNADAVTMI 258

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           +   + +  P         D  +   + ++      Y  L   H+ DY  LF+RV + L 
Sbjct: 259 VDVRTDYKSP---------DYKALCATTVEEAGMKPYEALKLMHIKDYSNLFNRVELSLG 309

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV- 371
           +   D            T+P+  R K  ++ + D S   L FQ+GRYL I+SSR  + + 
Sbjct: 310 KDSND------------TIPTDIRWKQIRSGKTDTSFDALYFQYGRYLTIASSRENSPLP 357

Query: 372 ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             LQG +N++ +    W +  H++IN + NYW S   NL+EC  PLF+++  LS++G+KT
Sbjct: 358 IALQGFFNDNQACNMGWTNDYHLDINTQQNYWVSNVGNLAECNTPLFNYIKDLSVHGAKT 417

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+V Y   GW  +   +IW  + A  G ++W L+P+ G+W+ THLW  Y YT D+ +L +
Sbjct: 418 AEVVYGCKGWTANTTANIWGYTPAS-GSIIWGLFPLAGSWIATHLWTQYEYTQDKKYLAE 476

Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
            AYPLL+G A F+LD++ E   +GYL T PS SPE+ F   +G+    S   T D  ++ 
Sbjct: 477 VAYPLLKGNAEFILDYMTENPANGYLMTGPSISPENWFKTANGQEMVASMMPTCDRELVY 536

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           E+F++ I AA++L  ++ A    +  +L +L P ++  +G+I EW +
Sbjct: 537 EIFTSCIQAADILGIDK-AFSNNLQTALAKLPPIQLRANGAIREWFE 582


>gi|423346901|ref|ZP_17324588.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
           CL03T12C32]
 gi|409218562|gb|EKN11530.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
           CL03T12C32]
          Length = 809

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 205/609 (33%), Positives = 316/609 (51%), Gaps = 50/609 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T  PL   F+ PA  +  + P+GNGRLG M  GGV +E + LNE ++W+G   D
Sbjct: 17  NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
             NP A  +L  +R L+  G+  EA       F         G  A+V    YQLLG++ 
Sbjct: 77  TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +D      +   YRREL+L+ A A   +  G V + RE F+S  D + V  ++     
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
           +L+F+  ++   +++      N ++M+G+ P            + KG+++++   +++  
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247

Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
            +G      D  + V  +  A+LL+ +A+  FD          KD   +  S L +    
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLAGKVSSLLANAEKK 297

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
            ++ L   H+  Y+ LF RV + L  S         S EN+   P  ER+ +F  + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHS---------SRENL---PMDERLAAFHENPDDP 345

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           SL  L FQFGRYLLISS+R G    NLQG+W   ++  W+   H+NINL+MN+W +   N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W       
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
           AWLC HL+ HY YT+D+++L K  YP+L+G + F +D L+E   + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
             P+GK A +   STMD  I+RE+F+  I AA++L   + A   ++     RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582

Query: 587 DGSIMEWVQ 595
           DG IMEW++
Sbjct: 583 DGCIMEWLE 591


>gi|282878225|ref|ZP_06287021.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
 gi|281299643|gb|EFA92016.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
          Length = 793

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 204/597 (34%), Positives = 313/597 (52%), Gaps = 46/597 (7%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
           P+++ ++ PA++F +++PIGNGR+GA+V+GG     + LN+ TLWTG P D   + +A +
Sbjct: 23  PMQLWYDKPAQYFEESMPIGNGRMGALVYGGTRDNLIYLNDITLWTGQPVDPNLDQNAHQ 82

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +  +R  +    Y +A +  +++ G  +  YQ L  + L  D    +     Y R LD+
Sbjct: 83  WIPAIREALFKEDYRKADSLQLRVQGPNSQYYQPLATLHL-LDPRGGQ--ATNYTRTLDI 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           + A     YS+  V+  RE+F+S+PD VI   I+ ++  S+S  V+L + +  HS     
Sbjct: 140 DKALLTDSYSLNGVKIKREYFASHPDSVICIHITANKPRSISLEVNLSAQIP-HSVKAAG 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N I M+G   G          +    I F ++L  +    +G I A +   L ++ ++ A
Sbjct: 199 NLITMKGHAMG----------NPENSIHFCSVL--RAVTKQGKIQATDSTLLIIDATE-A 245

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L  V  +SF+G   +P    K     +++  +++    Y  +  +H+ DY   + R+ +
Sbjct: 246 TLFFVNETSFNGFDKHPVRQGKPCEQLALAHQKALEKKDYQTIKKQHVADYTHYYDRMKL 305

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLISSSRPG 368
            L  S    VTD CS        + +++K +  Q   +P L  L  Q+GRYLLI+SSR  
Sbjct: 306 FLGGS----VTD-CSRT------TEQQLKDYTDQGGHNPYLETLYMQYGRYLLIASSRTK 354

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              ANLQG+W+  L   W S   VNINLE NYW +   NL E  +PLF F+  L+ NG  
Sbjct: 355 GIPANLQGLWSHYLRAPWRSNYTVNINLEENYWLAEVANLGEMAKPLFTFMQALAANGRH 414

Query: 429 TAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           TA+  Y +  GW   H +D+WA ++     R    W+ W MGGAWL  +LWEHY +  D 
Sbjct: 415 TAKNYYGINRGWCSSHNSDVWAMTNPVGEKRESPEWSNWNMGGAWLTQNLWEHYRFNPDA 474

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
            FL   A PLLEG ++F+LDWL+E   +   L T PSTSPE+E+  P+G      Y  T 
Sbjct: 475 QFLNDTALPLLEGASAFMLDWLVENPKNPSELITAPSTSPENEYKTPEGYHGTTCYGGTA 534

Query: 543 DMAIIREVFSAIISAAEVLEKN------EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           D+AIIRE+F   I+ AE + K       +  L++ +  SL RL P  I   G + EW
Sbjct: 535 DLAIIRELF---INTAEAINKKGADYARQSQLLKDIEASLKRLHPYTIGHLGDLNEW 588


>gi|410096023|ref|ZP_11291014.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227429|gb|EKN20327.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 821

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 199/602 (33%), Positives = 317/602 (52%), Gaps = 48/602 (7%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           T N +   F+ PA+ + + +P+GNGRLG M  GG+  E + LNE ++W+G   D  NP A
Sbjct: 35  TANKIAYHFDEPARIWEETLPLGNGRLGMMPDGGINKENILLNEISMWSGSKQDTDNPQA 94

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDS 115
             +L+++R L+  G+  EA     + F               P   YQLLG++ L++   
Sbjct: 95  VWSLANIRRLLFEGKNDEAQDLMYRTFVCKGAGSGQGQGANVPYGSYQLLGNLVLDYVYV 154

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
               +   YRREL+LN A A   +  G V ++RE F+S    + V  +      +L+F V
Sbjct: 155 DGSDSVAAYRRELNLNDAIASTSFRKGKVNYSRESFTSFSGDLGVVHLMADADKALNFTV 214

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
            ++        V+G + ++M+G+ P            + KGI++ A + + +      IS
Sbjct: 215 GMNRPEHYALSVDGKD-LLMKGQLP------DGVDTLEMKGIKYGARVRVLLPKGGSLIS 267

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
              D  L V+ +  A+LL+  ++++       ++  +D   +  S L       YS L  
Sbjct: 268 G--DSSLTVQNASEAILLVSMATNYK------NEGFED---QLFSLLAESERKDYSTLRK 316

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
            H++ Y+ LF RV + L RS +D             +P  ER+ +FQ D+ DPSL  L F
Sbjct: 317 EHVNAYRSLFDRVDLDLGRSARD------------EMPINERLHAFQEDQNDPSLGALYF 364

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLISS+R G+   NLQG+W   ++  W+   H+NIN +MN+W +   NLSE   P
Sbjct: 365 QFGRYLLISSTRTGSLPPNLQGLWCNTINTPWNGDYHLNINFQMNHWPAEVTNLSELHLP 424

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           + ++      +G +TA+V Y A G V H   ++W + +A      W       AWLC HL
Sbjct: 425 MIEWTKQQVESGERTAKVFYNARGLVTHILGNVW-EFTAPGEHPSWGATNTSAAWLCEHL 483

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
           + HY YT+D+++L K  YP+++G A F  D L+ +  + YL T P+TSPE+ +  P+GK+
Sbjct: 484 FTHYQYTLDKEYL-KEVYPVMKGAALFFTDMLVRDPRNNYLVTAPTTSPENAYRMPNGKV 542

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +   STMD  I+RE+F+  I+AA +L   + A  +++     RL PT I +DG I+EW
Sbjct: 543 VHICAGSTMDNQIVRELFTNTIAAANILGI-DSAFCQELADKRSRLMPTTIGKDGRILEW 601

Query: 594 VQ 595
           ++
Sbjct: 602 LE 603


>gi|291550959|emb|CBL27221.1| hypothetical protein RTO_27700 [Ruminococcus torques L2-14]
          Length = 775

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 200/605 (33%), Positives = 308/605 (50%), Gaps = 57/605 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           KI F   AK + +A+PIGNG LGAMV+G   +E L++NED++WTG   +  NPDA +   
Sbjct: 3   KICFREEAKDWNEALPIGNGFLGAMVFGKTGTERLQINEDSVWTGSFMERVNPDARENYP 62

Query: 74  DVRSLVDSGQY--AEATAASVKLFGHP-ADVYQLLGDIELEFDDS--------------- 115
            VR L+ +G+   AE  A       +P    YQ LGD+ ++F                  
Sbjct: 63  KVRELLLNGEIEQAELLAERSMYATYPHMRHYQTLGDVWIDFYKQRGKTIFKKDQGGLLS 122

Query: 116 --HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
             H     +TY RELD++ A  +++Y     ++ RE F+SNPD +IV ++   +   L+F
Sbjct: 123 VQHESVEVQTYNRELDISRAVGKIQYESEKGKYEREFFASNPDHIIVYQMKSIDGELLNF 182

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGR--CPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
           ++SL +  DN S   G      +G     G +I        D  GI F  +++++  +  
Sbjct: 183 DLSL-TRKDNRS---GRGSSFCDGTEVLDGNKIRLYGKQGGD-HGIAFELLVQVRTKN-- 235

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           G IS +    L VE +  A L + A +SF           + P    M  L +    SY 
Sbjct: 236 GKISRM-GSHLLVEDAKEATLFITARTSF---------RSEQPLQWCMDVLSNAEKESYG 285

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
            L  RH+ DY   + + +++L+            +++ + + + ER++  +   ED  L+
Sbjct: 286 TLQERHIKDYLSYYEKSNLKLN-----------YKDSYEHLTTPERLEQMRNGIEDIELI 334

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
              + F RYLLISSSR G+  +NLQGIWNE+  P W S   +NIN+EMNYW +    LS+
Sbjct: 335 NTYYNFARYLLISSSREGSLPSNLQGIWNEEFEPMWGSKYTININIEMNYWIAEKTGLSK 394

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
              PL + L  +  +G   A+  Y   G+  HH TDIW   +     V   LWPMGGAW 
Sbjct: 395 LHMPLLEHLQRMYPHGKDVAEKMYGIDGFCCHHNTDIWGDCAPQDNHVSSTLWPMGGAWF 454

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
           C HL EHY YT DR+FL K  Y +L+    F L ++++   G   + PS+SPE+ ++   
Sbjct: 455 CLHLIEHYKYTKDREFL-KEYYGILKDAVKFFLQYMVKDAHGKWISGPSSSPENIYLNQK 513

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDG 588
           G+  C+   ++MD  IIRE+F+  +   E+ E+N+  + L E + + L  +   +I + G
Sbjct: 514 GEAGCLCMGASMDTEIIRELFNGYL---EITEENQLPNDLNEAINERLNHMPELQIGKYG 570

Query: 589 SIMEW 593
            I EW
Sbjct: 571 QIQEW 575


>gi|346972979|gb|EGY16431.1| alpha-L-fucosidase [Verticillium dahliae VdLs.17]
          Length = 765

 Score =  315 bits (806), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 205/597 (34%), Positives = 299/597 (50%), Gaps = 61/597 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + ++ PA  +++A+P+GNGRLG MV+G   +E L+LNED++W G P D T  DA + L
Sbjct: 8   LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
             +R L+   ++A A A      F  PA +   + LG+  LEF   H       YRR LD
Sbjct: 68  DTLRQLIRDEKHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
           L TA A V+Y    V + RE  +S PD V+  + S SE       ++         +  L
Sbjct: 126 LATAQATVEYQCTGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
           D+    NG  +I++     GK        N +P     S +L I    +D+ G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDANDEGGSIEAVGN 231

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
                       L++ A S       + +  K DP + +   +      S+ +L  R   
Sbjct: 232 -----------ALVVKAFSCTIAIAAHTTYRKADPEAAARQDVDKALKRSWHELVLRQRT 280

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DY  LF R S+++  +  D+             P+ ER+   + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR   +   A LQGIWN   +P W     +NINL+MNYW + PCNL +C  P+  
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPCNLVDCALPMLG 384

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +++ G+KTA+  Y   GW  HH TDIWA +      +   +WP+GG WLC  + E 
Sbjct: 385 LVERMAVRGAKTARTMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
             Y  DR  L +RA  LLEGC  FLLD+LI    G +L TNPS SPE+ F++  G    +
Sbjct: 445 LLYQYDRK-LHERAAVLLEGCIVFLLDFLIPSACGKFLVTNPSLSPENTFVSKSGDTGIL 503

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
              S +D  IIR  F   + +  +L+K  + LV +V  ++ RL    I  DG I EW
Sbjct: 504 CEGSAIDTTIIRIAFEKFLWSTAILDKG-NPLVPEVRDAMARLPNLTINNDGLIQEW 559


>gi|253580291|ref|ZP_04857557.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
 gi|251848384|gb|EES76348.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
          Length = 751

 Score =  315 bits (806), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 190/595 (31%), Positives = 303/595 (50%), Gaps = 60/595 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + F+ PA+ + +A+P+GNG +GAM +G   +E ++LN D+LW+G   +  NP+     
Sbjct: 4   LALIFDKPAEAWNEALPLGNGTMGAMSYGRFQNERIELNLDSLWSGNGRNKENPNKNVDW 63

Query: 73  SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
              R  + +G Y  A       + G   + Y   G + +   +  ++     YRREL L 
Sbjct: 64  DLFRKHIFAGDYQGAENYCKENVLGDWTESYLPAGTLSINVKEP-IQNGNSFYRRELCLT 122

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            AT ++++   ++ + RE F S  + V+      S + +L  +++L+S + + S     N
Sbjct: 123 NATEKIEFCQDDLIYQREFFVSMSEPVMAIHYHTSPNCNLEMSITLESEIKHKSAFFAEN 182

Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
            II+EG+ P    PP  +       ++ +GI+F+  + + +  + G +    DK      
Sbjct: 183 GIILEGQAPIYVAPPYYSCEVPVVYEEGQGIRFA--IGLYVQTNGGNVYQQADKLFINTP 240

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D  V + V+        +     K+   S+    +++I+++ Y      H+D Y   F 
Sbjct: 241 ND--VYIYVSG-------VTDFKQKELFFSKRNCMMENIQHIQYEKQKKAHMDVYANYFD 291

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           R+ + ++ +P                             D  L   +F + RYL+I SS 
Sbjct: 292 RMHLDINYTP-----------------------------DNELALKMFHYARYLMICSSV 322

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG+Q  NLQGIWN  +   W S   VNIN EMNYW +   NLS+C  PL + +   S  G
Sbjct: 323 PGSQCTNLQGIWNHHMRAPWSSNYTVNINTEMNYWMAEKANLSDCHMPLLELIERTSKKG 382

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            KTAQ  Y  +GWV HH  DIW  SS       D     +++WPM   WLC HLWEHY Y
Sbjct: 383 EKTAQDVYHLAGWVSHHNLDIWGHSSPVGQFGQDENPCTYSMWPMSSGWLCCHLWEHYCY 442

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
           T+D  FL+K+A+P+++G   F L +L+  + GY  T PSTSPE+ F+APD     V+++S
Sbjct: 443 TLDEAFLKKKAFPIIQGAVEFYLGYLVP-YKGYYVTAPSTSPENTFLAPDMTTHGVTFAS 501

Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           TMD++I+RE+F   + A E+L  E   +A V+ VL+ LP   P KI ++G + EW
Sbjct: 502 TMDISILRELFGLYLKACEILGVEDFTNA-VKNVLQKLP---PYKIGKEGQLQEW 552


>gi|315500597|ref|YP_004089399.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
 gi|315418609|gb|ADU15248.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
          Length = 788

 Score =  314 bits (805), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 199/591 (33%), Positives = 290/591 (49%), Gaps = 62/591 (10%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK---- 70
           ++FN PA  + +A+P+GNGRLGAMV+GGV SE L+LN   LW+G     T  D PK    
Sbjct: 38  LSFNAPAARWMEALPVGNGRLGAMVYGGVRSERLQLNHIELWSG----RTVEDNPKTTRA 93

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPAD-----VYQLLGDIELEFDDSHLKYAEETYR 125
           AL  VR L+ + + AEA   +      P +      YQ+LGD+ LE        A   Y 
Sbjct: 94  ALPKVRELLFADKRAEANRLAQDDMMAPMNEVDYGSYQMLGDLRLEMGHEE---AVSDYS 150

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           RELD+ T    V+Y +G   ++R   +S PDQ +  +I  S    LS   +L    D   
Sbjct: 151 RELDMATGQVTVRYRIGKATYSRTVLASAPDQCLAVRIETSAPEGLSLKATLKR--DRDV 208

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
             +   Q++            K +    P G+ + A L  +     G   A +    +V 
Sbjct: 209 AFDWQGQVL------------KMSGQPQPFGVHYCAYLACR---SEGGSVAPDGHGFRVS 253

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G+   VL L  ++    P         +P   + +A   +   S+  L      D++ LF
Sbjct: 254 GARAVVLNLTGATDLLAP---------EPEKVAQAAQAKLVARSWQALARDQERDHRALF 304

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVP--SAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            RV + L+ +                VP  ++ER+ +     + +L+E  F FGRYLLI 
Sbjct: 305 ERVELTLASA---------------GVPRLASERLAAASDAAEMALIETYFNFGRYLLIG 349

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           S+RPG+   NLQG+W +  +P W +  H+NIN++MNYW +  C LSE  E LFD++  L 
Sbjct: 350 SNRPGSLPPNLQGLWADGFAPPWSADYHININIQMNYWPAEVCGLSELHESLFDYVDRLM 409

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
               +TAQ+ Y   G V H+ T+ W  ++ D GKV W LWP G AWL  H WEHY YT D
Sbjct: 410 PYARQTAQIAYGCRGAVAHYTTNPWGHTALD-GKVQWGLWPEGLAWLTLHYWEHYLYTGD 468

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
            +FL+ RA P+   CA F LD+L+E    G L + P++SPE+ ++  +G++  V     M
Sbjct: 469 LEFLKTRALPVFRACAEFTLDYLVEDPRTGKLVSGPASSPENSYVMDNGEVGYVDMGCAM 528

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             ++   V +    A E L   E  L E    +L RL   KI  DG + EW
Sbjct: 529 SQSMAFTVLTLTQKATEALSV-EPELREACAAALARLDRLKIGPDGRVQEW 578


>gi|154494326|ref|ZP_02033646.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
           43184]
 gi|423725485|ref|ZP_17699622.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
           CL09T00C40]
 gi|154085770|gb|EDN84815.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
           43184]
 gi|409234609|gb|EKN27437.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
           CL09T00C40]
          Length = 809

 Score =  314 bits (804), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 202/609 (33%), Positives = 315/609 (51%), Gaps = 50/609 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T  PL   F+ PA  +  + P+GNGRLG M  GGV +E + LNE ++W+G   D
Sbjct: 17  NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
             NP A  +L  +R L+  G+  EA       F         G  A+V    YQLLG++ 
Sbjct: 77  TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +D      +   YRREL+L+ A A   +  G V + RE F+S  D + V  ++     
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
           +L+F+  ++   +++      N ++M+G+ P            + KG+++++   +++  
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247

Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
            +G      D  + V  +  A+LL+ +A+  FD          KD   +  S L +    
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLEGKVSSLLANAEKK 297

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
            ++ L   H+  Y+ LF RV + L  S ++             +P  ER+ +F  + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHSSRE------------DLPMDERLAAFHENPDDP 345

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           SL  L FQFGRYLLISS+R G    NLQG+W   ++  W+   H+NINL+MN+W +   N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W       
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
           AWLC HL+ HY YT+D+++L K  YP+L+G + F +D L+E   + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
             P+GK A +   STMD  I+RE+F+  I AA++L   + A   ++     RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582

Query: 587 DGSIMEWVQ 595
           DG IMEW++
Sbjct: 583 DGRIMEWLE 591


>gi|160885575|ref|ZP_02066578.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
 gi|156109197|gb|EDO10942.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
          Length = 826

 Score =  314 bits (804), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 25  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 84

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 85  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R  P          + + A L++K     G +    D  L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  +         S+ N    P   R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580


>gi|393781489|ref|ZP_10369684.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676552|gb|EIY69984.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
           CL02T12C01]
          Length = 821

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 192/593 (32%), Positives = 310/593 (52%), Gaps = 48/593 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ ++ PA  + +++P+GNGRLGAMV+G    E  +LNE+T+W G P + TNP A +AL
Sbjct: 24  MKLWYDRPATQWVESLPLGNGRLGAMVYGDPIHEEFQLNEETIWGGSPYNNTNPKAKEAL 83

Query: 73  SDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
             +R L+  G+  EA A       S    G P   YQ +G + L+F+      +   Y R
Sbjct: 84  PQIRQLIFEGRNKEAQALCGPNICSQTANGMP---YQTVGSLHLDFEGIS---SYSNYYR 137

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-- 184
           ELD+  A    +++ G V +TRE F+S PDQ+++ +++ SE G LSF     +    +  
Sbjct: 138 ELDIEKAVTTTRFTAGGVTYTREAFTSFPDQLLIIRLTASEKGKLSFTARYSTPYQENIT 197

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
             ++   ++ M+G         KAN ++  +G +QF+A+   +I  + G + ++ D  L+
Sbjct: 198 KSISSRKELQMDG---------KANDHEGIEGKVQFTAL--TRIERNGGHMESVSDTLLR 246

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V  ++ +V + V   S    FIN  D   +    + + L++    +Y      H   Y K
Sbjct: 247 VRNAN-SVTIYV---SIGTNFINYKDISGNARKTAQTYLKNAGK-NYLKAKEAHCATYGK 301

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RVS+ L  + +               P+  RV  F +  DP L  L FQFGRYLLI 
Sbjct: 302 WFNRVSLDLGSNAQA------------AKPTDVRVHEFASAFDPQLAALYFQFGRYLLIC 349

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW + P NL+E  EP    +  ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAEPTNLTEMHEPFLQLVKEVA 409

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G ++A + Y   GW +HH TDIW  + +  G   + +WP   AW C HLW+ Y ++ +
Sbjct: 410 EQGRQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNAWFCQHLWDRYLFSGN 467

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           RD+L +  YPL+     F LD+LI E  + +L  +PS SPE+       +   V   +TM
Sbjct: 468 RDYLAE-VYPLMRSACEFYLDFLIREPQNNWLVVSPSYSPENRPSVNGKRDFVVVAGATM 526

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  ++ ++F   + AA ++ ++    ++ +   +  L P ++   G + EW++
Sbjct: 527 DNQMVSDLFHNTLEAASLMGES-STFMDSLQTVVQNLAPMQVGRWGQLQEWME 578


>gi|333380444|ref|ZP_08472135.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826439|gb|EGJ99268.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 786

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 194/599 (32%), Positives = 316/599 (52%), Gaps = 52/599 (8%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           + N  +  FN PA  + ++IP+GNGR+G M WGGV  E + LNE +LW G   D  NPDA
Sbjct: 20  SQNKWQYYFNEPASAWEESIPLGNGRIGMMPWGGVDKERIVLNEISLWAGNKQDADNPDA 79

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLF------GHPADV--YQLLGDIELEFDDSHLKYA 120
            K L ++R L+   +  EA     K F      G  AD   ++  G++ ++        A
Sbjct: 80  YKHLGEIRKLLFEKKNREAQELMYKTFTCKGEGGSGADYGKFENFGNLYIDITYPDASAA 139

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRR LD+N A + V Y+ G +++TRE+F+S  D + + + +  +S +L+  +SLD  
Sbjct: 140 VSDYRRTLDMNNALSDVTYTKGGIKYTREYFTSFTDDIGIARYTADKSKALNMCISLDRD 199

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +Y +G    I  G+ P         A +  +G+++  +++   ++ +G       +
Sbjct: 200 ENYETYASGPVLYIF-GQLP---------AGEGKEGMKYLGMVK---AEHKGGQLFTNAR 246

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA--LQSIRNLSYSDLYTRHL 298
            ++++ +D   L +  +++++G              E ++   L  ++   Y     +H+
Sbjct: 247 DIEIKNADEVTLFISLATNYNGV-----------EHEKLAGYLLNKLKG-DYKTRKQKHI 294

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
           + YQ LF+RV + L ++           +N D +P  +R+++F  D  D  L  L  Q+G
Sbjct: 295 EKYQNLFNRVDLTLGKN-----------KNSD-LPINKRLEAFVNDRSDYDLAALYMQYG 342

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISS+R G    NLQG+W   +   W+   H+NINL+MN W +  CNLSE   P  +
Sbjct: 343 RYLLISSTREGGLPPNLQGLWAPQIHTPWNGDYHLNINLQMNLWPAEVCNLSELHLPTIE 402

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           ++  L+  G KTA+V Y + GWV H   ++W  +S       W      GAW+C HLWEH
Sbjct: 403 YVKSLTEPGHKTAKVYYNSDGWVTHILGNVWGFTSPGESP-SWGATNTSGAWMCQHLWEH 461

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y Y+ D ++L K  YP ++G A F  + L+E  ++GYL T P+TSPE+ +I   G +  V
Sbjct: 462 YLYSQDVEYL-KSVYPTMKGAALFFENMLVEDPNNGYLVTAPTTSPENTYITESGDVLSV 520

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              STMD  I+RE+F+ +  AA++L  +E   +  +     RL PT I + G IMEW++
Sbjct: 521 CAGSTMDNQIVRELFTNVSEAAKILNTDEQ-WIRTIETKKQRLAPTTIGKYGQIMEWLE 578


>gi|255692382|ref|ZP_05416057.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
           17565]
 gi|260621848|gb|EEX44719.1| hypothetical protein BACFIN_07502 [Bacteroides finegoldii DSM
           17565]
          Length = 826

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 25  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKA 84

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 85  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R  P          + + A L++K     G +    D  L V+G
Sbjct: 202 IYGKKGLRLEGITYGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  +         S+ N    P   R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580


>gi|423290259|ref|ZP_17269108.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
           CL02T12C04]
 gi|423294445|ref|ZP_17272572.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
           CL03T12C18]
 gi|392665646|gb|EIY59169.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
           CL02T12C04]
 gi|392675636|gb|EIY69077.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
           CL03T12C18]
          Length = 816

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 15  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 75  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R  P          + + A L++K     G +    D  L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  +         S+ N    P   R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570


>gi|336415223|ref|ZP_08595564.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941256|gb|EGN03114.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
           3_8_47FAA]
          Length = 816

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 15  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 75  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 132 LDISNAVAVARYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R  P          + + A L++K     G +    D  L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  +         S+ N    P   R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570


>gi|380692991|ref|ZP_09857850.1| hypothetical protein BfaeM_03308 [Bacteroides faecis MAJ27]
          Length = 779

 Score =  313 bits (801), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 201/593 (33%), Positives = 314/593 (52%), Gaps = 43/593 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
           +K+ ++ PA  +  ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G   ++   P   + 
Sbjct: 1   MKLWYDKPADKWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQRPLGREK 60

Query: 72  LSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
           L  +R L      AE    A   + G  H A  +  +GD++L F     + ++  Y  EL
Sbjct: 61  LDQIRKLFFEDNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD--YHHEL 118

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL TAT  V Y VG+ E+TR+  +SNPD VI   I  S   S++  + L  LL N   V 
Sbjct: 119 DLTTATNTVTYKVGDTEYTRQCIASNPDDVIAMHIKASRPESITVELELQ-LLRNAEVVA 177

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
             NQ+I  G    ++            G+ F   +  +I    GTI A + KKL ++ + 
Sbjct: 178 SGNQLIYTGNAEFEK--------HGRGGVLFEGRIAAEIKG--GTIKA-DGKKLLIDKAT 226

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
             +LL    S     + N + +  D   +    +++    S+  L   H++DY  LF RV
Sbjct: 227 EVLLL----SDVRTNYKNTTFAGYDYQQKCKETIEAASKKSFKTLRNTHVEDYTPLFSRV 282

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
           ++    + K              +P+ +R    +  E DP L  L FQ+ RYLLISSSRP
Sbjct: 283 ALSFGENGK-----------FSHLPNDQRWARVKAGESDPGLDALFFQYARYLLISSSRP 331

Query: 368 GTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
            + +   LQG +N++L+    W +  H++IN E NYW +   NL EC  PLFD++  LS+
Sbjct: 332 NSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFDYIKDLSV 391

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +GSK AQ  Y   GW  H  ++ W  ++   G ++W L+P   +W+ +H+W  Y YT D+
Sbjct: 392 HGSKIAQDLYGCKGWTAHTTSNPWGYAAVS-GSILWGLFPTASSWITSHVWTQYEYTQDK 450

Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
           +FL++ AYPLL+  A FLLD+++ +  + YL T PS SPE+ F    G+  C S   T D
Sbjct: 451 NFLKETAYPLLKSNAEFLLDYMVTDPRNNYLVTGPSISPENSF-RYQGQEFCASMMPTCD 509

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
             ++ E+FSA + + E+L  N DA     L++ + +L P +I+ +G + EW +
Sbjct: 510 RVLVYEIFSACLKSTEIL--NVDAAFADSLRTAISKLPPFRISANGGVQEWFE 560


>gi|218262384|ref|ZP_03476870.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223418|gb|EEC96068.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
           DSM 18315]
          Length = 809

 Score =  313 bits (801), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 199/609 (32%), Positives = 313/609 (51%), Gaps = 50/609 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   L   F+ PA+ + + +P+GNGRLG M  GGV +E + LNE ++W+G   D
Sbjct: 17  NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRLGLMPDGGVDTEKIVLNEISMWSGSKQD 76

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L  +R L+  G+  EA       F               P   YQLLG++ 
Sbjct: 77  TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +D      +   YRREL+L+ A A   +  G V++ RE F+S  D + V  ++     
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
           +L+F+  ++   +++      N ++M+G+ P            + KG+++++ + + +  
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249

Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
               I    D  + +  +  A+LL+ +A+  FD          KD   +  S L +    
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
            ++ L   H+  Y+ LF RV + L  S ++             +P  ER+ +F  D +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVDLDLGHSSRE------------DLPIDERLATFNADPDDP 345

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           SL  L FQFGRYLLISS+R G    NLQG+W   ++  W+   H+NINL+MN+W +   N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W       
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
           AWLC HL+ HY YT+D+++L K  YP+L+G + F +D L+E   + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
             P+GK A +   STMD  I+RE+F+  I AA +L   + A   +++    RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582

Query: 587 DGSIMEWVQ 595
           DG IMEW++
Sbjct: 583 DGRIMEWLE 591


>gi|294806382|ref|ZP_06765225.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|294446397|gb|EFG15021.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 562

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 202/582 (34%), Positives = 302/582 (51%), Gaps = 57/582 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PAK++++A+PIGN RLGAMV+GG   E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR L+  G+  EA     A+     H    Y  LG++ LEF     K A++ YR +L+
Sbjct: 82  PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  AT   +Y V  + +TR  F+S  D VI+  I  S+  +L+FNVS +  L N   V  
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           +  II    C GK          + +G++ +   E ++      I       L++ G   
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  +   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L  S                + +  R+++F    D ++  LLFQ+GRYLLISSS+PG 
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
            I  +     + A+ +  +   +  + + ++L +L P +I +
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGK 554


>gi|373956599|ref|ZP_09616559.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
 gi|373893199|gb|EHQ29096.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
          Length = 783

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 204/611 (33%), Positives = 317/611 (51%), Gaps = 62/611 (10%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S   ++PL++ +N PA+ + + +P+GNGRLG M  GGV  ET+ LN+ TLW+G P D  N
Sbjct: 20  SFGQSHPLRLWYNKPAQMWEETLPLGNGRLGMMPDGGVSQETIVLNDITLWSGAPQDANN 79

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD---- 113
             A K+L  +R L+  G+  EA A   + F        G     YQ+LG++ L F     
Sbjct: 80  YQAYKSLPQIRKLLMEGKNDEAQALVDQAFICTGKGSGGVNYGCYQVLGNLSLNFQYPDH 139

Query: 114 ---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
              +S + Y  + Y REL L+ A A+  Y V  V + RE+ +S  D V + K++  + G 
Sbjct: 140 NTANSPVNY--QNYERELTLDNAIAKCTYQVNGVTYKREYITSFGDDVDIIKLTADKPGQ 197

Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
           L+ ++ +     + + V  N  + MEG+          +   D KG+Q+ AI++   ++ 
Sbjct: 198 LNLSIGISRPERSATSV-ANGALQMEGQL---------DNGIDGKGMQYQAIVK---AEQ 244

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
           +G        ++ ++ +   ++ + A + F  P       K+   S    A+Q      Y
Sbjct: 245 QGGSVNYSSSQINIKDATSVIIYISAGTDFRNPHF-----KQSIQSVLTKAIQK----PY 295

Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDE--DP 347
           S    +H+  YQKLF+RV + L   P K++ TD             +R+ +F  D   D 
Sbjct: 296 SLQKQQHIARYQKLFNRVHVNLGAEPAKELTTD-------------QRLIAFHADRKADN 342

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
            L  L FQFGRYL I S+R G    NLQG+W   +S  W    H+++N++MN+W     N
Sbjct: 343 GLPALFFQFGRYLSICSTRVGLLPPNLQGLWANQISTPWTGDYHLDVNVQMNHWPLEVAN 402

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL D +  +  +G KTA+  Y A GWV H  T++W  +        W     G 
Sbjct: 403 LSELNLPLADLVKRMVPHGEKTAKAYYNAKGWVAHVITNVWQFTEPGE-SASWGATKAGS 461

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
            WLC +LWEHY +T D ++L +  YP+L+G A F  D LI+    G+L T+PS+SPE+ F
Sbjct: 462 GWLCDNLWEHYAFTNDVNYL-RDIYPVLKGAAQFYNDMLIKDPKSGWLVTSPSSSPENSF 520

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKI 584
             P+GK A +    T+D  IIRE+F+ +I+A+  L  +    A +++ +  LP   P +I
Sbjct: 521 YLPNGKHASICLGPTIDNQIIRELFNNVITASGKLGVDAALSAELQQRVTQLP--PPGRI 578

Query: 585 AEDGSIMEWVQ 595
           A DG IMEW++
Sbjct: 579 ASDGRIMEWME 589


>gi|383110853|ref|ZP_09931671.1| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
 gi|382949363|gb|EFS31261.2| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
          Length = 810

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 207/592 (34%), Positives = 309/592 (52%), Gaps = 59/592 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG   E L+LNE+T W G P    N +A   L
Sbjct: 22  LKLWYSQPARNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGGPYSNNNSNAKYVL 81

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR+L+  G+  EA +     F        Y  LG++ ++F     K A   YR +L+L
Sbjct: 82  PVVRNLIFDGKNREAQSLVDANFLTKQHGMSYLTLGNLYIDFPGH--KDASGFYR-DLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V  V +TR  F+S  D VI+  I   ++ +L+FN++ +  L+ +     +
Sbjct: 139 ENATTTTRYEVNGVTYTRTTFASFTDNVIIVHIQADKTQALNFNMTYNCPLEYNVNAQDD 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             II    C GK              IQ   ++++K +   G IS    K L+VE +  A
Sbjct: 199 KLIIT---CQGKE------QEGIKAAIQAECVVQVKTN---GAISP-AGKVLQVEKATEA 245

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L + A++++    +N  +   + +  +   L+      Y+     H+  Y+K F RV +
Sbjct: 246 TLYIAAATNY----VNYQNVSANASERANKFLEKAIQTPYNKALKDHIAFYKKQFDRVRL 301

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L            SE +    P   R+++F   ED ++  LLFQFGRYLLISSS+PG Q
Sbjct: 302 NLP----------SSEASKAETP--RRIENFNKGEDMAMAALLFQFGRYLLISSSQPGGQ 349

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVANLSETHSPLFSMLKDLSVTGAETA 409

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
           Q  Y   GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T D++FL
Sbjct: 410 QSMYNCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDKEFL 465

Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
            K  YP+L+G A F +D+L+E  D  +L   PS SPEH           ++   TMD  I
Sbjct: 466 -KEYYPILKGTAQFYMDFLVEHPDYKWLVVAPSVSPEH---------GPITAGCTMDNQI 515

Query: 547 IREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             +     + A+ +  +    +D+L +++L  LP   P +I +   + EW++
Sbjct: 516 AFDALHNTLLASRITGETSSFQDSL-QQILDKLP---PMQIGKHHQLQEWLE 563


>gi|167764888|ref|ZP_02437009.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
           43183]
 gi|167697557|gb|EDS14136.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
           43183]
          Length = 825

 Score =  312 bits (799), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 195/595 (32%), Positives = 315/595 (52%), Gaps = 52/595 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+ + +A+P+GNG LGAMV+G    E  +LNE+T+W G P + TNP A +AL
Sbjct: 27  LKLWYDSPARQWVEALPLGNGSLGAMVFGDPIHERFQLNEETVWGGSPHNNTNPKAKEAL 86

Query: 73  SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
             +R L+  G+  EA         S    G P   YQ +G + L+F+    KY  + Y R
Sbjct: 87  PRIRQLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGIS-KY--DDYYR 140

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
           +LD+  A A  +++   + + RE F+S PD+++V +++ S+  S+SF     +    +  
Sbjct: 141 DLDIEKAIATTRFTANGITYVRETFTSFPDRLLVIRLTASKKRSISFTAHYTTPYTENTE 200

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
             ++  N++ + G         KAN ++  +G ++F+A+   +I ++ GT+ A  D  L+
Sbjct: 201 RRISSLNELQLNG---------KANDHEGIEGKVRFTAL--TRIENNGGTLKATSDSTLQ 249

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDLYTRHLDDYQ 302
           V+ ++  VL +   ++F    IN  D   D    +   + Q+ +N  Y+     H+  YQ
Sbjct: 250 VKNANSVVLYVSIGTNF----INYKDISGDALKTAQQYMKQAGKN--YTKRKEAHIAAYQ 303

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           K F+RVS+ L            S   I   P+  RVK F +  DP +  L FQFGRYLLI
Sbjct: 304 KYFNRVSLDLG-----------SNSQIKK-PTDRRVKEFSSTADPQMAALYFQFGRYLLI 351

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +    L E  EP    +  +
Sbjct: 352 CSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALPEMHEPFLQLVKEV 411

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +I G ++A + Y   GW +HH TDIW  + A  G   + +WP   AW C HLW+ Y ++ 
Sbjct: 412 AIQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-KYGIWPTCNAWFCQHLWDRYLFSG 469

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D+++L +  YP++ G   F LD+L+ E  + +L   PS SPE+       +   +   +T
Sbjct: 470 DKNYLAE-VYPIMRGACEFYLDFLVREPQNNWLVVAPSYSPENSPSVNGKRDFVIVAGAT 528

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           MD  ++ ++F   I AA ++  NE       L+++ + L P ++   G + EW++
Sbjct: 529 MDNQMVYDLFHNTIQAATLM--NEHKSFTDSLQTVAKHLAPMQVGRWGQLQEWME 581


>gi|160879031|ref|YP_001557999.1| hypothetical protein Cphy_0874 [Clostridium phytofermentans ISDg]
 gi|160427697|gb|ABX41260.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
          Length = 760

 Score =  312 bits (799), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 205/593 (34%), Positives = 300/593 (50%), Gaps = 60/593 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PAK + +A+P+GNGRLGAM++G    E +++NED++W+G   D  NPDA K L  +R
Sbjct: 8   YQDPAKDWDEALPLGNGRLGAMIYGKPEHEIIQVNEDSIWSGYAMDRNNPDAKKNLPIIR 67

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G   EA  A++  L G P ++  YQ  G+I +    S +      Y+R+L+L+ A
Sbjct: 68  SLIADGNLEEAQNATLHSLSGTPDNMRCYQTAGEIHITTGHSEVT----NYKRQLNLSEA 123

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHSYVNGNN 191
           T  V Y      F REH  S P  V V + +  G    +LS  +S    +D   Y    +
Sbjct: 124 TVTVSYDFEGTTFIREHLISTPADVFVMRFTSKGPRKLNLSILLSRPHFMDR-LYCENGD 182

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            I++  R                 GI F   L           +A  D K+K  G+   V
Sbjct: 183 SIVLTYR----------------GGIPFCNRL----------TAASCDGKIKTIGAHLVV 216

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
                 + F    I  +   ++ T++  S L  +++L + +L   H  DYQ  F R  + 
Sbjct: 217 SEATTVTLFFD--IRTAYRSENYTNDVKSHLMDVKSLQFDELKRSHKKDYQSFFKRNDLI 274

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
           L+ S ++       E ++ T+ +A+R++  +    D  L+E  F FGRYLLIS SRPGT 
Sbjct: 275 LTPSAEE-------EADVATLDTAKRLERMRMGHSDLKLLEDYFHFGRYLLISCSRPGTL 327

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN  ++P W     +NIN EMNYW +   NL E   PLFD L  +  NG  TA
Sbjct: 328 PANLQGIWNNSMTPPWGGKFTININTEMNYWFAEKLNLPELHLPLFDLLKRMHQNGKVTA 387

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
           +  Y   G+V HH TD+W   +     +    W +GGAWLC H+WEHY YT D +FL   
Sbjct: 388 EKMYGCHGFVAHHNTDLWGDCAPQDYWLPGTYWVLGGAWLCLHIWEHYEYTKDINFL-IN 446

Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
            +P+L     FL ++L E  +G L  +P+ SPE+++  P+G++  +    TMD  I+RE+
Sbjct: 447 MFPVLSDACLFLTEFLTEDENGKLILSPTASPENKYRHPNGRIGYLCAGCTMDHQIMREL 506

Query: 551 FSAIISAAEVL--EKNED-------ALVEKVLKS----LPRLRPTKIAEDGSI 590
           F   I A   L   KN         AL EK+ KS    L RL  T++  +G+I
Sbjct: 507 FHHYIDAYHTLLDAKNSTENKEVPIALNEKLTKSVKDCLSRLPETRVHSNGTI 559


>gi|289773991|ref|ZP_06533369.1| large secreted protein [Streptomyces lividans TK24]
 gi|289704190|gb|EFD71619.1| large secreted protein [Streptomyces lividans TK24]
          Length = 693

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 183/511 (35%), Positives = 269/511 (52%), Gaps = 46/511 (9%)

Query: 92  VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTRE 149
            +  G P++   YQ+LGD+EL       +     Y RELDL TA AR  Y+ G V   RE
Sbjct: 15  AEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYERELDLETAVARTTYTRGGVRHVRE 71

Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN 209
            F+S PDQV+V ++S    G++ F     S   +       + I ++G           +
Sbjct: 72  VFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAVDAHTIALDG--------VGGD 123

Query: 210 ANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
               P  ++F  +        ++S D GT        L VEG+D A L++  ++S+    
Sbjct: 124 WYGRPGSVRFRGLARAESEGGRVSTDGGT--------LTVEGADAATLVISLATSYR--- 172

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            N  D   DP S + + L       Y+ L  RH+ D+++LF RV++ L  S +       
Sbjct: 173 -NYLDVGADPASRARNHLAPAARKPYAHLRARHVADHRRLFGRVALDLGPSERA------ 225

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSP 384
                  +P+ +R+  F   +DP L  L FQ+GRYLL S SR   Q ANLQG+WN+ L+P
Sbjct: 226 ------ELPTDQRIPLFADGKDPQLAALYFQYGRYLLASCSRSPGQPANLQGLWNDSLNP 279

Query: 385 TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHK 444
            W+S   VNIN EMNYW + P NL+EC +P    +  L+ +G++TA+  Y A GWV+HH 
Sbjct: 280 AWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELAESGTRTAKALYDAPGWVLHHN 339

Query: 445 TDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD 504
           TD W + +A      + +WP GGAWLC  LW+HY +T D   L  R YP+++G   F LD
Sbjct: 340 TDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGDTGAL-SRNYPVMKGAVEFFLD 397

Query: 505 WL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
            L ++   G+L TNPS SPE      +G+   +    TMDM ++R++F A   AAEVL++
Sbjct: 398 TLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTMDMQLLRDLFDAYRQAAEVLDR 457

Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +   LV +V +   RL PT++   G I EW+
Sbjct: 458 DSR-LVGRVTEVRDRLAPTRVGHLGQIQEWL 487


>gi|375144807|ref|YP_005007248.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
 gi|361058853|gb|AEV97844.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
          Length = 780

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 204/603 (33%), Positives = 318/603 (52%), Gaps = 55/603 (9%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           T  PL++ ++ PA  + + +P+GNGRLG M  GGV  E + LN+ TLW+G P D  N  A
Sbjct: 27  TNKPLRLWYDKPAAQWEETLPLGNGRLGMMPDGGVLQENIVLNDITLWSGAPQDANNYKA 86

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD-DSHLKY 119
            + L +++ L+  G+  EA A   K F          P   +Q LG + + F+ D     
Sbjct: 87  NQKLPEIQKLLLEGKNDEAQALINKDFICTGKGSGAEPFGCFQTLGRLGIAFNYDGPANA 146

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
           A   Y R+L LN A A   Y VG+V + RE+F+S  + V + K++ S +G L+F VSL S
Sbjct: 147 AFTNYSRQLSLNDAAAACTYKVGDVTYNREYFTSFGNDVGIIKLTASAAGKLNFEVSL-S 205

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +  +     N++ M G+              D KG+Q+ A++  K++   G++SA  +
Sbjct: 206 RPEKATVTVAGNKLEMAGQLEN---------GTDGKGMQYVALVSAKLTG--GSLSAAGN 254

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           K L V+ +  A+L   A +S+            D    +   L     ++Y     +HL+
Sbjct: 255 K-LVVKNATKAILFFSAKTSY---------KDADYRQHAQQLLDKAMLVAYDAEKKKHLN 304

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFG 357
           +Y KLF+R+ + L  S              D +P+ +R+  F   T  D  L  L +Q+ 
Sbjct: 305 NYGKLFNRLQVDLGSS------------GADELPTDQRLDKFYNATTPDNRLTVLFYQYS 352

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYL ISS+R G    NLQG+W  ++   W+   H+++N++MN+W   P NLSE   PL D
Sbjct: 353 RYLSISSTRVGLLPPNLQGLWAHEVHTPWNGDYHLDVNVQMNHWGVEPANLSELNLPLAD 412

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  +  +G KTA+  Y A GWV H  T+ W  +        W +   G  WLC +LW+H
Sbjct: 413 LVKEMGPHGEKTAKAYYNARGWVAHVITNPWLFTEPGE-SASWGVTKAGSGWLCNNLWDH 471

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDG-KLAC 535
           Y ++ D ++L K+ YP+L+G A F  D LI+  + G+L T PS+SPE+ F  PDG K + 
Sbjct: 472 YTFSNDLNYL-KKIYPVLKGSALFYSDILIKDPETGWLVTAPSSSPENWFYMPDGSKQSS 530

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIME 592
           +   +T+D  IIRE+F+ +I+A+E L  +E     L EK LK +P     +I+ DG +ME
Sbjct: 531 ICMGATIDNQIIRELFNNVITASEQLHIDEPFRKELKEK-LKQIP--PAAQISADGRVME 587

Query: 593 WVQ 595
           W++
Sbjct: 588 WLK 590


>gi|375256587|ref|YP_005015754.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
 gi|363407344|gb|AEW21030.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
          Length = 850

 Score =  311 bits (798), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 201/626 (32%), Positives = 319/626 (50%), Gaps = 76/626 (12%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           F+ PA  + ++ P+GNGR+G M  GG+  E + LNE ++W+G      NP A K+L  +R
Sbjct: 32  FDEPATLWEESFPLGNGRIGLMPDGGIEKENIVLNEISMWSGSKQQTDNPAAQKSLGRIR 91

Query: 77  SLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEF-----DDSHLK 118
            L+ +G+  EA       F               P   YQLLG++ L+F     DD+ + 
Sbjct: 92  ELLFAGRNDEAQELMYDTFVCYGDGSGRGSGANKPYGSYQLLGNLMLDFTYDAADDAQVS 151

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
                YRRELDL  A   + +  G  E++RE F+S  D V V ++  +    L   + ++
Sbjct: 152 ----DYRRELDLEQALTTLSFRKGKTEYSREVFTSFADDVAVIRLKVNNGRKLQCQIGMN 207

Query: 179 SLLDNHSYVNGNNQIIMEGRC-----------------------PGKRIPPKANAN---- 211
              + ++    N+++ M GR                            IP          
Sbjct: 208 RP-ERYAVRAENSELEMRGRLYEGDAYKTKEQLEREEAMRNRTNNSDSIPAAEQKTMPGA 266

Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
           +D +G+++++ +++ + +  G + A  D  L VE +   +LL+  ++ + G  +   D++
Sbjct: 267 EDGQGVRYASRVQVVLPNG-GEVKAFNDTTLIVEEASEIILLVGMATDYFGKAV---DAQ 322

Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT 331
            D      S L +  + SY  L   H+  YQ+L+HRV++   R+ +            + 
Sbjct: 323 ID------SLLTAAASKSYETLKEEHIRAYQELYHRVAVHFGRNAQK-----------EA 365

Query: 332 VPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
           +P  +R+++FQ D+ DPSL+ L +QFGRYLLISS+RPG    NLQG+W   +   W+   
Sbjct: 366 LPMNKRLEAFQNDKNDPSLLALYYQFGRYLLISSTRPGLLPPNLQGLWCNTIHTPWNGDY 425

Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
           H+NINL+MN W +   NLSE   PL ++      +G +TA+  Y A GWV H   ++W +
Sbjct: 426 HLNINLQMNLWPAETGNLSELHLPLIEWTKQQVESGRQTAKAFYNARGWVTHILGNVW-E 484

Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG- 509
            +A      W       AWLC HL+ HY +T+D  +L +  YP++   A F +D L+E  
Sbjct: 485 FTAPGEHPSWGATNTSAAWLCEHLYTHYLFTLDTAYL-RDVYPVMRESALFFVDMLVEDP 543

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
              YL T P+TSPE+ ++ P+GK   V   STMD  I+RE+FS  I AA +L+ +E+ LV
Sbjct: 544 RSHYLVTAPTTSPENAYVMPNGKKVSVCAGSTMDNQILRELFSNTIQAARLLKTDEE-LV 602

Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + +     RL PT I  DG IMEW++
Sbjct: 603 QTLAAYQARLMPTTIGPDGRIMEWLE 628


>gi|423343039|ref|ZP_17320753.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409216715|gb|EKN09698.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 809

 Score =  311 bits (797), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 198/609 (32%), Positives = 312/609 (51%), Gaps = 50/609 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   L   F+ PA+ + + +P+GNGR G M  GGV +E + LNE ++W+G   D
Sbjct: 17  NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRFGLMPDGGVDTEKIVLNEISMWSGSKQD 76

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L  +R L+  G+  EA       F               P   YQLLG++ 
Sbjct: 77  TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +D      +   YRREL+L+ A A   +  G V++ RE F+S  D + V  ++     
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
           +L+F+  ++   +++      N ++M+G+ P            + KG+++++ + + +  
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249

Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
               I    D  + +  +  A+LL+ +A+  FD          KD   +  S L +    
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
            ++ L   H+  Y+ LF RV + L  S ++             +P  ER+ +F  D +DP
Sbjct: 298 DFASLKKGHIVAYRSLFGRVDLDLGHSSRE------------DLPIDERLAAFNADPDDP 345

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           SL  L FQFGRYLLISS+R G    NLQG+W   ++  W+   H+NINL+MN+W +   N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W       
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
           AWLC HL+ HY YT+D+++L K  YP+L+G + F +D L+E   + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
             P+GK A +   STMD  I+RE+F+  I AA +L   + A   +++    RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582

Query: 587 DGSIMEWVQ 595
           DG IMEW++
Sbjct: 583 DGRIMEWLE 591


>gi|46118818|ref|XP_384910.1| hypothetical protein FG04734.1 [Gibberella zeae PH-1]
          Length = 768

 Score =  311 bits (797), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 202/602 (33%), Positives = 304/602 (50%), Gaps = 55/602 (9%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           +E  ST   L + +  PA  +++A+PIGNGRLGAMV+G   +E L+LNED++W G P D 
Sbjct: 5   SEKASTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGPQDR 64

Query: 64  TNPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
           T  DA   L+ +R L+   ++ +A T A    F  PA +  Y+ LG   +EF   H +  
Sbjct: 65  TPRDACSNLATLRQLIRDEKHKDAETLAREAFFATPASMRHYEPLGQCTIEF--GHDEKN 122

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              Y+R LDL T+ +  KY    V + R+  +S P+ V+  +   S        ++  S 
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVSYRRDVIASFPNNVLAFRFQASAPTRFVVRLNRQSE 182

Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
           ++  +  Y++     +N II++    GK      N+N      + +  L +      GT+
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSINGTV 230

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
                   KV G+    L++ A         + +    +P + ++  + S     +  L 
Sbjct: 231 --------KVVGN---CLIVNAEECIIAIGAHTTYRSYNPDASALRDVNSALREPWETLV 279

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH  DY +LF + ++++               +   VP+ ER+   Q++ DP +V L  
Sbjct: 280 SRHRRDYGRLFGKTALRM-------------WPDASHVPTEERI---QSNRDPGVVALYH 323

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            +GRYLLISSSR   +   A LQGIWN   +P W S   +NINL+MNYW + PCNL EC 
Sbjct: 324 NYGRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAAPCNLIECA 383

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
            PL D +  ++  G +TA++ Y   GW  HH TDIWA +      +   LWP+GG WLC 
Sbjct: 384 IPLIDHIERMAEKGKRTAKMMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
            + +   Y  D   L  R  PLLEGC  FLLD+LI    G YL T+PS SPE+ FI+  G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTSPSLSPENSFISESG 502

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           +       S MDM I+R    + I +  +L K E  L + V+ +L +L P +I + G I 
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561

Query: 592 EW 593
           EW
Sbjct: 562 EW 563


>gi|390958734|ref|YP_006422491.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
 gi|390413652|gb|AFL89156.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
          Length = 837

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 202/624 (32%), Positives = 300/624 (48%), Gaps = 74/624 (11%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPS-------------------------- 45
           P ++ +  PA  +T+A+PIGNGR+GAMV+GG  +                          
Sbjct: 37  PARLWYRAPAPVWTEALPIGNGRIGAMVFGGANTGPNNGDLEDAAKNADILSGDKTRGQD 96

Query: 46  ETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV------DSGQYAEATA-ASVKLFGHP 98
           E L+LNE T+W G   D  NP A +    VR+L+      D  + AEA   A   +  +P
Sbjct: 97  EHLQLNESTVWAGSRADRLNPRAAEGFRRVRALLLESKGTDGKKIAEAEKLAQETMIANP 156

Query: 99  ADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPD 156
             +  Y  +GD+ L    S    A   Y R+LDL T   R+ Y  G V FTRE F+S PD
Sbjct: 157 KAMPPYSTVGDLYLRSSSSE---AIADYHRQLDLKTGVVRITYRQGPVHFTREIFASAPD 213

Query: 157 QVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
            VIV  ++     ++S   S+D   D     +G   +++      K              
Sbjct: 214 HVIVMHLTADRPNAISLTASMDRPGDFAIRASGQRDLVLTQSATTK------------NA 261

Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
             F A  + + +   G + A  D+ +  +  +  VL+  AS    GP +       DP +
Sbjct: 262 THFQA--QARFATHGGAVHADGDRIVVEKAQELTVLIAAASDFKGGPILG-----GDPAT 314

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
                L S +  +++ L      D  +   R+S+ L   P D          +  +P+ E
Sbjct: 315 LCGDILASAQKKNFAALSAAATKDQFRYIDRMSLSLG--PVDAA--------LAAMPTDE 364

Query: 337 RVKSFQTDEDP-SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
           R+K     +D   L  L FQ+ RYLL+ SSRPG   ANLQG+W   LS  W S   +N+N
Sbjct: 365 RLKRVAAGQDDFGLQALYFQYARYLLLGSSRPGGLAANLQGLWASGLSNPWGSKWTINVN 424

Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYL----SINGSKTAQVNYLASGWVIHHKTDIWAKS 451
            EMNYW +   NLSE  +PLFD +  +    S  G K A+  Y A G+VIHH TDIW  +
Sbjct: 425 TEMNYWLAEAANLSEMHQPLFDLVGMVRDPASGTGVKVAKEYYGAKGFVIHHNTDIWGDA 484

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
               G   + +WP GGAWL  H W+HY +T ++ FL  +A+PLL   + F LD+L +   
Sbjct: 485 EPIDG-YQYGIWPDGGAWLTLHAWDHYAFTGNKQFLRSQAWPLLHDASLFFLDYLTDDGS 543

Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
           G+L T PS SPE+++   DG    ++   TMD+ I+RE+F   + A  +L ++  A +++
Sbjct: 544 GHLVTGPSLSPENKYKLADGTSHSLTMGPTMDIEIVRELFQRTMQAGTILGEDA-AFLQQ 602

Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
           V ++  RL P  +   G + EW Q
Sbjct: 603 VRQASDRLPPFHVGSLGQLQEWQQ 626


>gi|302917285|ref|XP_003052415.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
           77-13-4]
 gi|256733355|gb|EEU46702.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
           77-13-4]
          Length = 765

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 191/587 (32%), Positives = 300/587 (51%), Gaps = 43/587 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +  PA  +++A+P+GNGRLGAM++G   +E L+LNED++W G P D T  DA + L
Sbjct: 8   LALHYTSPASSWSEALPVGNGRLGAMIYGRTTTELLQLNEDSVWYGGPQDRTPRDAKRNL 67

Query: 73  SDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
           + +R L+ + ++ EA T      F  P  +  Y+ LG+  +EF+  H       +RR LD
Sbjct: 68  AKLRELIRAERHQEAETLVREAFFATPTSMRHYEPLGNCTIEFN--HGVEDVTDFRRRLD 125

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+T+    +Y+   V + R+  +S PD V+  +   SE       ++  S ++  +    
Sbjct: 126 LSTSQNTTEYTCRGVSYRRDVIASFPDNVLAIRFEASEKTRFVVRLTRRSDVEWETNEFL 185

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           ++    +GR      P   N+N      Q + +L +    + G + A+ +    +  +  
Sbjct: 186 DSIRAEDGRIILHATPGGRNSN------QLALVLGVSCDANDGEVEAIGN--CLIVNTTR 237

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
            V+ + A +++            DP + ++  +       +S+L   H  DY  LF R+S
Sbjct: 238 CVIAIGAQTTY---------RVADPEASALHDVDEALKRPWSELAEHHRQDYTNLFGRMS 288

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           +++               N   +P+ ER+K+   + DP LV L   +GRYLLISSSR   
Sbjct: 289 LRMG-------------PNAGHIPTDERIKN---NRDPGLVALYHNYGRYLLISSSRNSH 332

Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           +   A LQGIWN   +P W S   +NINL+MNYW +  CNL EC  P+ D L  ++  G 
Sbjct: 333 KALPATLQGIWNPFFAPPWGSKYTININLQMNYWPAAQCNLLECALPVMDLLEKMAERGR 392

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           KTA+  Y   GW  HH TDIW  +      +  +LWP+GG W+C  ++    Y  D   L
Sbjct: 393 KTAETMYGCRGWCAHHNTDIWGDTDPQDTWMPASLWPLGGVWVCIDVFNMLKYEYD-SAL 451

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
             R  P+LEGC  FLLD+LI    G YL TNPS SPE+ F++  GK   +   S +DM I
Sbjct: 452 HSRVAPVLEGCIEFLLDFLIPSACGKYLVTNPSLSPENTFLSESGKPGILCEGSVIDMTI 511

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +R  F + + + ++L ++   L  +V ++L +L P  I  DG I EW
Sbjct: 512 VRIAFESFLLSVDILNQDH-PLRSQVQEALEKLPPLTINNDGLIQEW 557


>gi|293370624|ref|ZP_06617176.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292634358|gb|EFF52895.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 811

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 200/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIKREELQLNEETFWAGSPYNNNNPNAIHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PIVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
              +    C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   + +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSANESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L        T   S+     + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G+KT
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGTKT 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y + GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T D++F
Sbjct: 409 ARNMYNSRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDQEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L   PS SPEH           V+   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVAPSVSPEH---------GPVTAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|423290387|ref|ZP_17269236.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
           CL02T12C04]
 gi|392665774|gb|EIY59297.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
           CL02T12C04]
          Length = 811

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 199/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + REL+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRELNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L                   + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|299147445|ref|ZP_07040510.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
 gi|298514723|gb|EFI38607.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
          Length = 826

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 197/594 (33%), Positives = 313/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 25  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGNPQLEQIQLNEETVSAGSPYQNYNEEAKT 84

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 85  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYPD-HKKV--NNYYRD 141

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R             + + A L++K     G +    D  L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  + +       + +++D      R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 354

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580


>gi|423215145|ref|ZP_17201673.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692408|gb|EIY85646.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 816

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 197/594 (33%), Positives = 313/594 (52%), Gaps = 47/594 (7%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           N  +I ++ PA ++ +A+P+GNGR+ AMV+G    E ++LNE+T+  G P    N +A  
Sbjct: 15  NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74

Query: 71  ALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
           AL ++R L+  G+Y EA   A+ K+     +   YQ +G + + + D H K     Y R+
Sbjct: 75  ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
           LD++ A A  +Y V  VEFT E F+S  DQ+++  I  S+ G+++  +  ++ + D    
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           + G   + +EG   G R             + + A L++K     G +    D  L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    L +  +++F    +N  D   DP   + + L++     YS     H+  YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV++ L  + +       + +++D      R+K F +  DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 344

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQG WN +  P W      NIN EMNYW +   NL+E  +P    +  LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
            + A   Y   GWV+HH TD+W  + A DR       WP+  AWLC HLW+ Y ++ D+ 
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
           +LE+  YP+++  + F +D+L+ + + GYL   PS SPE+   +I     L       TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
           D  ++ ++FS    AA+VL  N D      LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570


>gi|317474862|ref|ZP_07934132.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
 gi|316909000|gb|EFV30684.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
          Length = 801

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 210/590 (35%), Positives = 312/590 (52%), Gaps = 48/590 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+TLW G P +  NP+A + + 
Sbjct: 12  KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 71

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  G + + F   H +Y +  Y RE
Sbjct: 72  KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 125

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V Y+V  V + RE  +S  DQV++ ++S S  G ++ N  L S   +    
Sbjct: 126 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 185

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +  ++I + G          ++ ++  KG + F   + ++    +G  S+  D  L VE 
Sbjct: 186 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 233

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D A   L  +++F    +N  D   +    S + L +    SY      HL  Y+    
Sbjct: 234 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 289

Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           RV + L      D+ TD              RV++F+  +D  LV   F+FGRYLLI SS
Sbjct: 290 RVDLDLGHDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 336

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PG Q ANLQGIWN+ L P+WDS    NINLEMNYW +   NLSE  +PL   ++ +S  
Sbjct: 337 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 396

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+  Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D  
Sbjct: 397 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 455

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL + AYP+++  A F    ++ E    +L   PS SPE+      GK +  +   TMD 
Sbjct: 456 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            +I ++++ +I+ A +L  +E  L     + L  + P ++   G + EW+
Sbjct: 514 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWM 562


>gi|405378422|ref|ZP_11032344.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
 gi|397325094|gb|EJJ29437.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
          Length = 750

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 212/584 (36%), Positives = 300/584 (51%), Gaps = 42/584 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA+ +TDA+P+GNGRLGAMV+G   SE L++N+ T W G P    NPD+   L  +R
Sbjct: 10  YDAPARLWTDALPLGNGRLGAMVFGDPVSERLQINDSTFWAGGPYRPVNPDSYGHLEKIR 69

Query: 77  SLVDSGQYAEATAASV-KLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+ +G YAEA A +   L   P     YQ +GD+ ++F  S       +YRR LDL+TA
Sbjct: 70  ELIFAGHYAEAEAMAEEHLMARPIKQMSYQPIGDLHIDFLHSQ---TIGSYRRTLDLDTA 126

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y    + F RE F S  D V+V ++S    G++   +SLDS      +      +
Sbjct: 127 IATTSYVADGITFFREAFISTVDGVLVLRLSADRPGAIRCRISLDSPQQGQLFDQDAAGL 186

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
              G   GK     A A      ++F+  + +    + G   +     + V+ +D  V+L
Sbjct: 187 TFSGT--GKAEWGIAAA------LRFAFGIRVI---NTGGSLSSSSGIISVDSTDELVIL 235

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A++SF        D   DP     + L      S   +   H+ ++Q+LF   +I L 
Sbjct: 236 LDAATSFR----RFDDVSGDPDGAITARLSKATGHSIEAMRRDHIIEHQRLFRAFAIDLG 291

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
                  T   S       P+  R+  F   EDP+L  L  QFGRYL+I+SSRPGTQ AN
Sbjct: 292 ------TTQAASH------PTDRRIAGFADGEDPALAALYVQFGRYLMIASSRPGTQPAN 339

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIWNE++ P W S    NINL+MNYW   P NL +C  PL +    L+  G +TAQV+
Sbjct: 340 LQGIWNEEVDPPWGSKYTANINLQMNYWLPAPANLPQCIVPLVEMAEELAEAGRETAQVH 399

Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
           Y A GWV+HH TD+W  +    G   W LWP GGAWL T L +  +Y  D D L +R +P
Sbjct: 400 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGAWLMTQLLDLSDYLDDADRLRRRLFP 458

Query: 494 LLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           + +  A F+ D L  + G + YL T PS SPE+  + P G   C      MD  IIR+  
Sbjct: 459 VAKAAAEFVFDALASLPGTN-YLVTTPSLSPEN--VHPHGASICA--GPAMDNQIIRDFL 513

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + +   A  +   ED  V ++ + LPRL P +I   G + EW++
Sbjct: 514 NLLRPIATSI-GGEDEFVSEIDRVLPRLPPDRIGSAGQLQEWLE 556


>gi|408387708|gb|EKJ67420.1| hypothetical protein FPSE_12405 [Fusarium pseudograminearum CS3096]
          Length = 768

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 196/602 (32%), Positives = 305/602 (50%), Gaps = 55/602 (9%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           +E  +T   L + +  PA  +++A+PIGNGRLGAMV+G   +E L+LNED++W G P D 
Sbjct: 5   SEKANTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRASTELLQLNEDSVWYGGPQDR 64

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
           T  DA   L+ +R L+   ++ +A A A    F  PA +  Y+ LG   +EF   H +  
Sbjct: 65  TPRDAYSNLATLRQLIRDEKHKDAEALAREAFFATPASMRHYEPLGQCTIEF--GHDERI 122

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              Y+R LDL T+ +  KY    V + R+  +S P+ V+  +   S        ++  S 
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVTYRRDVIASFPNNVLAIRFQASAPTRFVVRLNRQSE 182

Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
           ++  +  Y++     +N II++    GK      N+N      + +  L +    + G +
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSNNGNV 230

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
             + +    +  ++  ++ + A +++            +P + ++  + S     + +L 
Sbjct: 231 KVVGN--CLIVNTEECIIAIGAHTTY---------RSYNPDASALRDVNSALREPWENLV 279

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH  DY +LF + ++++               +   VP+ ER+   Q++ DP L+ L  
Sbjct: 280 SRHRQDYGRLFSKTALRM-------------WPDASHVPTDERI---QSNRDPGLIALYH 323

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            + RYLLISSSR   +   A LQGIWN   +P W S   +NINL+MNYW +  CNL EC 
Sbjct: 324 NYSRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAASCNLIECA 383

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
            PL D +  ++  G +TA+V Y   GW  HH TDIWA +      +   LWP+GG WLC 
Sbjct: 384 VPLIDHIERMAQKGKRTAKVMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
            + +   Y  D   L  R  PLLEGC  FLLD+LI    G YL TNPS SPE+ FI+  G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTNPSLSPENSFISESG 502

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
           +       S MDM I+R    + I +  +L K E  L + V+ +L +L P +I + G I 
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561

Query: 592 EW 593
           EW
Sbjct: 562 EW 563


>gi|298481665|ref|ZP_06999856.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
 gi|298272206|gb|EFI13776.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
          Length = 812

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 198/591 (33%), Positives = 311/591 (52%), Gaps = 56/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAM++GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKKAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L  + K    +T            +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLPAAGKASQLET-----------PKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 349

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 350 QSANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 409

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 410 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 465

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 466 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 514

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 515 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 564


>gi|423575217|ref|ZP_17551336.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
 gi|401209825|gb|EJR16582.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
          Length = 940

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 84  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 523 LWEHYKFTNDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630

Query: 591 MEW 593
            EW
Sbjct: 631 QEW 633


>gi|345515268|ref|ZP_08794774.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|229434306|gb|EEO44383.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
          Length = 814

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 320/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NP+A + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   A  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+       D+  D  +    D      RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +     + + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|212694638|ref|ZP_03302766.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
 gi|237711097|ref|ZP_04541578.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|265750683|ref|ZP_06086746.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|423239195|ref|ZP_17220311.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
           CL03T12C01]
 gi|212663139|gb|EEB23713.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
 gi|229454941|gb|EEO60662.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|263237579|gb|EEZ23029.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|392646982|gb|EIY40688.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
           CL03T12C01]
          Length = 814

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 319/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NP+A + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   A  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+ L       VT            +  RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +     + + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|342884136|gb|EGU84463.1| hypothetical protein FOXB_05018 [Fusarium oxysporum Fo5176]
          Length = 767

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 198/601 (32%), Positives = 304/601 (50%), Gaps = 47/601 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M   ES+ T   + + +  PA  +++A+PIGNGRLGAMV+G   +E L+LNED++W G P
Sbjct: 1   MDEGESSDTDKGMLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGP 60

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
            D T  DA   L+ +R L+   ++ +A        F  P+ +  Y+ LG  ++EFD  H 
Sbjct: 61  QDRTPRDAHSHLATLRQLIRDEKHKDAEDLVKEAFFATPSSMRHYEPLGQCKIEFD--HD 118

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
           +     Y R LDLNT+    +Y      + R+  +S PD V+  ++  SE     F V L
Sbjct: 119 ESEVTDYTRYLDLNTSQVTTRYKCDGRSYRRDIIASFPDSVLAVQVQASEKSR--FVVRL 176

Query: 178 DSLLDNHSYVNG--NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
           +   +N    N   ++    + R     IP  AN+N      + S +L +      GT+ 
Sbjct: 177 NRQSENEGETNEYLDSIFAQDSRIILNAIPGGANSN------RLSLVLGVSCGPGDGTVK 230

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A+ +    +  +   V+ + A ++F          K+DP   ++  +       +  L  
Sbjct: 231 AVGN--CLIVNATKCVIAIGAHTTF---------RKEDPERSALLNVDDALRRPWDVLVR 279

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
           RH  DY  LF R+S++L               + + +P+ +R+ S   + DP LV L   
Sbjct: 280 RHRSDYTNLFGRMSLRLF-------------PDANHLPTNKRIVS---NRDPGLVALYHN 323

Query: 356 FGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           +GRYLLISSSR   +   A LQGIWN   SP W S   +NINL+MNYW ++PC+L +C  
Sbjct: 324 YGRYLLISSSRNSDKALPATLQGIWNPSFSPPWGSKFTININLQMNYWPAIPCSLIQCAI 383

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PL + L  ++  G +TA++ Y   GW  HH TDIWA +      +   +WP+GGAWLCT 
Sbjct: 384 PLINLLERMAERGKRTAKMMYNCKGWCAHHNTDIWADTDPQDRWMPATIWPLGGAWLCTD 443

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGK 532
           +     Y  +   L  R  P+LEGC  FLLD+LI    G YL TNPS SPE+ F++  G+
Sbjct: 444 VVRMLIYQYE-PTLHCRIAPILEGCVQFLLDFLIPSACGRYLVTNPSLSPENSFVSQSGE 502

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
                  S +DM I+R    + + +  +L+ +     + +  +L +L P  + +DG I E
Sbjct: 503 TGIFCEGSVIDMTIVRIALESFLWSISILDPDHPRRNDAI-AALDKLPPMSLNKDGLIQE 561

Query: 593 W 593
           W
Sbjct: 562 W 562


>gi|423228044|ref|ZP_17214450.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
           CL02T00C15]
 gi|423243307|ref|ZP_17224383.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
           CL02T12C06]
 gi|392637080|gb|EIY30955.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
           CL02T00C15]
 gi|392645314|gb|EIY39042.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
           CL02T12C06]
          Length = 814

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 319/589 (54%), Gaps = 46/589 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+T+W G P +  NP+A + + 
Sbjct: 25  KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  GD+ + F   H +Y++  Y RE
Sbjct: 85  KVRQLVFEGKYLEAQTLATEKIMTKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V+Y V  V + RE  +S  DQV++ +++ S+ G ++ N +L +   +    
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
               ++ + G          ++ ++  KG ++F   +  +    +G   A  D  L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D AV+ +  +++F     N  D   +    + + L+   +  Y      H+D +++   
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RVS+ L       VT            +  RV++F+  +D  LV   F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG Q ANLQGIWN+ L P+WDS    NIN+EMNYW +   NLSE  EPL   +  +S  G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            ++A++ Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYP+++    F  + ++ E    +L   PS SPE+     +GK A  +   T+D  
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +I ++++ II+ A +L  + +     + + L  + P +I   G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575


>gi|423482848|ref|ZP_17459538.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
 gi|401143214|gb|EJQ50752.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
          Length = 1156

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 204/600 (34%), Positives = 318/600 (53%), Gaps = 69/600 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYT--NP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P    DYT  N 
Sbjct: 47  LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSDYTYGNR 106

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
           D A   L  +R  V  G  + A   S +        FG     YQ  GDI L+F+    +
Sbjct: 107 DGAASHLDSIREKVSKGDKSGAEEESSQFLTGLQNGFGS----YQNFGDIYLDFNMPD-Q 161

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            +   YRREL+LN   A V Y+  +V++ RE+F+S PD+V+V +++ SES  LS +V   
Sbjct: 162 ASFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASESKQLSLDVRPT 221

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++A E
Sbjct: 222 SA-QGGEITSIDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + K+KV  +D   +++ A++ ++  +  P+   +DP  +    + +I N SY  L   H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKVEKIMAAISNKSYEVLKYTHI 322

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DY  LF+RVS+ L                  +VP+ E + S+       L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLDLGGEKP-------------SVPTNELLASYNKQNSKYLEELFFQYGR 369

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429

Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +LWE
Sbjct: 430 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 488

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         +  +
Sbjct: 489 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------IGGI 539

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           S     D  ++ E+FS +I A+EVL+ ++   D L  K  +  P   P +I   G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQTDKVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596


>gi|218129080|ref|ZP_03457884.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
 gi|217988715|gb|EEC55034.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
          Length = 828

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 210/590 (35%), Positives = 312/590 (52%), Gaps = 48/590 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ ++ PA+ +T+A+P+GNGRLGAMV+G    E ++LNE+TLW G P +  NP+A + + 
Sbjct: 39  KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 98

Query: 74  DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            VR LV  G+Y EA T A+ K+      G P   YQ  G + + F   H +Y +  Y RE
Sbjct: 99  KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 152

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           L L++A   V Y+V  V + RE  +S  DQV++ ++S S  G ++ N  L S   +    
Sbjct: 153 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 212

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +  ++I + G          ++ ++  KG + F   + ++    +G  S+  D  L VE 
Sbjct: 213 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 260

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D A   L  +++F    +N  D   +    S + L +    SY      HL  Y+    
Sbjct: 261 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 316

Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           RV + L      D+ TD              RV++F+  +D  LV   F+FGRYLLI SS
Sbjct: 317 RVDLDLGPDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 363

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +PG Q ANLQGIWN+ L P+WDS    NINLEMNYW +   NLSE  +PL   ++ +S  
Sbjct: 364 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 423

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G +TA+  Y A GWV+HH TDIW  + A   K    LWP GGAWLC HLWE Y YT D  
Sbjct: 424 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 482

Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           FL + AYP+++  A F    ++ E    +L   PS SPE+      GK +  +   TMD 
Sbjct: 483 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 540

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            +I ++++ +I+ A +L  +E  L     + L  + P ++   G + EW+
Sbjct: 541 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWM 589


>gi|225157647|ref|ZP_03725037.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
           colitermitum TAV2]
 gi|224802714|gb|EEG20967.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
           colitermitum TAV2]
          Length = 852

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 197/618 (31%), Positives = 309/618 (50%), Gaps = 80/618 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+P+GNGRLGAM++G + SE L+LNED+LW G P D  NPD  + L  +R L+  G+ A 
Sbjct: 25  ALPVGNGRLGAMIFGDIVSERLQLNEDSLWNGGPRDRRNPDTREHLPVLRQLLADGRLAA 84

Query: 87  ATAASVKLFGHPAD---VYQLLGDIELEF-----------DDSHLKYAEET--------- 123
           A      +     D    Y+ L D+ L F           D+  L     T         
Sbjct: 85  AHELVHDVMAGIPDSQRCYEPLADLFLNFEHPGAPVSVSADEMALAAGYTTPRFDPSLLS 144

Query: 124 -YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--- 179
            YRR LDL TA A V Y++ ++ ++R   +S  DQVI  ++     GSL+  V ++    
Sbjct: 145 HYRRALDLRTAVASVDYTLNSIAYSRRMVASAVDQVIAIQLRAGRPGSLTLRVRMERGPR 204

Query: 180 ------LLDNHSYVN----GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
                   D   +V+     +  +++ GR  G+            +G++F+  L  +IS 
Sbjct: 205 NSYSTRYADTVGFVSDACSSSPTLLLRGRAGGE------------EGVRFATGLRAQISG 252

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
             G +  +  + L ++G+D   L+L A++SF          + DP +  +   ++     
Sbjct: 253 --GALRHI-GETLYIDGADSVTLVLAAATSF---------READPAASVIERTRAALARG 300

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRS-PKDIVTDTCSEENIDTVPSAERVK-SFQTDEDP 347
           +  +   H  +Y+  F R S+ L      +  T T       T+P+ ER++ + +T  DP
Sbjct: 301 WEKILADHEREYRSFFDRASLTLGAGFASEAPTATA------TLPTDERLRHAHETSGDP 354

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           +L  L F + RYLLISSSRPG+  +NLQG+WN D  P+W S   +NIN EMNYW + P N
Sbjct: 355 ALASLYFNYARYLLISSSRPGSLPSNLQGLWNGDFWPSWGSKYTININTEMNYWIAEPAN 414

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           L++C +PLFD L  +  +G +TA+V Y   G+V+HH TDIWA +         + W +GG
Sbjct: 415 LADCHKPLFDHLERMVESGRETARVMYGCRGFVVHHNTDIWADTCPTDRNAGASYWLLGG 474

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
           AW   H W+ +++  D   L   AY  L+  A F LD+L+E   G L  +PS SPE+ + 
Sbjct: 475 AWFVLHAWDRFDFDRDPASLAA-AYERLKEAALFFLDFLVEDARGRLVISPSCSPENTYR 533

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK----------NEDALVEKVLKSLP 577
            P+G+   +   STMD  ++  +F   + AA +LE+          +E   + +V  +  
Sbjct: 534 LPNGEAGVLCVGSTMDSQMLAILFRRTLQAARLLEQRNATAGGGGGDEREFLAQVAAAAE 593

Query: 578 RLRPTKIAEDGSIMEWVQ 595
           RL    I   G ++EW++
Sbjct: 594 RLPKMTIGRHGQLLEWLE 611


>gi|217960596|ref|YP_002339160.1| alpha-fucosidase [Bacillus cereus AH187]
 gi|375285103|ref|YP_005105542.1| hypothetical protein BCN_3009 [Bacillus cereus NC7401]
 gi|423352889|ref|ZP_17330516.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
 gi|423567917|ref|ZP_17544164.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
 gi|217068135|gb|ACJ82385.1| alpha-fucosidase [Bacillus cereus AH187]
 gi|358353630|dbj|BAL18802.1| conserved hypothetical protein [Bacillus cereus NC7401]
 gi|401090895|gb|EJP99046.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
 gi|401211256|gb|EJR18004.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
          Length = 1193

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 84  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630

Query: 591 MEW 593
            EW
Sbjct: 631 QEW 633


>gi|160885438|ref|ZP_02066441.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
 gi|423294310|ref|ZP_17272437.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
           CL03T12C18]
 gi|156109060|gb|EDO10805.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
 gi|392675501|gb|EIY68942.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
           CL03T12C18]
          Length = 811

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 198/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L                   + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GWV HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|423373036|ref|ZP_17350376.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
 gi|401097368|gb|EJQ05391.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
          Length = 1193

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 84  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630

Query: 591 MEW 593
            EW
Sbjct: 631 QEW 633


>gi|229139796|ref|ZP_04268363.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
 gi|228643676|gb|EEK99940.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
          Length = 1172

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 63  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609

Query: 591 MEW 593
            EW
Sbjct: 610 QEW 612


>gi|300726087|ref|ZP_07059544.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
 gi|299776557|gb|EFI73110.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
          Length = 824

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 199/594 (33%), Positives = 309/594 (52%), Gaps = 48/594 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +N PA  + +A+PIGNGR+  M++GGV SE ++LNE+T+W G P           L
Sbjct: 22  LKLWYNHPASIWQEALPIGNGRIAGMIYGGVQSEEIQLNEETVWGGGPHSNVRAIPVDTL 81

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  GQ   A A   + F  G     Y+ +G ++++F+  +       YRRELDL
Sbjct: 82  RQVRQLIFDGQEKAAHAMINRNFMTGQHGMPYESVGSLKIDFN--YRAGDTRNYRRELDL 139

Query: 131 NTATARVKYSVGNVEFTREHFS--SNPDQ---VIVTKISGSESGSLSFNVSLDSLLDNHS 185
           N A +   + VG V + RE F+  S+P+    V+V +++ S+ GS+SF +   S L +  
Sbjct: 140 NRAVSTTTFQVGKVTYKREVFTTFSSPEHHANVMVIRLTASKRGSISFKLHYTSPLRHAI 199

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLK 243
            +N    + M G               D +GI+    A    ++ +  G I     + ++
Sbjct: 200 TLNQQGDLCMLGYGA------------DHEGIKGVIQASTVTRVLNIGGKIKR-NGESIE 246

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V  ++   + L   ++F     + ++   D  +++   LQ+    +Y  L  +H   YQ 
Sbjct: 247 VTNANQVEIRLAMGTNFK----SYNEVSLDAKAQTFGELQTASPYTYEALLQQHEQVYQN 302

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F RVS+ L  +            N  ++P+ ER++ FQ   DP+L  L+FQ+GRYLLIS
Sbjct: 303 QFGRVSLDLGEN-----------TNETSLPTDERLRRFQQSNDPALATLVFQYGRYLLIS 351

Query: 364 SSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SS+  ++  ANLQGIWN+D++  WD    +NIN EMNYW +   NLS+ + PL+  +  L
Sbjct: 352 SSQIDSRTPANLQGIWNKDMNAPWDGKYTININTEMNYWPAQTTNLSDNEWPLYRLVQNL 411

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S  G + A   Y A G++ HH TDIWA +    G   W +WP G  WL THLW+ Y +T 
Sbjct: 412 SKTGVEAASKMYGAKGYMAHHNTDIWATTGMVDG-ATWGIWPNGAGWLSTHLWQRYLFTG 470

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D+ FL +  YP L+G A F L  ++     GY+ T PS SPEH    P GK   V+   T
Sbjct: 471 DQQFL-RTFYPQLKGAADFYLTAMVRHPKYGYMVTVPSISPEH---GPHGK-PSVTAGCT 525

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           MD  I  +V    + A EVL ++E A  + + + + +L P ++     + EW++
Sbjct: 526 MDNQIAFDVLQDALQATEVLGESE-AYADSLRQHIRQLAPMQVGRYCQLQEWLE 578


>gi|222096655|ref|YP_002530712.1| alpha-fucosidase [Bacillus cereus Q1]
 gi|221240713|gb|ACM13423.1| alpha-fucosidase [Bacillus cereus Q1]
          Length = 1172

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 63  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609

Query: 591 MEW 593
            EW
Sbjct: 610 QEW 612


>gi|325678667|ref|ZP_08158277.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
 gi|324109717|gb|EGC03923.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
          Length = 761

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 203/589 (34%), Positives = 302/589 (51%), Gaps = 49/589 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           KI F  PA+ +  A+P+GNGR+G M +G    E ++LNED++++G      NP A + L 
Sbjct: 10  KIWFKAPAEDWNVALPVGNGRIGGMCFGQPLYEKIQLNEDSIFSGGQRKRNNPSARENLE 69

Query: 74  DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            VR L+   + AEA    ++ F G P +   Y  LGD+ ++    HL+   E   R LDL
Sbjct: 70  KVRQLLKEEKIAEAEKIVLEAFCGTPVNQRHYMPLGDLVIQ---HHLESECEYKCRSLDL 126

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
             A    +YS+  V + R    S P QV+   I+  +S S+S  ++LD      D++S +
Sbjct: 127 ENAVCTAEYSIKGVNYVRRVICSEPAQVMAINITADKSASISLKLTLDGRDDYFDDNSPM 186

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
           N +  I+  G C G+             GI F+A L  ++    G++       +  E  
Sbjct: 187 N-DTDILYYGGCGGE------------DGINFAAYL--RVIGVGGSVHRW-GSSIVTEDC 230

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D   +L+   +S+       SD KK    + ++A +      + +L   H++DY+  F R
Sbjct: 231 DSVTILIGVQTSY-----RVSDYKKSAELDVITAAEK----DFEELLKEHIEDYRSYFDR 281

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
                     +IV D   E   D++P+ ER+K  +    D  LV L F FGRYL+IS SR
Sbjct: 282 T---------EIVFD---EGGNDSLPTDERLKLVKEGGVDNGLVSLYFDFGRYLMISGSR 329

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            GT   NLQGIWN+D+ P W     VNIN EMNYW +   ++ +   PLFD +  +  NG
Sbjct: 330 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWLAEVADMGDLHMPLFDHIERMRPNG 389

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
             TA+  Y   G+V HH TDIW  ++     +    W  G AWLCTH+WEH+ Y+ DR+F
Sbjct: 390 RATAREMYGCGGFVCHHNTDIWGDTAPQDLWMPGTQWVTGAAWLCTHIWEHWLYSRDREF 449

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L ++ Y  L+  + F +D+LI+   G L T PS SPE+ +I   G    V    +MD  I
Sbjct: 450 LAEK-YDTLKEASLFFVDFLIDNGKGQLVTCPSVSPENTYITASGAKGSVCMGPSMDSQI 508

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I E+F+A+I A EVL  + D   EK+     +L   +I + G IMEW +
Sbjct: 509 IYELFTAVIEAGEVLGIDAD-YREKLKGMREKLPKPQIGKYGQIMEWAE 556


>gi|423215045|ref|ZP_17201573.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692308|gb|EIY85546.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 811

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 199/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PIVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L        T   S+     + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLP-------TGKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|319936285|ref|ZP_08010703.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
 gi|319808661|gb|EFW05205.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
          Length = 749

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 192/587 (32%), Positives = 304/587 (51%), Gaps = 43/587 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ FN PA  + +A+P+GNG LGAMV+G    E + +NED+L++G P +  NP+    L 
Sbjct: 6   KLIFNKPALQWEEAMPLGNGYLGAMVFGQTQKELICMNEDSLYSGGPIERGNPNTLDHLD 65

Query: 74  DVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           ++R+L+  G+  EA   +   F     HP   YQ LG + +EF   ++    + Y++ LD
Sbjct: 66  EMRTLLLDGKVEEAQKKAPNYFYATTPHPRH-YQPLGQVWMEFHHQNV----QDYQKVLD 120

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  +   ++Y   NVE+ RE F S P+QV V KI  S++  L+F    D  L       G
Sbjct: 121 LKNSIGSIQYRYNNVEYQRECFISYPNQVFVYKIKASQNQQLNF----DLYLTRRDIRPG 176

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
            ++  ++     K     +  N + K GI ++    +++ D  G +      +L +E + 
Sbjct: 177 RSESYVDDIHIEKDYLYLSGYNGNQKNGISYTMATTVQLKD--GCLKKY-GSRLVIENAT 233

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A++ +V  +S+            +P       L      SY +L   H+ DYQ  F ++
Sbjct: 234 EAIVYVVGRTSY---------RSHNPFQWCQKQLDKTLLKSYRNLKQDHIRDYQNYFDQL 284

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSA-ERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
            + L              EN+ ++P   +++K  Q D D  L+E  F FGRYLLISSSR 
Sbjct: 285 ELTLGDH---------KNENMMSIPERLQKMKEGQIDLD--LIETYFHFGRYLLISSSRE 333

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G+  ANLQGIWN +  P W S   +NIN++MNYW +    LS    PL      +   G 
Sbjct: 334 GSLAANLQGIWNGEFEPPWGSRYTININIQMNYWLAEKTGLSRLHLPLMQLQKIMLPRGQ 393

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           K A+  Y   G   HH TDIW   +     V   LWPMG  WL  H++EHY YT +++F+
Sbjct: 394 KIAKEMYGCRGTCAHHNTDIWGDCAPADYYVPSTLWPMGSLWLSLHIFEHYQYTHNQEFI 453

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            +  +P+L+  A F LD++ +  +G+  T PS SPE+ ++  DG+ A V  S +MD+ ++
Sbjct: 454 LE-YFPILKENALFFLDYMFKDANGFYATGPSVSPENAYMTQDGQAATVCLSPSMDIQLL 512

Query: 548 REVFSAIISAAEVLEKNE-DALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           RE F++ +   + L +++ +A + + L+ LP   P +I + G IMEW
Sbjct: 513 REFFTSYLQLLKELNRHDLEAEINEYLEKLP---PIQIGKYGQIMEW 556


>gi|336415344|ref|ZP_08595684.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940940|gb|EGN02802.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
           3_8_47FAA]
          Length = 648

 Score =  307 bits (787), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 198/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNFPLVHKVNVQND 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
              +    C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L        TD  S+     + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLP-------TDKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS  G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSATGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|302669281|ref|YP_003832431.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
           B316]
 gi|302396945|gb|ADL35849.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
           B316]
          Length = 714

 Score =  307 bits (787), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 195/610 (31%), Positives = 301/610 (49%), Gaps = 86/610 (14%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +   AK +  A+P+GNG +GAM +GG   +  +LN D++W   P D  NPDA +++ 
Sbjct: 3   RLWYKEAAKDWNSALPLGNGFMGAMCFGGTLIDRFQLNNDSIWWSGPRDRINPDAKESIP 62

Query: 74  DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIEL--------------EFDDSH 116
            +R L+  G+ ++A   A+  + G P     Y+ LGD+ +              E     
Sbjct: 63  VIRRLIREGRISDAEDLANEAMAGIPEYQSHYEPLGDLFIIPEGKERIQILGIREHWSGQ 122

Query: 117 LKYAEET--YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS------ES 168
           L   EE   Y+RELD+      V Y+   V+F RE F SN D+V+  K  GS      E 
Sbjct: 123 LNRIEEIPDYKRELDIEKGIHTVSYTKDGVKFCRESFISNVDKVMAIKCLGSRLRIFAER 182

Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           G     V          Y    N + MEGR                 G++F  ++ +   
Sbjct: 183 GDQCEKV----------YKLSENTLCMEGRTGAD-------------GVRFCMVIRVVNG 219

Query: 229 DD--RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
           +   RG +         +   D A +L+ + + F           +DP ++++  L + +
Sbjct: 220 NPYIRGRM---------LHADDDAEILIASQTDF---------YNEDPVADAVRTLDAAQ 261

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
            L Y +L  RH+ D Q+L  R ++++              +N D +P+ +R+++  +   
Sbjct: 262 KLGYDELKKRHVCDVQELMDRCTLEID------------SDNRDNIPTDKRLQAVAEGGT 309

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           D  L+ LLF +GRYLLISSSRPG+  ANLQGIWN+  SP WDS   +NIN +MNYW +  
Sbjct: 310 DNGLINLLFAYGRYLLISSSRPGSLPANLQGIWNDSFSPAWDSKFTININAQMNYWPAEV 369

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
             LSE  EPLFD +  +  NG + A   Y A GW+ HH TDIW   +        + W M
Sbjct: 370 TGLSELHEPLFDLMKRMLPNGRRAAAEMYCARGWMAHHNTDIWGDCAPQDTWQAASYWQM 429

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
           G AWLC H+ EHY YT D +F+ +   P+++  A F  D LIE   G L  +PS SPE+ 
Sbjct: 430 GAAWLCLHILEHYRYTQDENFM-REYLPMVKEAALFFEDSLIENEAGQLVVSPSVSPENT 488

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
           ++ P G+   +   ++MD  I+ E+FS +I   ++L   E      +L  LP+    +I+
Sbjct: 489 YVLPSGERGMMCEGASMDAQILYELFSGLI-GTDMLSSEEKERYTTILCKLPK---PQIS 544

Query: 586 EDGSIMEWVQ 595
           E G++ EW +
Sbjct: 545 EIGTVQEWAE 554


>gi|325281855|ref|YP_004254397.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
 gi|324313664|gb|ADY34217.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
          Length = 807

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 202/602 (33%), Positives = 307/602 (50%), Gaps = 59/602 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ +  PA+ + + +P+GNGRLG M  GGV  ET+ LN+ T+W+G   D  NP+A K L
Sbjct: 28  LKLWYTRPAERWEETLPLGNGRLGMMPDGGVVQETIVLNDITMWSGSFQDTRNPEALKYL 87

Query: 73  SDVRSLVDSGQYAEATAASVKLFG-------------HPADVYQLLGDIELEF---DDSH 116
            ++R L+  G+  EA     K F               P   +QLLG++ L++   D S 
Sbjct: 88  PEIRRLLLEGKNDEAQELMYKHFACGGQGSAFGQGANAPYGAFQLLGNLHLQYHFPDSSD 147

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
           + Y+   Y R L L+ A A   +  G V++ RE+F S  + V++ K++    G L F+V+
Sbjct: 148 VGYS--AYERGLSLDKALAWTCFRKGKVKYRREYFVSQTEDVMIMKLTADRKGMLDFDVA 205

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
           +D   +   Y N +  + MEG+          +      G ++   L++  +D R     
Sbjct: 206 IDRPENYTCYAN-DGVVYMEGQL---------DNGKGKAGTKYMVQLKVWTADGR---QV 252

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
            +   + V+ +  A +L+ A +S             D        +Q   N+ Y  L  R
Sbjct: 253 ADSACIHVKEATTAYVLVSAGTSL---------WAADYPERVEKLMQIAGNMDYGYLLER 303

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H   ++  ++RV + L  +P+DI+            P+ +R+  FQ  EDP LV L FQ+
Sbjct: 304 HDSAWRYKYNRVELDLG-TPQDIL------------PTDQRLARFQEQEDPGLVALYFQY 350

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLIS +R  +   NLQG+W   +   W+   H+NINL+MNYW     NLSE   PL 
Sbjct: 351 GRYLLISGTRENSFPLNLQGLWANSVQTPWNGDYHLNINLQMNYWPVEIVNLSELHTPLK 410

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           + +  L  +G  TA   Y A GWV H  T+ W + +A      W     GGAWLC HLWE
Sbjct: 411 NLVKDLVTSGEVTAHSFYGAQGWVAHMMTNPW-RFTAPGEHASWGATNTGGAWLCEHLWE 469

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG-KLA 534
           HY +T+D+++L +  YP+L G + F L  +IE    G+L T PS+SPE+ F  P   K  
Sbjct: 470 HYAFTLDQEYL-REVYPVLSGASRFFLSSMIEEPTQGWLVTAPSSSPENAFYMPGTRKEV 528

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA-EDGSIMEW 593
            V     MD  IIRE+FS  I AA +LE +  A  + + K+L +L P +I+ + G + EW
Sbjct: 529 SVCMGPAMDTQIIRELFSNTIQAARLLEIDA-AFADSLEKALDKLPPMQISPKGGYLQEW 587

Query: 594 VQ 595
           ++
Sbjct: 588 LE 589


>gi|260591756|ref|ZP_05857214.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
 gi|260536040|gb|EEX18657.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
          Length = 804

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 190/596 (31%), Positives = 303/596 (50%), Gaps = 47/596 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
           +++ +N PA  F ++IP+GNG+LGA+V+GG   +T+ LN+ T WTG P D  N    KA 
Sbjct: 24  MRLWYNQPAHFFEESIPLGNGKLGALVYGGTQKDTIYLNDITYWTGKPVD-PNEGLGKAK 82

Query: 72  -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            + ++R  + +  Y  A +    + G  +  YQ LG + +   ++    A   Y REL+L
Sbjct: 83  WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 139

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           ++A A + Y    ++FTRE+F+++ D +I   I  +++G+++ ++ L +    H     N
Sbjct: 140 DSALAHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLHIQLTAQTP-HKVKATN 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           NQ+ M G   G                   A   +++    G + A  D  L +  +D A
Sbjct: 199 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 245

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + +V ++SF+G   +P          +++A    +N +YS+   RH+ +YQ++++R+ +
Sbjct: 246 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYSEFKDRHIKEYQQIYNRIKL 305

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
           QL            ++E  + +P+ + ++ + +   P        L  L FQFGRYLL+S
Sbjct: 306 QLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 354

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            SR     ANLQG+W   L   W     +NINLE NYW + P N+SE  +PL  F+  LS
Sbjct: 355 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 414

Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
             G  TA+  Y +  GW   H +D W K+S    GK    WA W +GGAWL   LW+HY 
Sbjct: 415 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 474

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+ D+  L+   YPL+EG + F   WL+   +    L T PSTSPE+E++   G      
Sbjct: 475 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 534

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           Y  T D+AIIRE+F  +  A + L    D  ++     L RL P  +   G + EW
Sbjct: 535 YGGTADLAIIRELFMNMQQARKSLGLKPDKEMD---DKLHRLHPYTVGSQGDLNEW 587


>gi|198275795|ref|ZP_03208326.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
 gi|198271424|gb|EDY95694.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
          Length = 816

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 201/590 (34%), Positives = 309/590 (52%), Gaps = 50/590 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++ +A+P+GN  LG MV+GG+  E ++LNE+T W G P       A   L
Sbjct: 26  LKLWYSAPARNWWEALPVGNSHLGGMVFGGINHEEIQLNEETFWAGGPYSNNRTGASGYL 85

Query: 73  SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +VR L+   +  EA     + F   H    Y  LG + ++F+    +   ++Y R+L+L
Sbjct: 86  DEVRRLIFENKNLEARTLLDEKFMTSHHGMRYLTLGSLLMDFN---CEGKVDSYYRDLNL 142

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             ATA V++    VE+TR  F+S  D V+V +++ ++ G+   +V L       S V   
Sbjct: 143 EDATASVRFRCDGVEYTRRVFTSFSDNVMVVEMA-TDKGNKKLDVDLRYTCPLTSEVKSE 201

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
              ++  +C G      A     P  +   A++ +++  D G I   +D +L V G+  A
Sbjct: 202 GDYLIM-KCNG------AEHEGIPAALH--AVVMMRVKSD-GKIEC-KDGRLSVRGASSA 250

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + L A+++F    +N  D   D  +++  A++   +     LY  H   Y   F RV++
Sbjct: 251 TVFLSAATNF----VNYQDVSGDAYAKARCAIEGAWDKQNKKLYDEHKAIYSAQFGRVAL 306

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L  S       +  E N+       R+  F   +D SL  L+FQ+GRYLLISSS+PG+Q
Sbjct: 307 HLPSS-----EFSKKETNV-------RINEFNKVKDCSLAALMFQYGRYLLISSSQPGSQ 354

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+DL   WDS   +NIN EMNYW +   NLSE   P F     LS+ G + A
Sbjct: 355 PANLQGIWNKDLYAPWDSKYTININAEMNYWPAEVTNLSETHVPFFQMAHELSVTGKEAA 414

Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           +V Y A GWV HH TDIW  +     AD G     +WP GGAW+  HLW+HY Y+ D++F
Sbjct: 415 RVLYGAKGWVAHHNTDIWRAAGPVDFADAG-----MWPNGGAWVAQHLWQHYLYSGDKNF 469

Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L +  YP+L+G A FLL ++ +    G+  T PS SPEH    P+G    +    TMD  
Sbjct: 470 L-REYYPVLKGTADFLLSFMTKHPRYGWRVTAPSVSPEH---GPNG--VSIVAGCTMDNQ 523

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I  +V S  + AA ++  +  A  + +   + +L P +I +   + EW++
Sbjct: 524 IAFDVLSNTLRAARII-GDSKAYCDSLQSLISQLPPMQIGQYNQLQEWLE 572


>gi|453085568|gb|EMF13611.1| glycoside hydrolase family 95 protein [Mycosphaerella populorum
           SO2202]
          Length = 811

 Score =  306 bits (785), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 215/608 (35%), Positives = 309/608 (50%), Gaps = 70/608 (11%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + +  PA  + D +PIGNGRLGAM+ G    E L LNED++W G P +  NP A K L  
Sbjct: 7   LFYESPANLWEDGLPIGNGRLGAMIRGTTNVERLWLNEDSVWYGGPQNRVNPAAHKNLEL 66

Query: 75  VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLK----YAEETYRRE 127
           VR L+D  + AEA     + F G P  +  Y+ LGD+ + F           A ++YRR 
Sbjct: 67  VRELIDQNKIAEAENIMSRTFTGMPESMRHYEPLGDVFMHFGHGRFSGRGGAAVQSYRRA 126

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL T  A V Y+     F RE FSS   +VI  +IS  +   LSF ++L+   DN ++ 
Sbjct: 127 LDLQTGLATVSYACQGGNFQREVFSSTVAEVICMRISSDQC--LSFLLTLNRGDDNDAH- 183

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE----------IKISDDRGTISAL 237
                     R   +      N +D   G+  +A++           +KI  D G     
Sbjct: 184 ----------RQFDRAFDTLTNTDD---GLVLTAVMGGRNAVELAIGVKIVCDDGVKVDS 230

Query: 238 EDKKLKVEGSDWAVLLLVAS-SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
               ++V     +VL+L+A  ++F     N  D+ +    E+  +       ++  L + 
Sbjct: 231 CGIDVEVSMQKGSVLILIAGETTFRN--TNAVDAVQQRLEEAAKS-------TWDQLLSA 281

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLF 354
           H+  + +L++RV + L +           E N+D V + +R++  +    +D  L  LLF
Sbjct: 282 HVAHFGRLYNRVELHLDQ-----------ELNVDHVSTDQRLEQARQHPGQDNELTALLF 330

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
            +GRYLLISSS      ANLQGIWN D  P W S    NINLEMNYW +   NL EC + 
Sbjct: 331 HYGRYLLISSSLS-GLPANLQGIWNCDAKPVWGSKYTANINLEMNYWPAEVTNLPECHQV 389

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LF+FL  L+  G++TAQ  Y   GW  HH TDIWA ++     +    W + GAWL TH+
Sbjct: 390 LFNFLERLAERGTQTAQQMYGCRGWTCHHNTDIWADTAPQDRSICATYWNLTGAWLSTHI 449

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-- 532
           WEHY +T+D DFL+ R +P++ G A F  D+LIE  DG+L T+PS S E+ +  P+    
Sbjct: 450 WEHYLFTLDLDFLQ-RYFPIMRGSAQFFQDFLIE-RDGHLVTSPSISAENSYFLPNSNSN 507

Query: 533 -----LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
                +  +    T D  I+RE+F A I A  +L +   A  E VL  LP   PT+I + 
Sbjct: 508 NNKPVVGSICAGPTWDSQILRELFHACIQAGNLLHE-PVAEYEHVLNKLP---PTQIGKH 563

Query: 588 GSIMEWVQ 595
           G IMEW+ 
Sbjct: 564 GQIMEWLH 571


>gi|395804734|ref|ZP_10483969.1| glycoside hydrolase [Flavobacterium sp. F52]
 gi|395433122|gb|EJF99080.1| glycoside hydrolase [Flavobacterium sp. F52]
          Length = 778

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 203/608 (33%), Positives = 320/608 (52%), Gaps = 57/608 (9%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S S  N L++ +  PA  + + +P+GNGRLG M  GG+ +E L LN+ TLW+G P D  N
Sbjct: 18  SFSQNNQLELWYTKPASQWEETLPLGNGRLGIMPDGGIETEKLVLNDITLWSGSPQDANN 77

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEF 112
             A   L  +R L+ + + +EA     + F         G  A+V    YQ+LGD+ L+F
Sbjct: 78  YKAYTFLPQIRELLLANKNSEAEQLINQNFVCTGPGSGSGDGANVQFGCYQVLGDMTLKF 137

Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
           D    K     Y R L++ TA A  ++++  V + RE+F+   D V+  K++ S+ G L+
Sbjct: 138 D-YKTKSKAINYSRNLNIQTALASTQFTIDGVIYKREYFAGFGDDVLFVKLTSSKKGKLN 196

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
           F V LD   ++   VN +N ++M G+          N   D KG+++ A ++ K +D  G
Sbjct: 197 FTVKLDRS-EHFKTVNSDNSLVMTGQL---------NNGIDGKGMKYKAKVKAKTAD--G 244

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
           ++    +  ++V+ +   VL + A + F        ++  D T E   ALQ      Y +
Sbjct: 245 SV-LYTNNTIEVKNATEVVLYVSAGTDFKNQNF---ETAVDKTLEI--ALQK----KYDE 294

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
               H+ +YQKLF+RV++   ++ ++            T+P+ ER+ +F    D D  L 
Sbjct: 295 QKKTHIQNYQKLFNRVALNFGKTARN------------TLPTNERLDAFMKNPDSDTGLP 342

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
            L +Q+GRYL ISS+R G    NLQG+W   +   W+   H+++N++MN+W     NLSE
Sbjct: 343 VLFYQYGRYLSISSTRVGLLPPNLQGLWAHQIQTPWNGDYHLDVNVQMNHWALETGNLSE 402

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
              PL D +  +   G KTA+  Y A GWV H  T+IW  +        W +   G  WL
Sbjct: 403 LNLPLKDLVKEMVPYGEKTAKAYYNADGWVAHVITNIWGFTEPGE-SASWGIAKAGSGWL 461

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAP 529
           C +LW HY YT D+ +L    YP+++G A F    L++  + G+L T+PS SPE+ F  P
Sbjct: 462 CNNLWNHYLYTNDQAYLAD-IYPIIKGAAQFYNSMLVKDPETGWLVTSPSVSPENSFFLP 520

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEV--LEKNEDALVEKVLKSLPRLRPTKIAED 587
           +G+ A V    T+D  I+RE+F+ +I+A+    L+    A +EK LK LP   P  ++ D
Sbjct: 521 NGQDAHVCMGPTIDNQIVRELFNNVIAASSKLGLDNTLKAELEKRLKLLP--PPGVVSPD 578

Query: 588 GSIMEWVQ 595
           G I EW++
Sbjct: 579 GRIQEWLK 586


>gi|340619499|ref|YP_004737952.1| alpha-L-fucosidase [Zobellia galactanivorans]
 gi|339734296|emb|CAZ97673.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
          Length = 809

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 194/603 (32%), Positives = 319/603 (52%), Gaps = 42/603 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +  PAK +TDA P+GNGRL AM +GGV  E  +LNE++LW GVP +    D    L
Sbjct: 36  LTLWYTSPAKKWTDAFPLGNGRLAAMTFGGVAQERFQLNEESLWAGVPSNPFAEDYRAKL 95

Query: 73  SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDS-HLKYAEETYRREL 128
           + ++ L+  G+  EA A  ++ +   PA    Y+ LGDI L+F D+ H+      Y+R L
Sbjct: 96  TKLQKLILEGKTLEANAFGLENMTAAPASFRSYEPLGDIVLDFKDTTHIS----NYKRAL 151

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL T  ++V Y   + E  RE F S  D  +  ++S   S  ++  +SL    D      
Sbjct: 152 DLETGISKVTYRTEDSEMVRESFISAEDDALFIRLSAKGSKKINCTISLARPKDVRITAT 211

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-----IQFSAILEIKISDDRGTISALEDKKLK 243
              ++ M G+      P   + N    G     + F+A L+ K+S   G      +  L 
Sbjct: 212 PEGKLYMLGQIVDIEAPEAHDENAGGSGEGGEHMSFAAGLQTKVS---GGKLCHTEHNLV 268

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +E +D  ++   A++++D   +N  D+  DP+ +    L+ +   S+ +L   H ++++ 
Sbjct: 269 IENADEVLIAYTAATNYDLSKLN-FDASVDPSLKVRGILEKLDQKSWKELEYTHREEHRN 327

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
           +F RV   L  SP D            ++P+ ER+ +F+   +D  L   LFQFGRYLL+
Sbjct: 328 MFDRVQFDLGTSPND------------SLPTDERLLAFKNGAKDTGLPVQLFQFGRYLLM 375

Query: 363 SSSR-PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
            SSR P    ANLQG W+E +   W++  H+N+NL+MNYW +   N+SE  +PL ++   
Sbjct: 376 GSSRGPAVLPANLQGKWSERMWAPWEADYHLNVNLQMNYWPADVTNISETIDPLVNWFEL 435

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-----WALWPMGGAWLCTHLWE 476
           +       A+  Y + GW  HH ++ + + +     +        L P+ GAW+  +LW+
Sbjct: 436 IVETSKPLAKEMYGSDGWFSHHASNPFGRVTPSASTLPSQFNNAVLDPLPGAWMAMNLWD 495

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-DGKLAC 535
           HY +T D+ FL++R YPLL+G + F+LD L+E  +G L   PSTSPE+++  P  G++  
Sbjct: 496 HYEFTQDKVFLKERLYPLLKGASEFILDVLVEDSEGVLHFVPSTSPENQYKDPATGQMMR 555

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL---KSLPRLRPTKIAEDGSIME 592
           ++ +ST  ++IIR +F A + AA +L +  +   ++++   K+LP     K   +G +ME
Sbjct: 556 ITSTSTYHLSIIRAMFKATLEAATILGEGNNERCKRIVEAGKALPDFPIDKT--NGRMME 613

Query: 593 WVQ 595
           W Q
Sbjct: 614 WRQ 616


>gi|423605155|ref|ZP_17581048.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
 gi|401244303|gb|EJR50667.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
          Length = 1193

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 202/600 (33%), Positives = 313/600 (52%), Gaps = 69/600 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 84  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F+     
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 199

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            +   YRREL+LN   + V YS   V++ RE+F+S PD+V+V +++ SES  LS +V   
Sbjct: 200 -SFSNYRRELNLNEGISTVSYSYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 258

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S        + + +I ++G+           AN+   G+++ +  E K+ ++ GT++A E
Sbjct: 259 SAQGGQ-VTSKDKKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 301

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + K+KV  +D   +++ A++ ++  +  P+   +DP  +    + +I N SY  L   H+
Sbjct: 302 NGKIKVANADSLTIIMTAATDYENKY--PNYKGEDPHQKVEKIMSAISNKSYEVLKYTHI 359

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ+GR
Sbjct: 360 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 406

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL D+
Sbjct: 407 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 466

Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +LWE
Sbjct: 467 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 525

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L  +
Sbjct: 526 HYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 576

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G + EW
Sbjct: 577 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 633


>gi|229197298|ref|ZP_04324028.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
 gi|228586175|gb|EEK44263.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
          Length = 1172

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 203/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 63  LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F   D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
                   YRREL+LN   + V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
              S        + +N+I ++G+           AN+   G+++ +  E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           A E+ K+KV  +D   +++ A++ ++  +  PS   +DP  +    + +I N SY  L  
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H+ DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           +GRYLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442

Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            D++  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------L 552

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSI 590
             +S     D  ++ E+FS +I A+ +L+ ++   D L  K  K  P   P +I   G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASNLLQIDKGFRDELKAKRDKLFP---PIQIGRYGQV 609

Query: 591 MEW 593
            EW
Sbjct: 610 QEW 612


>gi|393786769|ref|ZP_10374901.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
           CL02T12C05]
 gi|392658004|gb|EIY51634.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
           CL02T12C05]
          Length = 821

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 187/593 (31%), Positives = 312/593 (52%), Gaps = 48/593 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA  + +A+P+GNGR+GAMV+G V  E  +LNE+++W G P +  NP A +AL
Sbjct: 24  LKLWYDRPATQWVEALPLGNGRIGAMVYGDVLHEEFQLNEESIWGGSPYNNVNPKAKEAL 83

Query: 73  SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
             +R L+  G+  EA         S    G P   YQ +G + L+F+  +  Y++  Y R
Sbjct: 84  PRIRQLIFEGRNKEAQEMCGHAICSQTANGMP---YQTVGSLHLDFEGVN-NYSD--YYR 137

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
           ELD+  A    K++   V +TRE F+S PDQ+++ +++ S+   +SF    ++    D  
Sbjct: 138 ELDIEKAIVTTKFTSEGVTYTREAFTSFPDQLLIIRLTASQKRKISFTARYNTPYGKDII 197

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
             V+   ++ + G         KAN ++  +G ++FS +   ++  + G   A+ D  L+
Sbjct: 198 RNVSSRKELQLHG---------KANDHEGIEGKVRFSTL--TRVEHNGGYTEAIADTLLR 246

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +  ++ +V L V   S    FIN +D   +    + + L++    +Y      H   Y+K
Sbjct: 247 ISNAN-SVTLYV---SIGTNFINYNDVSGNALKTAQNYLKNAGK-NYQKAKETHCSTYRK 301

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RVS+ L  + +               P+  RV+ F +  DP L  L FQFGRYLLI 
Sbjct: 302 WFNRVSLDLGSNAQSFK------------PTDVRVREFTSTFDPQLAALYFQFGRYLLIC 349

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+PG Q ANLQGIWN  L   WD     +IN+EMNYW +   NL E  EP    +  ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTNLPEMHEPFLQLIKEVA 409

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G ++A + Y   GW +HH TDIW  + +  G   + +WP   +W C HLW+HY ++ +
Sbjct: 410 EKGKQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNSWFCQHLWDHYLFSGN 467

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           RD+L +  YPL+     F LD+LI +  + +L  +PS SPE+  +    +   +   +TM
Sbjct: 468 RDYLTE-IYPLMRSACEFYLDFLIRDPKNNWLVVSPSYSPENRPVVNGKRDFTIVAGATM 526

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  ++ ++F   + AA ++ ++  A ++ +   +  L P ++   G + EW++
Sbjct: 527 DNQMVNDLFRNTLEAASLIGES-SAFIDSLQTVIQNLAPMQVGRWGQLQEWME 578


>gi|299147305|ref|ZP_07040370.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
 gi|298514583|gb|EFI38467.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
          Length = 811

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 192/590 (32%), Positives = 303/590 (51%), Gaps = 55/590 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANTLNFTIAYNFPLVHKVNVQND 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
              +    C GK          + +G++ +   E +I     +        L++     A
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNSTLRPGGNTLQINEGTEA 245

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            L + A++++    +N  +   D +  +   L+    + Y      H+  Y+K F RV +
Sbjct: 246 TLYISAATNY----VNYQNVSADESHRTSEYLKRATQIPYEKALKSHIAYYKKQFDRVRL 301

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            L                I  + + +R+++F   ED ++  LLF +GRYLLISSS+PG Q
Sbjct: 302 TLPTG------------KISQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGGQ 349

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAETA 409

Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++FL
Sbjct: 410 RTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEFL 465

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
            K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD  
Sbjct: 466 -KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDNQ 514

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 515 IAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|336404644|ref|ZP_08585337.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
 gi|335941548|gb|EGN03401.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
          Length = 811

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 194/591 (32%), Positives = 306/591 (51%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAM++GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PVVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y V +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
              +    C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   D +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L                   + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|295086436|emb|CBK67959.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 811

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 197/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAMV+GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y + +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   + +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L        T   S+     + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|160883519|ref|ZP_02064522.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
 gi|156110932|gb|EDO12677.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
          Length = 793

 Score =  305 bits (782), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 199/593 (33%), Positives = 303/593 (51%), Gaps = 58/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PAK + +A+P+GN RLGAMV+G    E L+LNE+T+W G P    NP A +AL
Sbjct: 10  LKLWYDRPAKVWEEALPLGNSRLGAMVYGIPQREELQLNEETIWGGSPYRNDNPKAVQAL 69

Query: 73  SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
            + R L+ +G+  EA     + F     G P   +Q  G I L F   H  Y  + + RE
Sbjct: 70  PEARKLIFAGKNTEADKLINETFFTRAHGMP---FQTAGSIILNFP-GHENY--QNFYRE 123

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
           LDL  A +  +Y+V  VE+ RE ++S  D VIV +I+ S   +++F +     ++ +  V
Sbjct: 124 LDLGRAVSTTRYTVDGVEYAREAYASFADDVIVMRITASRKRAINFVLEYSRPVNFNVSV 183

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            G+  I        + IP + N             +  ++  + G    L ++ + V+ +
Sbjct: 184 KGSTLIFHSKGTDHEGIPGEINYQ-----------IHTRVVTNDGEAEVLNNR-IVVKNA 231

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
             A L +   S+F        D      ++ +    +I+N +Y     +H++ + + F+R
Sbjct: 232 TVATLYISIGSNFIDYKTLGGDEYVAKVTQKLDC--AIKN-NYKAALKKHIEIFSQQFNR 288

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
             + L      +  +T            +R+  FQ D+DPSLV LL QFGRYLLI SS+P
Sbjct: 289 FKLNLGNRSDGVKKNTL-----------QRIADFQIDQDPSLVTLLTQFGRYLLICSSQP 337

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           G Q ANLQGIW   ++P+WDS   +NIN EMNYW +   NLSE   P    +  LS NG 
Sbjct: 338 GGQPANLQGIWCHQMNPSWDSKYTLNINAEMNYWPAEVTNLSETHLPFLQMVKDLSENGR 397

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDR 484
           +TA + Y A GW +HH TDIW  +    G + +A   +WP GGAW+C HLWEHY YT D+
Sbjct: 398 RTAAMMYNAEGWTVHHNTDIWRVT----GPIDFARSGMWPTGGAWVCQHLWEHYLYTGDK 453

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
            FL    YP ++G A + L  +++ H  Y  +   PS SPE            V    TM
Sbjct: 454 KFLAD-VYPAMKGAADYFLSSMVK-HPKYDWMVVCPSVSPEQ---------GGVVAGCTM 502

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           D  +I E+ +    A E+L ++     +K+ + L +L P  I +   + EW++
Sbjct: 503 DNQLIIELLTKTAKANEILGESP-VYRQKLYELLEKLPPMHIGKHTQLQEWLE 554


>gi|160879541|ref|YP_001558509.1| hypothetical protein Cphy_1395 [Clostridium phytofermentans ISDg]
 gi|160428207|gb|ABX41770.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
          Length = 758

 Score =  305 bits (782), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 196/588 (33%), Positives = 302/588 (51%), Gaps = 60/588 (10%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           + +A+P+GNG  GAM++G V  E +KLN++++W G   +  NPD+ K L  VR L+  GQ
Sbjct: 18  WEEALPLGNGSFGAMLYGNVEEEVIKLNQESVWYGGFRNRINPDSRKVLPKVRELIFDGQ 77

Query: 84  YAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE---------TYRRELDLN 131
              A       +FG P     Y+ L D+ + F+   L ++E+          Y+R LDL 
Sbjct: 78  LKAAEELVYTSMFGTPISQGHYEPLADLRIAFNKRILHHSEQWQERQINHSNYKRFLDLQ 137

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN- 190
           TA     Y+    ++ RE   S PDQV+  +++      +   + LD   +N+  V  N 
Sbjct: 138 TACYNSSYTWRETDYKREALISYPDQVMAIRLTAD--NPMGVRIELDRG-ENYEKVEANE 194

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           N I + G C G              G +F A +++ ISD  GTI       L+VE +   
Sbjct: 195 NTITLSGSCGGN-------------GSKFIAKVQV-ISD--GTI-VRAGAFLEVENASEI 237

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           VL +   + F          ++DP       L       Y ++   H+ DY  L+ RV +
Sbjct: 238 VLYVAGRTDF---------YEEDPMDWCNEKLALAAQKGYEEIKKDHIADYASLYQRVDL 288

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGT 369
            L+            ++N   +P+ ER++ F+ ++ D  L+EL + +GRYLLISSSR G 
Sbjct: 289 DLN-----------GDKNYLNLPTDERLRLFKENKLDDGLLELYYNYGRYLLISSSREGA 337

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWN+D+ P W S   +NIN +MNYW +   NLSEC  PLF+ +  +  +G + 
Sbjct: 338 LPANLQGIWNKDMMPAWGSKYTININTQMNYWPAEVTNLSECHTPLFEHIKRMVPHGREV 397

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   G V HH TDI+         +   +WPMG AWL TH+ EHY YT D  F+ K
Sbjct: 398 AEKMYGCRGIVAHHNTDIYGDCVPQGKWMPATMWPMGFAWLATHVIEHYRYTKDVSFV-K 456

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
             Y +L+  + F +D+L+   +  L T PSTSPE+ +I  +G+ + + Y  +MD  II+E
Sbjct: 457 DFYSILKDASLFYVDYLVRDKENQLVTCPSTSPENTYILENGEKSTLCYGPSMDSQIIKE 516

Query: 550 VFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +++  I  +  LE + D +  VE +LK LP+    K+   G ++EW +
Sbjct: 517 LWTGFIEVSSDLEVSNDVVSAVENMLKELPK---AKVGSRGQLLEWTK 561


>gi|237719758|ref|ZP_04550239.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
 gi|229451027|gb|EEO56818.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
          Length = 811

 Score =  305 bits (781), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 196/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           LK+ ++ PA+++++A+PIGN RLGAM++GG+  E L+LNE+T W G P +  NP+A   L
Sbjct: 22  LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81

Query: 73  SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             VR L+  G+  EA     +  L       Y  LG + LEF +         + R+L+L
Sbjct: 82  PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             AT   +Y + +V +TR  F+S  D VI+  I  S++ +L+F ++ +  L +   V  N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
           +Q+ +   C GK          + +G++ +   E +I     GT+    +     EG++ 
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L + A++++    +N  D   + +  +   L+    + Y      H+  Y+K F RV 
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L        T   S+     + + +R+++F   ED ++  LLF +GRYLLISSS+PG 
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
           Q ANLQGIWN      WDS   +NIN EMNYW +   NLSE   PLF  L  LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
           A+  Y   GW+ HH TD+W       G V +A   +WP GGAWL  H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           L K  YP+L+G A F +D+L+E H  Y  L  +PS SPEH           ++   TMD 
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I  +     + A+ +  +   +  + + ++L +L P +I +   + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563


>gi|384181040|ref|YP_005566802.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
 gi|324327124|gb|ADY22384.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
          Length = 1172

 Score =  305 bits (781), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 200/600 (33%), Positives = 310/600 (51%), Gaps = 69/600 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 63  LTLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 122

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F+     
Sbjct: 123 DGAASHLGSIREKLAKGDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 178

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            A   YRREL+LN   A V Y+  +V++ RE+F+S PD+V+V +++ SE+  +S +V   
Sbjct: 179 -AFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 237

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S        + +N+I M+G+                 G+++ A    K+ ++ GT++A E
Sbjct: 238 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 280

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + K+KV  +D   +++ A++ ++  +  P+   +DP  +    + +I   SY  L   H+
Sbjct: 281 NGKIKVANADSLTIIMTAATDYENKY--PTYKGQDPHEKVEKVMSAISKKSYEVLKYTHI 338

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ+GR
Sbjct: 339 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 385

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL D+
Sbjct: 386 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 445

Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  +LWE
Sbjct: 446 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 504

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D+ +L+++ YP+L+  A F   +L+E  +  L  +P  SPE         L  +
Sbjct: 505 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 555

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           S     D  ++ E+FS +I A+EVL+ +    D L  K  K  P   P +I   G + EW
Sbjct: 556 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 612


>gi|269793879|ref|YP_003313334.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
 gi|269096064|gb|ACZ20500.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
          Length = 856

 Score =  305 bits (781), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 205/592 (34%), Positives = 297/592 (50%), Gaps = 57/592 (9%)

Query: 5   ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--- 61
           ++ +   PL + ++ PA  +T+A+P+GNGRLGAM +GG   + +++N+DT W+G P    
Sbjct: 16  DNEAAARPLVLAYDAPAGRWTEALPVGNGRLGAMCFGGTTDDRVQVNDDTCWSGSPATTA 75

Query: 62  ---DYTNPDAPKALSDVRSLVDSGQYAEATAASVKL-FGHPADVYQLLGDIEL-EFDDSH 116
               +   + P  + D R+ + +G    A  A  +L  GH +  YQ L D+ L E D + 
Sbjct: 76  GRRHFETGEGPGIVDDARAALAAGDVRAAERAVQRLQHGH-SQAYQPLVDLLLVEVDPAG 134

Query: 117 LKYAEET---YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
                E    Y R LDL TA AR  ++       +E +SS P  V+V     ++    + 
Sbjct: 135 GAVDPEPRTGYARSLDLRTAVARHTWTGAGGTVVQETWSSAPRGVLVVDRRATDGTLPAL 194

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKIS 228
            VSL S             + +  R P   +P    A+     D   G   +A + + + 
Sbjct: 195 RVSLTSPHPTLDVQGTPTGLAVTVRMPSDVVPDHEPADVHVRYDPAPGAAVTAAVHVAVH 254

Query: 229 DDR----GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE--SMSAL 282
            D     G  SA  D  ++V G+ +  L+L   + F        D++  P  +  S+ A 
Sbjct: 255 TDGIVGDGGPSATADA-VEVVGATYVTLVLGTETDF-------VDAETAPHGDVDSLRAA 306

Query: 283 QSIRNLSYSD---------LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
            ++R     D         L   H+ D+  LF RV I L  +P   +T          VP
Sbjct: 307 VALRTSGVVDAITASGLPALRAEHVADHDALFGRVEIDLGPAPDSGLT----------VP 356

Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
             ER+        DP+L  L  Q+GRYL+I+ SRPGT+  NLQGIWNE + P W S    
Sbjct: 357 --ERLARHAAGAPDPALAALQAQYGRYLMIAGSRPGTRPMNLQGIWNESVVPPWSSNYTT 414

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS- 451
           NIN EMNYW + P NL EC EPL  +L  L+  G  TA+  Y   GW  HH +D+W  S 
Sbjct: 415 NINTEMNYWPAGPANLDECHEPLTSWLADLARTGGDTAREVYGLPGWAAHHNSDVWGFSL 474

Query: 452 SADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            A  G     W  WP+GG WL THLW+ Y+++ D  FL   A+PLL G A F L WL+E 
Sbjct: 475 PAGDGDSDPSWTAWPLGGVWLATHLWDRYDWSRDLGFLAD-AWPLLRGAADFALAWLVEQ 533

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
            DG L T+P+TSPE+ ++APDG  A V+ S+T D+A++RE+    + AA+VL
Sbjct: 534 PDGTLGTSPATSPENRYVAPDGLPAAVTVSTTSDLAMVRELLGRCLDAAQVL 585


>gi|317057786|ref|YP_004106253.1| alpha-L-fucosidase [Ruminococcus albus 7]
 gi|315450055|gb|ADU23619.1| Alpha-L-fucosidase [Ruminococcus albus 7]
          Length = 756

 Score =  305 bits (780), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 198/588 (33%), Positives = 298/588 (50%), Gaps = 49/588 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           +I F  PA+ +  A+P+GNGR+G M +G   +E ++LNED++W+G P    N  A   L 
Sbjct: 5   RIWFRRPAEDWNVALPVGNGRIGGMCFGQALNEKIQLNEDSVWSGGPRKRNNASARANLE 64

Query: 74  DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            VR L+   + AEA    ++ F G P +   Y  LGD+ ++    H +   E   R LDL
Sbjct: 65  KVRQLLREEKIAEAEKIVMEAFCGTPVNERHYMPLGDLSIQ---HHKEDTFEYTERSLDL 121

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
             A    +YS+  V +TR    S P QV+   I   +  S+S  VS+D      D++S V
Sbjct: 122 ENAVCETRYSINGVNYTRRVICSEPAQVMAVCIDADKPASVSVKVSIDGRDDYFDDNSPV 181

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
           N +  I+  G C  +             GI F+A   I++    GT+       +  +  
Sbjct: 182 N-DTDILYYGGCGSE------------DGICFAAY--IRVLGYGGTVGRW-GSSIVTDCC 225

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           D  +++L A + F       +D KK    + ++A       ++ +L   H +DY+  F R
Sbjct: 226 DRVMIILGAQTDF-----RVTDYKKGAELDVITAAGK----TFEELLAEHTEDYRSYFDR 276

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
             I        +  D  S     ++P+ ER+K  +    D  LV L F FGRYL+I+ SR
Sbjct: 277 AEI--------VFEDGGSY----SLPTDERLKLVKDGGVDNGLVSLYFDFGRYLMIAGSR 324

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            GT   NLQGIWN+D+ P W     VNIN EMNYW + PC L +   PLFD +  +  +G
Sbjct: 325 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWCAEPCGLGDLHIPLFDHIERMRPHG 384

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
             TA+  Y  SG+V HH TDIW  ++     +    W  G AWLCTH+WEH+ +T D++F
Sbjct: 385 RDTAREMYGCSGFVCHHNTDIWGDTAPQDLWIPGTQWVTGAAWLCTHIWEHWLFTQDKEF 444

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L ++ Y  ++  A F +D+LI+   G L T PS SPE+ +I   G    V    +MD  I
Sbjct: 445 LAQK-YDTMKEAAKFFVDFLIDDGSGRLVTAPSVSPENTYITESGARGSVCIGPSMDSQI 503

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           I ++F+A+I A ++L  ++ +  EK+     RL   +I + G I EW 
Sbjct: 504 IYQLFTAVIEAGKILGIDK-SFGEKLSAMRERLPKPEIGKYGQIKEWA 550


>gi|452000004|gb|EMD92466.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
           C5]
          Length = 806

 Score =  304 bits (779), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 200/604 (33%), Positives = 308/604 (50%), Gaps = 57/604 (9%)

Query: 11  NPLKITFNGP--AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
           N  ++ +  P  +  +TDA+PIGNGRLGAM++G    E ++LNE+T+W+G   D  N + 
Sbjct: 21  NSTRLWYTAPVASSTWTDALPIGNGRLGAMIYGIPVQELIQLNEETIWSGGRRDRVNQNG 80

Query: 69  PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYR 125
            + +S+VR L+  G    A   A++ + G P     YQ LGD+E+ FD +  +Y   TY 
Sbjct: 81  AQTVSEVRDLLARGDAGGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-EYDNTTYE 139

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL+TA A V++ V +  + RE F S PD V V  +  + +G LSF + +    D  +
Sbjct: 140 RWLDLDTALAGVRFKVNDTLYEREMFVSVPDDVFVHHLKATGNGKLSFQIRVHRPKDGLN 199

Query: 186 YV-----NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                  N N    M G   G           DP  + F+  L ++      T+      
Sbjct: 200 EASDQNWNENGWTYMTGGTGGI----------DP--VVFTTALAVESDGHVRTLGEF--- 244

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +  A   L A++S+            D  +   S +Q  R  +Y +L  RH++D
Sbjct: 245 -IVVENATEATAFLAAATSY---------RHNDTRAAVDSTIQKARQHTYEELRRRHIED 294

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
           Y  L++   + L+    D+ T +        +P+  R+ + +    DP LV L + +GRY
Sbjct: 295 YSPLYNASVLNLN--GPDLGTSS--------LPTNARINATRRGANDPGLVALAYNYGRY 344

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSSR G   +NLQGIWN++  P W S   VNINL+MNYW +   +LS   EP FD L
Sbjct: 345 LLISSSRAGNLPSNLQGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHEPFFDLL 404

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             +  +G+ TA+  Y ASGW+ HH TD+W  ++     +    W +   WL TH+ EHY 
Sbjct: 405 ELMRKDGTHTAKAMYNASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYW 464

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           YT D+ FL    + + E    F LD L      G + YL TNPS SPE+ ++ PDGK   
Sbjct: 465 YTGDKSFLASNLHIVSEAI-EFYLDTLQPYKTNGTE-YLVTNPSVSPENTYVGPDGKSYN 522

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIM 591
              + T D+ I+ E+F+  ++A   L  +  + A + ++  +  +L P + +    G++ 
Sbjct: 523 FDIAPTCDVEILNELFTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQ 582

Query: 592 EWVQ 595
           EW+Q
Sbjct: 583 EWMQ 586


>gi|343083763|ref|YP_004773058.1| glycoside hydrolase [Cyclobacterium marinum DSM 745]
 gi|342352297|gb|AEL24827.1| glycoside hydrolase family 65 central catalytic [Cyclobacterium
           marinum DSM 745]
          Length = 806

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 192/606 (31%), Positives = 325/606 (53%), Gaps = 35/606 (5%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           A  T+ +  +++ +  PA  + +A+PIGNGRLGAM++GGV  E ++LNE++LW G+P D 
Sbjct: 32  ARKTNNSKKMQLWYTSPANEWLEALPIGNGRLGAMIFGGVKEEQIQLNEESLWAGMPEDP 91

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYA 120
              D  K  +  + L   G+Y EA    ++ L   P  +  Y+ LG++ + FD  H K +
Sbjct: 92  YPEDVQKHYAAFQQLNMEGKYEEALKYGMEHLAVSPTSIRSYEPLGELHITFD--HQK-S 148

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
            E YRR LDL T      Y++    + RE FSS+   VI  +    +   ++  +  D  
Sbjct: 149 PENYRRTLDLETGVVISTYTIDGKRYLREAFSSDKYDVIFYRFQSLDGEPVNSTIRFDRE 208

Query: 181 LDNHSYVNGNNQIIMEGRC---PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
            D    +     +I++G+    P         + +  + ++F++  +I  + D G++S  
Sbjct: 209 KDIVQSIGEGELLIVDGQVFDDPDGYEDNPGGSGETGRHMKFAS--QITATLDEGSMSGN 266

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
           E+  L +E S    +++ A++ ++   +N  D   D   +++ +L+     +Y      H
Sbjct: 267 ENT-LNIENSTGYTVIVSAATDYNLAKLN-FDRNIDAKDKALKSLKGALETAYQTAKDAH 324

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQF 356
              + K+F+RV++ L  SP             DT+P+ +R+   +    D  + EL FQ+
Sbjct: 325 TAAHSKMFNRVALSLG-SPLQ-----------DTIPTDKRLDQVREGTNDNHITELFFQY 372

Query: 357 GRYLLISSS-RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           GRYLL+ SS       ANLQGIWN+++   W+S  H+NINL+MNYW +   NLSE   PL
Sbjct: 373 GRYLLMGSSVNRAILPANLQGIWNKEMWAPWESDFHLNINLQMNYWPADQTNLSESFVPL 432

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK-----SSADRGKVVWALWPMGGAWL 470
            +F+  L+ NG  TA+    +SGW+ HH ++ + +     S+ D         P+ GAW+
Sbjct: 433 SNFMEKLAKNGEITAEKFIGSSGWMAHHVSNPFGRTTPSGSTKDSQMTNGYSNPLAGAWM 492

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
              LW HY +T D+++L++ AYP+L G A F+LD+L E   G L T+PS SPE+ +I P 
Sbjct: 493 SLSLWRHYEFTQDQEYLKETAYPVLAGTAQFILDFLKENEKGELVTSPSYSPENAYIDPK 552

Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
            GK    + +++MD+ II ++F+A + A E++   +  L   + K+  +L P KI ++G+
Sbjct: 553 TGKATRNTTAASMDIQIINDIFNACLKAEEII--GDKQLTAAIKKASSKLPPIKIGKNGT 610

Query: 590 IMEWVQ 595
           + EW +
Sbjct: 611 LQEWYE 616


>gi|336425540|ref|ZP_08605561.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336012115|gb|EGN42041.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 835

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 203/631 (32%), Positives = 308/631 (48%), Gaps = 75/631 (11%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA+ F DA  +GNG LG  V G    E + +NEDTLW+G  G Y NP       + R L 
Sbjct: 11  PAEQFWDAHYLGNGSLGMSVMGDPVLEEVYINEDTLWSGSEGFYLNPQHYDRFMEARRLA 70

Query: 80  DSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSH------LKYAEET-------YR 125
             G+  EA T  +  + G   + Y  L  + +    +       LK   E        YR
Sbjct: 71  LEGKGKEANTIINNDMEGRWLETYLPLASLHITMGQADNRRNMPLKMVIEPQPGDIEDYR 130

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT------KISGSESGSLSFNVSLDS 179
           R L L+ A   V +    + + RE+F S PD+          K        L F   +DS
Sbjct: 131 RCLSLSDAAETVSWIRDGIRYRREYFVSYPDRTAYVYCTAEPKEGEGRDKVLDFAFGVDS 190

Query: 180 LLDNHSYVNG--NNQIIMEGRCPGKRIP------PKANAND--DPKGIQFSAILEIKISD 229
            L    Y+NG  + +  + G  P    P      P+    D  +   ++F+    +  +D
Sbjct: 191 SL---HYINGAEDGEAFLTGIAPDHAEPSYTAVAPRFIYKDPENSDALRFACCARVISTD 247

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM-SALQSIRNL 288
             GT+++ +  ++ V G+ +A+L + A +S+ G F  P D       E +   L  ++  
Sbjct: 248 --GTVAS-DGARVYVNGASYALLAVRAGTSYAG-FRVPRDRDAGKVLEELRKGLDGLQKA 303

Query: 289 S--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-SFQTDE 345
              Y      H+ DYQ L++RV + L              E    +P+ +R+    +  +
Sbjct: 304 GRDYEGARKDHVTDYQALYNRVDLDLG------------TELSGNLPTTQRLHFCGEGVD 351

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DPSL  L+ Q+ RYL I+ SRPG+Q  NLQGIWN+  +P W S    NIN+EMNYW    
Sbjct: 352 DPSLAALMLQYSRYLTIAGSRPGSQALNLQGIWNDTPNPPWSSNYTNNINVEMNYWPCEV 411

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
             L EC  P+ D LT L+  G +TA+  Y  +GWV HH  D+W  +        W+ WP 
Sbjct: 412 LGLPECHLPMMDLLTELADAGKQTAKEYYHMNGWVAHHNADLWRSTEPSCEDASWSWWPF 471

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
           GGAW+C H+W HY YT DR+FL K  YP+L   A+F+LD+L+E  +GYL T PS SPE++
Sbjct: 472 GGAWMCEHIWTHYEYTQDREFLRK-MYPVLREAAAFMLDFLVENKEGYLVTAPSLSPENK 530

Query: 526 F--------------IAPDGK-------LACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
           F              +A + +       ++ V+  STMDM+I+RE+FS +  AA++L+ +
Sbjct: 531 FLTSGEETVIELIDEVAKESRCSPNHPCISAVTIGSTMDMSILRELFSNVARAAQILDIS 590

Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +D +  + L+S+ +  P +    G + EW +
Sbjct: 591 DDPVPVQALESMKKFPPYRTGRFGQLQEWYE 621


>gi|451854086|gb|EMD67379.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
          Length = 805

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 197/589 (33%), Positives = 300/589 (50%), Gaps = 55/589 (9%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           +TDA+PIGNGRLGAM++G    E ++LNE+T+W+G   D  N +  + +S+VR L+  G 
Sbjct: 36  WTDALPIGNGRLGAMIYGIPVQERIQLNEETIWSGGRRDRVNQNGAQTVSEVRDLLARGD 95

Query: 84  YAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
            A A   A++ + G P     YQ LGD+E+ FD +  KY + TY R LDL+TA A V++ 
Sbjct: 96  AAGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-KYDKTTYERWLDLDTALAGVRFR 154

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV-----NGNNQIIM 195
           V +  + RE F S PD V V ++  + +  LSF + +    D  +       N N    M
Sbjct: 155 VNDTLYEREMFVSVPDDVFVHRLKATGNEKLSFQIRVHRPKDGLNEASDQNWNENGWTYM 214

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
            G   G           DP  + F+  L I+      T+       + VE +  A   L 
Sbjct: 215 TGGTGGI----------DP--VVFTTALAIESDGHVRTLGEF----IVVENATEATAFLA 258

Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
           A++S+            D  +   S +Q  R  +Y +L  RH++DY   ++   + L+  
Sbjct: 259 AATSY---------RHNDTRAAVESTIQKARQHTYEELRRRHIEDYAPFYNASVLNLN-G 308

Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANL 374
           P    +D         +P+  R+ + +    DP LV L + +GRYLLI+SSR G   +NL
Sbjct: 309 PDLKTSD---------LPTNARINATRKGANDPGLVALAYNYGRYLLIASSRAGNLPSNL 359

Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
           QGIWN++  P W S   VNINL+MNYW +   +LS    P FD L  +  +G  TA+  Y
Sbjct: 360 QGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHAPFFDLLELMRKDGMHTAKAMY 419

Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
            ASGW+ HH TD+W  ++     +    W +   WL TH+ EHY YT D+ FL     P+
Sbjct: 420 NASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYWYTGDKGFLASN-LPI 478

Query: 495 LEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
           +     F LD L      G + YL TNPS SPE+ ++ PDGK      + T D+ I+ E+
Sbjct: 479 VSEAIEFYLDTLQPYKANGTE-YLVTNPSVSPENTYVGPDGKSYNFDTAPTCDVQILNEL 537

Query: 551 FSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIMEWVQ 595
           F+  ++A   L  +  + A + ++  +  +L P + +    G++ EW+Q
Sbjct: 538 FTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQEWMQ 586


>gi|383812006|ref|ZP_09967453.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
           str. F0472]
 gi|383355392|gb|EID32929.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
           str. F0472]
          Length = 781

 Score =  302 bits (773), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 187/596 (31%), Positives = 301/596 (50%), Gaps = 47/596 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
           +++ +N PA  F +++P+GNG+LGA+V+GG   +T+ LN+ T WTG P D  N    KA 
Sbjct: 1   MRLWYNQPAHFFEESLPLGNGKLGALVYGGTQKDTIYLNDITYWTGNPVD-PNEGLGKAK 59

Query: 72  -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
            + ++R  + +  Y  A +    + G  +  YQ LG + +   ++    A   Y REL+L
Sbjct: 60  WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 116

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
           ++A   + Y    ++FTRE+F+++ D +I   I  +++G+++  + L +    H     N
Sbjct: 117 DSALVHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLRIQLTAQTP-HKVKATN 175

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           NQ+ M G   G                   A   +++    G + A  D  L +  +D A
Sbjct: 176 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 222

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + +V ++SF+G   +P          +++A    +N +Y++   RH+ +YQ++++RV +
Sbjct: 223 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYNEFKDRHIKEYQQIYNRVKL 282

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
           +L            ++E  + +P+ + ++ + +   P        L  L FQFGRYLL+S
Sbjct: 283 KLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 331

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            SR     ANLQG+W   L   W     +NINLE NYW + P N+SE  +PL  F+  LS
Sbjct: 332 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 391

Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
             G  TA+  Y +  GW   H +D W K+S    GK    WA W +GGAWL   LW+HY 
Sbjct: 392 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 451

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+ D+  L+   YPL+EG + F   WL+   +    L T PSTSPE+E++   G      
Sbjct: 452 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 511

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           Y  T D+AIIRE+F  +  A + L    D  ++     L RL P  +   G + EW
Sbjct: 512 YGGTADLAIIRELFMNMQQARKSLGLKPDKEID---DKLHRLHPYTVGSQGDLNEW 564


>gi|325298040|ref|YP_004257957.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
 gi|324317593|gb|ADY35484.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
          Length = 1004

 Score =  301 bits (770), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 186/592 (31%), Positives = 310/592 (52%), Gaps = 45/592 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PAK + + +P+GNGRLG M  GG+  E + LNE ++W+G   DY NP+A ++L  +R
Sbjct: 232 YDKPAKQWEETLPLGNGRLGMMPDGGITKEHIVLNEISMWSGSEADYRNPEAAESLPRIR 291

Query: 77  SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
            L+  G+  EA       F       G     +Q+L D+ + +         + Y R L+
Sbjct: 292 QLLFEGKNKEAQELMYTSFVPKKPEKGGTFGCFQMLADMYINYTFPDTISQAKDYLRWLN 351

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+   A   ++     + RE+F S    V++  +      +L F+++L      H     
Sbjct: 352 LDEGVAYTTFTKNATRYIREYFVSRNKDVMLIHLQADRPDALGFHLTLSRPERGHVRKLS 411

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
             ++ + G           + N+  +GI+++AI  +K+S  +  +    D  ++V  +D 
Sbjct: 412 EGKLEITGTL--------DSGNERQEGIRYAAIAGVKLSGKKSRMHTHADG-IEVSDADE 462

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A +++ A++S+    I  +++++       S L   +  +          +YQ+LFHR  
Sbjct: 463 AWIIVSANTSYMKGEIYQTETQRLLDQALASDLTQAKQEA--------TGEYQQLFHRAG 514

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           I+L  +       T S+ + D     +R+++FQT +DPSL  L + +GRYLLISS+RPG+
Sbjct: 515 IELPEN------KTVSQLSTD-----KRLEAFQTQDDPSLAALYYNYGRYLLISSTRPGS 563

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
              NLQG+W   +   W+   H NIN++MN+W   PCNLSE  +PL D +  L  +G +T
Sbjct: 564 LPPNLQGLWANGVMTPWNGDYHTNINVQMNHWPVEPCNLSELYQPLVDLIKRLVPSGEET 623

Query: 430 AQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           A+  Y   A GWV+H  T++W  +S       W     GGAWLC HLWEHY YT ++ +L
Sbjct: 624 AKAFYGSEAKGWVLHMMTNVWNYTSPGE-HPSWGATNTGGAWLCAHLWEHYLYTGNKQYL 682

Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIA--PDGKLACVSYSSTMDM 544
               YPLL+G + F    ++ E   G+L T P++SPE+EF     D     V    TMD+
Sbjct: 683 AD-IYPLLKGASEFFYSTMVREPEHGWLVTAPTSSPENEFYVSKKDRTPISVCMGPTMDI 741

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWVQ 595
            ++RE+++ +I AA +L  + D+L    LK +  +L P +I++ G +MEW++
Sbjct: 742 QLVRELYTHVIEAASIL--HTDSLYANQLKEASAQLPPHQISKKGYLMEWLK 791


>gi|197302771|ref|ZP_03167824.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
           29176]
 gi|197298169|gb|EDY32716.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
           29176]
          Length = 773

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 202/596 (33%), Positives = 301/596 (50%), Gaps = 38/596 (6%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
           +K+ ++ PA+++ +++P+GNGR+GAMV+GG   E L LNEDTLW+G P + T    P+  
Sbjct: 1   MKLYYDHPAENWHESLPLGNGRIGAMVYGGTKKEILALNEDTLWSGYP-EKTQKKLPEGY 59

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
           L  VR L +  +Y +A     + F    DV  Y   G++ +E  D   + ++  Y REL 
Sbjct: 60  LEKVRELTEKREYQKAMEYLEECFSSSEDVQMYVPFGNVYMEMLDGTEEISD--YHRELC 117

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L+TA  R+ Y        +    S P QV+V KI   ++ SL   V      ++      
Sbjct: 118 LDTAEVRITYKNQGALVEKSCIVSQPAQVLVYKIRSEKAFSLKLYVEGGYARES---CCT 174

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-FSAILEIKISDDRGTISALEDKKLK----- 243
           +  +  +G+CPG R+P         K +  F    E +     G    + D K+      
Sbjct: 175 DGILKTKGQCPG-RVPFTVGEGGSEKAVPVFPEEPEKQGMCYEGWGKIVTDGKVNEAGNA 233

Query: 244 --VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
             VE ++   L     SSF G   +P    + P  E + A       SY  L T HL +Y
Sbjct: 234 VIVENAEEVTLYYGIRSSFAGFDRHPVIEGRCP-EELLKADFDCTGKSYEALRTEHLKEY 292

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
           QK + RVS  L         D  +E+++      +R+  FQ   ED  L  LLFQ+GRYL
Sbjct: 293 QKYYKRVSFSLGEK------DEYAEKDLR-----QRLTDFQDHPEDVGLNALLFQYGRYL 341

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LI++SRPGTQ ANLQGIWN +L P W S   +NIN EMNYWQ+ PCNL E  EPL     
Sbjct: 342 LIAASRPGTQAANLQGIWNAELVPPWFSDYTININTEMNYWQTGPCNLEEMGEPLVRLCE 401

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            ++ +G +TA   +   G    H TD+W K++   G+  W  WPMG AWLC +L++ Y +
Sbjct: 402 EMAADGKETAMHYFGKEGVCSFHNTDLWRKTTPADGRAEWNFWPMGYAWLCRNLYDQYLF 461

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKLACVS 537
           T DR +LE R YP+L+    F ++ ++    GY   +P+TSPE++F+  +    KL    
Sbjct: 462 TEDRAYLE-RIYPVLKENVRFCVESVVGTAQGYA-MSPATSPENDFLFGEEKKEKLTVAQ 519

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           Y+   + AI+R +    + A  +L    D L  +  K    +    +  +G I+EW
Sbjct: 520 YTEN-ENAIVRNLLRDYLEAGRIL-GIRDELTGQAEKIFEEMAAPAVGSNGQILEW 573


>gi|389642921|ref|XP_003719093.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
 gi|351641646|gb|EHA49509.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
 gi|440473491|gb|ELQ42283.1| alpha-L-fucosidase 2 [Magnaporthe oryzae Y34]
 gi|440483559|gb|ELQ63936.1| alpha-L-fucosidase 2 [Magnaporthe oryzae P131]
          Length = 827

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 199/603 (33%), Positives = 300/603 (49%), Gaps = 50/603 (8%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           + S  NPL++        + D+  IGNGRLG  + G   +E + LNED+ W+G   D  N
Sbjct: 26  ANSAANPLRLWQTTAGVTYNDSFLIGNGRLGFSLPGSALTEAITLNEDSFWSGGKMDRVN 85

Query: 66  PDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
           PDA   +  ++ L+  G+  EA T A +   G P  V  Y  LG + L       +    
Sbjct: 86  PDAAANMPQIQQLITQGRIEEAATLAGMAYKGLPDSVRHYDWLGRLHLAMKGPAGQAGN- 144

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS------ 176
            Y R LD+    A V Y++    F+RE+ +S PDQ+I  ++  ++SGS+SF +S      
Sbjct: 145 -YERWLDVGEGLAGVDYTLNGTAFSREYLASFPDQIIAVRMKSNQSGSISFTLSQSRGSG 203

Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
           L+   D  + ++G+  I+M G   G               I FS+  ++ +S   G+I  
Sbjct: 204 LNRFQDYTTSLDGDT-ILMGGGSMGS------------DAIVFSSGAKVTVSG--GSIKT 248

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           +  + + V  +D AV+   A +++  P       K+      +  L++     Y  + + 
Sbjct: 249 I-GETIVVSDADSAVIYWTAWTTYRKP-------KEQLRESVLVDLRTAAAKGYDAIRSE 300

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+ DYQKL  RV + L  S         SE+   +  +A+R++      DP +  L F F
Sbjct: 301 HVKDYQKLAGRVDLNLGMS--------SSEQK--SKSTAQRLRGMSQAFDPEMATLYFYF 350

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
            RYLLI+S RPGT  ANLQGIWN D+SP W S   VNINL+MNYW +L  N+ E    L 
Sbjct: 351 ARYLLIASGRPGTLPANLQGIWNTDISPQWGSKYTVNINLQMNYWPALLTNMPELHHSLL 410

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           D L  +  NG   A+  Y ASG V HH TD+W   +          WP G  WL TH++E
Sbjct: 411 DHLKIMHENGKDVARRMYNASGSVCHHNTDLWGDCAPQDNYAASTFWPTGLGWLVTHVYE 470

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG---KL 533
           HY +T D   L +  YP+L   A F LD+L E + G+L TNPS SPE ++  P+    + 
Sbjct: 471 HYLFTGDEQVL-RDYYPVLRDSALFFLDFLTE-YQGHLVTNPSVSPEIQYYLPNSTTRQG 528

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIME 592
             ++   T D +II EVF  +  A E+L   E     ++++ +  RL P +  + G + E
Sbjct: 529 VALTLGPTCDNSIIWEVFGLVFHATEILGNVEGKEFQDRLMSARARLPPLRRDQYGGLAE 588

Query: 593 WVQ 595
           ++ 
Sbjct: 589 FIH 591


>gi|423330223|ref|ZP_17308007.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231839|gb|EKN24687.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
           CL03T12C09]
          Length = 809

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 197/611 (32%), Positives = 310/611 (50%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 16  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 76  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L++   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G   A  D  L V  +  A++L+ + +  FD          KD   +S+   L    
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L R  +D             +P  ER+ +F  D+ 
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+G +  +   STMD  I+RE+F+  I AA +L   + A   ++     RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591


>gi|380472541|emb|CCF46724.1| alpha-L-fucosidase [Colletotrichum higginsianum]
          Length = 780

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 198/589 (33%), Positives = 292/589 (49%), Gaps = 51/589 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + +  PA  + +A+PIGNGRLGAMV+G   +E ++LNED++W G P D T  DA + L  
Sbjct: 21  LHYQSPASEWAEALPIGNGRLGAMVYGRTGTELVQLNEDSVWYGGPQDRTPKDALRHLPK 80

Query: 75  VRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           +R L+   ++AEA +      F  PA +  Y+ LG   +E    H       YRR L L+
Sbjct: 81  LRQLIRDEKHAEAESLVREAFFATPASMRHYEPLGTCTIEL--GHAVEDVTGYRRHLCLD 138

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA   V+Y    V + R+  +S P+ V+  +++ SE       ++  S ++  +    ++
Sbjct: 139 TAQTTVEYLSRGVSYRRDAIASFPNNVLAFRVTASEPTRFVVRLNRVSEIEWETNEFLDS 198

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
               +GR      P   N+N      + S +L +   D +G++ A+ +  L V+ S    
Sbjct: 199 IEADDGRIVLNATPGGRNSN------RLSIVLGVSCHDAQGSVEAIGNS-LVVKSSS-CT 250

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           + + A +++             P + +   ++   +L + DL   H  DYQ LF R +++
Sbjct: 251 IAIGAQTTY---------RTLHPETVATEDVRKALDLPWDDLIRHHRSDYQTLFGRTALR 301

Query: 312 L----SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           +    S +P D+                      +   D  LV L   +GRYLLISSSR 
Sbjct: 302 MWPDASHNPTDM--------------------RIEKGRDAGLVALYHNYGRYLLISSSRH 341

Query: 368 GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
             +   A LQGIWN   +P W S   +NINL+MNYW + PCNL EC  P+ D L  ++  
Sbjct: 342 AEKALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCNLVECAIPVLDLLERMAER 401

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G KTAQ  Y   GW  HH TDIWA +      +   +WP+GG WLC  ++E   Y  D D
Sbjct: 402 GRKTAQAMYGCRGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVFEMLQYHHD-D 460

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            L +RA  +LEGC  FLLD+LI    G YL TNPS SPE+ FI+  GK   +   S +D 
Sbjct: 461 GLHRRAAAVLEGCILFLLDFLIPSSCGKYLVTNPSLSPENTFISNSGKAGILCEGSAIDT 520

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            IIR  F   + +  +L  NE  L  KV ++L +L        G I EW
Sbjct: 521 TIIRIAFEKFLWSNSMLGTNE-PLCSKVREALGKLPELMTNAHGLIQEW 568


>gi|256840971|ref|ZP_05546478.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
 gi|298375740|ref|ZP_06985696.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
 gi|256736814|gb|EEU50141.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
 gi|298266777|gb|EFI08434.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
          Length = 811

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 197/611 (32%), Positives = 310/611 (50%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 18  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 77

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 78  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 137

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L++   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 138 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 197

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 198 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 247

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G   A  D  L V  +  A++L+ + +  FD          KD   +S+   L    
Sbjct: 248 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 297

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L R  +D             +P  ER+ +F  D+ 
Sbjct: 298 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 345

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 346 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 405

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 406 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 464

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 465 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 523

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+G +  +   STMD  I+RE+F+  I AA +L   + A   ++     RL PT I
Sbjct: 524 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 582

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 583 GKDGRIMEWLE 593


>gi|296130834|ref|YP_003638084.1| alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
 gi|296022649|gb|ADG75885.1| Alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
          Length = 809

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 195/611 (31%), Positives = 300/611 (49%), Gaps = 41/611 (6%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M++  + +T + L +  + PA+ +TDA P+GNGRLGAMV GG  +E L++N+DT W+G P
Sbjct: 1   MIDDGAVTTASGLVLRLDEPARWWTDAFPVGNGRLGAMVHGGTGAERLQVNDDTCWSGAP 60

Query: 61  GDYT-------NPD-APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
            D T        PD AP  +   R L+  G    A     KL       YQ L D+ +E 
Sbjct: 61  HDGTVEPVGPLGPDGAPGVVRRARHLLAEGDPLAAQDELAKLQSGWVQAYQPLVDVLVEQ 120

Query: 113 DDSHLKYAEETYRRELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
             +      + YRR LDL        + S     + +E   S+PD  ++ + +G+  G  
Sbjct: 121 PGA---AGRDDYRRVLDLARGVVTTTWRSAAGEPWRQEVLVSHPDGALLLERAGA-PGET 176

Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA----ILEIKI 227
              ++      +     G+  ++     P   +P   +  D P  +Q+            
Sbjct: 177 RVRLASPHPWASTPAAAGDGILVATLDMPSHVLP---DWVDGPDPVQYGGRSVHAAVALA 233

Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
                   A+ D +++V G+    ++L +++  D   +       D    +  AL  +R 
Sbjct: 234 VLADDAPVAVVDGEVRVTGARRVRVVLTSATDHD---VATGTLHGDRERVAADALAGLRG 290

Query: 288 L--SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
                  +  RH+ D+  L  RVS+ L  +P D+  D            A   +    + 
Sbjct: 291 ALADVDGIPARHVADHAALLGRVSLDLVAAPPDLPLD------------ARLARHAAGEP 338

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           D  L  L FQ GRYL ++ SRPGT   NLQGIWNE + P W S   +NIN EMNYW +L 
Sbjct: 339 DAHLAVLAFQLGRYLTVAGSRPGTLPLNLQGIWNERVRPPWSSNYTININTEMNYWPALV 398

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA-KSSADRG--KVVWAL 462
            +L+EC EPL  +L  L+  G +TA+  Y A GWV HH +D W       RG     W+ 
Sbjct: 399 GDLAECHEPLLSWLDRLAAAGRQTARTLYGARGWVAHHNSDPWCFTGPTGRGHDSASWSA 458

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
           WP+GGAWL  H+ +H+++T D D L +R +P++   A  +LD L+E  DG L T+P TSP
Sbjct: 459 WPLGGAWLARHVVDHHDWTGDDDAL-RRHWPVVRDAARAVLDLLVELPDGTLGTSPGTSP 517

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           E+ ++ PDG+ A V+ S+T D+AI+R++   +   A V+   ++ L   V  +L RL   
Sbjct: 518 ENHYLLPDGRPAAVAVSTTADLAIVRDLLEQVRRLAPVVRDRDEDLRAAVDGALERLPTE 577

Query: 583 KIAEDGSIMEW 593
           ++A DG + EW
Sbjct: 578 RVAPDGRLAEW 588


>gi|149199940|ref|ZP_01876968.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
 gi|149137009|gb|EDM25434.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
          Length = 793

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 193/591 (32%), Positives = 300/591 (50%), Gaps = 65/591 (10%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           +PIGNG++GAMV+GGV  E +    D+LW+G V G      + K +  +R ++   +Y  
Sbjct: 55  LPIGNGKIGAMVYGGVEQEKINFTIDSLWSGKVDGTQNLAGSYKGMEQLRGMLMKDEYDA 114

Query: 87  ATAASVKLFGHP--AD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKY 139
           A   +  L G    AD     +Q  GD+     D+ +K+   + Y+R+LD+N A + V++
Sbjct: 115 AHKLAKDLIGSSPSADGNFGTFQTFGDLVF---DTGIKFESVSDYQRKLDINNALSVVEF 171

Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV---NGNNQIIME 196
           ++G  ++TR  F S+PDQ +V +   S  GS   N+ L     N  +V   NGN+ I++ 
Sbjct: 172 TMGKHKYTRTAFVSHPDQCLVLRFEVSAGGSQ--NIKLGFETPNKDWVPRINGND-IVIS 228

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
           G+     +P  A      +G +FSA         +GT+S        VEG+      L A
Sbjct: 229 GKAAQNHMPVNARIRVKHEGGKFSA--------SKGTLS--------VEGARVVEFYLSA 272

Query: 257 SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSP 316
            ++FD  +  P+   + P  E +  L      SY++L  RHL+DY+ LF R++I +  S 
Sbjct: 273 DTAFD--YKAPNRIGEAPDQEVLKTLNQASEKSYAELLERHLEDYKDLFDRLTIDIGDSS 330

Query: 317 KDIVTDTCSEENIDTVPSAERVKSF------QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            ++            +P   R+K++        + DP L+E ++Q+GRYLLI+SSRPGT 
Sbjct: 331 LEL----------RNMPMEARLKNYGDSLASNANPDPDLIETIYQYGRYLLIASSRPGTL 380

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQG+WN  L+P W +  H+NINL+MNYW + P NL EC+EPL  F+  L   G  TA
Sbjct: 381 PANLQGVWNNSLTPPWAADYHININLQMNYWLAGPTNLIECEEPLLKFIESLVEPGRITA 440

Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           +  + + GW+ +H T+IW  ++      +GK+ W        WL  HL+EH+ Y  D+  
Sbjct: 441 KEYFNSEGWMSYHATNIWGHTAPRVGRGKGKLTWKALTTCSLWLSHHLYEHFAYRQDKSQ 500

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           L+   +P+L   A F   +L +  DG   + PS S EH           +S  +  D+A 
Sbjct: 501 LKNEIWPVLAEAADFAAGYLTQLPDGAYTSMPSWSSEH---------GLISKGAITDIAT 551

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
            REV    +  AE+L  N +    K       L   KI + G + EW++ R
Sbjct: 552 TREVLQCALECAEILGINNER-TAKWKNRKDNLLAYKIGQHGQLQEWLEDR 601


>gi|429848646|gb|ELA24104.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
          Length = 791

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 207/621 (33%), Positives = 307/621 (49%), Gaps = 91/621 (14%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +N PA  + DA PIGNGRLGAMV G    E L +NED++W G P +  NP A  AL  VR
Sbjct: 8   YNKPANLWDDATPIGNGRLGAMVRGTTDVERLWINEDSVWYGGPQNRLNPAARDALPKVR 67

Query: 77  SLVDSGQYAEA--------TAASVKL------------FGH----PADVYQLLGDIELEF 112
            L+D  +  EA        TA    L            FGH    P D  ++ G +  E 
Sbjct: 68  ELIDQNRIREAEQLIKKTQTARPRSLRHYEPLGDVFLTFGHGQDPPGDEVRVSGIVNFEN 127

Query: 113 DDSH-LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
             S  L  + + YRRELDL T  + V Y  G   + R+ FSS  D+VI   IS    G  
Sbjct: 128 SFSRDLNRSPQNYRRELDLRTGISSVSYDFGGAHYERQVFSSTVDEVIA--ISVRSEGEY 185

Query: 172 SFNV------------SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
           SF +             L+   D+   ++G + I       G               ++F
Sbjct: 186 SFQIDLNRGDHPEWDRRLNQRYDSLEEIDGGHMITGSMGLKG--------------AVEF 231

Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLL-----LVASSSFDGPFINPSDSKKDP 274
           +  + +++  D G      D +++V+ + + V++     ++   S +  F NP+  +   
Sbjct: 232 A--MGVRVIADPG------DGEVQVDNTGYNVVVNAKDRVIVLVSGETTFRNPNAGEAVQ 283

Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
              + ++++S     ++DL + H++ +  L+ RV +QL  S                VP 
Sbjct: 284 NRLATASMKS-----WNDLKSAHVERFSALYDRVELQLPGSGDKT-----------AVPI 327

Query: 335 AERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
            +R+++  Q   D  L +LLF FGRYLLIS S  G   ANLQGIWN D  P W S   +N
Sbjct: 328 DQRIQAVKQGAVDNGLAQLLFHFGRYLLISCSLSGLP-ANLQGIWNRDHMPVWGSKYTIN 386

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           IN++MNYW +   NL+E  + LF FL   +  G++TA+  Y   GWV+HH TDIWA ++ 
Sbjct: 387 INIQMNYWPAEVANLAETHDVLFRFLERTAERGAETAKAMYGCRGWVMHHNTDIWADTAP 446

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
               V    W + GAW   HLWEHY +  D+DFL +R YPL+ G A F  D+L+E  DG 
Sbjct: 447 QDDGVQCTYWTLSGAWFMIHLWEHYRFGRDKDFL-RRVYPLMAGSALFFQDFLVE-RDGK 504

Query: 514 LETNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           L T+PS+S E+  +I     +A ++     D  I+ E+F A++ A ++L ++     EKV
Sbjct: 505 LITSPSSSAENSYYILGTKTVASIAAGPAWDGQILTELFRAVVEAGKLLGEDTSEF-EKV 563

Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
           L  LP     ++ + G +MEW
Sbjct: 564 LAKLP---TPQMGKHGQVMEW 581


>gi|423304137|ref|ZP_17282136.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
           CL03T00C23]
 gi|423310748|ref|ZP_17288732.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
           CL03T12C37]
 gi|392681018|gb|EIY74381.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
           CL03T12C37]
 gi|392685663|gb|EIY78977.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
           CL03T00C23]
          Length = 820

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 194/604 (32%), Positives = 311/604 (51%), Gaps = 50/604 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY+NPDA K+L 
Sbjct: 29  QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
            +R L+  G+  EA       F             YQ+LGD++++F      S L     
Sbjct: 89  AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR L+L  A A   + + +V++ RE+F S    V++  +     G+L+F+  L     
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGALNFSARLSRAEH 208

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           +   V GN  ++M+G     +  P  +      G+++   +++  +    ++S     +L
Sbjct: 209 SSVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGIRL 259

Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K     W +L        A + F G  +    DS   P +   ++  SI + S+S     
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCSILHSSFSS---- 315

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+  ++ L+ RVS+ L  +P D            T+P+ ER+  F   E P+L  L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISS+RPG+   NLQG+W   +S  W+   H NIN++MN+W      LSE  +PL 
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423

Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             +  L  +G  +A+  Y   A GWV+H  T++W   +A      W     GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
           WEHY YT D+D+L +R YP+L+G A F     + E   G+L T P++SPE+ F  P   +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541

Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             VS     TMD+ ++ E++  +I+AA +L+ + D  V K+   L R  P +I+++G + 
Sbjct: 542 TPVSICMGPTMDVQLLTELYINVIAAARLLDCDAD-YVAKLEADLKRFPPMQISKEGYLQ 600

Query: 592 EWVQ 595
           EW++
Sbjct: 601 EWLE 604


>gi|229173820|ref|ZP_04301360.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
 gi|228609670|gb|EEK66952.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
          Length = 1156

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 199/600 (33%), Positives = 314/600 (52%), Gaps = 69/600 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 47  LSLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 106

Query: 67  D-APKALSDVRSLV--DSGQYAEATAASV-----KLFGHPADVYQLLGDIELEFDDSHLK 118
           D A   L  +R  +  D    AE  ++       K FG     YQ  GDI L+F+     
Sbjct: 107 DGAASHLGSIREKLAKDDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 162

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            +   YRREL++N   A V Y+   V++ RE+F+S PD+V+V +++ SES  LS +V   
Sbjct: 163 -SFSNYRRELNVNEGIATVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 221

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S          +N+I ++G+           AN+   G+++ +  E K+ ++ GT++A E
Sbjct: 222 SAQGGQVSAT-DNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + K+KV  +D   +++ A++ ++  +  P+   +DP  +    + +I   SY  L   H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKIEKIMSAISKKSYEVLKYTHM 322

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ+GR
Sbjct: 323 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE  EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429

Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D+ +L+++ YP+++  A F  ++L+E  +  L  +P  SPE         L  +
Sbjct: 489 HYKFTDDKQYLQEKIYPIIKEAAEFHSNFLVEDQNKKLVVSPCWSPE---------LGGI 539

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           S     D  ++ E+FS +I A+EVL+ +    D L  K  +  P   P +I   G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRERLFP---PIQIGRYGQVQEW 596


>gi|254475685|ref|ZP_05089071.1| conserved hypothetical protein [Ruegeria sp. R11]
 gi|214029928|gb|EEB70763.1| conserved hypothetical protein [Ruegeria sp. R11]
          Length = 792

 Score =  298 bits (764), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 186/601 (30%), Positives = 291/601 (48%), Gaps = 62/601 (10%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA--LSDVRSLVDSGQYAEATAASVKLF 95
           MV+GG     + LNEDTL++G P +   P  P A  +  V  L++ G+Y EA     + F
Sbjct: 1   MVYGGADIFKMHLNEDTLYSGEPSEVFKP-TPVADQVPKVSKLLEQGEYEEAQELVRRSF 59

Query: 96  -GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
            G     YQ +G   +E  +   + +   Y R LD+          V + +  R+ + S+
Sbjct: 60  LGKQGASYQPVGYFLVEPRN---RVSASAYERRLDIGNGVHSETIEVDDAKILRDCYISH 116

Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------------K 202
             Q IV  +  S    L+ +  + +   N    +   + +  G+ P             +
Sbjct: 117 EHQAIVITMETSADEGLNLDARIVTQHPNGKATHRGRRYVFSGQAPSFTQHAKSLLQMHQ 176

Query: 203 RI---------------------PPKANA------NDDPKGIQFSAILEIKISDDRGTIS 235
           R+                     P + ++      N D +G+       + +  D GT+ 
Sbjct: 177 RLGDTWKQPALYDRNGDIHPYLTPAEMSSEHTVLYNQDGRGLGMFFEAAVDVRHDGGTVE 236

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
            + D  + +        L+  ++S++G   +PS    DP   + + L ++  ++   + +
Sbjct: 237 -VSDAGISLTNVQSVTFLISLATSYNGFDKSPSREGADPVRRNNNVLDALVGVAEPKIRS 295

Query: 296 RHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
            H DD Q L  RVS+ L   SP ++ TD             +R+K  Q   DP L  L F
Sbjct: 296 SHTDDIQALMSRVSLHLDGESPANLTTD-------------QRLKQAQDRPDPELAALAF 342

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           Q+GRYLLISSSRPG+Q  NLQGIWN      W S   +NINL+MNYW + P  L+E  EP
Sbjct: 343 QYGRYLLISSSRPGSQPPNLQGIWNNSTCAMWSSNYTMNINLQMNYWPAEPTGLAELTEP 402

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           LF+ +  LS+ G++ A+  + A GW+  H T +W + +        A WP+G  WL  HL
Sbjct: 403 LFNLIDELSVTGARQAKHMFDAPGWMAFHNTTLWREVTPSHATPQSAFWPVGAGWLVAHL 462

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           WE Y Y+ D +FL  RA+P +EG   FLLDW++EG DG+L T  STSPE++F+  +G   
Sbjct: 463 WERYEYSGDLEFLRDRAWPRMEGALEFLLDWMVEGSDGFLTTPISTSPENKFLDENGVEC 522

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            V   STMD+AIIR +   ++ AAE L+K  + +  +   +L +L P +    G ++EW 
Sbjct: 523 TVHQGSTMDIAIIRGLLEQMLQAAEALDKPAE-ISARYQTALDKLPPYRTGAKGELLEWA 581

Query: 595 Q 595
           +
Sbjct: 582 E 582


>gi|330933451|ref|XP_003304180.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
 gi|311319408|gb|EFQ87743.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
          Length = 792

 Score =  298 bits (764), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 203/606 (33%), Positives = 311/606 (51%), Gaps = 47/606 (7%)

Query: 4   AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           A   S  N  ++ +  PA+   +TDA+PIGNGRLGAM +G    E + LNE+T+W+G   
Sbjct: 14  ASLASAGNNTRLWYTTPAQSSAWTDALPIGNGRLGAMAFGIPVQERIALNEETIWSGGQQ 73

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
           D    ++P+ +S+VR L+  G   +A   A++ + G P     YQ LGD+++ FD +   
Sbjct: 74  DRIGQNSPQTVSEVRDLLAQGHAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y   TY+R LD++TA A V++ V    + RE F S PD V+V  +  + SG LSF + + 
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVLVHHLKATGSGKLSFQIRV- 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
               +     GN     E    G           DP  + F+  L ++ SD  G +  L 
Sbjct: 192 ----HRPEKGGNEASDHEWNADGLAYMTGGAGGIDP--VVFTTALAVQ-SD--GHVKNL- 241

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              + +E +  A  +  AS+S+            D  +   S +Q  R  +Y +L  RH+
Sbjct: 242 GPFIVIENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
            DY  L++   + LS S  DI           ++P+  R+ + +    DP+L  L + +G
Sbjct: 293 ADYAPLYNASVLDLSGS--DI--------EASSLPTDARINATREGASDPALAALSYNYG 342

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI+SSR G   +NLQGIWN++ +P W S   VNINL+MNYW +   +LS   EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            L  +  +G+KTA+  Y ASGWV HH TD+W  ++     +    W +   WL TH+ EH
Sbjct: 403 LLDLMRKDGTKTARQMYNASGWVTHHNTDLWGDTAPVDRWLPATYWTLSSGWLVTHILEH 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
           Y YT D+ FL  +   + E  A F LD L    I G   YL TNPS SPE+ ++  D   
Sbjct: 463 YWYTGDKKFLASKLDVVSEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 520

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
                + T D+ I+ E+F+  ++A   L  +  +   +  +  +  +L P + ++   G+
Sbjct: 521 YHFDIAPTCDIEILNELFTNYLNAVATLPNSTVDSTFLTHIRDTQAKLPPYRYSKRYPGT 580

Query: 590 IMEWVQ 595
           + EW+Q
Sbjct: 581 LQEWMQ 586


>gi|317477822|ref|ZP_07937009.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
 gi|316906021|gb|EFV27788.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
          Length = 820

 Score =  298 bits (762), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 194/604 (32%), Positives = 311/604 (51%), Gaps = 50/604 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY+NPDA K+L 
Sbjct: 29  QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
            +R L+  G+  EA       F             YQ+LGD++++F      S L     
Sbjct: 89  AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR L+L  A A   + + +V++ RE+F S    V++  +     G+L+F+  L     
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           +   V GN  ++M+G     +  P  +      G+++   +++  +    ++S      L
Sbjct: 209 SSVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPGNGICL 259

Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K     W +L        A + F G  +    DS   P +   ++  SI + S S+    
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+  ++ L+ RVS+ L  +P D            T+P+ ER+  F   E P+L  L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISS+RPG+   NLQG+W   +S  W+   H NIN++MN+W      LSE  +PL 
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423

Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             +  L  +G  +A+  Y   A GWV+H  T++W   +A      W     GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
           WEHY YT DRD+L +R YP+L+G A F     + E   G+L T P++SPE+ F  P   +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541

Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             VS     TMD+ ++ E+++ +I+AA +L+ + D  V K+   L +  P +I+++G + 
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600

Query: 592 EWVQ 595
           EW++
Sbjct: 601 EWLE 604


>gi|115443166|ref|XP_001218390.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114188259|gb|EAU29959.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 796

 Score =  298 bits (762), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 195/596 (32%), Positives = 308/596 (51%), Gaps = 55/596 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA  +  ++PIGNGRLGA VWG    E + LNE+++W+G   D  NP+A    +  R
Sbjct: 31  YESPASDYAGSLPIGNGRLGATVWG-TAVEKITLNENSIWSGPFQDRVNPNAYDGFTQAR 89

Query: 77  SLVDSGQYAEATAASVKLFGH----PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           SL++ G    A   +++        P + Y  LG + L+F+  H       YRR LDL +
Sbjct: 90  SLLEKGDMTGAGEVTLRDMASIPTSPRE-YHPLGVLHLDFN--HDVNLMTNYRRSLDLYS 146

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGN 190
             A V+Y    V ++RE+ +S P  VI  +++ SE G+L+   SL  D  + ++S  + N
Sbjct: 147 GNAVVEYDYNGVRYSREYIASAPAGVIAIRVTASEPGNLTVACSLARDRYVIDNSASSPN 206

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
              I+       R+   AN  D    IQF  I E +I    G + +     +  + +   
Sbjct: 207 ETGIL-------RL--MANTGDMEDPIQF--ISEARIIGHGGRVVSNSTTVVVRDATSVE 255

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           +     +S     +  P + K++  +E    L +     Y+ + T  + D+  L  RV+I
Sbjct: 256 IFFDAETS-----YRYPDEDKRE--AEMDRKLSTAMGRGYNAVKTAAVADHLSLARRVNI 308

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ--TDEDPSLVELLFQFGRYLLISSSR-- 366
           +L            S  +   +P+  R+K+++   D DP L  L+F FGR+ LI+SSR  
Sbjct: 309 KLG-----------SSGSAGQLPTDTRLKNYKDNPDSDPELATLMFNFGRHSLIASSRQS 357

Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
             PG   ANLQGIWN+D SP W     V++NLEMNYW +   NL++  +P  D +  +  
Sbjct: 358 GSPGLP-ANLQGIWNQDYSPAWGGKYTVDVNLEMNYWPAEVTNLADTFDPFMDLMDTVVP 416

Query: 425 NGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +G   A+  Y     G+V+HH TD+W  ++       W +WPMG AWL  +L +HY +T 
Sbjct: 417 HGIDVAKRMYQCDNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGSAWLSENLMQHYRFTQ 476

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVS 537
           +++ L +R +PLL+  A F   +L E  DGY  + PS SPE+ FI P      GK   + 
Sbjct: 477 NKEVLRERIWPLLKSAAQFYYCYLFE-FDGYFSSGPSISPENAFIVPSDMSVAGKSEGID 535

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            S TMD A++ E+F+++I  A++LE   +  V+K  + L +++P +I  DG I+EW
Sbjct: 536 ISPTMDNALLYELFNSVIETADILEITGEE-VDKAKEYLAKIKPPQIGSDGQILEW 590


>gi|270294825|ref|ZP_06201026.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274072|gb|EFA19933.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 820

 Score =  297 bits (761), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 193/604 (31%), Positives = 309/604 (51%), Gaps = 50/604 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY+NPDA K+L 
Sbjct: 29  QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
            +R L+  G+  EA       F             YQ+LGD++++F      S L     
Sbjct: 89  AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR L+L  A A   + + +V++ RE+F S    V++  +     G+L+F+  L     
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           +   V GN  ++M+G           +      G+++   +++  +    ++S      L
Sbjct: 209 SLVTVQGNT-LLMDGML--------ESGKPGLDGMKYRVAMQLVQNGGESSVSPENGICL 259

Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K     W +L        A + F G  +    DS   P +   ++  SI + S S+    
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+  ++ L+ RVS+ L  +P D            T+P+ ER+  F   E P+L  L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISS+RPG+   NLQG+W   +S  W+   H NIN++MN+W      LSE  +PL 
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWTNGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423

Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             +  L  +G  +A+  Y   A GWV+H  T++W   +A      W     GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
           WEHY YT DRD+L +R YP+L+G A F     + E   G+L T P++SPE+ F  P   +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541

Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             VS     TMD+ ++ E+++ +I+AA +L+ + D  V K+   L +  P +I+++G + 
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600

Query: 592 EWVQ 595
           EW++
Sbjct: 601 EWLE 604


>gi|302884741|ref|XP_003041265.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
           77-13-4]
 gi|256722164|gb|EEU35552.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
           77-13-4]
          Length = 765

 Score =  297 bits (760), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 194/589 (32%), Positives = 304/589 (51%), Gaps = 44/589 (7%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L++ +N P+  +++++P+GNGRLGA+V G   +E L+LNE+++W+G P + T PDA + L
Sbjct: 8   LRLQYNSPSSQWSESLPVGNGRLGAVVHGQPGAEVLQLNENSVWSGGPQERTPPDARRML 67

Query: 73  SDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +RSL+ + ++AEA A +   F  +P     Y+ +G    EF    +      Y R LD
Sbjct: 68  PKLRSLIRADKHAEAEALAKLAFYANPKSQRHYEPMGTASFEFGHEQVS----NYHRHLD 123

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A V+Y  G   + R+  +S PD V++ + + S+     F V LD + D+    N 
Sbjct: 124 LATAQAVVEYEHGGASYRRDMIASFPDNVLLWRFTASQ--KTRFIVRLDRINDDPIETNT 181

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
               I   +  G RI   A       G +  ++L     D+ G I A+      V  S  
Sbjct: 182 YADTI---KSEGSRIVLHATPR-GAGGNRLCSVLRAVCDDEEGAIEAV--GSCLVINSAS 235

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             + + A ++F  P         DP   + + +      ++S+L  RH  DY+ LF R+S
Sbjct: 236 CTIAIGAQTTFRHP---------DPELVATTDVDCALMRTWSELVVRHRRDYEGLFGRMS 286

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           +++     +  TD              R+++ Q+  DP LV L   +GRYLLISSSR G 
Sbjct: 287 LRMWPDASEKPTDA-------------RLETRQS-RDPGLVALYHNYGRYLLISSSRDGH 332

Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL-SECQEPLFDFLTYLSING 426
           +   A LQGIWN   +P W S   +NINL+MNYW + PC+L  EC  P+ D L  +SI G
Sbjct: 333 RALPATLQGIWNPSFTPPWGSKYTININLQMNYWLTAPCSLVDECTLPVIDLLERMSIRG 392

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            +TA+  Y   GW  HH TDIWA +S     +   +WP+GG W+   + +   Y    + 
Sbjct: 393 QETAKAMYGCRGWCAHHNTDIWADTSPQDHWISATVWPLGGLWVSVTVMDMLRYQYSEE- 451

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L +R +   EG   F++D+L+   DG YL  NPS SPE+ F +  G++      STMDM 
Sbjct: 452 LHRRIFACHEGAVQFVIDFLVPSSDGLYLIANPSISPENTFYSTTGEVGVFCEGSTMDMT 511

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
           +IR   +  + + + LE  ++  ++ V++ +L R+ P  + + G I EW
Sbjct: 512 LIRVALTQFLWSLDRLEGLQEHTLKTVVQDTLDRIPPILVNDAGRIQEW 560


>gi|150009027|ref|YP_001303770.1| glycoside hydrolase [Parabacteroides distasonis ATCC 8503]
 gi|149937451|gb|ABR44148.1| glycoside hydrolase family 95 [Parabacteroides distasonis ATCC
           8503]
          Length = 809

 Score =  296 bits (759), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 197/611 (32%), Positives = 308/611 (50%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 16  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 76  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L++   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G   A  D  L V  +  A++L+ + +  FD          KD   +S+   L    
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L R  +D             +P  ER+ +F  D+ 
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+  +  +   STMD  I+RE+F+  I AA +L  +     E   K   RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591


>gi|160887922|ref|ZP_02068925.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
 gi|156862608|gb|EDO56039.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
          Length = 820

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 193/604 (31%), Positives = 311/604 (51%), Gaps = 50/604 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY+NPDA K+L 
Sbjct: 29  QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
            +R L+  G+  EA       F             YQ+LGD++++F      S L     
Sbjct: 89  AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRR L+L  A A   + + +V++ RE+F S    V++  +     G+L+F+  L     
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGHEGTLNFSARLSRAEH 208

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           +   V GN  ++M+G     +  P  +      G+++   +++  +    ++S      L
Sbjct: 209 SLVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGICL 259

Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K     W +L        A + F G  +    DS   P +   ++  +I + S S+    
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCAILHSSLSN---- 315

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+  ++ L+ RVS+ L  +P D            T+P+ ER+  F   E P+L  L + +
Sbjct: 316 HVTAHRSLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLISS+RPG+   NLQG+W   +S  W+   H NIN++MN+W      LSE  +PL 
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423

Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             +  L  +G  +A+  Y   A GWV+H  T++W   +A      W     GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
           WEHY YT D+D+L +R YP+L+G A F     + E   G+L T P++SPE+ F  P   +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541

Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             VS     TMD+ ++ E+++ +I+AA +L+ + D  V K+   L R  P +I+++G + 
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEVDLKRFPPMQISKEGYLQ 600

Query: 592 EWVQ 595
           EW++
Sbjct: 601 EWLE 604


>gi|301312083|ref|ZP_07218005.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
 gi|423339363|ref|ZP_17317104.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830185|gb|EFK60833.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
 gi|409230744|gb|EKN23605.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
           CL09T03C24]
          Length = 809

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 194/611 (31%), Positives = 307/611 (50%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 16  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 76  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 136 LRYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G      D  L V  +  A++L+ + +  FD          KD   +S+   L    
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L +  +D             +P  ER+ +F  D+ 
Sbjct: 296 SKDFSTLRREHTFAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL ++      +G +TA+  Y A GW  H   ++W + +A      W     
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWGTHILGNVW-EFTAPGEHPSWGATNT 462

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAAQFFVDMLVQDPRTKYLVTAPTTSPEN 521

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+G +  +   STMD  I+RE+F+  I AA +L   + A   ++     RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591


>gi|423668781|ref|ZP_17643810.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
 gi|423675093|ref|ZP_17650032.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
 gi|401300760|gb|EJS06350.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
 gi|401309028|gb|EJS14402.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
          Length = 1156

 Score =  295 bits (756), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 194/600 (32%), Positives = 308/600 (51%), Gaps = 69/600 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
           L + +N PAK +   A+PIGNG +G MV+GGV  E ++ NE TLWTG P       Y N 
Sbjct: 47  LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 106

Query: 67  D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
           D A   L  +R  +  G  + A   S +        FG     YQ  GDI L+F+     
Sbjct: 107 DGAASHLGSIREKLAKGDKSGAEKESSQFLTGLEKGFGS----YQNFGDIYLDFNMPDAS 162

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            +   YRREL++N   A V Y+  +V++ RE+F+S PD+V+V +++ SE+  +S +V   
Sbjct: 163 -SFSNYRRELNINEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 221

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           S        + +N+I M+G+                 G+++ A    K+ ++ GT++A E
Sbjct: 222 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 264

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           + K+KV  +D   +++ A++ ++  +  P+   +DP  +    + +I   SY  L   H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PAYKGEDPHEKVEKTMAAISKKSYEVLKYTHI 322

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DY  LF+RVS+ L                  +VP+ E + S+  +    L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSRPGT  ANLQG+WN   +P W+S  H NINL+MNYW +   NLSE   PL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETALPLMDY 429

Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
           +  L   G  +A+ ++     GW ++   + +  ++   G + W   P   A++  ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           HY +T D+ +L+++ YP++   A F   +L+E  +  L  +P  SPE         L  +
Sbjct: 489 HYKFTDDKQYLKEKIYPIINEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 539

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           S     D  ++ E+FS +I A+EVL+ +    D L  K  +  P   P +I   G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596


>gi|265753143|ref|ZP_06088712.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|263236329|gb|EEZ21824.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
          Length = 803

 Score =  295 bits (756), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PAK + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 24  ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K  +
Sbjct: 84  KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 143

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 144 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   NNQ++  G+      P        P G+ F     I +  D G +  +E   
Sbjct: 201 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 249

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 250 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 300

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
             L++RVSI   +          +   + T    ++VK  +TD    L  L FQ+GRYL 
Sbjct: 301 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM G+W+ +HLW  Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 468

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                D  +  E+ S  + A+E+L+ + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 529 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 585


>gi|212695253|ref|ZP_03303381.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
 gi|237711725|ref|ZP_04542206.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|212662163|gb|EEB22737.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
 gi|229454420|gb|EEO60141.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
          Length = 818

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 181/599 (30%), Positives = 309/599 (51%), Gaps = 46/599 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PAK + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 39  ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 98

Query: 66  -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K  +
Sbjct: 99  KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 158

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 159 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 215

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   NNQ++  G+      P        P G+ F     I +  D G +  +E   
Sbjct: 216 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 264

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 265 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 315

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
             L++RVSI   +                 +P+  R K  +  + D  L  L FQ+GRYL
Sbjct: 316 NTLYNRVSIHFGQDANR------------AMPTDVRWKQVKEGKTDTGLDALFFQYGRYL 363

Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
            I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF 
Sbjct: 364 TIASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFT 423

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           ++  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM G+W+ +HLW  
Sbjct: 424 YIKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQ 482

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACV 536
           Y +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    
Sbjct: 483 YEFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVA 542

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S     D  +  E+ S  + A+E+L+ + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 543 SMMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 600


>gi|312131012|ref|YP_003998352.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
 gi|311907558|gb|ADQ17999.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
          Length = 805

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 210/603 (34%), Positives = 315/603 (52%), Gaps = 61/603 (10%)

Query: 6   STSTTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
           S S     K+ +  PA K +  A+P+GNG +G MV+G    E + LNE + W+G P   +
Sbjct: 14  SLSFAQEYKMWYQNPAGKVWEKALPVGNGFIGGMVYGNTEEERIDLNETSFWSGGPYATS 73

Query: 65  NPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAE 121
                 +L  +RSLV S +Y EA   A+  LF H +   ++  +G + L+F     +   
Sbjct: 74  PTLNRDSLEKLRSLVFSEKYKEAENMANRVLFSHGSHGQMFLPIGSLILKFPG---QKEA 130

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
            +Y RELDL+ A A  ++SVG   + RE F+   ++V+V K+S +E+ ++          
Sbjct: 131 TSYYRELDLSKAVASTRFSVGKQNYEREVFTPLQEKVLVMKLSSTEAMNVEVLYRTPLPE 190

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDK 240
                V GN     E +  G+ I     A++  +G ++F  I+ +K S   G  S+  D 
Sbjct: 191 GRVVQVQGN-----ELQIGGRNI-----AHEGSEGALRFHGIIHVKQS---GGNSSRTDS 237

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            L +  +   VL +  ++++        D K    +   SAL+S     Y++L  +H++ 
Sbjct: 238 SLIISNAKELVLYVSLATNYQSYQDVSGDEKALARARLTSALKS----PYTELKRKHIEK 293

Query: 301 YQKLFHRVSIQLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
           YQ L++RV + L    R P DI                 R++ F+   DP    L FQFG
Sbjct: 294 YQSLYNRVELTLGSDRREPTDI-----------------RLEKFREGNDPGFAALYFQFG 336

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSS+PG Q ANLQGIWN  + P WDS   +NIN EMNYW +   NLSE  +PLF+
Sbjct: 337 RYLLISSSQPGGQPANLQGIWNASIRPPWDSKYTININTEMNYWPAERTNLSEMHKPLFE 396

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            +  L+  G+ TA+  Y A GWV HH TD+W + +       + LWP GGAWL  H+WEH
Sbjct: 397 MVKDLTKTGAVTAKRLYGAGGWVAHHNTDLW-RLTWPVDAAFYGLWPSGGAWLSQHIWEH 455

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDG-KLA 534
           Y YT +  FL K    +L G A F +D +++ H    YL  NPSTSPE+   AP+  + +
Sbjct: 456 YQYTGNLHFL-KENQEVLFGAARFYVD-ILQKHPKYPYLVINPSTSPEN---APEAHQRS 510

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            +S   TMD  +  +VF   I A+++L    +  D+L +++LK LP   P  I + G + 
Sbjct: 511 SLSAGVTMDNQLAFDVFQNAIWASKILGVKTQFSDSL-KQLLKQLP---PMHIGKHGQLQ 566

Query: 592 EWV 594
           EW+
Sbjct: 567 EWL 569


>gi|423241353|ref|ZP_17222466.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
           CL03T12C01]
 gi|392641729|gb|EIY35503.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
           CL03T12C01]
          Length = 800

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PAK + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 21  ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K  +
Sbjct: 81  KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   NNQ++  G+      P        P G+ F     I +  D G +  +E   
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
             L++RVSI   +          +   + T    ++VK  +TD    L  L FQ+GRYL 
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM G+W+ +HLW  Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                D  +  E+ S  + A+E+L+ + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582


>gi|423231014|ref|ZP_17217418.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
           CL02T00C15]
 gi|423244725|ref|ZP_17225800.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
           CL02T12C06]
 gi|392630134|gb|EIY24136.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
           CL02T00C15]
 gi|392641574|gb|EIY35350.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
           CL02T12C06]
          Length = 800

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PAK + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 21  ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K  +
Sbjct: 81  KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   NNQ++  G+      P        P G+ F     I +  D G +  +E   
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
             L++RVSI   +          +   + T    ++VK  +TD    L  L FQ+GRYL 
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM G+W+ +HLW  Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                D  +  E+ S  + A+E+L+ + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582


>gi|311746349|ref|ZP_07720134.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
 gi|126575233|gb|EAZ79565.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
          Length = 778

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 192/591 (32%), Positives = 310/591 (52%), Gaps = 42/591 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA-LSDV 75
           +  PA+ + +A+P+GNGRLGAMV+G    E ++LNED+LW G  GD+      ++ L  +
Sbjct: 27  YTSPAEIWEEALPVGNGRLGAMVFGKPSMERIQLNEDSLWPGEQGDWGIAKGRRSDLDQI 86

Query: 76  RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           R+ + +G+  ++ +  V  F   A    +Q LGD+ L+FD   +      Y+R LDL TA
Sbjct: 87  RAYLRAGENEKSDSLLVAAFSRKAITRSHQTLGDLWLDFDFQEIS----DYKRSLDLTTA 142

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN--- 190
            A   +       T+E  SS PD  IV ++  +        + L S  ++  +       
Sbjct: 143 VASSTFKSQGYTVTQEVLSSAPDDAIVIRLKTNHPDGFVGKIRL-SRPEDEGFATAETKS 201

Query: 191 ---NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
              N + M G    ++    +N      G++F  ++ ++  D  G ++   D  L++ GS
Sbjct: 202 LSENTLSMAGMITQRKGQLDSNPYPLLTGVKFKTLVYVETED--GNLNNGVDY-LELSGS 258

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
              ++ LV  +SF           +D    +   L++++  ++  +   H+ DY + F R
Sbjct: 259 KEVLIKLVTETSF---------YNQDFDHAAELELENVKTKNWEGILEPHIQDYSQWFER 309

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
           + ++L ++             +  VP+  R+++ Q    D  L +LLF +GRYLLISSSR
Sbjct: 310 MELKLGKAA------------MSEVPTDVRIENVQAGGVDLHLEKLLFDYGRYLLISSSR 357

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG   ANLQGIWN+D++  W++  H+NINL+MNYW +   NLS+  +PLFDF+  +   G
Sbjct: 358 PGNNPANLQGIWNKDINAPWNADYHLNINLQMNYWPADVTNLSKLNQPLFDFVDGVIHRG 417

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            + AQ N+  +G  + H TD+W           W  W   G W+  H W+HY +T D  F
Sbjct: 418 QEVAQTNFGMAGTFLPHATDLWQVPFMRAATAYWGGWVGAGGWMARHYWDHYLFTKDERF 477

Query: 487 LEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L +RA+P +    +F  DWL+E   +  L + PSTSPE+ F    G+    +  + MD  
Sbjct: 478 LRERAFPAISQVTAFYSDWLVEYPGENTLVSAPSTSPENRFFNEAGRPVATTMGAAMDQQ 537

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
           II +VFS+ ++A+E+L  +E  L ++V + L RLRP  +IAEDG I+EW Q
Sbjct: 538 IIADVFSSFLAASEIL-NSESRLRDRVKEQLARLRPGVQIAEDGRILEWDQ 587


>gi|332881351|ref|ZP_08449001.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357045233|ref|ZP_09106870.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
           11840]
 gi|332680727|gb|EGJ53674.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355531816|gb|EHH01212.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
           11840]
          Length = 798

 Score =  295 bits (755), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 184/590 (31%), Positives = 307/590 (52%), Gaps = 49/590 (8%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
           ++ PA  +  ++P+GNGR+GAMV+GGV  ET+ LNE ++W G    +   P   + L ++
Sbjct: 29  YDAPADEWMKSLPVGNGRVGAMVFGGVNEETVALNESSMWAGEYDPNQEKPFGREKLDEL 88

Query: 76  RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           R L   G+  E    A  +L G  H    +  +GD++++FD +  +   E YRRELDL  
Sbjct: 89  RKLFFEGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYTGKEGGVEDYRRELDLTN 148

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A   V +  G  ++ RE  SSNP   +V   +  +  S+SF++ +  +        GN  
Sbjct: 149 AVVTVSFKKGGTKYKREFISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           +       G+ + PK        G+ F   + +K+  DRG + A   + ++V+ +D   +
Sbjct: 209 VF-----DGQALFPKLGTG----GVHFQGRVVVKV--DRGEVEA-TGETVRVKHAD--AV 254

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSI 310
            +VA    D          K+   ES+      + ++  +  +   H+ DY  LF RVS+
Sbjct: 255 TIVADVRTD---------YKNGQYESLCEKTVEKAIARPFETMKEEHVADYAPLFARVSL 305

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
           +L+   K             ++P   R K+  + ++D  L  L FQ+GRYL I+SSR  +
Sbjct: 306 KLADDSKK------------SIPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENS 353

Query: 370 QV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
            +   LQG +N++L+    W S  H++IN E NYW +   NL+EC  PLF ++  L+ +G
Sbjct: 354 PLPIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLTNVGNLAECNAPLFTYIADLAHHG 413

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           +KT +  Y   GW  H   ++W  ++   G + W L+P+ G+W+ THLW  Y YT+D+D+
Sbjct: 414 AKTVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDY 472

Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
           L + AYPLL+G A FLLD+++E  + GY+ T P  SPE+ F     +L   S  +T D  
Sbjct: 473 LRRTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDKV 531

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +  E+ SA + A+++L  ++ A  + +  +L +  P +I   G + EW +
Sbjct: 532 LAHEIMSACVQASDILGVDK-AFADSLRLALAKFPPFRINSFGGLCEWYE 580


>gi|262383921|ref|ZP_06077057.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
 gi|262294819|gb|EEY82751.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
          Length = 809

 Score =  295 bits (754), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 196/611 (32%), Positives = 308/611 (50%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 16  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 76  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L++   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G   A  D  L V  +  A++L+ + +  FD          KD   +S+   L    
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L +  +D             +P  ER+ +F  D+ 
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPINERLAAFAQDKN 343

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDFHLNINLQMNHWPAEV 403

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL ++      +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+  +  +   STMD  I+RE+F+  I AA +L  +     E   K   RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591


>gi|256425749|ref|YP_003126402.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
 gi|256040657|gb|ACU64201.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
          Length = 778

 Score =  295 bits (754), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 191/607 (31%), Positives = 310/607 (51%), Gaps = 48/607 (7%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M  A   +  NPL + ++ PA  + + +P+GNGRLG M  GG+ +E + LN+ TLW+G P
Sbjct: 16  MPAALCKAQQNPLTLKYDKPAAVWEETLPLGNGRLGMMPDGGIQTEKVVLNDITLWSGAP 75

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
            +  N +A K L  ++ L+  G+  EA +   K F          P   YQ LG+++++F
Sbjct: 76  QNANNYEAYKQLPKIQELLKEGRNDEAQSLMDKDFICTGKGSGDVPFGCYQTLGELQIQF 135

Query: 113 D-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
             D   K     Y R+L L  A A   Y V NV + RE+F+S  D +   +++ S++G L
Sbjct: 136 AYDKADKVEPTAYERKLSLQQAIASCSYKVNNVTYNREYFTSFGDDLSFIRLTASQAGKL 195

Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
           +  +++ S  +  +    N ++++ G+          ++ +D KG+Q+ A   +K     
Sbjct: 196 NLRITM-SRPEKAATRTENGELLLYGQL---------DSGNDTKGMQYQA--NVKAQLKG 243

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           GTI+  E+  L ++ +   +L + A + F     + +D KK  ++   +A++      Y 
Sbjct: 244 GTITT-EEHALVIKNATEVILYVAAGTDF-----HKNDFKKQISTVLATAVKK----PYE 293

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSL 349
                H+ +Y KLF+RV + L +                T+ + +R+ +F  +   D  L
Sbjct: 294 AQKQAHMRNYTKLFNRVQVDLGKG------------TAGTLTTDKRLAAFYNNAAADNEL 341

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
             L +QFGRYL I S+R G    NLQG+W   +   W+   H+++N++MN+W     NLS
Sbjct: 342 PVLFYQFGRYLTICSTRKGLLPPNLQGLWANQVHTPWNGDYHLDVNVQMNHWPVEVSNLS 401

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           E   PL D +  L   G +TA+  Y A GWV H  T++W  +        W     G  W
Sbjct: 402 ELNLPLADLVKGLVAPGQRTAKAYYNAPGWVAHVITNVWGFTEPGE-SASWGATKSGSGW 460

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIA 528
           LC +LWEHY +T D+ +L    YP+L+G A F    LI+    G+L  +PS+SPE+ F  
Sbjct: 461 LCNNLWEHYAFTNDKKYLAD-IYPVLKGSAEFYNSLLIKDEKTGWLVMSPSSSPENAFYL 519

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
           P+GK A +   +T+D  I+R++F+ II+A+  L  + D   E   K      P  IA DG
Sbjct: 520 PNGKHASICIGATIDNQIVRDLFNNIITASTELGIDADFKKELQQKVALLPPPGVIAPDG 579

Query: 589 SIMEWVQ 595
            IMEW++
Sbjct: 580 RIMEWLE 586


>gi|345513833|ref|ZP_08793348.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|345456122|gb|EEO45721.2| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
          Length = 800

 Score =  295 bits (754), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PAK + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 21  ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K  +
Sbjct: 81  KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   NNQ++  G+      P        P G+ F     I +  D G +  +E   
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 247 VSIKEADTVTLIVDVRTDYKSP---------DYKTLCADGVEKAAVKSYDELKQAHIKDY 297

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
             L++RVSI   +          +   + T    ++VK  +TD    L  L FQ+GRYL 
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM G+W+ +HLW  Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                D  +  E+ S  + A+E+L+ + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582


>gi|281419724|ref|ZP_06250723.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
 gi|281406253|gb|EFB36933.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
          Length = 1246

 Score =  295 bits (754), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 202/634 (31%), Positives = 313/634 (49%), Gaps = 66/634 (10%)

Query: 7   TSTTNPL---------KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           T+ TNP+          + +N PA ++ +A+P+GNGRLG M  G V  +TL+LNEDT W 
Sbjct: 333 TADTNPIPAPTIESKNHLWYNKPAGYWEEALPLGNGRLGVMHSGSVACDTLQLNEDTFWD 392

Query: 58  GVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDD 114
             P    N +A   L +V+  + +  YA     +V  +   G     Y+  G + L F  
Sbjct: 393 QGPNTNYNANAFGVLREVQQGIFNKDYASVQNLAVTNWMSQGSHGASYRAAGVVLLGFPG 452

Query: 115 SHLKYAE----------ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
                 E          + Y R LD+NTAT+ V+Y V  V + R  F+S  D V V ++ 
Sbjct: 453 QRFDDMESAQTSDAVDAQGYVRYLDMNTATSNVEYHVKGVGYKRTVFTSFKDNVTVVRLE 512

Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
             + G L FNV+      ++     +N +  E        P +    +    +     L 
Sbjct: 513 ADQKGKLDFNVAYAGCNKSNIEKLTSNVLYDEHTVKATMGPARDKCENVENKLNLCTYLR 572

Query: 225 I-----KISDD------RGTISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
           I      I++D      +GT+ A  +  +L V G+ +A +++  +++F        D   
Sbjct: 573 IVDTDGTITNDNVNIYAQGTVGAATNAPRLNVTGATYATIIISQATNFK----KYDDVSG 628

Query: 273 DPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID 330
           D ++ +++ L++  N    Y    + H   Y+  F RV + L+ +         ++E+ +
Sbjct: 629 DASASALAYLEAYENSKKDYVTTLSDHESVYRAQFDRVDLTLAGN--------ATQESKN 680

Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS--PTWDS 388
           T    +R+K F    DP L    FQFGRYLLISSS+PGTQ ANLQGIWN D    P WDS
Sbjct: 681 T---EQRIKEFHKTSDPQLAANYFQFGRYLLISSSQPGTQPANLQGIWNPDARQYPAWDS 737

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
               NIN+EMNYW +   NL+EC EP  + +  +S+ G++TA+  Y A GW +HH TDIW
Sbjct: 738 KYTSNINVEMNYWPAEVTNLAECHEPFVEMVKDVSVTGAETAKKMYGARGWALHHNTDIW 797

Query: 449 AKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
             + A D G V   +WP   AW C+HLWE Y ++ D+ +L +  YP+++G A F  D+L+
Sbjct: 798 RTTGAVDNGTV--GVWPTCNAWFCSHLWERYLFSGDKTYLAE-VYPIMKGAAEFFQDFLV 854

Query: 508 EG-HDGYLETNPSTSPEHE-----FIAPDGKLACVSY--SSTMDMAIIREVFSAIISAAE 559
           +  + GY+   PS SPE+      +  PDGK A ++      MD  ++ ++      AA 
Sbjct: 855 KDPNTGYMVVCPSNSPENHPGIGSYTKPDGKTANIALFGGVAMDNEMVYDLLKNTALAAR 914

Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            L+K+ D           ++ P KI + G + EW
Sbjct: 915 ALDKDADFADALDALK-AQITPWKIGQYGQVQEW 947


>gi|329962213|ref|ZP_08300219.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
           12057]
 gi|328530321|gb|EGF57198.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
           12057]
          Length = 834

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 195/619 (31%), Positives = 315/619 (50%), Gaps = 68/619 (10%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GGV  E + LNE +LW+G+  DY+NPDA K+L 
Sbjct: 29  RLYYTKPASVWEETLPLGNGRLGMMPDGGVLREHIVLNEISLWSGMEADYSNPDASKSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLF---GHPAD----VYQLLGDIELEF-----------DDS 115
            +R L+  G+  EA       F      AD     YQ LG ++++F           +  
Sbjct: 89  AIRKLLFEGKNREAQELMYSSFVPKKQEADGRYGTYQTLGTLDIDFAYQSQTSVSKSESL 148

Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
            L      YRR LDL  A A   +++  V++ RE+F S    V++  ++    G+L+F+ 
Sbjct: 149 ALDGGTSRYRRCLDLRDAVAYTAFALEGVDYRREYFVSRDRDVMLVHLTAGSKGALNFSA 208

Query: 176 SLDSLLDNHSYVNGNNQII---MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
            L         V GN  ++   +E   PG+            +G+++   + +++  D G
Sbjct: 209 RLGRAEHGTVTVKGNALLMDGTLESGSPGR------------EGMKYR--VAMQLVSDGG 254

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRN--- 287
            ++A  +  + ++    A L+L A++S+     +   S+     +S+  +A   I+N   
Sbjct: 255 EVAADPENGISLKHGQEAWLVLSATTSYAAEGTDFPGSRYAEVCDSLLKNAGVQIKNEMR 314

Query: 288 ----LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
                + +     H   ++ L+ RVS+ L  +P D            T+P+ ER+  F  
Sbjct: 315 MRGMAAEATALKSHAAAHRSLYDRVSLTLPSTPDD------------TLPTDERILRFTR 362

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
            E P+L  L + +GRYLLISS+RPG+   NLQG+W   L   W+   H NIN++MN+W  
Sbjct: 363 QESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANSLLTPWNGDYHTNINVQMNHWPL 422

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWA 461
               LSE  +PL   +  L  +G  TA+  Y   A GWV+H  T++W   +A      W 
Sbjct: 423 EQAGLSELYQPLTTLMERLVPSGEATARTFYGKEAEGWVLHMMTNVW-NYTAPGEHPSWG 481

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPST 520
               GGAWLC HLWEHY YT D+D+L +R YP+L+G A F     + E   G+L T P++
Sbjct: 482 ATNTGGAWLCAHLWEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVEEPSHGWLVTAPTS 540

Query: 521 SPEHEFIAPDGKLACVS--YSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSL 576
           SPE+ F  P   +  VS     TMD+ ++ E+++ +I+AA +L  + +  A +E  LK  
Sbjct: 541 SPENSFYVPGDSVTPVSICMGPTMDVQLLTELYTNVITAARLLGCDAEYAAKLEADLKKF 600

Query: 577 PRLRPTKIAEDGSIMEWVQ 595
           P   P +I+++G + EW++
Sbjct: 601 P---PMQISKEGYLQEWLE 616


>gi|330996466|ref|ZP_08320348.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329573022|gb|EGG54641.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 798

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 182/588 (30%), Positives = 306/588 (52%), Gaps = 45/588 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
           ++ PA  +  ++P+GNGR+GAMV+GGV  ET+ LNE ++W G    +   P     L  +
Sbjct: 29  YDAPADEWMKSLPVGNGRVGAMVFGGVDEETVALNESSMWAGEYDPNQEKPFGRARLDSL 88

Query: 76  RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           R L  +G+  E    A  +L G  H    +  +GD++++FD +  +   E YRRELDL  
Sbjct: 89  RELFFAGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYAGKEGGVEDYRRELDLTN 148

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A A V +  G  ++ RE+ SSNP   +V   +  +  S+SF++ +  +        GN  
Sbjct: 149 AVATVSFKKGGTKYKREYISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           +       G+ + PK        G++F   + +K+  D G + A   + ++V+ +D   +
Sbjct: 209 VF-----DGQALFPKLGTG----GVKFQGRVVVKV--DNGEVEA-AGETVRVKHAD--AV 254

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
            +VA    D      +   +    E+++         +  +   H+ DY  LF RVS++L
Sbjct: 255 TIVADVRTDYKNGQYASLCEKTVGEAIAR-------PFETMKEEHVADYAPLFARVSLKL 307

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           +   K             +VP   R K+  + ++D  L  L FQ+GRYL I+SSR  + +
Sbjct: 308 ADDSKK------------SVPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENSPL 355

Query: 372 -ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
              LQG +N++L+    W S  H++IN E NYW +   NL+EC  PLF ++  L+ +G+K
Sbjct: 356 PIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLANVGNLAECNAPLFTYIADLARHGAK 415

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           T +  Y   GW  H   ++W  ++   G + W L+P+ G+W+ THLW  Y YT+D+D+L 
Sbjct: 416 TVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDYLR 474

Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
           + AYPLL+G A FLLD+++E  + GY+ T P  SPE+ F     +L   S  +T D  + 
Sbjct: 475 RTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDRVLA 533

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            E+ SA + A+++L  ++D   + +  +L +  P ++   G + EW +
Sbjct: 534 HEIMSACVQASDILGVDKD-FADSLRLALAKFPPFRVNSYGGLCEWYE 580


>gi|427388255|ref|ZP_18884138.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724838|gb|EKU87712.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
           12058]
          Length = 829

 Score =  293 bits (751), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 192/611 (31%), Positives = 319/611 (52%), Gaps = 59/611 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY NPDA ++L 
Sbjct: 33  QLYYTTPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 92

Query: 74  DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLK--YAEET- 123
            ++ L+  G+  EA       F       G     YQ+L D+ L F     K  ++ +T 
Sbjct: 93  AIQQLLFEGKNREAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKEFFSGDTV 152

Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               YRR LDL  A A   ++ G +++ RE+++S    V++  ++ S   SL F  SL  
Sbjct: 153 PVTGYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTASRRRSLFFTASLSR 212

Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
                 S+V GN +    +++EG      PG+             G+++   + +   D 
Sbjct: 213 PQQGTVSFVPGNGKESGTLLLEGVLDSGKPGQ------------DGMKYRVAMRVVSKDG 260

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRNL 288
           +  ISA E+  +  +G++ A L++ A++S+     + S S+     +S+  +A QS   L
Sbjct: 261 KQHISA-ENGVMLTQGTE-AWLVISATTSYAAAGTDFSGSRYKEVCDSLLNAATQSHSQL 318

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
           S  +   ++   +++L+ RVS+ L  +  D             +P+ ER+  F   E P+
Sbjct: 319 SILNSQLKNAS-HRELYDRVSLTLPATEDD------------ALPTNERIVRFTERESPA 365

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L  L + +GRYLLISS+RPG+   NLQG+W   +   W+   H NIN++MN+W      L
Sbjct: 366 LATLYYNYGRYLLISSTRPGSLPPNLQGLWANGIQTPWNGDYHTNINIQMNHWPLEQAGL 425

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           SE  +PL   +  L  +G +TA   Y   A GWV+H  T++W   +A      W     G
Sbjct: 426 SELYQPLTTLIERLVPSGKETACTFYGNRAQGWVLHMMTNVW-NYTAPGEHPSWGATNTG 484

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
           GAWLCTHLWEHY YT D ++L K+ YP+L+G + F    ++ E   G+L T P++SPE+ 
Sbjct: 485 GAWLCTHLWEHYQYTQDLEYL-KKIYPILKGASEFFYSTMVQEPKHGWLVTAPTSSPENA 543

Query: 526 -FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            F+  D     +    TMD+ ++ E+++ ++ AA +L K +D    K+  +L +  P +I
Sbjct: 544 FFVGDDPTPVSICMGPTMDVQLLTELYTNVVQAASIL-KCDDGYAAKLRAALEKFPPMQI 602

Query: 585 AEDGSIMEWVQ 595
           +++G + EW++
Sbjct: 603 SKEGYLQEWLE 613


>gi|255014859|ref|ZP_05286985.1| glycoside hydrolase family protein [Bacteroides sp. 2_1_7]
          Length = 850

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 193/611 (31%), Positives = 305/611 (49%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 57  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 116

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 117 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 176

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 177 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 236

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 237 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 286

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G      D  L V  +  A++L+ + +  FD          KD   + +   L    
Sbjct: 287 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 336

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L +  +D             +P  ER+ +F  D+ 
Sbjct: 337 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 384

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 385 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 444

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL +       +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 445 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 503

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 504 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 562

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+G +  +   S MD  I+RE+F+  I AA +L   + A   ++     RL PT I
Sbjct: 563 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 621

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 622 GKDGRIMEWLE 632


>gi|410102732|ref|ZP_11297657.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
 gi|409237859|gb|EKN30654.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
          Length = 809

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 193/611 (31%), Positives = 305/611 (49%), Gaps = 53/611 (8%)

Query: 3   NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           N     T   +   F+ PA+ + + +P+GNGR+G M  GG+  E + LNE +LW+G   D
Sbjct: 16  NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
             NP A  +L+++R L+  G+  EA     K F               P   YQL G++ 
Sbjct: 76  TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L +   +   +   YRR L+L+ A A V +  GNV + RE F+S    + V  +      
Sbjct: 136 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
           +L+F++ ++     H+ ++ + + ++M G+ P            + KG++F++   ++I 
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245

Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
             +G      D  L V  +  A++L+ + +  FD          KD   + +   L    
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 295

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
           +  +S L   H   Y+ LF RVS+ L +  +D             +P  ER+ +F  D+ 
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           DP L  L FQFGRYLLISS+R G    NLQG+W   +   W+   H+NINL+MN+W +  
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSE   PL +       +G +TA+  Y A GWV H   ++W + +A      W     
Sbjct: 404 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWLC HL+ HY YT+D+ +L +  YP ++G A F +D L++     YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
            +  P+G +  +   S MD  I+RE+F+  I AA +L   + A   ++     RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580

Query: 585 AEDGSIMEWVQ 595
            +DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591


>gi|409099481|ref|ZP_11219505.1| alpha-L-fucosidase [Pedobacter agri PB92]
          Length = 937

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 175/503 (34%), Positives = 265/503 (52%), Gaps = 50/503 (9%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           YQ  GD+ L F    L      Y+R LDL TA AR  Y++  V +TRE+F+S P+Q IV 
Sbjct: 293 YQPFGDLNLAFQHKGLI---TKYKRSLDLTTAIARTNYTIAGVNYTREYFASQPNQSIVI 349

Query: 162 KISGSESGSLSFNVSLDSLLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
            +S  +  S+S   +L SL         G N I +  +     +  ++         + +
Sbjct: 350 HLSADKKASISLTAALSSLHQQSGIKALGKNTISLSVQVKDGALKGES---------RLT 400

Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
           A+++       G +  L +K + +  +D   L L A ++F    IN  D   DP + ++ 
Sbjct: 401 AVIK------NGAVKVLNNK-ISISKADEVTLYLTAGTNF----INAQDVSGDPAAANIK 449

Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
           AL ++ + + +++  RH+ +YQ  +++  +   +S K+             +P+ ER+  
Sbjct: 450 ALNTVTDKTSAEIKNRHIKEYQSYYNKFHVDFGQSGKE------------NLPTNERLNK 497

Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
           F T  DP    L  Q+GRYLLISSSRPGTQ ANLQGIWN+ L+P W S    NIN+EMNY
Sbjct: 498 FATSNDPGFAALYMQYGRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINMEMNY 557

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W +   NLS   EPLF+ +  L+  G++TA+  Y   GWV+HH TD+W   +A       
Sbjct: 558 WPAEVLNLSALNEPLFNKINGLAKTGTETAKEYYNTPGWVLHHNTDLW-NGTAPINASNH 616

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPS 519
            +W  G AWL  HLWEHY +T D+ FL   AYPL++  A F   +LI+    G+L + PS
Sbjct: 617 GIWVTGAAWLSQHLWEHYAFTGDQTFLRNEAYPLMKQAALFFDAFLIKDPKTGWLISTPS 676

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPR 578
            SPE      +G L       TMD  IIR +F   I+A E+L  N DA    +L++ + +
Sbjct: 677 NSPE------NGGLVA---GPTMDHQIIRSLFKNCIAATEIL--NVDADFRTILQAKMKQ 725

Query: 579 LRPTKIAEDGSIMEWVQRRLNTS 601
           + P +I + G + EW + + +T+
Sbjct: 726 IAPNQIGKYGQLQEWREDKDDTT 748



 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 57/82 (69%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +N PA+ +TDA+PIGNGRLGAMV+ GV ++ ++ NE+TLWTG P +Y    A K L+
Sbjct: 29  QLWYNQPAEKWTDALPIGNGRLGAMVFAGVENDHIQFNEETLWTGKPRNYNRKGAYKYLA 88

Query: 74  DVRSLVDSGQYAEATAASVKLF 95
           ++R L+  G+  EA   + K F
Sbjct: 89  EIRKLLFEGKQKEAEVLAQKEF 110


>gi|189460419|ref|ZP_03009204.1| hypothetical protein BACCOP_01058 [Bacteroides coprocola DSM 17136]
 gi|189432851|gb|EDV01836.1| GDSL-like protein [Bacteroides coprocola DSM 17136]
          Length = 1006

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 182/592 (30%), Positives = 308/592 (52%), Gaps = 45/592 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ PA  + + +P+GNGRLG M  GG+  E + LNE ++W+G   +Y NPDA K+L ++R
Sbjct: 233 YDEPAAQWEETLPLGNGRLGMMPDGGIVKEHIVLNEISMWSGSEANYLNPDASKSLPEIR 292

Query: 77  SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEETYRREL 128
            L+  G+  EA       F       G     +Q+LG++ LE     H K     Y R L
Sbjct: 293 RLLFEGKNKEAQELMYTSFVPKKPEKGGTYGTFQMLGNLFLEHQYGVHEKDVPADYHRWL 352

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL+   A   +S GNV + RE+  S    V++  +  +  GS++F ++L           
Sbjct: 353 DLSKGIAYTTFSRGNVNYVREYVVSRDKDVMLIHLKANVPGSINFKMNLSRP------ER 406

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G+ + + EG+     +    ++     G++++AI  I     R T  + +++ + V+ +D
Sbjct: 407 GSVRKLAEGKL---ELYGSLDSGSSQTGVRYAAIAGI-TCKGRQTNQSTDEQSITVQNAD 462

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
            A +++ A +SF    I  +++ +         L      +  +  +  +  YQ LF+R 
Sbjct: 463 EAWIVVSAKTSFLAGEIYETEADR--------ILNDALKSNLCETVSEAILSYQALFNRA 514

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            I+L  +           E +  + + +R++ FQ  +DPSL  L + +GRYLLISS+RPG
Sbjct: 515 GIRLPEN-----------EAVSHLTTDQRIERFQQQDDPSLAALYYNYGRYLLISSTRPG 563

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +   NLQG+W  +    W+   H NIN++MN+W     NLSE   PL D +  L  +G +
Sbjct: 564 SLPPNLQGLWANEPGTPWNGDYHTNINVQMNHWPVEQANLSELYLPLVDLVKRLVPSGEE 623

Query: 429 TAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           +A+  Y   A GWV+H  T++W   +A      W     GGAWLC HLWEHY ++ DR++
Sbjct: 624 SAKAFYGPQAKGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHLWEHYLFSGDRNY 682

Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMD 543
           L    YP+++G + F    ++ E   G+L T P++SPE+ F  P  D     V    TMD
Sbjct: 683 LAD-IYPIMKGASEFFYSTMVREPKHGWLVTAPTSSPENAFYLPGKDRTPISVCMGPTMD 741

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           + ++RE+++ +I A+ +L   + A  E + +++  L P +I++ G +MEW++
Sbjct: 742 IQLVRELYTNVIEASHILH-TDTAYAEALQEAIGLLPPHQISKKGYLMEWLE 792


>gi|340621763|ref|YP_004740215.1| alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
 gi|339902029|gb|AEK23108.1| Alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
          Length = 806

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 201/605 (33%), Positives = 312/605 (51%), Gaps = 60/605 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA  FT+++P+GNGRLGAMV+G    ET+ LNE +LW+G   +  + +A K L
Sbjct: 23  VSVVFDQPATFFTESLPLGNGRLGAMVFGKTDVETIVLNEISLWSGGKQEADDENAHKYL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFD-DSHLK 118
            ++++L+  G+  EA +  +K F         G+ A+     YQ LG +++++  D+ + 
Sbjct: 83  KEIQNLLLQGKNLEAQSLLMKHFVAKGKGTCHGNGANCHYGCYQTLGQLKIDWKSDASVT 142

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           +    Y+R LDL  A A  +Y     +  +  F+   + VI  KI  ++   L  ++   
Sbjct: 143 H----YKRVLDLEKAVATTQYVRNGNQIEQIVFTDFNNDVIWVKIKSAQKTDLGLSLFRK 198

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
              +N  +    N++IM+G  P          N++ KG++F+ I E+    +  T  A  
Sbjct: 199 ---ENAHFSYDKNKLIMQGTLP----------NENQKGMEFATIAEVTTDGELTTSLA-- 243

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              L+V  +   ++ + AS+++   + N      D   ++++ L++I +LS+ +    + 
Sbjct: 244 --GLEVRSASEVIVKISASTNYS--YENGELENTDVVKQTLAYLKAINSLSFQNALLENQ 299

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
             Y K+F+R   ++  S  D        EN+ T    +R ++  TD    L  L + FGR
Sbjct: 300 VTYGKIFNRNRWEMPTSLTD--------ENLTTWQRLQRYQAGNTD--AQLPVLYYNFGR 349

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW +   NLS+  EPL  F
Sbjct: 350 YLLISSSRKGLLPANLQGLWAEEYQTPWNGDYHLNINVQMNYWLAEVTNLSDLAEPLLRF 409

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
              L  NG KTA+  Y A GWV H  ++ W  +S   G   W     GGAWLC H+WEHY
Sbjct: 410 TKNLVPNGKKTAKAYYNAEGWVAHVVSNPWFFTSPGEG-ASWGSTLTGGAWLCQHIWEHY 468

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP---DGK-- 532
            +T + DFL K  Y +L+  A F  D LI E   GY  T PS SPE+ +  P   DGK  
Sbjct: 469 QFTQNIDFL-KEYYFVLKEAAHFFEDMLIKEPKSGYWVTAPSNSPENAYYLPELKDGKKQ 527

Query: 533 --LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
               C+    TMDM I+RE+FS ++ A+E+L K+ D    K    +    P  I E G +
Sbjct: 528 HGFTCM--GPTMDMQIVRELFSNVLKASEILNKDTDKH-PKWKDIIKNTVPNTIGEQGDL 584

Query: 591 MEWVQ 595
            EW  
Sbjct: 585 NEWFH 589


>gi|326798066|ref|YP_004315885.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
 gi|326548830|gb|ADZ77215.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
          Length = 794

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 185/608 (30%), Positives = 304/608 (50%), Gaps = 69/608 (11%)

Query: 8   STTNPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
           S    L++ ++ PA  +  +A+PIGNG +GAM +GG+  E ++ +E +LW+G PG   N 
Sbjct: 25  SQQKALQLWYDRPATDWMREALPIGNGYIGAMFFGGIGEEQIQFSEGSLWSGGPGANPNY 84

Query: 66  -----PDAPKALSDVRSLVDSGQYAEAT---------AASVKLFGHPAD-----VYQLLG 106
                P+A K L +VR+L+  G+  EA           A VKL G   D       Q +G
Sbjct: 85  NFGNRPNAWKYLGEVRALIKQGKLKEANELVEKQMTGMAPVKLAGDSTDWGDYGAQQTMG 144

Query: 107 DIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
           D+ ++    H     + YRR LD+  A  +V YSV   ++ R  F S P  V+V K +  
Sbjct: 145 DLFIKV--GHGSIPVQDYRRTLDIQRAIGKVSYSVAGNKYQRSFFGSYPQGVMVYKFTSD 202

Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
           +S S + + S     +  S+       +  G  P  ++  +           +  + + +
Sbjct: 203 KSESYTLHFSTPQYKEKESFEGLRYSCV--GYVPNNKLAFET---------AYQLVTDGR 251

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
           +    GT+S  + K L        +++  A++++   +  P  +  D  S     L + +
Sbjct: 252 VKYTNGTVSVEKAKSL--------LIIHTAATAYTMQY--PHYNGNDFRSIIKKRLDAAK 301

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDE 345
             SY  L+  H +DYQ LF RVS QL              ++ D +P+ +R ++ F+  E
Sbjct: 302 GKSYKQLFQIHQEDYQPLFDRVSFQLQ------------GKSADHLPTDKRQQALFEGAE 349

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
           D  L +L FQ+GRYL+I++SRPGT   +LQG WN  ++P W +  H NIN +M YW +  
Sbjct: 350 DVGLEQLYFQYGRYLMIAASRPGTMPMHLQGKWNNSVNPPWAADYHTNINEQMLYWPAEV 409

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
            NLSEC EPL D++  L   G K+A   +   GW+++   + +  ++ + G + W  +P 
Sbjct: 410 TNLSECHEPLIDYIESLVEPGKKSAHDFFHTRGWIVNTMNNAFGYTAVNWG-LPWGFYPA 468

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
           G AWL  H+WEHY YT D+ +L  RAYP+++  A F +D+L    +G+L ++PS SPEH 
Sbjct: 469 GAAWLTQHVWEHYAYTQDKAYLRNRAYPIMKEAARFWIDYLTLDENGHLVSSPSYSPEH- 527

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  ++MD  I  ++ +  + AA VL+  + A  +       R+ P ++ 
Sbjct: 528 --------GGISGGASMDHQIAWDILNNSLEAAMVLD--DKAFADTAQHVRDRILPPQVG 577

Query: 586 EDGSIMEW 593
             G + EW
Sbjct: 578 RWGQLQEW 585


>gi|350633298|gb|EHA21663.1| hypothetical protein ASPNIDRAFT_53702 [Aspergillus niger ATCC 1015]
          Length = 833

 Score =  291 bits (746), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 198/597 (33%), Positives = 297/597 (49%), Gaps = 58/597 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA +FT  +PIGNGRLGA +WG   +E + LNE+++W+G   +  NP +  AL  VR
Sbjct: 70  YTTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWSGPFINRVNPRSYDALWPVR 128

Query: 77  SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G   E   A++  + G P     Y  LG + L+F   H +     Y R LDL + 
Sbjct: 129 SLLAEGNMTEGNDATLANMVGIPDSPQSYSALGSLVLDF--GHDEAGISNYTRYLDLRSG 186

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V+Y+   V + RE+ +S+PD V+  ++S SE G L  NV+  S L    YV  NN  
Sbjct: 187 MAVVEYTYRAVRYRREYLASHPDNVVAVRLSSSEPGGL--NVA--SSLVRDRYVVSNNAT 242

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +      G  +  +A +N+    IQF+A   + +SD R T             S+   L+
Sbjct: 243 LSHD---GGLLTLRAYSNNVSNPIQFTAEARV-VSDGRAT-------------SNGTSLV 285

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRV 308
           +  +S+ D  FI+   S +    E+  A     L +  +  +  +    + DY  L  RV
Sbjct: 286 VRNASTID-IFIDTETSYRYSAQENWEAEIKSKLDTACSSGFVAVKKNAIADYSALAQRV 344

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
            + L            S  +   +P+  R+ +++ D   DP LV L+F FGR+ LI+SSR
Sbjct: 345 DLNLG-----------SSGSAGNLPTDSRLVNYRIDPDSDPELVVLMFHFGRHSLIASSR 393

Query: 367 PGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
                A   NLQG+WN+D  P W     ++INLEMNYW +   NL++   P  D L  + 
Sbjct: 394 ATESPALPANLQGLWNQDFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDVVH 453

Query: 424 INGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
             G   A+  Y  S  G+V+HH TD+W  ++       W +WPMGGAWL  +L EHY ++
Sbjct: 454 DRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYRFS 513

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACV 536
            D   L  R +PLL+  A F   +L    +GY  T PS SPE  +I P+     GK   +
Sbjct: 514 RDESILRNRIWPLLQSAARFYYCYLFP-FEGYYSTGPSLSPEASYIVPNDMTTAGKEEGI 572

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             + TMD +++ E+F A+I   +VL  N           L +++P +I   G I+EW
Sbjct: 573 DIAPTMDNSLLHELFQAVIETCDVLAINNTDCTTAA-SYLAKIKPPQIGSSGRILEW 628


>gi|159125849|gb|EDP50965.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 792

 Score =  291 bits (746), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 187/598 (31%), Positives = 307/598 (51%), Gaps = 47/598 (7%)

Query: 11  NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           NP   T +  PA  F   +PIGNGRL   +WGG   + + LNE+++W+G   D  NP+A 
Sbjct: 22  NPSTYTWYTSPAADFASTLPIGNGRLATAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80

Query: 70  KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
           +  +D R+++++G  + A    ++ +   P+    Y  LG ++L+F   H   +   Y R
Sbjct: 81  EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGSLKLDF--GHEASSLHNYTR 138

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL T  A V+Y VG+V ++RE+ +S+PD V+  ++  S+  +L+  VSL+     + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           V     +  +G      +  KAN+  +   I+F++   +   + R T +      + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +     +S+      P ++++D  S     L +   L+Y  +      DYQ L  
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLNYPAVKQAATSDYQSLSG 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
           RV +           D  S  +    P+  R+ +++T+   DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351

Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           SR G+     ANLQGIWN+D SP W     V++NLEMNYW +   NL++  EP+ D +  
Sbjct: 352 SREGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411

Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           +  +G   A+  Y   +G+++HH TD+W  ++       W +WPMG AWL  +L + Y +
Sbjct: 412 VLPHGQDVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
           T D+  L +R +PLL+  A F   +L E  +GY  + PS SPE+ F  P+     GK   
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  + TMD  ++ E+F A+I   + L+   + L     K + R+R  +I   G I+EW
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEW 587


>gi|375101342|ref|ZP_09747605.1| alpha-galactosidase family protein [Saccharomonospora cyanea
           NA-134]
 gi|374662074|gb|EHR61952.1| alpha-galactosidase family protein [Saccharomonospora cyanea
           NA-134]
          Length = 1130

 Score =  291 bits (745), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 201/595 (33%), Positives = 308/595 (51%), Gaps = 55/595 (9%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPD 67
           L + ++ PA  + ++ +PIG+G LGA V+GGV +E L+ NE TLWTG PG    D+ N  
Sbjct: 52  LTLWYDEPASDWESEILPIGSGALGAGVFGGVATERLQFNEKTLWTGGPGSAGYDFGNWK 111

Query: 68  APK--ALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEE 122
            P+  A+ +V+  +D+ Q  +    + KL G P      YQ  G++ +    S  +  E 
Sbjct: 112 EPRPGAIEEVQERIDAEQRVDPEWVASKL-GQPKQGYGAYQTFGEVRV----SGAEPQEV 166

Query: 123 T-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
           T YRR LD+  A A V Y    V  TRE+F++  D VIV + SG E+G++   V + +  
Sbjct: 167 TDYRRYLDIADAVAGVSYEADGVRHTREYFATAADDVIVARFSGDETGAVDVTVGV-TAP 225

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
           DN S     N    +GR         A A DD  G+++ A L++    + G+ +   D  
Sbjct: 226 DNRS----KNVTAKDGRIT------FAGALDD-NGLRYEAQLQVLT--EGGSRTDNPDGS 272

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + V  +D   L+L A + +   +  P+    DP +     + +     Y  L   H+ D+
Sbjct: 273 VTVADADTMTLVLAAGTDYSDEY--PAYRGDDPHAAVTERVDAAVAEGYDALRAAHVADH 330

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           ++LF RVS+ L +   D+ TD       D   +AE  ++ +         L FQ+GRYLL
Sbjct: 331 RELFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEA--------LYFQYGRYLL 382

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           I+SSRPG+  ANLQG+WN+  SP W +  HVNINL+MNYW +   NLSE  +PLFD++  
Sbjct: 383 IASSRPGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTNLSETTDPLFDYVDS 442

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNY 480
           L   G  TA+  +   GWV+H++T  +  +   D     W  +P  GAWL    WEHY +
Sbjct: 443 LVAPGEVTAREMFDNRGWVVHNETTPFGYTGVHDWATAFW--FPEAGAWLAQSYWEHYLF 500

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           T D  FL +RAYP+L+  + F +D L+ +  DG L  NPS SPE             S  
Sbjct: 501 TRDETFLRERAYPMLKSLSQFWIDELVTDPRDGKLVVNPSYSPEQ---------GDFSAG 551

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
           ++M   I+ ++ ++   AAE++   E+A   ++  +L  L P  ++   G + EW
Sbjct: 552 ASMSQQIVWDLLTSTAEAAELV-GGEEAFRSELAGTLAELDPGLRVGSWGQLQEW 605


>gi|70985434|ref|XP_748223.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gi|66845851|gb|EAL86185.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 792

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 187/598 (31%), Positives = 307/598 (51%), Gaps = 47/598 (7%)

Query: 11  NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           NP   T +  PA  F   +PIGNGRL A +WGG   + + +NE+++W+G   D  NP+A 
Sbjct: 22  NPSTYTWYTSPAADFASTLPIGNGRLAAAIWGGA-VDNITVNENSIWSGPFQDRVNPNAY 80

Query: 70  KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
           +  +D R+++++G  + A    ++ +   P+    Y  LG ++L+F   H   +   Y R
Sbjct: 81  EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGPLKLDF--GHEASSLHNYTR 138

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL T  A V+Y VG+V ++RE+ +S+PD V+  ++  S+  +L+  VSL+     + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           V     +  +G      +  KAN+  +   I+F++   +   + R T +      + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +     +S+      P ++++D  S     L +   L Y  +      DYQ L  
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLIYPAVKQAATSDYQSLSG 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
           RV +           D  S  +    P+  R+ +++T+   DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351

Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           SR G+  A   NLQGIWN+D SP W     V++NLEMNYW +   NL++  EP+ D +  
Sbjct: 352 SREGSSSALPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411

Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           +  +G   A+  Y   +G+++HH TD+W  ++       W +WPMG AWL  +L + Y +
Sbjct: 412 VLPHGQAVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
           T D+  L +R +PLL+  A F   +L E  +GY  + PS SPE+ F  P+     GK   
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  + TMD  ++ E+F A+I   + L+   + L     K + R+R  +I   G I+EW
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEW 587


>gi|238504526|ref|XP_002383494.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
           NRRL3357]
 gi|220690965|gb|EED47314.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
           NRRL3357]
          Length = 792

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 205/598 (34%), Positives = 293/598 (48%), Gaps = 60/598 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA +FT  +P+GNGRLGA VWG    E + LNE+++W+G   D  NPD+  AL  VR
Sbjct: 28  YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86

Query: 77  SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           S++  G    A   +++ + G P +   Y  LG + L+F   H     E Y R LDL   
Sbjct: 87  SMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V Y    VEF RE+ +S+P  VI  +++ SE+G L+   SL        YV  N   
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
                         A A +D   ++  A      SDD   IS     ++   G   S  A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
             +++ +++    FI+   S +  T E+  A     L +     +  +      D++ L 
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV + L+ S         +  N+ T    ER K+   D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350

Query: 366 R-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           R  GT     NLQG+WNED  P W     VNINLEMNYW +   NL+E   PL   L  +
Sbjct: 351 RETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410

Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
              G   A+  Y     G+V+HH TDIW  +        W +WPMGGAWL  +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
           T D + L++R +PLL   A F   ++    +GYL T PS+SPE+ F+ P+     G    
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  + TMD  ++ E+F +II   +VL  N +    K   SLP ++  +I   G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEW 586


>gi|429751943|ref|ZP_19284832.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
           taxon 326 str. F0382]
 gi|429178378|gb|EKY19657.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
           taxon 326 str. F0382]
          Length = 806

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 203/603 (33%), Positives = 315/603 (52%), Gaps = 59/603 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PAKHFT+++PIGNGRLGAM++G    + + LNE +LW+G   D  +PDA   L
Sbjct: 23  VSVVFHEPAKHFTESLPIGNGRLGAMLFGKTDIDRIVLNEISLWSGGTQDADDPDAHIHL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKY 119
             ++ L+  G+  EA +   K F                   YQ+LG+++L++  +    
Sbjct: 83  KTIQQLLLDGKNLEAQSLLQKHFIAKGKGSCNGNGANGNYGCYQILGELQLDWKTN---L 139

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   +  G+    +  F+   + +I  KI+ S+   L  ++SL+ 
Sbjct: 140 PIKNYQRILRLDQATAFTSFKRGDNHIQQIAFADFKNDLIWIKITASQP--LDMDISLNR 197

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
             +N +    +N+II+ G  P          N+D +G+QF+++++I+   + + T SA  
Sbjct: 198 K-ENATTSYKSNKIILSGALP----------NNDIQGMQFASVIDIQTDGNLQNTASATS 246

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
            +K K       VL + A++++D  F     ++ D   ++ + LQ    + + +      
Sbjct: 247 VQKAKE-----IVLKISAATNYD--FTKGRLTQDDVLQKANNYLQKT-TIPFDNAIIESQ 298

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
             YQ LF+R     +R   D  TDT S        + ER++ F   +  +L+ +L+  FG
Sbjct: 299 KAYQVLFNR-----NRWYSDANTDTSS------FSTFERLQRFYKGKKDALLPILYYNFG 347

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSR G   ANLQG+W E+    W+   H+NINL+MNYW +   NLSE   PL  
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHQ 407

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           F   L  NG KTA+  Y A GWV H  ++ W  +S       W     GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTAKAYYNAKGWVAHVISNPWFYTSPGES-AEWGSTLTGGAWLCEHIWQH 466

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
           Y YT++ DFL K  YP+L+  A F    LI+    GY  T PS SPE+ +I P   DGK 
Sbjct: 467 YLYTLNTDFL-KEYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525

Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            +     + TMDM I+RE+FS  + AA++L  + D L  +  + +    P +I   G + 
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDSD-LYSQWQEIITHTVPNRIGRKGDLN 584

Query: 592 EWV 594
           EW+
Sbjct: 585 EWL 587


>gi|18310857|ref|NP_562791.1| hypothetical protein CPE1875 [Clostridium perfringens str. 13]
 gi|18145539|dbj|BAB81581.1| conserved hypothetical protein [Clostridium perfringens str. 13]
          Length = 1479

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ ++ G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINNGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV + L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVDLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|168211677|ref|ZP_02637302.1| fibronectin type III domain protein [Clostridium perfringens B str.
           ATCC 3626]
 gi|170710364|gb|EDT22546.1| fibronectin type III domain protein [Clostridium perfringens B str.
           ATCC 3626]
          Length = 1479

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 193/605 (31%), Positives = 315/605 (52%), Gaps = 72/605 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSGQYAEATAASVKLF----GHPAD--VYQLLGDIELEFDDSHLKY 119
             A +A+ ++R ++     AE    S  L+    G   D   YQ  GDI L+F  SH + 
Sbjct: 108 EGAWEAVQEIRKIL-----AEGGTPSNDLYQRVCGDQRDYGAYQNFGDIFLDFK-SHEES 161

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  + 
Sbjct: 162 KVTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEG 221

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED
Sbjct: 222 AHNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKED 266

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           + + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++
Sbjct: 267 R-ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIE 323

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DY+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRY
Sbjct: 324 DYKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRY 370

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++
Sbjct: 371 LLISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYI 430

Query: 420 TYLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
             L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  
Sbjct: 431 ESLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQ 489

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIA 528
           +LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH    
Sbjct: 490 NLWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH---- 545

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
                   +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G
Sbjct: 546 -----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHG 599

Query: 589 SIMEW 593
            + EW
Sbjct: 600 QVQEW 604


>gi|422346543|ref|ZP_16427457.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
           WAL-14572]
 gi|373226088|gb|EHP48415.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
           WAL-14572]
          Length = 1479

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVLVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEIHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|182626122|ref|ZP_02953882.1| fibronectin type III domain protein [Clostridium perfringens D str.
           JGS1721]
 gi|177908559|gb|EDT71084.1| fibronectin type III domain protein [Clostridium perfringens D str.
           JGS1721]
          Length = 1479

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 190/604 (31%), Positives = 314/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPD- 67
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG   DY   + 
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEDYNGGNK 107

Query: 68  --APKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 YNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHY +T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYKFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|110798918|ref|YP_696557.1| fibronectin type III [Clostridium perfringens ATCC 13124]
 gi|110673565|gb|ABG82552.1| fibronectin type III domain protein [Clostridium perfringens ATCC
           13124]
          Length = 1479

 Score =  289 bits (739), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYIE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|336417082|ref|ZP_08597411.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936707|gb|EGM98625.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
           3_8_47FAA]
          Length = 859

 Score =  288 bits (738), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 206/633 (32%), Positives = 318/633 (50%), Gaps = 74/633 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
           LK T+N PAK + ++A+PIGNG +GAM++GGV  + ++ NE TLW+G PG+         
Sbjct: 32  LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91

Query: 64  TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
             P+  K+ L   R L         V+   Y +A    +                  KL 
Sbjct: 92  GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151

Query: 96  GHPADV--YQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
           G       +Q L +I +E  +S     A   Y R LD++ A  RV Y  G + F RE+F 
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNSATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211

Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
           S PD ++V ++ S S+ G +S  +SL+SL  +      +N I + G  P      K   +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270

Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
               G++++  L +K +   G I+ ++ KKLK+E +   ++L+ A++++     +     
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328

Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
           S ++P  +  + L+   N  Y+ L   H  DY  L+ R+ + L   P+  V  T      
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVATT------ 382

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
           D++       +    E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W+S 
Sbjct: 383 DSLLKGMDAHANSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
            H NIN++MNYW + P NLS C  P+ +++  L   G  TAQ  Y         GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
           + +IW  ++  + K     +P G  W+C  +WE+Y + +D+DFLE   Y ++   A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560

Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
           D  W  E  DG L  NPS SPEH EF      L C     +   A+I E+F  +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609

Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           L K+++  + ++  ++ +L   KI   G +MEW
Sbjct: 610 LGKDKEPEIAEIKTAMNKLSGPKIGLGGQLMEW 642


>gi|168214908|ref|ZP_02640533.1| fibronectin type III domain protein [Clostridium perfringens CPE
           str. F4969]
 gi|170713641|gb|EDT25823.1| fibronectin type III domain protein [Clostridium perfringens CPE
           str. F4969]
          Length = 1479

 Score =  288 bits (738), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 188/604 (31%), Positives = 312/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA  +  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGEI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSRAGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P ++ + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELEDKRERLLKP-QVGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|317138010|ref|XP_001816599.2| alpha-fucosidase [Aspergillus oryzae RIB40]
 gi|195972741|dbj|BAG68493.1| probable secreted protein [Aspergillus oryzae]
          Length = 792

 Score =  288 bits (738), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 204/598 (34%), Positives = 292/598 (48%), Gaps = 60/598 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA +FT  +P+GNGRLGA VWG    E + LNE+++W+G   D  NPD+  AL  VR
Sbjct: 28  YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86

Query: 77  SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            ++  G    A   +++ + G P +   Y  LG + L+F   H     E Y R LDL   
Sbjct: 87  YMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V Y    VEF RE+ +S+P  VI  +++ SE+G L+   SL        YV  N   
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
                         A A +D   ++  A      SDD   IS     ++   G   S  A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
             +++ +++    FI+   S +  T E+  A     L +     +  +      D++ L 
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
            RV + L+ S         +  N+ T    ER K+   D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350

Query: 366 RP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           R  GT     NLQG+WNED  P W     VNINLEMNYW +   NL+E   PL   L  +
Sbjct: 351 RKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410

Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
              G   A+  Y     G+V+HH TDIW  +        W +WPMGGAWL  +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
           T D + L++R +PLL   A F   ++    +GYL T PS+SPE+ F+ P+     G    
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  + TMD  ++ E+F +II   +VL  N +    K   SLP ++  +I   G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEW 586


>gi|300771448|ref|ZP_07081323.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300761437|gb|EFK58258.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 778

 Score =  288 bits (737), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 192/613 (31%), Positives = 309/613 (50%), Gaps = 68/613 (11%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            + +N LK+ ++  AK + + +P+GNG +G M  GGV  E + LNE ++W+G   D  N 
Sbjct: 22  VAQSNSLKLWYDKAAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 81

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
            A K++ +++ L+  G+  EA     K F       GH      P   YQ LG + L+F 
Sbjct: 82  TAYKSVGEIQKLLFEGKNDEAERLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFT 141

Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
            ++       Y R LDL  A AR  +++  V++TRE+F+S    V V +++ S+ G+L+F
Sbjct: 142 GTN---QPTGYERSLDLKDAVARTHFTINGVKYTREYFTSYDQNVGVVRLTSSKKGALNF 198

Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
           + SL S  +   Y +  N+  M G      + P     D   GI FS+ + I     RG 
Sbjct: 199 SASL-SREERARYTSKGNEFSMSG------VLPDGKGGD---GISFSSKIRIF---HRGG 245

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
             A  D  L V  +   ++   A++S+  P         DP       L+   +  Y  L
Sbjct: 246 KVAASDTALTVSKASEVLIFFAAATSYFHP---------DPQQYVNEQLKLAYDTPYPQL 296

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT--VPSAERVKSFQTD--EDPSL 349
           + +HL  Y+ +F+RV +QL             E++ID   + + +R+++F  +  +D  L
Sbjct: 297 FKQHLSRYESVFNRVDLQL-------------EDDIDKSDITTDKRLRAFYDNPAQDNGL 343

Query: 350 VELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
             L +QFGRYL ISS+ P  + A   NLQG+W   +   W+   H+NIN +MN+W     
Sbjct: 344 AALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVN 403

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           NLSE   P  + +  ++  G KTA+  Y A GWV++  T++W  S+    +  W      
Sbjct: 404 NLSEYHTPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTAS 462

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
           G WLC HLWEHY +T D  +L K  YP+++G A F    ++ +   G+L T+PS SPE+ 
Sbjct: 463 G-WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENA 520

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPT 582
           F   +GK A V     +D  I+RE++  +I A  +L ++    D L  ++ +  P   P 
Sbjct: 521 FRMKNGKTAAVVMGPAIDNQIVRELYKNLIDADSILGQHNTFTDTLRTQIQQLAP---PV 577

Query: 583 KIAEDGSIMEWVQ 595
            I++ G + EW++
Sbjct: 578 LISKSGRVQEWLE 590


>gi|169343800|ref|ZP_02864799.1| fibronectin type III domain protein [Clostridium perfringens C str.
           JGS1495]
 gi|169298360|gb|EDS80450.1| fibronectin type III domain protein [Clostridium perfringens C str.
           JGS1495]
          Length = 1479

 Score =  288 bits (737), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 188/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA ++  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD ++V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNIMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EILNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|150003335|ref|YP_001298079.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149931759|gb|ABR38457.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
          Length = 803

 Score =  288 bits (737), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 179/598 (29%), Positives = 309/598 (51%), Gaps = 44/598 (7%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           +T +    ++ +  PA+ + +++PIGNGRLGAM +GG+  E L LNE T+W+G   +  N
Sbjct: 24  ATDSCETTELWYAQPAEVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
            P   + ++ +R L   G+ +E    A   L G+      +  +GD++++F     K   
Sbjct: 84  IPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVT- 142

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             YRR L L+ A + V ++ G V + RE+F++NPD V+V +++  +  S++ N+ LD L+
Sbjct: 143 -GYRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                   +NQ++  G+      P        P G+ F     I +  D G +  +E  +
Sbjct: 201 RQADLSVEDNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSE 249

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           + ++ +D   L++   + +  P         D  +     ++     SY +L   H+ DY
Sbjct: 250 VGIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVKKAAAKSYDELKQAHIKDY 300

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
             L++RVSI   +          +   + T    ++VK  +TD    L  L FQ+GRYL 
Sbjct: 301 NTLYNRVSIHFGQD---------ANRALPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           I+SSR  + +   LQG +N++ +    W +  H++IN E NYW +   NL+EC  PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L+ +G+KTA+V Y   GW  H   ++W  + A    ++W L+PM  +W+ +HLW  Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMASSWIASHLWTQY 468

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            +T D+ +L + AYPLL+G A F+LD+L +    GYL T PS SPE+ F    G+    S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                D  +  E+ S  + A+E+L  + +   + +  ++ +L P ++  +G+I EW +
Sbjct: 529 MMPACDRELAYEILSNCVQASEILNTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 585


>gi|271969414|ref|YP_003343610.1| alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
 gi|270512589|gb|ACZ90867.1| Alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
          Length = 991

 Score =  288 bits (736), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 200/617 (32%), Positives = 317/617 (51%), Gaps = 72/617 (11%)

Query: 1   MMNAE------STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNED 53
           M NAE      +  T + L + ++ PA ++ T A+PIGNG LGAMV+GGV SE ++ NE 
Sbjct: 1   MANAEPEKSAAAVQTPDDLTLWYDKPATNWETQALPIGNGALGAMVFGGVASEQIQFNEK 60

Query: 54  TLWTGVPG-------DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQ 103
           TLWTG PG       ++T+P  P A+++V++ +D       +A + KL G P      YQ
Sbjct: 61  TLWTGGPGSGGYNAGNWTSPR-PNAIAEVQAQIDRDGRMSPSAVTAKL-GQPKSGFGAYQ 118

Query: 104 LLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI 163
             GD+ L+  D+    +   YRREL L  A ARV Y+ G V ++RE+F+S+P  VIV +I
Sbjct: 119 TFGDLWLDVPDA--PASPTGYRRELSLREAVARVGYTAGGVTYSREYFASHPGGVIVGRI 176

Query: 164 SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
           S S++G +SF +   S   +      N ++ + G                  G++F +  
Sbjct: 177 SASQAGKVSFTLRTSSPRSDKQVSVANGRLTVRGTLA-------------DNGMRFES-- 221

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
           +I++    G+ +   D+ + V G+D A+ +L A + + G   +P+    DP ++  +A+ 
Sbjct: 222 QIQVVTQGGSRTDGTDR-VTVTGADSAMFVLSAGTDYAG--THPAYRGPDPHAKVTAAVD 278

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           +    ++  L T H +DY+KLF RV + L +    I TD              R+++  T
Sbjct: 279 AAAARTFDQLRTAHQNDYRKLFDRVRLDLGQRVPAIPTD--------------RLRAAYT 324

Query: 344 D----EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
                +D +L  + F +GRYLLISSSR     ANLQG+WN   SP W +  HVNINL+MN
Sbjct: 325 GRASADDRALEAMFFAYGRYLLISSSRDEALPANLQGVWNNSTSPPWSADYHVNINLQMN 384

Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKV 458
           YW +   NL+E       ++  +   G KTAQ  + + GWV+H++T+ +  +   D    
Sbjct: 385 YWLAEQTNLAETTVAYDRYIKAMVAPGRKTAQEMFGSRGWVVHNETNPFGFTGVHDWATA 444

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETN 517
            W  +P   AW+   +++HY +  D  +L   AYP+++G A F LD L  +  DG L  +
Sbjct: 445 FW--FPEAAAWVTQQMYDHYRFNGDTAYLRDTAYPVMKGAAEFWLDNLHADPRDGKLVVS 502

Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
           PS SPE             S  ++M   I+ +V +  + AA  L  +  A   +V  +L 
Sbjct: 503 PSYSPEQ---------GDFSAGASMSQQIVFDVLTNSLEAARKLNVDP-AFQAEVTAALA 552

Query: 578 RL-RPTKIAEDGSIMEW 593
           +L R  ++   G + EW
Sbjct: 553 KLDRGIRVGSWGQLQEW 569


>gi|388259769|ref|ZP_10136938.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
 gi|387936495|gb|EIK43057.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
          Length = 806

 Score =  288 bits (736), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 204/610 (33%), Positives = 298/610 (48%), Gaps = 64/610 (10%)

Query: 4   AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
           A S        I F+ PA  +  + +PIGNG LGA++ G V  + ++ NE TLWTG PG 
Sbjct: 28  ASSVQAAGGESIWFDAPAADWEREGLPIGNGALGAVIAGDVTRDRIQFNEKTLWTGGPGA 87

Query: 62  ---DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGD--IELE 111
              D+  P   +  A++ VR+ ++  Q +     + KL GH    Y   Q  GD  I+  
Sbjct: 88  QGYDFGWPQQAQGDAVAQVRTTINE-QGSITPEDAAKLLGHKITAYGDYQTFGDLIIDSN 146

Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
            +DS +K     YRREL L+ A   V Y  G V + RE+ +S PD VI  K S  +  S+
Sbjct: 147 KNDSDVKSVFTNYRRELSLSDAQINVSYEQGGVRYRREYLASYPDGVIAIKYSADQPASI 206

Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
           SF  S+  + DN S        I +GR         A+      G+QF    +I++ +  
Sbjct: 207 SFTASVQ-VPDNRSLAVA----IDQGRI-------TASGKLHSNGLQFET--QIQLLNQG 252

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
           G ++ ++  KL+V  +D  V+LL A + +   +  P      P       L      S+ 
Sbjct: 253 GELAVIDGNKLQVTAADSVVILLAAGTDYAQSY--PKYRGAHPHKRLHKQLNKASKKSFE 310

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT--CSEENIDTVPSAERVKSFQTDEDPSL 349
            L   H  DYQ LF+RV++ + + P+ + T       +  D V             D +L
Sbjct: 311 QLQATHRADYQTLFNRVALDIGQKPQSLTTPKLLAGYKKGDAV------------LDRTL 358

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
               FQFGRYLLISSSRPG+  ANLQG+WN  ++P W++  HVNINL+MNYW +   NL 
Sbjct: 359 EATYFQFGRYLLISSSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETTNLP 418

Query: 410 ECQEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PM 465
           E   PLFDF+  L + G+  AQ V  +  GW +   T+IW  +    G + W  A W P 
Sbjct: 419 ELTAPLFDFVDSLVVPGTIAAQKVAGVDKGWTLFLNTNIWGFT----GVIDWPTAFWQPE 474

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
             AWL  H +EHY ++ D+ FL  RAYPL++  + F L++L++   DG    +PS SPEH
Sbjct: 475 AAAWLAQHYYEHYLFSGDKKFLRNRAYPLMKSASEFWLEFLVKDPRDGQWIVSPSFSPEH 534

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTK 583
               P  + A +S     D+  +R    A       L   +    + V + L  L R  +
Sbjct: 535 ---GPFTRAAAMSQQIVFDL--LRNTHEA------ALLTGDKKFAQAVQEKLANLDRGMR 583

Query: 584 IAEDGSIMEW 593
           I + G + EW
Sbjct: 584 IGKWGQLQEW 593


>gi|373850041|ref|ZP_09592842.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
 gi|372476206|gb|EHP36215.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
          Length = 839

 Score =  288 bits (736), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 196/624 (31%), Positives = 301/624 (48%), Gaps = 68/624 (10%)

Query: 17  FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
           F+ PA+  +  A+PIGNGR GAM++G + +E L+LNED+LW G P D  NPDA + L  +
Sbjct: 14  FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73

Query: 76  RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
           R L+  G+ A A       L G P     Y+ L D+ L F           D+  L    
Sbjct: 74  RQLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133

Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
            T          YRR LDL TA   V Y++ N  + R H +S  DQVI   +     G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASTVDQVIALHLRAGRPGGL 193

Query: 172 SFNVSLDS---------LLDNHSYVNGNNQIIMEGRC-PGKRIPPKANANDDPKGIQFSA 221
           +  + L+            D   +V    +   + R  P   +  +A   D   G++F+ 
Sbjct: 194 TLRLRLERGPRESYSTRYADTVGFVADAAREPADARTSPALLLRGRAGGED---GVRFAV 250

Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
            L  +I+   G +  +  + L ++ +D   L+L A+++F          + DP +  +  
Sbjct: 251 GLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------REDDPAAFVIGR 298

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-S 340
             +     +  +   H  +Y+  F R S+ L            +E    ++P   R+K +
Sbjct: 299 TGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAGSIPVDLRLKRA 351

Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
            ++  DP L  L F + RYLLISSSRPG+  ANLQG+WN D  P+W S   +NIN EMNY
Sbjct: 352 RESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYTININTEMNY 411

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W + P NL++C +PLFD L  +  +G +TA+V Y   G+V HH TD+WA +         
Sbjct: 412 WIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADTCPTDRNAGA 471

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
           + W +GGAWL  H W+ ++Y  D   L   AY LL   + F LD+LIE   G L  +P+ 
Sbjct: 472 SYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDARGRLVLSPTC 530

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---------LVEK 571
           SPE+ +  P+G+   +    TMD  ++  +F     AA++L +   A          + +
Sbjct: 531 SPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAIAGDHDFLAR 590

Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
           V  +  RL    +   G ++EW++
Sbjct: 591 VAAAAARLPQPAVGRHGQLLEWLE 614


>gi|168206072|ref|ZP_02632077.1| fibronectin type III domain protein [Clostridium perfringens E str.
           JGS1987]
 gi|170662403|gb|EDT15086.1| fibronectin type III domain protein [Clostridium perfringens E str.
           JGS1987]
          Length = 1479

 Score =  288 bits (736), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 189/604 (31%), Positives = 311/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA  +  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      K +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQKAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIKDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHY +T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|346311070|ref|ZP_08853080.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
           12063]
 gi|345901764|gb|EGX71561.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
           12063]
          Length = 770

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 191/598 (31%), Positives = 290/598 (48%), Gaps = 62/598 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +++ +  PA  + +A+PIGNG +  MV+GGV +E   LN++T+W   P D  NP +   L
Sbjct: 1   MRLWYTSPASVWNEALPIGNGHIAGMVFGGVENEKFSLNDETIWYRGPADRNNPSSADNL 60

Query: 73  SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R L+  G    A    ++ +F  P D   Y++LG++ LE     L+ A E+Y RELD
Sbjct: 61  GKIRELLAVGDVEAAEDLVALTMFATPRDQSHYEVLGEMFLEQRGVALE-ACESYERELD 119

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  A  RV +S G V++ RE+FSS    VI+ +++ S+ GS+S   +L            
Sbjct: 120 LENALCRVSFSCGGVDYRREYFSSFARNVILARLTASKEGSISLRATL------------ 167

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-----------FSAILEIKISDDRGTISALE 238
                  GRC  KR         D   I            F   L +   D  G++  L 
Sbjct: 168 -------GRC--KRFNDSVRQYRDRGVIMAAHAGGAAGVGFEVGLRVVSCD--GSVRVLG 216

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           +  +  E ++  VL LV+S+ +       S    +P + S+  +     L +      H+
Sbjct: 217 ETIVVDEATE-VVLALVSSTDY------WSAGAVEPDASSL--MDGFDGLDFDCALDDHV 267

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED-PSLVELLFQFG 357
             Y++ + RV++           D  ++E   ++P+   +   +     P L+ L F +G
Sbjct: 268 AAYREQYGRVAL-----------DIAADEEAPSIPTDGLIACAREGRHVPYLLNLAFDYG 316

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLL+SSS+PG   ANLQGIW ED+ P W S   +NIN EMNYW   P +L E Q PLFD
Sbjct: 317 RYLLLSSSQPGGLPANLQGIWCEDIDPIWGSKYTININTEMNYWMCGPADLPEAQLPLFD 376

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            L  +   G +TA+  Y A G+  HH TD +A ++     +  A+WP+   WL TH+WE 
Sbjct: 377 LLERMREPGRRTARAMYGARGFTCHHNTDGFADTAPQSHAIGAAVWPLTVPWLLTHVWEQ 436

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y +  D   L +    + +    F  D+L E + GYL T PS SPE+ +  P+G    V 
Sbjct: 437 YRFFGDASVLAEH-LDMFKEALLFFEDYLFE-YQGYLVTGPSASPENRYRLPNGVEGNVC 494

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            S  +D  I+R  F   +  A VL    D   ++      RL PT+I   G I EW++
Sbjct: 495 LSPAIDNQILRFFFDCCVDVARVLGDQSD-FADRAKALAERLPPTRIGSHGQIQEWLE 551


>gi|299149390|ref|ZP_07042447.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
 gi|298512577|gb|EFI36469.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
          Length = 859

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 205/633 (32%), Positives = 319/633 (50%), Gaps = 74/633 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
           LK T+N PAK + ++A+PIGNG +GAM++GGV  + ++ NE TLW+G PG+         
Sbjct: 32  LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91

Query: 64  TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
             P+  K+ L   R L         V+   Y +A    +                  KL 
Sbjct: 92  GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151

Query: 96  GHPADV--YQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
           G       +Q L +I +E  + +  + A   Y R LD++ A  RV Y  G + F RE+F 
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211

Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
           S PD ++V ++ S S+ G +S  +SL+SL  +      +N I + G  P      K   +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270

Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
               G++++  L +K +   G I+ ++ KKLK+E +   ++L+ A++++     +     
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328

Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
           S ++P  +  + L+   N  Y+ L   H  DY  L+ R+ + L   P+  V  T      
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVVTT------ 382

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
           D++       +    E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W+S 
Sbjct: 383 DSLLKGMDAHTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
            H NIN++MNYW + P NLS C  P+ +++  L   G  TAQ  Y         GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
           + +IW  ++  + K     +P G  W+C  +WE+Y + +D+DFLE   Y ++   A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560

Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
           D  W  E  DG L  NPS SPEH EF      L C     +   A+I E+F  +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609

Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           L K+++  + ++  ++ +L   KI   G +MEW
Sbjct: 610 LGKDKEPEIAEIETAMNKLSGPKIGLGGQLMEW 642


>gi|119499317|ref|XP_001266416.1| hypothetical protein NFIA_040960 [Neosartorya fischeri NRRL 181]
 gi|119414580|gb|EAW24519.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 792

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 187/598 (31%), Positives = 309/598 (51%), Gaps = 47/598 (7%)

Query: 11  NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           NP   T +  PA  F   +PIGNGRL A +WGG   + + LNE+++W+G   D  NP+A 
Sbjct: 22  NPSTYTWYTTPAADFASTLPIGNGRLAAAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80

Query: 70  KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
           +  +D R+++++G  + A    ++ +   P+    Y  LG + L+F   H   + ++Y R
Sbjct: 81  EGFTDSRAMLEAGNLSSANDVVLQDMVSIPSSPREYHPLGSLRLDF--GHDATSLQSYTR 138

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            LDL T  A V+Y VG+V ++RE+ +S+PD V+  ++  S++G+L+   SL+       Y
Sbjct: 139 FLDLGTGVAGVRYQVGDVVYSREYVTSHPDGVLAVRLRASKNGALNVVTSLE----RSRY 194

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           V     +   G      +  KAN+      I+F+A   +    +RG         + V G
Sbjct: 195 VESLTAVSSRGMG---TLTLKANSGQSTDPIRFTAQARVV---NRGGRITTNGTAVVVAG 248

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +    +     +S+      P ++++D   +    L +    SY  +      DY+ L  
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERDAVVKKQ--LDAAVKASYPAVKQAATSDYKSLSG 302

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
           RV + L            S  +    P+  R+K+++TD   DP L+ L+F FGR+ LI+S
Sbjct: 303 RVKLDLG-----------SSGSAGNQPTDIRLKNYKTDPDRDPELMTLMFNFGRHSLIAS 351

Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           SR G+     ANLQGIWN+D SP W     V++NL+MNYW +   NL++  EP+ D +  
Sbjct: 352 SRAGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLQMNYWHAQVTNLADTFEPVIDLMDK 411

Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           +  +G   A+  Y   +G+++HH TD+W  ++       W +WPMG AWL  +L + + +
Sbjct: 412 VVPHGQDVAKKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQFRF 471

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
           T D+  L++R +PLL+  A F   +L +  +GY  + PS SPE+ FI P+     GK   
Sbjct: 472 TQDKTLLQERIWPLLKSAADFYYCYLFD-FEGYYTSGPSISPENAFIIPEDMTIAGKSTG 530

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  S TMD  ++ E+F+A+I   + L+   + L     K + R+R  +I   G I+EW
Sbjct: 531 IDLSPTMDNLLLHELFTAVIETCKALDITGEDLT-NAHKYISRIRHPQIGSYGQILEW 587


>gi|224537148|ref|ZP_03677687.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521203|gb|EEF90308.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 776

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 188/583 (32%), Positives = 288/583 (49%), Gaps = 43/583 (7%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  +  A+PIGNGR+G M++G   +E + +NE+T+W G P    NP  P+ ++ +R+L+
Sbjct: 32  PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91

Query: 80  DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            +G+Y EA     K F       A  YQ  G + ++F D   K A   Y+R LD   A  
Sbjct: 92  FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
            V Y+   V +TRE F S P++V+V +I+  + G +SF           +    N    +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
           +G+   +        N +  G++F  I  I   ++ G I A E   +++  ++   +++ 
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKANETD-IEINNANSVTIMIA 257

Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
            S+ +     N  D+K   T          L   + L Y  L   H+D+Y  L++R S  
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
                 DI  +T    N    P  +R++   + + D  L+   + + RYL ISSSR G  
Sbjct: 312 ------DITFNTPVNNN----PIDKRIQLAASGQIDSELLFEYYNYCRYLFISSSRKGGL 361

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
             NLQGIWN  +   W S  H+N+N++  YW +   NLSEC EP+F     L  NG +TA
Sbjct: 362 PMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPIFTLTENLIKNGKETA 421

Query: 431 QVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           QV +    G V  H+TD W  +     K  W +     AWLC H  EHY YT+D++FL+ 
Sbjct: 422 QVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFLKT 481

Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           RA P+L   A F +DWL+ +   G L + P+ SPE+ F   +GK+A ++   T D  II 
Sbjct: 482 RALPILRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMGCTYDQEIIW 540

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             F   + A ++L  N +  VE V  S+ +L    IA DG +M
Sbjct: 541 NTFRDFLEACKILGINNEETVE-VEASMKKLSMPTIANDGRLM 582


>gi|168215503|ref|ZP_02641128.1| fibronectin type III domain protein [Clostridium perfringens NCTC
           8239]
 gi|182382428|gb|EDT79907.1| fibronectin type III domain protein [Clostridium perfringens NCTC
           8239]
          Length = 1479

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 188/604 (31%), Positives = 311/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA  +  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL++  + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +D   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHY +T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPEH     
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|255532706|ref|YP_003093078.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
 gi|255345690|gb|ACU05016.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
          Length = 940

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 176/502 (35%), Positives = 263/502 (52%), Gaps = 47/502 (9%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           YQ  GD+ L F   +   A   Y+R+LDLNTA A   Y++  + + RE+ +S PDQ IV 
Sbjct: 295 YQPFGDLYLNFKTEN--EAVTNYKRKLDLNTAVASTTYTLKGINYLREYLASQPDQAIVI 352

Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
           +++  + GS+SF    D+LL +    +G  +I         ++            ++  +
Sbjct: 353 RLTADKKGSISF----DALLGSPHKYSGVKKINANTIALSLKVRDGV--------LKGES 400

Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
            L+  I+  +  ++A    K+ +  +D   L L A +SF    +N  D   +P S ++ A
Sbjct: 401 RLQAIITKGKLLVTA---NKISIVAADAVTLYLTAGTSF----VNDKDVSGNPASAAVKA 453

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           L  +   SY+ +   H+ +YQK +   S+      K             ++P+ ER++ F
Sbjct: 454 LTGLNGKSYAQVKAAHIKEYQKYYTAFSVSFGPDSKA------------SLPTDERIEQF 501

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
               DP+   L  Q+GRYLLISSSRPGTQ ANLQGIWNE L+P W S    NINLEMNYW
Sbjct: 502 SDGNDPAFAALFMQYGRYLLISSSRPGTQPANLQGIWNELLTPPWGSKYTTNINLEMNYW 561

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
            +   NLS   EPL   +  L+ NG  TA+V+Y A GWV+HH TD+W   +A        
Sbjct: 562 PTGVLNLSAMAEPLIRKINALAKNGEVTAKVHYNAKGWVLHHNTDLW-NGTAPINASNHG 620

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPST 520
           +W  G  WL  HLWEHY +T D +FL+  AYP+++  A F  D+LI+    G+L + PS 
Sbjct: 621 IWVSGAGWLSQHLWEHYLFTQDLNFLKNEAYPVMKQAAVFFNDFLIKDPKTGWLISTPSN 680

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRL 579
           SPE      +G L       TMD  IIR +F   I+A  +L    DA  +K L + +  +
Sbjct: 681 SPE------NGGLVA---GPTMDHQIIRTLFRNCIAATALL--GVDADFKKTLEQKITLI 729

Query: 580 RPTKIAEDGSIMEWVQRRLNTS 601
            P +I + G + EW++ + +T+
Sbjct: 730 APNQIGKYGQLQEWLEDKDDTT 751



 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 52/82 (63%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA+ +TDA+PIGNGRLGAM++ GV  + ++ NE+TLWTG P DY +  A   L 
Sbjct: 32  QLWYTKPAEKWTDALPIGNGRLGAMIFAGVEKDHIQFNEETLWTGGPRDYNHKGAAAYLP 91

Query: 74  DVRSLVDSGQYAEATAASVKLF 95
            +R L+  G   EA   + + F
Sbjct: 92  QIRQLLFEGNQQEAEKLAAEKF 113


>gi|374376430|ref|ZP_09634088.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
 gi|373233270|gb|EHP53065.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
          Length = 946

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 173/495 (34%), Positives = 263/495 (53%), Gaps = 40/495 (8%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           YQ  GD+    +    K  +  YRR LDL TA     Y+   V+F R + +S P QV+  
Sbjct: 289 YQPFGDVVFHVNADETKVKD--YRRVLDLETAVLTTAYNYNGVDFKRTYIASQPQQVLAV 346

Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
             + S  GS+SF   L S    H  V   +Q         + +  K    D    ++  +
Sbjct: 347 NFTASRPGSVSFETELTSP-HQHFIVEAVDQ---------QTLVLKIQVKDG--ALRGES 394

Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
            ++++++  +G++ A++D KL V  +D A + + A+++F     N  D   DP++   +A
Sbjct: 395 YVQVRVT--KGSV-AVKDNKLIVSKADEATVFIAAATNFK----NFKDVSADPSARCRAA 447

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           ++ I+  S++ +   H+ +YQ+ F+ +S+           +       +++P+  R++ F
Sbjct: 448 IKGIQQQSFASVLKAHVKEYQQYFNTLSVNFYGQKNQPSAN-------ESLPTDLRLEKF 500

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
               DP  V L  Q+GRYLLISSSRPGT  ANLQGIWNE LSP W S    NIN EMNYW
Sbjct: 501 ARSGDPEFVALYMQYGRYLLISSSRPGTYPANLQGIWNELLSPPWGSKYTTNINAEMNYW 560

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
            +    LS   + LF  +  L+++G +TA+  Y A GWV+HH TD+W  ++A        
Sbjct: 561 PAELLGLSPLHDALFKMVEELAVSGKETAKEYYNAPGWVLHHNTDLWRGTAAINASNH-G 619

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPST 520
           +W  GGAWLC+HLWE Y +T D  FL+  AYP++   A F   +LI+    GYL + PS 
Sbjct: 620 IWVTGGAWLCSHLWERYLFTKDERFLKDTAYPIMREAALFFNHFLIKDPVTGYLISTPSN 679

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
           SPEH      G L       TMD  IIR +F + I A+++L K + AL +++ +  PR+ 
Sbjct: 680 SPEH------GGLVA---GPTMDHQIIRALFKSTIEASQIL-KTDAALRKELEEKYPRIA 729

Query: 581 PTKIAEDGSIMEWVQ 595
           P KI   G + EW+Q
Sbjct: 730 PNKIGRFGQLQEWMQ 744



 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 53/75 (70%)

Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
          LK+ +  PAK + +A+PIGNGRLGAMV+GGV ++ ++ NE+TLW+G P DY    A + L
Sbjct: 24 LKLWYQHPAKEWVEALPIGNGRLGAMVFGGVQTDRVQFNEETLWSGYPRDYNKKGAYRYL 83

Query: 73 SDVRSLVDSGQYAEA 87
            +R L+ +G+  EA
Sbjct: 84 DSIRGLLFAGKQKEA 98


>gi|391227681|ref|ZP_10263888.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
 gi|391223174|gb|EIQ01594.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
          Length = 839

 Score =  286 bits (731), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 196/633 (30%), Positives = 302/633 (47%), Gaps = 86/633 (13%)

Query: 17  FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
           F+ PA+  +  A+PIGNGR GAM++G + +E L+LNED+LW G P D  NPDA + L  +
Sbjct: 14  FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73

Query: 76  RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
           R L+  G+ A A       L G P     Y+ L D+ L F           D+  L    
Sbjct: 74  RKLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133

Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
            T          YRR LDL TA   V Y++ N  + R H +S  DQVI   +     G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASAADQVIALHLRAGRPGGL 193

Query: 172 SFNVSLDS---------LLDNHSYVN----------GNNQIIMEGRCPGKRIPPKANAND 212
           +  + L+            D   +V            +  +++ GR  G+          
Sbjct: 194 TLRLRLERGPRKSYSTRYADTVGFVADAAREPSDACASPALLLRGRAGGE---------- 243

Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
              G++F+  L  +I+   G +  +  + L ++ +D   L+L A+++F          + 
Sbjct: 244 --DGVRFAVGLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------RED 289

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP +  +    +     +  +   H  +Y+  F R S+ L            +E   ++V
Sbjct: 290 DPAAFVIGRTGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAESV 342

Query: 333 PSAERVK-SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPH 391
           P   R+K + ++  DP L  L F + RYLLISSSRPG+  ANLQG+WN D  P+W S   
Sbjct: 343 PVDLRLKRARESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYT 402

Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS 451
           +NIN EMNYW + P NL++C +PLFD L  +  +G +TA+V Y   G+V HH TD+WA +
Sbjct: 403 ININTEMNYWIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADT 462

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
                    + W +GGAWL  H W+ ++Y  D   L   AY LL   + F LD+LIE   
Sbjct: 463 CPTDRNAGASYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDAR 521

Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---- 567
           G L  +P+ SPE+ +  P+G+   +    TMD  ++  +F     AA++L +   A    
Sbjct: 522 GRLVLSPTCSPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAI 581

Query: 568 -----LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
                 + +V  +  RL    +   G ++EW++
Sbjct: 582 AGDHDFLARVAAAAARLPQPAVGRHGQLLEWLE 614


>gi|383113365|ref|ZP_09934137.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
 gi|313695534|gb|EFS32369.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
          Length = 859

 Score =  286 bits (731), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 202/633 (31%), Positives = 319/633 (50%), Gaps = 74/633 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
           LK T+N PAK + ++A+PIGNG +GAM++GGV  + ++ NE TLW+G PG+         
Sbjct: 32  LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91

Query: 64  TNPDAPKA-LSDVRSLVD------SGQYAEATAASVKLFGHPAD---------------- 100
             P+  K+ L   R L+       +  ++    A  KL  H  +                
Sbjct: 92  GTPETNKSYLHKTRVLLQQKMNDFTANHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151

Query: 101 -------VYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
                   +Q L +I +E  + +  + A   Y R LD++ A  RV Y  G + F RE+F 
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211

Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
           S PD ++V ++ S S+ G +S  +SL+SL  +      +N I + G  P      K   +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270

Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
               G++++  L +K +   G I+ ++ KKLK+E +   ++L+ A++++     +     
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328

Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
           S ++P  +  + L+   N  Y+ L   H  DY  L+ R+ + L    +  V  T      
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLAAHEKDYHSLYDRMKLNLGNLTEMPVVTT------ 382

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
           D++      ++    E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W+S 
Sbjct: 383 DSLLKGMDARTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
            H NIN++MNYW + P NLS C  P+ +++  L   G  TAQ  Y         GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
           + +IW  ++  + K     +P G  W+C  +WE+Y + +D+DFLE   Y ++   A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560

Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
           D  W  E  DG L  NPS SPEH EF      L C     +   A+I E+F  +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609

Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           L K+++  + ++  ++ +L   KI   G +MEW
Sbjct: 610 LGKDKEPEIAEIETAMNKLSGPKIGLGGQLMEW 642


>gi|423223092|ref|ZP_17209561.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392639998|gb|EIY33805.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 776

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 187/583 (32%), Positives = 288/583 (49%), Gaps = 43/583 (7%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  +  A+PIGNGR+G M++G   +E + +NE+T+W G P    NP  P+ ++ +R+L+
Sbjct: 32  PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91

Query: 80  DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            +G+Y EA     K F       A  YQ  G + ++F D   K A   Y+R LD   A  
Sbjct: 92  FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
            V Y+   V +TRE F S P++V+V +I+  + G +SF           +    N    +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
           +G+   +        N +  G++F  I  I   ++ G I A     +++  ++   +++ 
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKA-NGTDIEINNANSVTIMIA 257

Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
            S+ +     N  D+K   T          L   + L Y  L   H+D+Y  L++R S  
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
                 DI  +T    N    P  +R++   + + D  L+   + + RYL ISSSR G  
Sbjct: 312 ------DIAFNTPVNNN----PIDKRIQLAASGQIDSELLFEYYNYCRYLFISSSRKGGL 361

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
             NLQGIWN  +   W S  H+N+N++  YW +   NLSEC EP+F     L  NG +TA
Sbjct: 362 PMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPMFTLTENLIKNGKETA 421

Query: 431 QVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           QV +    G V  H+TD W  +     K  W +     AWLC H  EHY YT+D++FL+ 
Sbjct: 422 QVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFLKT 481

Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           RA P+L   A F +DWL+ +   G L + P+ SPE+ F   +GK+A ++ S T D  II 
Sbjct: 482 RALPVLRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMSCTYDQEIIW 540

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             F   + A ++L  + +  VE V  S+ +L    IA DG +M
Sbjct: 541 NTFRDFLEACKILGISNEETVE-VEASMKKLSMPTIANDGRLM 582


>gi|149277534|ref|ZP_01883675.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
 gi|149231767|gb|EDM37145.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
          Length = 780

 Score =  285 bits (729), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 185/611 (30%), Positives = 315/611 (51%), Gaps = 54/611 (8%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M+++    +   LK+ +  PA+ + + + +GNGRLG M  GG+  ET+ LN+ TLW+G P
Sbjct: 15  MLSSNGVFSQAKLKLWYEHPAQKWEETLALGNGRLGMMPDGGITRETVVLNDITLWSGAP 74

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
            D  N +A K+L  +R L+  G+  EA     + F        G     +Q+LG +++ F
Sbjct: 75  QDANNYEASKSLPQIRKLLAEGKNDEAQELVNRDFICTGKGSGGVNYGCFQVLGTLQMNF 134

Query: 113 D---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
                +  +  +  Y REL +  A A   Y +  V++ +E+ +S  D + + +I+  + G
Sbjct: 135 SYPGATADQLKDNGYHRELSIGEAIASSSYQINGVKYKKEYLTSFDDDICLIRITADKPG 194

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
           +L+F VS+       + + G  ++ ++G+          +   D KG+Q+ + +   +  
Sbjct: 195 ALNFKVSISRPERGEASIAGQ-ELQLQGQL---------DNGIDGKGMQYLSRVRAVLKG 244

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
            + T     +K+  V      V+L VAS    G     SD +   T + M+A    R   
Sbjct: 245 GKLTT----EKEALVISKATEVILFVAS----GTDFRASDFRMK-TEQVMAAAMKKR--- 292

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
           Y+   + H+ ++Q LF+RVS+            +   + +D+VP+  R++ F  +   D 
Sbjct: 293 YALQRSNHIRNFQHLFNRVSV------------SIGHQLMDSVPTDLRLERFHKNPAADL 340

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
               L +QFGRYL ISS+R G    NLQG+W   +   W    H+++N++MN+W     N
Sbjct: 341 GFPALFYQFGRYLSISSTRVGLLPPNLQGLWANQIQTPWTGDYHLDVNVQMNHWPVEVSN 400

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   PL + +  L   G +TA+  Y A GW+ H  T++W  +        W     G 
Sbjct: 401 LSELNLPLAELVRGLVKPGQRTAKAYYNADGWIAHVITNVWGFTEPGE-SASWGSSNAGS 459

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF 526
            WLC +LW+HY ++ D+++L +  YP+L+G A F    L+   + G+L T PS SPE+ F
Sbjct: 460 GWLCNNLWDHYAFSNDKEYL-RSIYPILKGSAEFYNSVLVRDEETGWLVTAPSVSPENSF 518

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV--LEKNEDALVEKVLKSLPRLRPTKI 584
             P+GK A +S   T+D  I+RE+F  +I+A+E+  L+    A++++ LKS+P      I
Sbjct: 519 YLPNGKTASISMGPTIDNQIVRELFGNVIAASEMLGLDAGFRAILQEKLKSIP--PAGNI 576

Query: 585 AEDGSIMEWVQ 595
           ++DG IMEW++
Sbjct: 577 SKDGRIMEWLR 587


>gi|422874794|ref|ZP_16921279.1| fibronectin type III domain-containing protein [Clostridium
           perfringens F262]
 gi|380304435|gb|EIA16724.1| fibronectin type III domain-containing protein [Clostridium
           perfringens F262]
          Length = 1479

 Score =  285 bits (729), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 187/604 (30%), Positives = 312/604 (51%), Gaps = 70/604 (11%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
           L + ++ PA  +  +A+PIGNG +G M++G V SE ++ NE TLW+G PG +        
Sbjct: 48  LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107

Query: 66  PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
             A +A+ ++R ++  G        +      + +G     YQ  GDI L+F  SH +  
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
              YRREL+++ + + VKY+   V + RE+F S PD V+V K+   ++ SL+ +V  +  
Sbjct: 163 VTNYRRELNIDESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +  +    NN +I+ G               +  G+++ +  +IK+ +  G+I   ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE ++   +++ A + +   +  P+   +DP S     + +  NL Y +L +RH++D
Sbjct: 268 -ISVENANEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ LF RV++ L     D  TD             E +  ++T++  SL  L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKSDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LISSSR G+  ANLQG+WN   +P W S  H N+N++MNYW +   NLSE   PL +++ 
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431

Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L   G KTA+++          +GW ++   + +   +A   +  W   P   AW+  +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
           LWEHYN+T D+D+L +  YP+++  A F   +L+E    DG  YL ++PS SPE      
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEQ----- 545

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                  +  +T D  +I ++F+  I A+E L  +E+   E   K    L+P +I + G 
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600

Query: 590 IMEW 593
           + EW
Sbjct: 601 VQEW 604


>gi|325261844|ref|ZP_08128582.1| fibronectin type III domain protein [Clostridium sp. D5]
 gi|324033298|gb|EGB94575.1| fibronectin type III domain protein [Clostridium sp. D5]
          Length = 805

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 196/597 (32%), Positives = 297/597 (49%), Gaps = 46/597 (7%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKA-LSD 74
           +  PAK FT A+P+GNG LGAMV+GG P E + LN DTLW+G PG +      P+  +  
Sbjct: 10  YTHPAKDFTQALPLGNGHLGAMVYGGFPRERISLNLDTLWSGHPGHWHGKQKIPQGTMER 69

Query: 75  VRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           VRSL+D+G Y EA     K + G   + Y   G +EL+FD +   Y  E   R L L  A
Sbjct: 70  VRSLIDAGAYWEAQKQIQKHMLGCNNESYLSAGSLELQFD-TEADY--EGCERRLSLEEA 126

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
             R  + +   +   + F S     +  +I  +E   +S  +SL + L         + +
Sbjct: 127 ITRTDWELKGQKVREDVFVSAVQNGMYIRIF-TEGAPVSVAISLQTQLRVLQSAAEADGL 185

Query: 194 IMEGRCPG----KRIPPKA--NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
           ++  + P       +P +     +++  G+ +   L I   D  G I   E+  + VE  
Sbjct: 186 LLVAQAPSHVEPNYVPSREPIQYDEEKPGMIYGLFLGINECD--GGIKRTEEG-ICVENF 242

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFH 306
               + L   + ++G +  P + + +     +        L S+ + +  HL ++Q+L+ 
Sbjct: 243 TCLTMFLSGETEYEG-YGKPLNGQAESIIRYLRERGHRAKLKSWEENFRAHLREHQRLYL 301

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
           R            V +    E  +  P+ ER++  ++  EDP L  LLF +GRYL+++SS
Sbjct: 302 RT-----------VLELEGGEEEEQRPTDERLEMVRSGKEDPGLSALLFHYGRYLILASS 350

Query: 366 RPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           RP     Q A LQGIW ED+   W S   VNIN +MNYW   P NL EC+ PL   +  L
Sbjct: 351 RPLDGLVQPATLQGIWCEDVRSVWSSNWTVNINTQMNYWICGPGNLPECEIPLIRMVKEL 410

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           S +  + A  N    G+V+HH  D+W +     G+V WA WPMGG WL THL+ HY YT 
Sbjct: 411 S-DAGREAAANLNCRGFVVHHNVDLWRQCIPALGEVKWAYWPMGGLWLTTHLYRHYLYTG 469

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
           D+++LEK  YP+ + C +F+LD+L   HDG   +T PSTSPE+ F     +      S T
Sbjct: 470 DKEYLEK-IYPVFQECTAFILDYLY--HDGSAYQTCPSTSPENTFYDEQERECAACVSPT 526

Query: 542 MDMAIIREVFSAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           MD+A+IREV   ++   E++     E  +     +VL  LP     +    G ++EW
Sbjct: 527 MDIALIREVLCNLLEIDEIIRGTRPESGQCREARRVLNELPAF---QTGSRGQLLEW 580


>gi|256394373|ref|YP_003115937.1| alpha-L-fucosidase [Catenulispora acidiphila DSM 44928]
 gi|256360599|gb|ACU74096.1| alpha-L-fucosidase, putative, afc95A [Catenulispora acidiphila DSM
           44928]
          Length = 742

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 199/598 (33%), Positives = 300/598 (50%), Gaps = 78/598 (13%)

Query: 17  FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPDA 68
           ++ PA  +  +A+PIGNGR+GAMV+GGV +E ++  E+TLWTG PG       D+  P  
Sbjct: 7   YDAPASDWEREALPIGNGRIGAMVFGGVAAERVQFTEETLWTGGPGHPGYDHGDWREP-R 65

Query: 69  PKALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYR 125
           P AL +VR  +D    +  T    +L G P      +Q  GD+ +EF    L    + YR
Sbjct: 66  PGALEEVRRRIDE-HGSLPTQTVTELLGQPKTGFGAFQNYGDLIIEF--PGLSEEAQDYR 122

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLD 182
           R LD++ A A V +    V  TRE+F S+P  V++ +++  + G+L   +  +      D
Sbjct: 123 RTLDISDALAGVAFEADGVHHTREYFVSHPAGVLLGRLTADQPGALHCVLRYEPGTDATD 182

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                  +  +++ G  P               G++ +A   IK+  + G +   ED+ L
Sbjct: 183 ATRVTTEDATLVIIGALPDN-------------GLRHAA--RIKVIPEGGRLIEGEDR-L 226

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            +EG+D  V++L A++ +   +    +   DP      A+      +Y DL   H+ D+ 
Sbjct: 227 TIEGADRVVIILAAATDYADTYPAYRNGI-DPAGPVAEAVAKAAASTYDDLRAAHIADHS 285

Query: 303 KLFHRVSIQLSRS-PKDIVTDTC-SEENID-TVPSAERVKSFQTDEDPSLVELLFQFGRY 359
            LF RV + L  S P D+ TD   +    D + P+A+R          +L +L F  GRY
Sbjct: 286 ALFDRVVLDLGGSLPGDVPTDRLLTAYGTDASTPAADR----------ALEQLFFDHGRY 335

Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           LLI+SSRP +Q+ ANLQG+WN   +P W    HVNINL+MNYW + PC L EC EPLF +
Sbjct: 336 LLIASSRPASQLPANLQGVWNASPTPPWAGDYHVNINLQMNYWLAEPCALGECAEPLFAY 395

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEH 477
           +  L   G  +A+  +   GWV+H++T  +  +   D     W  +P   AWLC HLWEH
Sbjct: 396 IEALRAPGRVSARTLFGTEGWVVHNETTPFGFTGVHDWPDAFW--FPEAAAWLCRHLWEH 453

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLAC 535
           Y +T+D +FL++RAYP+++  A F L  L  +  DG L  NPS SPE  E+ A       
Sbjct: 454 YAFTLDEEFLKERAYPVMKEAAQFWLANLRRDPRDGKLVANPSFSPEQGEYTA------- 506

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
               S M   IIR++F   +  A  +E  +  L              +I   G + EW
Sbjct: 507 ---GSAMAQQIIRDLFKNTVGLAAEVEDLDTGL--------------RIGSWGQLQEW 547


>gi|224537768|ref|ZP_03678307.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520588|gb|EEF89693.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 833

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 185/614 (30%), Positives = 308/614 (50%), Gaps = 69/614 (11%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GGV  E + LNE +LW+G+  DY NPDA ++L 
Sbjct: 41  QLYYTAPATIWEETLPLGNGRLGMMPDGGVDREHIVLNEISLWSGMEADYGNPDASRSLP 100

Query: 74  DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET--- 123
            ++ L+  G+  EA       F       G     YQ+L D+ ++F   H +        
Sbjct: 101 AIQQLLFEGKNKEAQELMYSSFVPKKPESGGTYGNYQMLADLNIDFSFPHRRKTISENDA 160

Query: 124 -----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
                YRR LDL  A A   ++   +++ RE+F+S    V++  ++ S   +LSF+  L 
Sbjct: 161 APVTDYRRWLDLRDAVAYTSFTKEGIDYQREYFTSRDKDVMIIHLTTSRRRALSFSAQLS 220

Query: 179 -------SLLDNHSYVNGNNQIIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKI 227
                  S+L       G   +++EG      PG+            +G+++   + +  
Sbjct: 221 RPKQGAVSMLPGIGKEEGT--LLLEGTLDSGKPGR------------EGMKYRVAMRLIS 266

Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSI 285
              +  ISA  ++ + +     A L+L A++S+     + S ++     +S+  +A Q +
Sbjct: 267 KGGKQNISA--ERGITLTQGREAWLVLSATTSYAASGTDFSGNRYKEVCDSLLNAATQHV 324

Query: 286 RNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
           +      +   H+  ++  + RVS+ L  +  D++            P+ ER+  F   E
Sbjct: 325 Q------IKESHIASHRTFYDRVSLTLPFTEDDVL------------PTNERITRFTERE 366

Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
            P+L  L + +GRYL ISS+RPG+   NLQG+W   +   W+   H NIN++MN+W    
Sbjct: 367 SPALAALYYNYGRYLFISSTRPGSLPPNLQGLWANGVETPWNGDYHTNINIQMNHWPLEQ 426

Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALW 463
             LSE  +PL   +  L  +G +TA+  Y   A GWV+H  T+IW   +A      W   
Sbjct: 427 AGLSELYQPLTALVERLIPSGEETARTFYGTHAQGWVLHMMTNIW-NYTAPGEHPSWGAT 485

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSP 522
             GGAWLC HLWEHY YT D +FL KR YP+L+G + F    ++ E   G+L T P++SP
Sbjct: 486 NTGGAWLCAHLWEHYQYTQDIEFL-KRIYPVLKGASEFFYSTMVREPKHGWLVTAPTSSP 544

Query: 523 EHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           E+  F+  D     V    TMD+ ++ E+++ +I A  +LE + D    K+ ++L +  P
Sbjct: 545 ENAFFVGNDPTPVSVCMGPTMDVQLLTELYTNVIEATSILECDAD-YAAKLREALDKFPP 603

Query: 582 TKIAEDGSIMEWVQ 595
            +I++ G + EW++
Sbjct: 604 MQISKGGYLQEWLE 617


>gi|423227144|ref|ZP_17213608.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392624284|gb|EIY18376.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 825

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 193/621 (31%), Positives = 310/621 (49%), Gaps = 79/621 (12%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ +  PA  + + +P+GNGRLG M  GG+  E + LNE +LW+G+  DY NPDA ++L 
Sbjct: 29  QLYYTAPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 88

Query: 74  DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEE--- 122
            ++ L+  G+  EA       F       G     YQ+L D+ L F      K+A +   
Sbjct: 89  AIQQLLFEGKNKEAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKKFASDEVV 148

Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               YRR LDL  A A   ++ G +++ RE+++S    V++  ++ S   SL F  SL  
Sbjct: 149 PVTNYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTVSRRRSLFFTASLSR 208

Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
                 S V G+ +    +++EG      PG+             G+++   + +     
Sbjct: 209 PQQGTVSLVPGSGKEAGTLLLEGALDSGKPGQ------------DGMKYRVAMRVVSKGG 256

Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN-PSDSKKD----------PTSESM 279
           +  ISA ED  +  +G++ A L++ A++S+     + P    K+          P S  +
Sbjct: 257 KQFISA-EDGIMLTQGTE-AWLIISATTSYAAAGTDFPGSRYKEVCDSLLNAATPPSSQL 314

Query: 280 SALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
           S L S + N S+ +LY R                       V+ T      D +P+ ER+
Sbjct: 315 SILNSPLTNASHRELYDR-----------------------VSLTLPATEDDALPTNERI 351

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
             F   E P+L  L + +GRYLLISS+RPG+   NLQG+W   +   W+   H NIN++M
Sbjct: 352 VRFAERESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANGVQTPWNGDYHTNINIQM 411

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRG 456
           N+W      LSE  +PL   +  L  +G  TA+  Y   A GWV+H  T++W   +A   
Sbjct: 412 NHWPLEQAGLSELYQPLTGLVERLVPSGKGTARTFYGNHAQGWVLHMMTNVW-NYTAPGE 470

Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLE 515
              W     GGAWLC HLWEHY YT D ++L K+ YP+L+G + F    ++ E   G+L 
Sbjct: 471 HPSWGATNTGGAWLCAHLWEHYQYTQDIEYL-KKIYPILKGASEFFYSTMVREPKHGWLV 529

Query: 516 TNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
           T P++SPE+  F+  D     V    TMD+ ++ E+++ +I AA +LE ++D    K+ +
Sbjct: 530 TAPTSSPENAFFVGDDPTPVSVCMGPTMDVQLLTELYTNVIEAASILECDDD-YAAKLRE 588

Query: 575 SLPRLRPTKIAEDGSIMEWVQ 595
           +L +  P +I++ G + EW++
Sbjct: 589 ALGKFPPMQISKGGYLQEWLE 609


>gi|227536429|ref|ZP_03966478.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227243805|gb|EEI93820.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 798

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 192/610 (31%), Positives = 310/610 (50%), Gaps = 62/610 (10%)

Query: 7   TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
            + +  L++ ++ PAK + + +P+GNG +G M  GGV  E + LNE ++W+G   D  N 
Sbjct: 42  VAQSGSLRLWYDKPAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 101

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
            A K++ +++ L+  G+  EA     K F       GH      P   YQ LG + L+F 
Sbjct: 102 AAYKSVGEIQKLLVEGKNDEAEQLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFK 161

Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
           ++    A+ T Y R LDL  A AR  +++  V++TRE+F+S    V V ++  S+ G+L+
Sbjct: 162 EA----AQSTDYERSLDLKDAVARTNFTINGVKYTREYFTSFDQNVGVVRLKSSKKGALN 217

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
           F+ SL S  +   Y +  N+  M G      I P     D   GI FS+  +IK+    G
Sbjct: 218 FSASL-SREEGVQYSSKGNEFSMSG------ILPDGKGGD---GISFSS--KIKVFHRGG 265

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
            + A  D  L V  +   ++   A++S+            DP       L+   +  Y  
Sbjct: 266 KVVA-SDTALTVSKASEVLIFFAAATSY---------FHADPLQYVDEQLKQANDTPYPQ 315

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLV 350
           L+ +HL  Y+ +F+RV +QL         D   +  I T    +R+++F  +  +D  L 
Sbjct: 316 LFKQHLSRYESVFNRVDLQLE--------DDADKSGITT---DKRLRAFYDNPAQDNGLA 364

Query: 351 ELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
            L +QFGRYL ISS+ P  + A   NLQG+W   +   W+   H+NIN +MN+W     N
Sbjct: 365 ALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVNN 424

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE   P  + +  ++  G KTA+  Y A GWV++  T++W  S+    +  W      G
Sbjct: 425 LSEYHIPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTASG 483

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
            WLC HLWEHY +T D  +L K  YP+++G A F    ++ +   G+L T+PS SPE+ F
Sbjct: 484 -WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENAF 541

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIA 585
              +GK A V     +D  I+RE++  +I A  +L ++ +A  + +   + +L  P  I+
Sbjct: 542 RMKNGKTAAVVMGPAIDNQIVRELYRNLIDADSILGQH-NAFTDTLRIQIQQLAPPVLIS 600

Query: 586 EDGSIMEWVQ 595
           + G + EW++
Sbjct: 601 KSGRVQEWLE 610


>gi|192360052|ref|YP_001983169.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
 gi|190686217|gb|ACE83895.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio japonicus Ueda107]
          Length = 782

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 199/610 (32%), Positives = 309/610 (50%), Gaps = 60/610 (9%)

Query: 2   MNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           M+AE +  + PL I F+ PA  +  + +PIGNG +GA++ GGV  + ++ NE TLWTG P
Sbjct: 1   MSAEVSRESVPLAIAFDRPATDWEREGLPIGNGAMGAVISGGVEQDIIQFNEKTLWTGGP 60

Query: 61  G-----DYTNPDAPKA--LSDVR-SLVDSGQYAEATAASV---KLFGHPADVYQLLGDIE 109
           G     D+  P   +A  L+ VR S+   G  +   AA +   K+ G+    YQ  GD+ 
Sbjct: 61  GSVRGYDFGIPAESQASALAKVRDSIRKDGSISPEKAAELMGRKILGYGD--YQTFGDLI 118

Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
           L F ++     +  Y R L L+     + Y    V +TRE+F+S PD VIV ++S  + G
Sbjct: 119 LSFPENDSGVIK--YNRRLSLDEGRVILGYQQEGVTYTREYFASYPDGVIVVRLSADKPG 176

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
            +   V L +          N Q+    R  G ++       D+  G  F+A   I +  
Sbjct: 177 QIHLRVGLRT--------PDNRQVTT--RIEGNQLDIVGELQDNKLG--FAA--RIAVVA 222

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS-ALQSIRNL 288
           + G +     + L+V+ +D   ++  A++++   + +   +      + +S  L +    
Sbjct: 223 EGGNLDNSGQQSLQVKRADAVTIVFAAATNYAQRYPHYRQADASYAQQKISNTLAAALQK 282

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
           +Y+ L  RH  DYQ L+ RV++ + +    + T     +           K+     D S
Sbjct: 283 NYAQLLARHTQDYQSLYKRVALDIGQGVHSLATPALLAQ----------YKTGNAALDRS 332

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L  + FQFGRYLLI+SSRPG+  ANLQG+WN  ++P W++  HVNINL+MNYW +   NL
Sbjct: 333 LEAIYFQFGRYLLIASSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETANL 392

Query: 409 SECQEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-P 464
            E  +P FDF+  L   G+ +AQ +  ++ GW +   T+IW  +    G + W  A W P
Sbjct: 393 PELMQPYFDFVDSLVEPGNISAQRIADVSKGWALFLNTNIWGFT----GVIDWPTAFWQP 448

Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE 523
             GAWL  H +EH+ ++ D+ FL  RAYPL++G A F LD+L++   DG     PS SPE
Sbjct: 449 EAGAWLAQHYYEHFLFSGDQAFLRNRAYPLMKGAAEFWLDFLVKDPRDGLWVVTPSFSPE 508

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
           H    P    A +S     D+  +R    A   AA V +K    LV++ LK++   R  +
Sbjct: 509 H---GPFTTGAAMSQQIVFDL--LRNTSEA---AALVGDKKFKRLVDQTLKNMD--RGIR 558

Query: 584 IAEDGSIMEW 593
           I   G + EW
Sbjct: 559 IGSWGQLQEW 568


>gi|325103050|ref|YP_004272704.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
 gi|324971898|gb|ADY50882.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
          Length = 938

 Score =  282 bits (721), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 178/498 (35%), Positives = 267/498 (53%), Gaps = 55/498 (11%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           YQ  GDI L F   H +Y    Y+RELDLN+A A+  YS     +TR +F + P   +V 
Sbjct: 294 YQPFGDIYLNF--KHQEYT--NYKRELDLNSALAKTSYSHKGTNYTRTYFVNAPQNTLVI 349

Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
            +  ++  +++F  S DS     S                ++I  +  A D    +++ A
Sbjct: 350 HLEANQPKNVTFTASFDSPHSQKSI---------------RKIDDRTIALDVK--VKYGA 392

Query: 222 ILE---IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
           +     + + +  G IS +++ +L VEG+D A L+L A+++F    +N  D    P+ ++
Sbjct: 393 LFGESILHLKNKNGKIS-VKNNQLVVEGADEATLMLFAATNF----VNFHDVSGKPSVKN 447

Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
              L S +NL Y  L   HL DY  L++R S+    + ++             +P+ ER+
Sbjct: 448 QQTLASAKNLDYQTLKQNHLQDYTSLYNRFSLSFGGNSRE------------DLPTDERI 495

Query: 339 KSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
           + F +T  DP+L+ L  Q+GRYLLISSSR  TQ ANLQGIWN  L+P+W S    NIN+E
Sbjct: 496 REFSKTANDPALLALYAQYGRYLLISSSRANTQPANLQGIWNHLLAPSWGSKYTTNINVE 555

Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
           MNYW S   NLS+  +PLF  +  LS +G++TA+  Y   GWV+HH TDIW + +A    
Sbjct: 556 MNYWLSEMLNLSDLHQPLFGMIEDLSKSGAETAKNYYNLPGWVLHHNTDIW-RGAAPINN 614

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLET 516
               +WP GGAWL THL EHY +T D+ FL K+ YP+++    F  D+L ++   G L +
Sbjct: 615 SNHGIWPTGGAWLTTHLLEHYAFTKDQAFL-KKYYPIIKNSVLFYKDFLVVDPISGCLIS 673

Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
            PS SPEH      G L       TMD  IIR +F   ++ +  L  +ED L +++    
Sbjct: 674 TPSNSPEH------GGLVA---GPTMDHQIIRALFDGFVNVSAALGLDED-LRKEIQTKK 723

Query: 577 PRLRPTKIAEDGSIMEWV 594
            ++ P KI + G + EW+
Sbjct: 724 QQILPNKIGKYGQLQEWM 741



 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 57/79 (72%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PAK +T+A+PIGNG++GAM++GGV  + ++ NE+TLWTG P +Y  PDA K L  +R
Sbjct: 32  YKQPAKEWTEALPIGNGKIGAMIFGGVAQDRIQFNEETLWTGSPRNYNKPDAYKYLPQIR 91

Query: 77  SLVDSGQYAEATAASVKLF 95
           +L+  G+  EA A +++ F
Sbjct: 92  TLLQQGKQREAEALAMQEF 110


>gi|254785612|ref|YP_003073041.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
 gi|237683920|gb|ACR11184.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
          Length = 814

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 194/605 (32%), Positives = 308/605 (50%), Gaps = 71/605 (11%)

Query: 15  ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD 67
           + F  PA  + +  +PIGNG +GA++ G +  E ++ NE +LW G PG          P+
Sbjct: 44  LLFFSPASDWENQGLPIGNGAMGAVITGEINKELVQFNEKSLWEGGPGAQGYNFGLAAPN 103

Query: 68  APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETY 124
            P  L  V+  +  G    A   + +L   P +   YQ  GD+ +E    HL   E + Y
Sbjct: 104 FPAKLKAVQQQLAKGAVLSAETVATQLGQDPTEYGNYQTFGDLIIE----HLHSTEVQDY 159

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
           RR L++  A A V+Y++  V + RE+F+S PD+VIV +I+  + G+L+ NV L +  +  
Sbjct: 160 RRNLNIENALASVEYTITGVGYRREYFASFPDKVIVLQIASDKPGALNLNVGLHTSDNRS 219

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
             +N              R+      N++  G++++A++E++     GT++   DK L++
Sbjct: 220 QLLNATTH----------RMSLSGALNNN--GLRYAAMVEVRTQS--GTVARTSDK-LQI 264

Query: 245 EGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
             +D   L+L  ++ +    P    +     P +   + L S+    Y  L +RH+ DY+
Sbjct: 265 RSADKVTLVLATATDYAPVYPTYRVASGAPSPLAVVETRLNSLTKKGYPLLKSRHITDYR 324

Query: 303 KLFHRVSIQLS--RSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFG 357
            LF RV++ L+   SP  +          DT P   R++++  D      +L  L F +G
Sbjct: 325 SLFQRVTLNLTPNSSPNSVA---------DTKPLPARLEAYHKDTPENKRALETLYFNYG 375

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI+SSR G+  ANLQG+WN   +P W++  HVNINL+MNYW +L  NLSE   PL+D
Sbjct: 376 RYLLIASSRAGSLPANLQGVWNHSNTPPWNADYHVNINLQMNYWPALVTNLSETTPPLYD 435

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PMGGAWLCTHL 474
           F+  L   G K+AQ     +GW +   T+I+  S    G + W  A W P   AWL    
Sbjct: 436 FVDALRAPGEKSAQTLGADAGWAVLLNTNIFGFS----GLISWPTAFWQPEANAWLMRLY 491

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           ++ Y +T D+ FL +RAYP ++  + F + +L +  DG    NPS SPEH          
Sbjct: 492 FDFYQFTGDKKFLRERAYPAMKSTSQFWMTFLTQ-RDGTYWVNPSYSPEH---------G 541

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT----KIAEDGSI 590
             S  ++M   I+ E+F    +AAE+L+  + A   + LK  P L+ T    +I + G +
Sbjct: 542 PFSEGASMSQQIVSELFRNTHAAAEMLKDRQFA---RSLK--PFLQNTDDGLRIGKWGQL 596

Query: 591 MEWVQ 595
            EW Q
Sbjct: 597 QEWQQ 601


>gi|189464509|ref|ZP_03013294.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
           17393]
 gi|189438299|gb|EDV07284.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
           17393]
          Length = 817

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 198/598 (33%), Positives = 297/598 (49%), Gaps = 72/598 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G V +E + LNE TLW G P      DY    N  +   L ++R  
Sbjct: 64  SLPIGNGSLGANILGSVAAERITLNEKTLWRGGPNTSGGADYYWNVNKQSAPILKEIRQA 123

Query: 79  VDSGQYAEATAASVKLFG----------HPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + K F           HP     +  +G++ +E D S L+   + YRR
Sbjct: 124 FTEGNGEKAAQLTRKNFNGLAAYEEKDEHPFRFGSFTTMGELYIETDLSELRM--KNYRR 181

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            L L++A A V++    V++ R++F S PD V+  + S  ++G  +  +S     +  S 
Sbjct: 182 ILSLDSAMAVVQFDKEGVQYRRKYFISYPDSVMAMEFSADKAGKQNLVLSYAPNPEAQSN 241

Query: 187 V--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
           +  +G + ++  G               +  G++F+    IK     GT+ A  D+ L V
Sbjct: 242 IRTDGTDGLVYTGVL-------------NNNGMKFA--FRIKAIAKGGTVIAQNDR-LIV 285

Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           +G+D  V LL A +    +F+  F NP      DP   + S +       Y  L   H  
Sbjct: 286 KGADRVVFLLTADTDYKMNFNPDFKNPKTYVGDDPELTTQSMMNQALLKGYETLANNHKA 345

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF+RV + L+  P    +D         +P+ +R+ +++  + D  L EL +QFGR
Sbjct: 346 DYTALFNRVKLTLN--PDVTGSD---------LPTYQRLANYRKGQPDFRLEELYYQFGR 394

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +L   W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 395 YLLIASSRPGNLPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPAGPTNLSECTWPLIDF 454

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  +S    +++ W   PM G WL TH+WE+
Sbjct: 455 IRGLVKPGEKTAQAYFAARGWTASISANIFGFTSPLSSEIMAWNFNPMAGPWLATHIWEY 514

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT DR+FL++  Y L++  A F +D+L    DG     PSTSPEH           V 
Sbjct: 515 YDYTRDRNFLKEVGYDLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GPVD 565

Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +T   A++RE+    I A++VL  +  E    ++VL     L P KI   G ++EW
Sbjct: 566 EGATFVHAVVREILLDAIEASKVLGVDSRERKHWQEVLA---HLVPYKIGRYGQLLEW 620


>gi|383638758|ref|ZP_09951164.1| alpha-L-fucosidase [Streptomyces chartreusis NRRL 12338]
          Length = 740

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 189/583 (32%), Positives = 286/583 (49%), Gaps = 61/583 (10%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--------DYTNPDAPKALSDVRSL 78
           A+P+GNG LGAMV+G + SE ++ NE TLWTG PG        D+  P  P A+  V+  
Sbjct: 15  ALPVGNGALGAMVFGSIASERVQFNEKTLWTGGPGSVQGYDHGDWREPR-PTAIDAVQDD 73

Query: 79  VDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
           +D+ +       + +L G P      YQ  GD+ L+F  +      E YRREL L+T  A
Sbjct: 74  LDTRRRLAPEDVAGRL-GQPRVGFGAYQTFGDLYLDFPGTP---TPEAYRRELALDTGVA 129

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
            V Y+       RE F+S PD VIV +I       ++F +   S   + +      ++ +
Sbjct: 130 SVAYTHRQTRHRREFFASFPDGVIVGRIGADRPAGITFTLRYTSPRGDFTTTATGGRLTV 189

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
            G         K N      G++F A  ++++  D G +++  D  + V G+D A  +L 
Sbjct: 190 RGAL-------KDN------GLRFEA--QVQVRSDGGAVTSGADGTITVTGADSAWFVLA 234

Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
           A + +     +P     DP      A+    +  Y  L  RH+ D++ LF RV++ + +S
Sbjct: 235 AGTDYAD--THPDYRGADPHPAVTRAVDRASSRGYDSLRARHIADHRTLFARVTLDIGQS 292

Query: 316 -PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
            P ++ TD           +A+R          +L  L FQ+GRYLLI+SSR G+  ANL
Sbjct: 293 APAEVPTDRLLASYTGGTSAADR----------ALEALFFQYGRYLLIASSRAGSLPANL 342

Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
           QG+WN   SP W +  HVNINL+MNYW +   NL E   P   F+  L   G  TA+  +
Sbjct: 343 QGVWNHSTSPPWSADYHVNINLQMNYWLAEAANLPETTVPYDRFVQALRAPGRHTARQMF 402

Query: 435 LASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
            + GWV+H++T+ +  +   D     W  +P   AWL   L+EHY +    D+L   AYP
Sbjct: 403 GSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFGGSTDYLRTTAYP 460

Query: 494 LLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVF 551
           +++  A F LD L  +  DG L   PS SPEH +F A           + M   I+ ++F
Sbjct: 461 VMKEAAEFWLDNLRTDPRDGRLVVTPSYSPEHGDFTA----------GAAMSQQIVHDLF 510

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
           +  + AA VL  + D   ++V ++L  L P  +I   G + EW
Sbjct: 511 TNTLEAARVLGDSRD-FRQRVEQALAHLDPGLRIGSWGQLQEW 552


>gi|345881344|ref|ZP_08832866.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
 gi|343920009|gb|EGV30749.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
          Length = 834

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 189/598 (31%), Positives = 296/598 (49%), Gaps = 71/598 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT--------NPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE +LW G PG  +        N  A   L  +R+ 
Sbjct: 82  SLPIGNGSLGANILGSIAAERITLNEKSLWRGGPGVSSDASYYWNVNKHAAPVLKAIRAA 141

Query: 79  VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
             +G  A+A + + K F   A              +  +G++ +E   +  ++++  YRR
Sbjct: 142 FLAGDKAKADSLTRKNFNGLAAYESYAEKPFRFGNFTTMGELTIETGLNDAQFSD--YRR 199

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
           EL L++A   V++    V + R  F S PD V+V +   +  G  +L F+ + + +    
Sbjct: 200 ELSLDSARTLVQFVHDGVRYARTAFISYPDNVMVLRFKANAKGMQNLCFHYAPNPVSTGK 259

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +G N ++  G               D  G+Q+  ++ I+     GT+     + L +
Sbjct: 260 MQADGANGLVYRGAL-------------DSNGMQY--VVRIQAVTHSGTLEN-SGQTLTI 303

Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHLD 299
           +G+D  V L+ A +    +FD  F NP       P   +   +Q      Y+ L+ RH  
Sbjct: 304 KGADEVVFLITADTDYRINFDPDFHNPKTYVGVQPEVTTEKWMQQAAERGYAQLFQRHFK 363

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF RV +QL+           ++ N   VP+A+R+ +++    D  L EL +QFGR
Sbjct: 364 DYSPLFQRVKLQLN----------AAQTNDKDVPTAQRLAAYRNGATDNYLEELYYQFGR 413

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ ++   W    H NIN++MNYW     NL+EC  PL DF
Sbjct: 414 YLLIASSRPGNLPANLQGLWHNNVDGPWRVDYHNNINVQMNYWPVHTTNLNECALPLVDF 473

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G+ TA+  Y A GW     ++I+  ++    + + W L PMGG WL THLWE+
Sbjct: 474 VRTLVKPGAVTAKAYYGARGWTTSVSSNIFGFTAPLASEDMSWNLCPMGGPWLATHLWEY 533

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y++T D+ FL    Y +++  A+F +D+L    DG     PSTSPEH           + 
Sbjct: 534 YDFTRDKRFLRSTLYDIIKQSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPID 584

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEW 593
              T   A+IRE+    I+A++VL+ +E A    + VL  LP   P +I   G + EW
Sbjct: 585 EGVTFVHAVIREILLDAIAASKVLQVDETARKQWQMVLLHLP---PYRIGRYGQLQEW 639


>gi|429852446|gb|ELA27582.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
          Length = 796

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 179/580 (30%), Positives = 280/580 (48%), Gaps = 35/580 (6%)

Query: 22  KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDS 81
           + F +A+PIGNGRLGAM+ G    E ++LNE+++W G P D     A  AL  +R  +  
Sbjct: 37  RDFYEALPIGNGRLGAMIHGYTDKELIRLNEESIWNGGPRDKIPTTALDALEPLREQILD 96

Query: 82  GQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
           G+  EA    V  F    D    YQ  G++ L+F+  H       YR  LD++   + + 
Sbjct: 97  GRLTEADQNWVANFTPEYDDMRRYQPAGELRLDFN--HTLNETSGYRHSLDVSKGLSSLS 154

Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
           Y  G VE+TRE F + P  V+  + S + SGSLS + SL             N   +   
Sbjct: 155 YVFGGVEYTREAFGNAPKNVLAFRFSCNSSGSLSLDASLS---------RDRNVTELTAD 205

Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
             G+ +       +D    +F +  ++ + D  G I +     L +  +    ++  A +
Sbjct: 206 AAGRILKLDGTGEEDDT-YRFVSQAQVLLPDGVGDIIS-NGTALHIRNATDVFIIYTAET 263

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           +F     +P  +     +     L++ +   Y  +    + DY++ + R SI    S   
Sbjct: 264 AFR----HPDATMAQLETIVNGRLETAQEAGYETIQREAVKDYKQYYDRTSIDFGTS--- 316

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
              +  S++ I  +   +R  +  TD  P L+ L F  G+YLLI SSRPG+  ANLQGIW
Sbjct: 317 --QEIGSKDTIARLEDWKRGSNITTD--PELMALQFNVGKYLLIQSSRPGSLPANLQGIW 372

Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASG 438
           N D  P WDS   +N+NLEMNYW + P NL E   P+ DFL  L++ GS+ A+  Y A G
Sbjct: 373 NRDFGPPWDSKFTINVNLEMNYWPAQPLNLPEIAGPVVDFLDRLAVTGSEVAKGMYGADG 432

Query: 439 WVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
           W  HH TDI    +      + A +P+GGAWL     E++ +T D  +   R  P+L+G 
Sbjct: 433 WCCHHNTDITGDCTPFHAITIAAPYPLGGAWLAFEAIEYFRFTGDTTYARDRILPILKGA 492

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIREVFSA 553
             F+  W  E  DG+  TNPS SPE+ +  P+     G+   +   +  D AI+ E+ S 
Sbjct: 493 MDFIYSWATE-RDGWRITNPSCSPENSYYIPENMTVAGETTGIDAGAMNDRAIMWEIMSG 551

Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +  +E L  +E A   +  +   +++P      G ++E+
Sbjct: 552 FLEISEALSSDEGADRARSFRD--KIQPPVAGSFGQLLEY 589


>gi|336427807|ref|ZP_08607799.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336008767|gb|EGN38776.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 784

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 187/603 (31%), Positives = 289/603 (47%), Gaps = 95/603 (15%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
           DA P+GNG LGAMV+G    + ++LNED+LW G   D  NP+A + L +V+ L+   ++ 
Sbjct: 37  DATPMGNGFLGAMVYGHTARDRIQLNEDSLWHGKFRDRINPNAKEHLKEVQELILDRKFE 96

Query: 86  EATAASVKLFGH----PADV--YQLLGDIELEFDDS---HLKYAEET----YRRELDLNT 132
           EA      +F H    P ++  +  LG++ L  + +    + +  E+    Y  +L++  
Sbjct: 97  EAEEL---MFSHMVSAPGNMRNFSPLGELNLALNTALPFQMGWLPESDGENYVSDLNMEE 153

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
               + +    V++TRE F SNPD+V+  ++   +  +    + LD LL+   + +   Q
Sbjct: 154 GILCISHEDKGVQYTREMFVSNPDRVLCIRLKSDKEKA----IRLDMLLNRVPFTD---Q 206

Query: 193 IIMEGRCPGKRIPPKA-----------------NANDDPKGIQFSAILEIKISDDRGTIS 235
            + + R PGK +                         D  G +F+  L + ++D R    
Sbjct: 207 RLPDDRRPGKFVSAGVWPVTRCERIYTENGHTLMMEGDENGTRFACGLTV-VTDGR---- 261

Query: 236 ALED--KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
            +ED   KL    +   V+ L ASS          + ++D      S+L + R   Y+D+
Sbjct: 262 -IEDCYAKLVAHEAGEVVIYLAASSD---------NREEDFVGNVKSSLAAARAKGYADI 311

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
            T H+ D+     R ++ L                    P  E+   +            
Sbjct: 312 RTDHIADFTSYMKRCTLAL--------------------PEDEKAGMY------------ 339

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           FQ+ RY+++S+ R G    NLQGIWN +  P+W+S    NINL+MNYW +  CNLS   E
Sbjct: 340 FQYARYMMVSAGREGATAMNLQGIWNHEFCPSWESKYTTNINLQMNYWPAEICNLSTLHE 399

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PLFD +  +   G   A+  Y   G + HH TDI+            A W MGGAW+  H
Sbjct: 400 PLFDLIHTVQERGRDVAKRMYGCRGTMCHHNTDIYGDCGTQDMYAAAAFWQMGGAWMAMH 459

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LWEHY +T+D DFL K  YP++E  A F +D+LI+  +GYL T PS SPE+ F+  DG  
Sbjct: 460 LWEHYLFTLDEDFLRKE-YPVMEEFALFFVDFLIKDKEGYLVTCPSVSPENRFVLEDGSD 518

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             +    TMD  IIR + SA + AA++L  E    A  E++++    LRP +I   G + 
Sbjct: 519 TPICAGPTMDNQIIRGLMSACLEAAKILGIESPYKADFERIIRE---LRPNQIDSIGRLK 575

Query: 592 EWV 594
           EW 
Sbjct: 576 EWA 578


>gi|374384834|ref|ZP_09642351.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
           12061]
 gi|373227638|gb|EHP49951.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
           12061]
          Length = 780

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 180/585 (30%), Positives = 283/585 (48%), Gaps = 59/585 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPDAP---KALSDVRSLV 79
           +A+P+GNG +GAM +GG   + ++L E++ W G PG    Y   +     K L +VR L+
Sbjct: 36  EALPVGNGYMGAMWFGGPVRDEIQLAEESFWAGGPGASKSYKGGNKEGSWKYLKEVRELL 95

Query: 80  DSGQYAEATAASVKLFGH---PADVYQLLGDIELEFDDSHLKYAEET-------YRRELD 129
           +SG+  +A   + + F     P +     GD         L    E        YRR LD
Sbjct: 96  ESGEKEKAAELAGRYFVGEITPTEAGDQFGDFGGNQPFGSLGVTVEAADTSWTDYRRSLD 155

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  A  +V+Y +G   F   +F+S P ++ V K + +  G   + V+ ++          
Sbjct: 156 LERAMGKVEYDMGGTHFRNTYFASYPARMFVFKYTNNAPGGKDYRVTFETPHQGTKITVR 215

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            +  I++G+     +P +                 IK+  D G I   +    ++EG+  
Sbjct: 216 KDLWIIQGKLASNGLPFEGR---------------IKVKTD-GKIR-FQKGVFRIEGAKN 258

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
               +  +S++   +  P     D    +  A++     ++ DL   H  DY+ LF RV 
Sbjct: 259 TEFYVSIASAYANTY--PLYRGNDYEEVNRKAIERAERGTWEDLQAEHETDYRSLFERVK 316

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
           ++L  S             ++ +P+ +R   +     DP L  L FQ+GRYLLISSSRPG
Sbjct: 317 LELGHS------------GLEKLPTDKRQLRYSLGAYDPGLEALYFQYGRYLLISSSRPG 364

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           T  A+LQG WN  L+  W    H+NINL+M YW +   NLSEC  PL +++  L   G  
Sbjct: 365 TLPAHLQGRWNHQLNAPWACDYHMNINLQMIYWPAEVANLSECHLPLLEYIDKLREPGRV 424

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA+  + A GWV+H   + +   +A      W   P   AWLC HLWEH+NYT DR+FL 
Sbjct: 425 TAREYFNARGWVVHTMNNAFG-YTAPGWDFYWGYAPNSAAWLCAHLWEHFNYTRDREFLG 483

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
           ++AYP+++  A F +D+L+   DG+L ++PS SPEH  IA           +TMD  I  
Sbjct: 484 RKAYPIMKEVARFWMDYLVADEDGFLVSSPSYSPEHGDIA---------IGATMDQEIAW 534

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           ++F+ ++ A + + K + A  + V     RL P +I + G + EW
Sbjct: 535 DLFTNVLQAMDYV-KEDPAFADSVSDFRKRLLPLRIGKFGQLQEW 578


>gi|358368086|dbj|GAA84703.1| alpha-L-fucosidase 2 precursor [Aspergillus kawachii IFO 4308]
          Length = 790

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 193/599 (32%), Positives = 290/599 (48%), Gaps = 62/599 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +N PA +FT  +PIGNGRLGA +WG   +E + LNE+++W G   +  NP +  AL  VR
Sbjct: 27  YNTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWNGPFINRVNPRSYDALWPVR 85

Query: 77  SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G   E    ++  + G P     +  LG + L+F   H +     Y R LDL T 
Sbjct: 86  SLLAQGNMTEGNDVTLANMVGIPDSPQSFSALGSLVLDF--GHDQAGISNYTRYLDLRTG 143

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNN 191
            A V+Y+   V + RE+ +S PD V+  ++S S+ G L+   SL  D  + ++     ++
Sbjct: 144 VAVVEYTYREVHYRREYVASYPDGVVAVRLSSSQPGRLNVASSLARDRYVVSNQAAVSSD 203

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
             ++  R   K I        DP  IQF+    I +SD R T +               V
Sbjct: 204 LGVLTLRAYSKNI-------SDP--IQFTTEARI-VSDGRATSNG--------------V 239

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFH 306
            L+V ++S    FI+   S +  T E+  A     L +     +  +    + DY  L  
Sbjct: 240 SLVVRNASTVDIFIDTETSYRYTTRETREAEIKDKLDTASRSGFLTVKQNAIADYSTLAQ 299

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
           RV + L            S  +   +P+  R+ +++TD   DP L  L+F FGR+ LI+S
Sbjct: 300 RVDLNLG-----------SSGSAGNLPTDTRLVNYRTDPDSDPELAVLMFHFGRHSLIAS 348

Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           SR     A   NLQG+WN++  P W     ++INLEMNYW +   NL++   P  D L  
Sbjct: 349 SRATESPALPANLQGLWNQEFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDI 408

Query: 422 LSINGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
           +   G   A+  Y  S  G+V+HH TD+W  ++       W +WPMGGAWL  +L EHY 
Sbjct: 409 VHGRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYR 468

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLA 534
           +T D   L  R +PLL+  A F   +L    +GY  T  S SPE  +I PD     G + 
Sbjct: 469 FTRDETILRDRIWPLLQSAARFYYCYLFP-FEGYYSTGLSLSPEASYIVPDDMTTAGNVE 527

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +  + TMD +++ E+F A+    +VL  N         K L +++  +I   G I+EW
Sbjct: 528 GIDIAPTMDNSLLHELFQAVTETCDVLGINNTDCTTAA-KYLSKIKQPQIGSSGRILEW 585


>gi|384566468|ref|ZP_10013572.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
           K62]
 gi|384522322|gb|EIE99517.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
           K62]
          Length = 924

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 203/609 (33%), Positives = 306/609 (50%), Gaps = 58/609 (9%)

Query: 3   NAESTSTTNP----LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
            A  TS   P    L + ++ PA  + ++ +P+GNG LG  V+GGV +E L+ NE TLWT
Sbjct: 39  GAAETSDLRPSPEGLTLWYDEPASDWESEVLPVGNGALGVGVFGGVATERLQFNEKTLWT 98

Query: 58  GVPG-----DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGD 107
           G PG     D+ N   P+  A+ +VR  +D+   A+      KL G P      YQ  G+
Sbjct: 99  GGPGAADGYDFGNWREPRPGAIEEVRQRLDTELRADPEWVVSKL-GQPKRGYGAYQTFGE 157

Query: 108 IELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE 167
           I +    + L+   + YRR L+L  A A V Y    V  TRE+F+S  D V+V + SG  
Sbjct: 158 IRVS--GAELEEVAD-YRRYLNLADAVAGVSYEADGVHHTREYFASAADDVVVARFSGEV 214

Query: 168 SGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI 227
            G++   V + +  DN S     N     GR         + A DD  G+++ A  +I++
Sbjct: 215 PGAVDVTVGV-TAPDNRS----KNLTARGGRIT------FSGALDD-NGLRYEA--QIQV 260

Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
             D G+     D  + V  +D   L+L A + +   +  P    +DP +     + +   
Sbjct: 261 LTDGGSRVDNPDGSVTVTDADTMTLVLAAGTDYSAEY--PVYRGEDPHAAVTERVDAAVA 318

Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP 347
             Y  L   H+ D++ LF RVS+ L +   D+ TD       D   +AE  ++ +     
Sbjct: 319 KGYDALRAAHVADHRGLFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEV---- 374

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
               L FQ+GRYLLI+SSR G+  ANLQG+WN+  SP W +  HVNINL+MNYW +   N
Sbjct: 375 ----LYFQYGRYLLIASSRSGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTN 430

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMG 466
           LSE  EPLFD++  L   G+ TA+  +   GWV+H++T  +  +   D     W  +P  
Sbjct: 431 LSETTEPLFDYVDSLVAPGTVTAKEMFGNRGWVVHNETTPFGYTGVHDWATSFW--FPEA 488

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
           GAWL    WEHY +T D  FL +RAYP+L+  + F +D L+ +  DG L  +PS SPE  
Sbjct: 489 GAWLAQSYWEHYLFTRDETFLAERAYPMLKSLSRFWIDELVTDSRDGRLVVSPSYSPEQ- 547

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KI 584
                      S  ++M   I+ ++ +    AAE++ ++E+   E +  +L  L P  +I
Sbjct: 548 --------GDFSAGASMSQQIVWDLLTNTAEAAELVGEDEEFRAE-LAATLADLDPGLRI 598

Query: 585 AEDGSIMEW 593
              G + EW
Sbjct: 599 GSWGQLQEW 607


>gi|375146879|ref|YP_005009320.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
 gi|361060925|gb|AEV99916.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
          Length = 943

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 172/499 (34%), Positives = 257/499 (51%), Gaps = 64/499 (12%)

Query: 110 LEFDDSHLKYAEET----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG 165
           L F D + ++A       Y+R LDL+ A + V Y+   V + RE+F S P Q +V  ++ 
Sbjct: 296 LPFGDLYFRFAHGNNSSDYQRSLDLDNAISTVSYTANGVSYNREYFISAPHQCVVMHVTA 355

Query: 166 SESGSLSFNVSLDS--------LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
           S+ G+LS    L++         +D+H+       + +E             +N   K +
Sbjct: 356 SKPGALSLQAVLNTPHKKYVVKKIDDHTL-----SLSLE------------VSNGVLKAV 398

Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
            +   L    +  R T++   D  + ++ +      LVA++SF     N  D   DP + 
Sbjct: 399 GY---LYATATGGRLTVN---DTAINLQQATEVNFYLVAATSFK----NYKDVSGDPVAA 448

Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
             +AL  ++ + Y+ + T HL++Y KLF   S             T        +P+ ER
Sbjct: 449 CKAALARVKGVPYASIKTAHLNEYHKLFETFSF------------TVPAGKNSGLPTNER 496

Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
           ++ F   +D +LV L   + RYLLISSSRPGTQ ANLQGIWN+ L+P W S    NINLE
Sbjct: 497 IRQFNMKDDAALVPLFLMYSRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINLE 556

Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
           MNYW +   NLS C +PLF+ +  L++ G +TA+ +Y A GWV+HH TD+W + +A    
Sbjct: 557 MNYWTAEVLNLSTCTQPLFNMINELAVAGHQTAKDHYNAPGWVLHHNTDLW-RGTAPINA 615

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLET 516
               +W  G AWL  H+WEH+ YT D  FL  + YP L+G A F   +L++    GYL +
Sbjct: 616 SNHGIWVTGAAWLTLHIWEHFLYTQDTAFLRAQ-YPNLQGAAQFFEHFLVKDPKTGYLIS 674

Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
            PS SPEH      G L       TMD  IIRE+F    +AA VL K + A  E++   +
Sbjct: 675 TPSNSPEH------GGLVA---GPTMDHQIIRELFRNCSAAAAVL-KTDAAFAERLKTLI 724

Query: 577 PRLRPTKIAEDGSIMEWVQ 595
           P++ P KI +   + EW++
Sbjct: 725 PQIAPNKIGKHNQLQEWME 743



 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)

Query: 4   AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           ++S +   PL++ +  PA  +TDA+P+GNGRLGAMV+GGV  E L+LNE+TLW+G P  Y
Sbjct: 20  SQSYAQKQPLRLWYQQPAATWTDALPLGNGRLGAMVFGGVGEEHLQLNEETLWSGRPRSY 79

Query: 64  TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEE 122
           ++P A + L  +R L+  G+ AE+ A   K F G  A             DDS  +  ++
Sbjct: 80  SHPGAAQYLQPMRQLLAEGKQAESEAMGEKYFMGLKAP------------DDSAYELQKD 127

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
           T+ R +      A V Y+  N    +       ++V +    GS     SFNV    L  
Sbjct: 128 TWFRSVRAQIEPAGVTYNDNNWPAMQLPTPEGWERVGLEGTDGSLWFRTSFNVPAKWLGK 187

Query: 183 N------------HSYVNG 189
           N            ++YVNG
Sbjct: 188 NLVLDLGRIRDLDYTYVNG 206


>gi|421235008|ref|ZP_15691623.1| large secreted protein [Streptococcus pneumoniae 2061617]
 gi|395599385|gb|EJG59558.1| large secreted protein [Streptococcus pneumoniae 2061617]
          Length = 764

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|148998038|ref|ZP_01825551.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
           SP11-BS70]
 gi|168576031|ref|ZP_02721936.1| large secreted protein [Streptococcus pneumoniae MLV-016]
 gi|307068776|ref|YP_003877742.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
 gi|419472044|ref|ZP_14011900.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
 gi|419504884|ref|ZP_14044547.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
 gi|421315019|ref|ZP_15765603.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
           GA47562]
 gi|147756048|gb|EDK63091.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
           SP11-BS70]
 gi|183578125|gb|EDT98653.1| large secreted protein [Streptococcus pneumoniae MLV-016]
 gi|306410313|gb|ADM85740.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
 gi|379543433|gb|EHZ08583.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
 gi|379604070|gb|EHZ68832.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
 gi|395911603|gb|EJH22468.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
           GA47562]
          Length = 764

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|410477499|ref|YP_006744258.1| hypothetical protein HMPREF1038_02170 [Streptococcus pneumoniae
           gamPNI0373]
 gi|421269340|ref|ZP_15720202.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
 gi|444387345|ref|ZP_21185368.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
           PCS125219]
 gi|444391139|ref|ZP_21189052.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
           PCS70012]
 gi|444391645|ref|ZP_21189459.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
           PCS81218]
 gi|444395928|ref|ZP_21193466.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
           PNI0002]
 gi|444398446|ref|ZP_21195928.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
           PNI0006]
 gi|444399000|ref|ZP_21196473.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
           PNI0007]
 gi|444402193|ref|ZP_21199365.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
           PNI0008]
 gi|444404331|ref|ZP_21201289.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
           PNI0009]
 gi|444408063|ref|ZP_21204730.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
           PNI0010]
 gi|444415928|ref|ZP_21212144.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
           PNI0199]
 gi|444417791|ref|ZP_21213797.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
           PNI0360]
 gi|444419629|ref|ZP_21215476.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
           PNI0427]
 gi|395866259|gb|EJG77390.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
 gi|406370444|gb|AFS44134.1| conserved hypothetical membrane protein [Streptococcus pneumoniae
           gamPNI0373]
 gi|444253440|gb|ELU59896.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
           PCS125219]
 gi|444255297|gb|ELU61653.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
           PCS70012]
 gi|444255745|gb|ELU62088.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
           PNI0002]
 gi|444259175|gb|ELU65491.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
           PNI0006]
 gi|444265102|gb|ELU71130.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
           PCS81218]
 gi|444266940|gb|ELU72867.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
           PNI0008]
 gi|444269354|gb|ELU75162.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
           PNI0007]
 gi|444271659|gb|ELU77410.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
           PNI0010]
 gi|444277109|gb|ELU82631.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
           PNI0009]
 gi|444278655|gb|ELU84090.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
           PNI0199]
 gi|444282561|gb|ELU87815.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
           PNI0360]
 gi|444286393|gb|ELU91377.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
           PNI0427]
          Length = 764

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|149012024|ref|ZP_01833172.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
           SP19-BS75]
 gi|418077389|ref|ZP_12714618.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
 gi|147763979|gb|EDK70912.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
           SP19-BS75]
 gi|353745563|gb|EHD26232.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
          Length = 764

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|387760237|ref|YP_006067215.1| hypothetical protein SPNINV200_19710 [Streptococcus pneumoniae
           INV200]
 gi|419515658|ref|ZP_14055280.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
 gi|301802826|emb|CBW35604.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
 gi|379633974|gb|EHZ98540.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
          Length = 764

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|225861978|ref|YP_002743487.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
 gi|298229408|ref|ZP_06963089.1| large secreted protein [Streptococcus pneumoniae str. Canada
           MDR_19F]
 gi|298255588|ref|ZP_06979174.1| large secreted protein [Streptococcus pneumoniae str. Canada
           MDR_19A]
 gi|298501665|ref|YP_003723605.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
 gi|387789197|ref|YP_006254265.1| large secreted protein [Streptococcus pneumoniae ST556]
 gi|417313623|ref|ZP_12100332.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
 gi|418083982|ref|ZP_12721174.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
 gi|418086144|ref|ZP_12723319.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
 gi|418094961|ref|ZP_12732084.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
 gi|418119732|ref|ZP_12756683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
 gi|418142694|ref|ZP_12779502.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
 gi|418151670|ref|ZP_12788412.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
 gi|418153939|ref|ZP_12790673.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
 gi|418199016|ref|ZP_12835468.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
 gi|418224372|ref|ZP_12851007.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
 gi|418228657|ref|ZP_12855270.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
 gi|419430394|ref|ZP_13970551.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
 gi|419439146|ref|ZP_13979210.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
 gi|419502823|ref|ZP_14042501.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
 gi|419529128|ref|ZP_14068665.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
 gi|225728210|gb|ACO24061.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
 gi|298237260|gb|ADI68391.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
 gi|327388899|gb|EGE87247.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
 gi|353753506|gb|EHD34129.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
 gi|353754984|gb|EHD35594.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
 gi|353762498|gb|EHD43057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
 gi|353788845|gb|EHD69241.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
 gi|353803816|gb|EHD84107.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
 gi|353811993|gb|EHD92229.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
 gi|353815265|gb|EHD95485.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
 gi|353859431|gb|EHE39382.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
 gi|353876904|gb|EHE56749.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
 gi|353878966|gb|EHE58794.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
 gi|379138939|gb|AFC95730.1| large secreted protein [Streptococcus pneumoniae ST556]
 gi|379535583|gb|EHZ00782.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
 gi|379548700|gb|EHZ13818.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
 gi|379562772|gb|EHZ27781.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
 gi|379598038|gb|EHZ62833.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
          Length = 764

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 189/593 (31%), Positives = 301/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + +++ G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|418101640|ref|ZP_12738719.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
 gi|353768739|gb|EHD49262.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
          Length = 764

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 190/592 (32%), Positives = 299/592 (50%), Gaps = 54/592 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +    L L + +++ G    PS               SI   +  D    H+  YQ+ F
Sbjct: 225 NATEVFLYLKSMTNYWGNIDIPS---------LQGEFSSIDYFTEKD---EHVKKYQEQF 272

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
           +RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISSS
Sbjct: 273 NRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSS 320

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           +P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +   
Sbjct: 321 QPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREP 380

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D  
Sbjct: 381 GRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDER 440

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
            L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D  
Sbjct: 441 ILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQ 498

Query: 546 IIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 499 ILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|168484015|ref|ZP_02708967.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
 gi|417697350|ref|ZP_12346525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
 gi|418108816|ref|ZP_12745849.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
 gi|418111150|ref|ZP_12748165.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
 gi|418163224|ref|ZP_12799902.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
 gi|418168087|ref|ZP_12804735.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
 gi|418176974|ref|ZP_12813561.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
 gi|418219924|ref|ZP_12846585.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
 gi|419423904|ref|ZP_13964112.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
 gi|419461001|ref|ZP_14000923.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
 gi|419463323|ref|ZP_14003222.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
 gi|421273944|ref|ZP_15724780.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
 gi|172042696|gb|EDT50742.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
 gi|332198777|gb|EGJ12859.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
 gi|353775273|gb|EHD55754.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
 gi|353780261|gb|EHD60720.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
 gi|353825359|gb|EHE05524.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
 gi|353837695|gb|EHE17777.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
 gi|353838933|gb|EHE19009.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
 gi|353871990|gb|EHE51859.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
 gi|379528874|gb|EHY94127.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
 gi|379529046|gb|EHY94298.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
 gi|379584326|gb|EHZ49194.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
 gi|395872020|gb|EJG83121.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
          Length = 764

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 189/593 (31%), Positives = 301/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD ++ ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFSDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|345517561|ref|ZP_08797030.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
 gi|254837350|gb|EET17659.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
          Length = 828

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 186/601 (30%), Positives = 294/601 (48%), Gaps = 74/601 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P            N  +   L ++R
Sbjct: 72  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTSAGAAAYWNVNKQSAHILDEIR 131

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
               +G    A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 132 QAFINGDEKRAMLLTQKNFNSEVPYESWKEKPFRFGNFTTMGEFYIETGLSTIGMSD--Y 189

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V+++   V + R +F S P+ V+  +   ++ G  +L F+   + +  
Sbjct: 190 KRILSLDSALAIVQFNKDGVAYERNYFISYPNNVMTIRFKANKPGKQNLVFSYEPNPVST 249

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                NGNN ++   R                   Q   ++ I  +   GT+S  +  KL
Sbjct: 250 GKMETNGNNGLVYTARLDNN---------------QMEYVIRIHATAKGGTLSN-QSGKL 293

Query: 243 KVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYTR 296
            V G+D  + L+ A + +   F NP  +D K     +P+  + + ++    L Y  L+  
Sbjct: 294 SVNGADEVIFLVTADTDYQINF-NPDFNDPKAYVGVNPSETTATWMKDAAALGYDALFDA 352

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
           H  DY  LF+RVS+ L+ S K            D +P+ +R+K+++  + D  L EL +Q
Sbjct: 353 HYKDYASLFNRVSLSLNGSGK-----------TDNIPTPQRLKNYRKGKPDFYLEELYYQ 401

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL
Sbjct: 402 FGRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPAGSTNLAECTLPL 461

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
            DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+
Sbjct: 462 IDFIKTLVKPGEKTAQAYFGARGWTASISGNIFGFTAPLESENMSWNFNPMAGPWLATHV 521

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           W++Y+YT D+ FL+K  Y L++  A F +D+L +  DG     PSTSPEH          
Sbjct: 522 WDYYDYTRDKQFLKKTGYGLIKSSAQFAVDYLWKKPDGTYTAAPSTSPEH---------G 572

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            +   +T   A++RE+    I A+++L  +K E    E+VL+   +L P +I   G +ME
Sbjct: 573 PIDQGATFIHAVVREILLNAIDASKILGVDKKERKQWEEVLE---KLAPYQIGRYGQLME 629

Query: 593 W 593
           W
Sbjct: 630 W 630


>gi|67541006|ref|XP_664277.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
 gi|40738426|gb|EAA57616.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
 gi|259480257|tpe|CBF71222.1| TPA: alpha-fucosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 831

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 197/585 (33%), Positives = 278/585 (47%), Gaps = 46/585 (7%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRL A ++GGV +E + LNE+T+W+G   + T  +A  AL   R L+ +G   E
Sbjct: 45  ALPIGNGRLAATIYGGVRAEVITLNENTIWSGPFQERTPENALAALPIARELLLNGSITE 104

Query: 87  ATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           A     +   H  D    Y   G++EL F   H +   E YRR LD     A V+Y V  
Sbjct: 105 AGEFIQREMMHEIDSMRAYSYFGNLELGF--GHDEAKVEGYRRWLDTRKGDAGVEYVVEG 162

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
           V++TRE+ +S P  V+  + + SE G+L+ N +   + D  S      Q  +  R P  R
Sbjct: 163 VKYTREYIASFPAGVLAARFTASEKGALTLNATFCRVSDATSL-----QASVSDRAPWIR 217

Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
           +   +    +   I FS           G  S + +  L    +    L LV +++ D  
Sbjct: 218 LSGTSGQPAEEYPIVFS-----------GQASFVAEGALFTSSN--GTLTLVNATTVD-I 263

Query: 264 FINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           F +   + + P+ E++ A     L    N  Y  +    L D   L  R SI    S  D
Sbjct: 264 FFDAETNYRYPSQEAIDAEIAHKLTDALNKGYDRIRDEALADSSSLLDRASIDFGIS-TD 322

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV----ANL 374
             +D  ++E I  V SA  +     D D  L  L + +GR+LL++SSR  T+     ANL
Sbjct: 323 ETSDLATDERIALVRSAGGL-----DGDLELATLAWNYGRHLLVASSRNTTEAIDLPANL 377

Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
           QGIWN   +  W     +NIN EMNYW + P NL E QEPLFD        G K A+  Y
Sbjct: 378 QGIWNNQTTAAWGGKYTININTEMNYWPAGPTNLIETQEPLFDLFAVAYPRGQKLARDMY 437

Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
             SG V HH  D+W   +        ++WPMG AWL THL++ Y +T D+  L    YP 
Sbjct: 438 NCSGVVFHHNLDVWGDPAPVDNYTSSSMWPMGAAWLATHLYDQYRFTGDKALLADTIYPY 497

Query: 495 LEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIRE 549
           L   A F   +  E H+GY  T PS SPE+ FI P+     G  A +  +  MD  II E
Sbjct: 498 LVDVAKFYQCYTFE-HEGYKVTGPSLSPENTFIIPENWTVAGNKAAMDVAIPMDDQIIWE 556

Query: 550 VFSAIISAA-EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           V   ++ AA E+   ++D  V      L ++ P +I   G I EW
Sbjct: 557 VLHNLLDAASELGIADDDHTVSAAKSFLHKIHPPRIGFQGQIQEW 601


>gi|302540737|ref|ZP_07293079.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
 gi|302458355|gb|EFL21448.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
          Length = 775

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 193/587 (32%), Positives = 294/587 (50%), Gaps = 63/587 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV----------PGDYTNPDAPKALSDV 75
           +A+PIGNG LGAMV+GGV  E ++ NE +LWTG            G++  P  P AL+ V
Sbjct: 18  EALPIGNGTLGAMVFGGVARERIQFNEKSLWTGGPGGPGSAPYDSGNWREPR-PGALAAV 76

Query: 76  RSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           + L+D    A     + +L G P      YQ  GD+ LE   +    + ++YRR L++  
Sbjct: 77  QRLIDEHGAAAPEDVAARL-GQPRSRYGAYQPFGDLWLEIPGA--PESPDSYRRLLEIRK 133

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
             A VKY+   V   RE F+S PD+VIV +   +  G++ F +   S      +V  ++ 
Sbjct: 134 GVALVKYTAQGVRHRREFFASYPDRVIVGRFDAA-PGTVGFTLRHTSPRPGDHHVTAHD- 191

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
                     R+  +    D+  G++F A  ++++  D GT+++ ED  L V G+  A  
Sbjct: 192 ---------GRLTIRGALEDN--GLRFEA--QVRVMADGGTVTSGEDGTLTVTGAHSAWF 238

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           +L A + +     +P    +DP       + +  +  Y  L +RH+ D++ LF R ++ L
Sbjct: 239 VLAAGTDYAD--THPHYRGEDPHRTVTGTVDAAADRGYLTLLSRHVRDHRALFDRTALDL 296

Query: 313 S-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
             R+P    TD            A+R          +L EL F +GRYLLI+SSRPG  +
Sbjct: 297 GGRTPPRTPTDRQRAAYTGGESPADR----------ALEELFFDYGRYLLIASSRPGAPL 346

Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQGIWN+ + P W +  H NINL+M YW +   +L+E  EPL  F+T L   G  TA
Sbjct: 347 PANLQGIWNDSVRPAWSADYHTNINLQMAYWPAHALHLAETAEPLHRFITALRAPGRITA 406

Query: 431 QVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           +  + A GWV+H++T+ +  +   D     W  +P   AWL  HL+EHY +T+D  FL  
Sbjct: 407 REMFGARGWVVHNETNAYGFTGVHDWSTAFW--FPEAAAWLVHHLYEHYRFTLDTGFLRD 464

Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAII 547
            AYP +   A+F LD L  +  DG L  +P  SPEH +F A             M   I+
Sbjct: 465 TAYPAMREAAAFWLDTLRPDPRDGTLVVSPGYSPEHGDFTA----------GPAMSQQIV 514

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
            ++ +A + AA  L  ++ AL   + ++L  L P  +I   G + EW
Sbjct: 515 HDLLTATLEAARTL-GDDPALQAGLRRALDALDPGLRIGSWGQLQEW 560


>gi|15904007|ref|NP_359557.1| hypothetical protein spr1966 [Streptococcus pneumoniae R6]
 gi|116517212|ref|YP_817374.1| hypothetical protein SPD_1988 [Streptococcus pneumoniae D39]
 gi|148988800|ref|ZP_01820215.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
           SP6-BS73]
 gi|148991988|ref|ZP_01821762.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
           SP9-BS68]
 gi|149020072|ref|ZP_01835046.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
           SP23-BS72]
 gi|168494084|ref|ZP_02718227.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
 gi|387627290|ref|YP_006063466.1| hypothetical protein INV104_18640 [Streptococcus pneumoniae INV104]
 gi|417687620|ref|ZP_12336887.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
 gi|418075010|ref|ZP_12712256.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
 gi|418081811|ref|ZP_12719017.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
 gi|418090533|ref|ZP_12727683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
 gi|418099496|ref|ZP_12736589.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
 gi|418103895|ref|ZP_12740963.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
 gi|418106297|ref|ZP_12743347.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
 gi|418115675|ref|ZP_12752658.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
 gi|418117845|ref|ZP_12754811.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
 gi|418135939|ref|ZP_12772788.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
 gi|418160899|ref|ZP_12797595.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
 gi|418174588|ref|ZP_12811195.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
 gi|418183706|ref|ZP_12820260.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
 gi|418203403|ref|ZP_12839826.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
 gi|418217614|ref|ZP_12844290.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
 gi|419432556|ref|ZP_13972681.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
 gi|419434785|ref|ZP_13974899.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
 gi|419456417|ref|ZP_13996371.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
 gi|419465661|ref|ZP_14005549.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
 gi|419467835|ref|ZP_14007713.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
 gi|419469963|ref|ZP_14009827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
 gi|419476555|ref|ZP_14016386.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
 gi|419480975|ref|ZP_14020776.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
 gi|419487705|ref|ZP_14027464.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
 gi|419498536|ref|ZP_14038238.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
 gi|419500675|ref|ZP_14040366.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
 gi|419513550|ref|ZP_14053180.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
           GA05578]
 gi|419517761|ref|ZP_14057373.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
           GA02506]
 gi|419522113|ref|ZP_14061704.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
 gi|421207672|ref|ZP_15664716.1| large secreted protein [Streptococcus pneumoniae 2090008]
 gi|421209866|ref|ZP_15666875.1| large secreted protein [Streptococcus pneumoniae 2070005]
 gi|421221343|ref|ZP_15678174.1| large secreted protein [Streptococcus pneumoniae 2070425]
 gi|421223600|ref|ZP_15680377.1| large secreted protein [Streptococcus pneumoniae 2070531]
 gi|421226019|ref|ZP_15682753.1| large secreted protein [Streptococcus pneumoniae 2070768]
 gi|421230717|ref|ZP_15687375.1| large secreted protein [Streptococcus pneumoniae 2061376]
 gi|421241635|ref|ZP_15698176.1| large secreted protein [Streptococcus pneumoniae 2080913]
 gi|421267146|ref|ZP_15718023.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
 gi|421284302|ref|ZP_15735084.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
           GA04216]
 gi|421292975|ref|ZP_15743706.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
           GA56348]
 gi|444381684|ref|ZP_21179890.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
           PCS8106]
 gi|444384154|ref|ZP_21182250.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
           PCS8203]
 gi|15459667|gb|AAL00768.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
 gi|116077788|gb|ABJ55508.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
 gi|147925611|gb|EDK76687.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
           SP6-BS73]
 gi|147929037|gb|EDK80048.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
           SP9-BS68]
 gi|147930750|gb|EDK81731.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
           SP23-BS72]
 gi|183575953|gb|EDT96481.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
 gi|301795076|emb|CBW37545.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
 gi|332071430|gb|EGI81924.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
 gi|353745184|gb|EHD25855.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
 gi|353750133|gb|EHD30775.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
 gi|353759533|gb|EHD40117.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
 gi|353767716|gb|EHD48248.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
 gi|353773458|gb|EHD53955.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
 gi|353774259|gb|EHD54752.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
 gi|353783638|gb|EHD64065.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
 gi|353787046|gb|EHD67455.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
 gi|353820164|gb|EHE00352.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
 gi|353835112|gb|EHE15207.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
 gi|353846724|gb|EHE26752.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
 gi|353864851|gb|EHE44761.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
 gi|353868852|gb|EHE48736.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
 gi|353899786|gb|EHE75353.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
 gi|379535787|gb|EHZ00985.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
 gi|379536100|gb|EHZ01291.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
 gi|379542257|gb|EHZ07415.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
 gi|379542673|gb|EHZ07828.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
 gi|379557271|gb|EHZ22317.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
 gi|379569141|gb|EHZ34115.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
 gi|379575027|gb|EHZ39964.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
 gi|379584597|gb|EHZ49463.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
 gi|379597600|gb|EHZ62398.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
 gi|379597787|gb|EHZ62584.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
 gi|379626380|gb|EHZ90998.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
 gi|379626589|gb|EHZ91206.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
 gi|379632837|gb|EHZ97407.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
           GA05578]
 gi|379637411|gb|EIA01967.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
           GA02506]
 gi|395571912|gb|EJG32514.1| large secreted protein [Streptococcus pneumoniae 2090008]
 gi|395572036|gb|EJG32637.1| large secreted protein [Streptococcus pneumoniae 2070005]
 gi|395584331|gb|EJG44724.1| large secreted protein [Streptococcus pneumoniae 2070425]
 gi|395586059|gb|EJG46437.1| large secreted protein [Streptococcus pneumoniae 2070531]
 gi|395588107|gb|EJG48442.1| large secreted protein [Streptococcus pneumoniae 2070768]
 gi|395592519|gb|EJG52784.1| large secreted protein [Streptococcus pneumoniae 2061376]
 gi|395605911|gb|EJG66022.1| large secreted protein [Streptococcus pneumoniae 2080913]
 gi|395865531|gb|EJG76670.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
 gi|395879316|gb|EJG90376.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
           GA04216]
 gi|395891223|gb|EJH02225.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
           GA56348]
 gi|429316926|emb|CCP36654.1| putative alpha-L-fucosidase [Streptococcus pneumoniae SPN034156]
 gi|444252808|gb|ELU59268.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
           PCS8203]
 gi|444253936|gb|ELU60383.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
           PCS8106]
          Length = 764

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 189/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|168491689|ref|ZP_02715832.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
 gi|183574053|gb|EDT94581.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
          Length = 764

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFINRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|421212007|ref|ZP_15668985.1| large secreted protein [Streptococcus pneumoniae 2070035]
 gi|421232851|ref|ZP_15689488.1| large secreted protein [Streptococcus pneumoniae 2080076]
 gi|395571698|gb|EJG32309.1| large secreted protein [Streptococcus pneumoniae 2070035]
 gi|395593380|gb|EJG53629.1| large secreted protein [Streptococcus pneumoniae 2080076]
          Length = 764

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|255035225|ref|YP_003085846.1| hypothetical protein Dfer_1435 [Dyadobacter fermentans DSM 18053]
 gi|254947981|gb|ACT92681.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 790

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 185/610 (30%), Positives = 297/610 (48%), Gaps = 67/610 (10%)

Query: 4   AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
           A+ TS T PL + ++ PAK + T A+PIGNG +GAM +GG   E ++ +E +LW G  G 
Sbjct: 24  AQPTSKTAPLSLWYDQPAKEWMTQALPIGNGHVGAMFFGGTDEERIQFSEGSLWAGGKGA 83

Query: 62  --DYT---NPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGH--------PADVY---QL 104
             DY      +A K L +VR L+ +G+  EA A A+ +L G         P+  +   Q 
Sbjct: 84  NADYNFGIKKEAHKHLPEVRELLAAGKLKEAHALANKELTGAIHEKKENTPSSDFGAQQT 143

Query: 105 LGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
           +GD+ ++      K A + YRREL+++ A  +V+Y  G   F R +F + P +V+V + +
Sbjct: 144 VGDLFIKMPS---KGAAQNYRRELNISDALGKVQYEAGGTRFERSYFGNYPAKVMVYRFT 200

Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
            S   + S         D               R  GK+     +  D+ +  +F  +  
Sbjct: 201 SSTPETYSIRFETPHAKDYE-------------RFEGKQYTFGGHLKDNHQ--EFETVYR 245

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
           I    D    +A  D  L V G+   VL+   ++ +   F  P     D    + + +  
Sbjct: 246 I----DTDGKTAFSDGVLTVTGARSIVLIHTVATDYVMKF--PDYKGNDYKKANAATMAG 299

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
           +   +Y+ L      DY  LF RV++ L  +            +   +P+ +R K++   
Sbjct: 300 VAGKNYASLVAAQQKDYHSLFDRVALTLGNA------------DAPAIPTDQRQKAYSAG 347

Query: 345 E-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
           + D  L EL FQ+GRYL+ISS+RPGT   +LQG WN+  +P W +  H NIN++M YW +
Sbjct: 348 QADGRLEELYFQYGRYLMISSTRPGTMPMSLQGKWNDSTNPPWANDYHTNINIQMLYWPA 407

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
              NLSEC  PL DF   +   G   A+  + A GW+++   + +  +S       W  +
Sbjct: 408 EVTNLSECHVPLMDFTQSIVAPGRLAAKEFFNAKGWIVNTMLNAYGYTSPGW-DFPWGFF 466

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           P G AWL  HLWEHY +T D+ FL+  AYP+++  + F +D+L +   G L ++PS SPE
Sbjct: 467 PGGAAWLSQHLWEHYAFTNDKAFLKNTAYPIMKEASEFWMDYLTDDGRGRLVSSPSYSPE 526

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
           H           +S  +TMD  +  +V +    AA +L  ++D   +K   +  ++ P +
Sbjct: 527 H---------GGISTGATMDHEMAWDVLNNTAEAAAILGVDQD-FAQKARSTRDKILPLQ 576

Query: 584 IAEDGSIMEW 593
           I     + EW
Sbjct: 577 IGRWKQLQEW 586


>gi|419436976|ref|ZP_13977057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
 gi|379611263|gb|EHZ75990.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
          Length = 764

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 189/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|149003007|ref|ZP_01827918.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
           SP14-BS69]
 gi|168489226|ref|ZP_02713425.1| large secreted protein [Streptococcus pneumoniae SP195]
 gi|221232865|ref|YP_002512019.1| hypothetical protein SPN23F_21920 [Streptococcus pneumoniae ATCC
           700669]
 gi|225855653|ref|YP_002737165.1| large secreted protein [Streptococcus pneumoniae JJA]
 gi|237650653|ref|ZP_04524905.1| large secreted protein [Streptococcus pneumoniae CCRI 1974]
 gi|237822208|ref|ZP_04598053.1| large secreted protein [Streptococcus pneumoniae CCRI 1974M2]
 gi|415701401|ref|ZP_11458355.1| large secreted protein [Streptococcus pneumoniae 459-5]
 gi|415750467|ref|ZP_11478309.1| large secreted protein [Streptococcus pneumoniae SV35]
 gi|415753360|ref|ZP_11480342.1| large secreted protein [Streptococcus pneumoniae SV36]
 gi|417680132|ref|ZP_12329525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
 gi|418124532|ref|ZP_12761459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
 gi|418126808|ref|ZP_12763710.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
 gi|418129072|ref|ZP_12765961.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
 gi|418138272|ref|ZP_12775106.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
 gi|418144761|ref|ZP_12781556.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
 gi|418179304|ref|ZP_12815881.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
 gi|418192602|ref|ZP_12829101.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
 gi|418215362|ref|ZP_12842093.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
 gi|418235345|ref|ZP_12861918.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
 gi|419458700|ref|ZP_13998639.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
 gi|419474246|ref|ZP_14014091.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
 gi|419485376|ref|ZP_14025147.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
 gi|419494282|ref|ZP_14034004.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
 gi|419509243|ref|ZP_14048891.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
 gi|421279922|ref|ZP_15730725.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
 gi|421300242|ref|ZP_15750913.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
 gi|147759010|gb|EDK66005.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
           SP14-BS69]
 gi|183572159|gb|EDT92687.1| large secreted protein [Streptococcus pneumoniae SP195]
 gi|220675327|emb|CAR69925.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
           700669]
 gi|225723250|gb|ACO19103.1| large secreted protein [Streptococcus pneumoniae JJA]
 gi|332071597|gb|EGI82090.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
 gi|353794144|gb|EHD74502.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
 gi|353794344|gb|EHD74701.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
 gi|353797122|gb|EHD77459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
 gi|353807227|gb|EHD87499.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
 gi|353840818|gb|EHE20880.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
 gi|353854436|gb|EHE34414.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
 gi|353867652|gb|EHE47543.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
 gi|353885068|gb|EHE64858.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
 gi|353899629|gb|EHE75198.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
 gi|379528696|gb|EHY93950.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
 gi|379549315|gb|EHZ14425.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
 gi|379580149|gb|EHZ45044.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
 gi|379591544|gb|EHZ56368.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
 gi|379609534|gb|EHZ74272.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
 gi|381309007|gb|EIC49850.1| large secreted protein [Streptococcus pneumoniae SV36]
 gi|381313067|gb|EIC53859.1| large secreted protein [Streptococcus pneumoniae 459-5]
 gi|381316317|gb|EIC57067.1| large secreted protein [Streptococcus pneumoniae SV35]
 gi|395877150|gb|EJG88220.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
 gi|395899666|gb|EJH10605.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
          Length = 764

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|213963750|ref|ZP_03392000.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
 gi|213953630|gb|EEB64962.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
          Length = 806

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 195/603 (32%), Positives = 315/603 (52%), Gaps = 59/603 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA+HFT+++PIGNGRLGAM +G    + + LNE +LW+G   D  +P+A   L
Sbjct: 23  VSVVFHKPAEHFTESLPIGNGRLGAMFFGKTDVDRIVLNEISLWSGGTQDADDPNAHIHL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
             ++ L+  G+  EA A   K F         G+ A+     YQ+LG++ L++  +    
Sbjct: 83  KTIQQLLLEGKNLEAQALLQKHFIAKGEGSCKGNGANCSYGCYQILGELLLDWKST---L 139

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             E Y+R L L+ ATA   +  GN    +  F+   + +I  +I+ S+   L  ++SL  
Sbjct: 140 PTENYQRILRLDQATAFTSFKRGNNYIQQIAFADFKNDLIWIRITASQP--LDIDISLHR 197

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
             +N +    +N+I + G  P          N++ +G+QF++ ++++   + + T +A  
Sbjct: 198 R-ENATTSYKSNKITLSGVLP----------NENTEGMQFASEIDVQTDGNLQNTTNATS 246

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
            +K K       VL + A+++++  F     ++ D   ++   LQ    + + +      
Sbjct: 247 IQKAKE-----IVLKISAATNYN--FTKGGLTQNDVLQKANDYLQKA-TIPFENAIIESQ 298

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
             YQ  F+R     +R   +  TDT S      + + ER++ F   +  +L+ +L+  FG
Sbjct: 299 KAYQVFFNR-----NRWYSEANTDTSS------LSTFERLQRFYKGKKDALLPVLYYNFG 347

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSR G   ANLQG+W E+    W+   H+NINL+MNYW +   NLSE   PL  
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHK 407

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           F   L  NG KTA+  Y A+GW+ H  ++ W  +S       W     GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTARAYYNANGWMAHVISNPWFYTSPGE-SAEWGSTLTGGAWLCEHIWQH 466

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
           Y YT++ DFL +  YP+L+  A F    LI+    GY  T PS SPE+ +I P   DGK 
Sbjct: 467 YLYTLNTDFL-REYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525

Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            +     + TMDM I+RE+FS  + AA++L  + + L  +  + +    P +I + G + 
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDNE-LYSQWQEIITHTVPNRIGKKGDLN 584

Query: 592 EWV 594
           EW+
Sbjct: 585 EWL 587


>gi|418147412|ref|ZP_12784184.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
 gi|353810492|gb|EHD90743.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
          Length = 764

 Score =  275 bits (702), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|417695030|ref|ZP_12344214.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
 gi|332198979|gb|EGJ13060.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
          Length = 764

 Score =  274 bits (701), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 299/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P     NLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPVNLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|307705834|ref|ZP_07642675.1| alpha-fucosidase [Streptococcus mitis SK597]
 gi|307620620|gb|EFN99715.1| alpha-fucosidase [Streptococcus mitis SK597]
          Length = 764

 Score =  274 bits (701), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 189/593 (31%), Positives = 299/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEVQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SSALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGDI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA   Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTATKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RVLTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|373958328|ref|ZP_09618288.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
           paludis DSM 18603]
 gi|373894928|gb|EHQ30825.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
           paludis DSM 18603]
          Length = 960

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 165/502 (32%), Positives = 266/502 (52%), Gaps = 49/502 (9%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           Y   GD+ L F  S        Y+R+LD+  A A   Y+   V FTRE+ +S+P + I+ 
Sbjct: 311 YLPFGDLILNFKTSS---QVMDYKRDLDIGKAVATTTYNSNGVNFTREYLASDPAKAIII 367

Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
            +  S+ G     +++ +LL     ++  +Q+         ++          KG+   A
Sbjct: 368 HLKASKPG----QINMVALLQTSHKISSVHQVDANTIALDVKVQ---------KGV-LKA 413

Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
           +  + I    GT+  + ++ + +  +D   + L A++SF     N  D    P      A
Sbjct: 414 VSYLYIKALSGTVKVINNQ-ISISKADDVTIYLTAATSFK----NYKDVSGKPDEICKQA 468

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           LQ+ +  +++ L  + + DYQ+ F+  S+ L     D+ TD             ER+K++
Sbjct: 469 LQAAKTKTFAQLKAQSITDYQQYFNTFSVNLGPGKVDVPTD-------------ERIKTY 515

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
               DP L+ L  Q+GRYLLIS SRP +++ ANLQGIWN+ + P+W S    NINL+MNY
Sbjct: 516 SVAFDPGLLALYMQYGRYLLISCSRPNSKLPANLQGIWNDQMVPSWGSKFTTNINLQMNY 575

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
           W +   NL+ C++PLF  ++ L++ G++TA+++Y A GW++HH TDIW   +A       
Sbjct: 576 WPAEELNLTPCEKPLFKMISQLAVTGAQTAKIHYDAPGWILHHNTDIWL-GTAPINASNH 634

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPS 519
            +W  G AWLC  LWEHY YT D DFL+K  Y  ++G A F +  L++    G+L + PS
Sbjct: 635 GIWQGGAAWLCHQLWEHYLYTGDIDFLKKH-YAEMKGAAEFFVSTLVKDPVTGFLISTPS 693

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH      G L       TMD  IIR++F   ISA+E+L K +DA  + + +   ++
Sbjct: 694 NSPEH------GGLVA---GPTMDRQIIRDLFKNCISASEIL-KTDDAFRKTLQEKYAQI 743

Query: 580 RPTKIAEDGSIMEWVQRRLNTS 601
            P K+ + G + EW++ + +T+
Sbjct: 744 APNKVGKFGQLQEWMEDKDDTA 765



 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 60/95 (63%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           ++ A     +  LK+ +  PA+ +TDA+PIGNG LGAM +GG+ S+ ++ NE TLW+G P
Sbjct: 14  LLAAAQNVFSQDLKLWYKKPAEKWTDALPIGNGTLGAMFYGGISSDRIQFNEQTLWSGSP 73

Query: 61  GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF 95
             Y    A   L ++R+L+ +G+ AEA A + K F
Sbjct: 74  RKYQRDGAATYLPEIRNLLFAGKQAEAEALAEKHF 108


>gi|225857727|ref|YP_002739238.1| large secreted protein [Streptococcus pneumoniae P1031]
 gi|444410728|ref|ZP_21207248.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
           PNI0076]
 gi|444412459|ref|ZP_21208780.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
           PNI0153]
 gi|444422182|ref|ZP_21217843.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
           PNI0446]
 gi|225724930|gb|ACO20782.1| large secreted protein [Streptococcus pneumoniae P1031]
 gi|444274421|gb|ELU80068.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
           PNI0153]
 gi|444276759|gb|ELU82299.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
           PNI0076]
 gi|444288455|gb|ELU93349.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
           PNI0446]
          Length = 764

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|393788805|ref|ZP_10376931.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
           CL02T12C05]
 gi|392653911|gb|EIY47561.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
           CL02T12C05]
          Length = 814

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 191/599 (31%), Positives = 299/599 (49%), Gaps = 70/599 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           T ++P+GNG LGA + G + +E + LNE TLW G P      DY    N  +   L ++R
Sbjct: 60  TSSLPLGNGSLGANIMGSIAAERITLNEKTLWKGGPNTSGGADYYWNVNKQSAPILKEIR 119

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
               +G    A   + K F   A              +  +G++ +E   S +  ++  Y
Sbjct: 120 QAFTAGDQKRAETLTRKNFNGLAAYEEKDETPFRFGSFTTMGEVYVETGLSEIGMSD--Y 177

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    +++ R +F S PD V+V + +  + G  +L+F+ S ++   
Sbjct: 178 KRILSLDSAMATVRFLKDGIKYQRNYFISYPDSVMVMRFTADKPGMQNLTFSYSPNTEAQ 237

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +   G         K N N     ++F AI       ++G    +E+ KL
Sbjct: 238 GKIEADGTNGLYYAG---------KLNNNQMKFALRFRAI-------NKGGTVRVENGKL 281

Query: 243 KVEGSDWAVLLLVASSSFD---GPFINPSDS--KKDPTSESMSALQSIRNLSYSDLYTRH 297
            ++ ++  V LL A + +     P  N  ++    +P+  + + ++     +Y  LY RH
Sbjct: 282 VIKDANEVVFLLTADTDYKMNYNPDFNSPETYVGNNPSETTRNMMKQAEAKTYEVLYLRH 341

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQF 356
            +DY  LF+RV  +LS +P+  + D         +P+ +R+K + Q   D  L +L +Q+
Sbjct: 342 QNDYTALFNRV--KLSLNPQVPIAD---------LPTDQRLKHYRQGTPDYYLEQLYYQY 390

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ +L   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 391 GRYLLIASSRPGNMPANLQGIWHNNLDGPWRVDYHNNINIQMNYWPACSTNLDECMIPLI 450

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTA+  + A GW      +I+  ++     ++ W   PM G WL TH+W
Sbjct: 451 DFIRGLVKPGEKTAKAYFNARGWTASISANIFGFTAPLSSEQMEWNFNPMAGPWLATHIW 510

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL +  YPL++  A F +D+L    DG     PSTSPEH           
Sbjct: 511 EYYDYTRDKKFLSEIGYPLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GP 561

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
           V   +T   A++RE+ S  ISA+++L    DA   K  K  L  L P +I   G +MEW
Sbjct: 562 VDQGATFVHAVVREILSDAISASKIL--GVDAKERKQWKDILKNLVPYQIGRYGQLMEW 618


>gi|15901970|ref|NP_346574.1| hypothetical protein SP_2160 [Streptococcus pneumoniae TIGR4]
 gi|418131327|ref|ZP_12768207.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
 gi|418230992|ref|ZP_12857587.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
 gi|419478817|ref|ZP_14018636.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
 gi|421243935|ref|ZP_15700445.1| large secreted protein [Streptococcus pneumoniae 2081074]
 gi|421248340|ref|ZP_15704814.1| large secreted protein [Streptococcus pneumoniae 2082170]
 gi|14973671|gb|AAK76214.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
 gi|353800742|gb|EHD81051.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
 gi|353884503|gb|EHE64302.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
 gi|379563089|gb|EHZ28094.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
 gi|395605861|gb|EJG65975.1| large secreted protein [Streptococcus pneumoniae 2081074]
 gi|395612201|gb|EJG72246.1| large secreted protein [Streptococcus pneumoniae 2082170]
          Length = 764

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|419443562|ref|ZP_13983582.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
 gi|379549113|gb|EHZ14224.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
          Length = 764

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L + + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPKVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547


>gi|429847882|gb|ELA23431.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
          Length = 798

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 193/592 (32%), Positives = 282/592 (47%), Gaps = 65/592 (10%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNGRLG  VWGG  +ETL +NEDT+W+G   D T P+A   L   R L  SG+  E
Sbjct: 42  ALPIGNGRLGGTVWGGA-NETLTINEDTIWSGPIQDRTPPNALATLPVARKLFLSGKITE 100

Query: 87  ATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
                ++    PA+     +   G+++L+F  S      E Y R LD     +   Y+  
Sbjct: 101 GGQLVLREM-TPAEKSERQFGYFGNLDLDFGHSG---NLENYVRWLDTKQGNSGSSYAFD 156

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGNNQIIMEGRC 199
            V FTRE  +S P  V+  + + SE G+L+   S   L ++L N +   G    +     
Sbjct: 157 GVNFTREFVASYPAGVLAARFTSSEEGALNLKASFSRLANILVNVASTAGGVNSVTLMSS 216

Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
            G+ +        D   I F+                    K + +GS   VL +  +++
Sbjct: 217 SGQPL--------DENPILFTGQARF----------VAPGAKFENDGS---VLRITGATA 255

Query: 260 FDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
            D  F   ++    S+ +  +E    L +     YSDL    L D   L  R SI L +S
Sbjct: 256 IDLFFDAETNYRFASQDEWEAEIDRKLNAALTKGYSDLRDEALKDSSSLLGRASIDLGKS 315

Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV--- 371
           P+           +  +P+ ERV   + +  D  L  L +  GR++L+ +SR  T+    
Sbjct: 316 PR----------GLSALPTDERVAIARNNSSDVELSTLTWNLGRHMLVGASR-NTEADID 364

Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
             ANLQGIWN   +  W     +NIN EMNYW + P NL E QEPLFD +   +  G   
Sbjct: 365 MPANLQGIWNNKTTAAWGGKYTININTEMNYWSAGPTNLIETQEPLFDLMKVANPRGKAM 424

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A+  Y   G + HH  D+W    A        +WPMG AWL  H+ +HY++T D+ FL  
Sbjct: 425 AKAMYGCDGTMFHHNLDVWGDPGATDNYTSSTMWPMGAAWLVQHMVDHYHFTGDKTFLAD 484

Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDM 544
            AYP L   A+F   +  E H+GY  T PS SPE+ F+ P      G+   +     MD 
Sbjct: 485 VAYPFLIDVATFYECYTFE-HEGYRITGPSLSPENTFVVPSNFSVAGRSEPMDIDIPMDN 543

Query: 545 AIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            ++ +VFSAII AA++L   + N+D  ++K    LPR++P +I   G I+EW
Sbjct: 544 QLMHDVFSAIIEAADILGIDDTNQD--LKKAKDFLPRIKPAQIGSKGQILEW 593


>gi|238482887|ref|XP_002372682.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220700732|gb|EED57070.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 608

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 181/592 (30%), Positives = 293/592 (49%), Gaps = 60/592 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ P   F  ++P+GNGRLG  ++  +P+E +  NED++W+G   D  N +A      VR
Sbjct: 34  YDTPGTRFNASLPVGNGRLGGTLYY-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +L+ +G    A   ++  + G   D   YQ+L ++ ++      +       R LD    
Sbjct: 93  NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQ---RGDATNLVRYLDTLEG 149

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
               +Y    V +TRE  +S P  V+  +I  + S +++ N          +  NG   I
Sbjct: 150 YTACEYGFDGVSYTRELIASAPSGVLGFRIQANTSRAINLN----------AVANGIASI 199

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +M+ R              +     F+A + + +  D G ++A  DK L V G+   V  
Sbjct: 200 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 244

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A SS+         +  D  +E    L +   L Y  L    + D++ L  RV++ L 
Sbjct: 245 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 298

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
            S  D  +          +P  ER+ ++++  D D     L+F +GR+LLI+SSR   + 
Sbjct: 299 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 348

Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +    LQGIWN+D SP+W +   VNINLEMNYW +   NL+E   PL+D L  +   G  
Sbjct: 349 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 408

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+  +   G+V+HH TD+W  S        +++WPMGGAWL  H+ EHY +T D+ FL+
Sbjct: 409 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 468

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
           ++A P+ +    F   +L +  DGYL T PS SPE+ F  P      GK   ++ S T+D
Sbjct: 469 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 527

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +++ E+ +A+    ++LE + D L   V   L ++RP +I  DG I+EW++
Sbjct: 528 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQILEWIE 578


>gi|169834518|ref|YP_001695515.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
 gi|168997020|gb|ACA37632.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
          Length = 764

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S   +        +I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWIVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|223932290|ref|ZP_03624293.1| conserved hypothetical protein [Streptococcus suis 89/1591]
 gi|386584235|ref|YP_006080638.1| hypothetical protein SSUD9_1198 [Streptococcus suis D9]
 gi|223898971|gb|EEF65329.1| conserved hypothetical protein [Streptococcus suis 89/1591]
 gi|353736381|gb|AER17390.1| conserved hypothetical protein [Streptococcus suis D9]
          Length = 763

 Score =  272 bits (696), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 188/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSAVKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             VR  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKVREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-PSALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVIFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG++F  +   K++D  G ++ L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVRFKVVCHSKVTD--GEVNVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNL-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLEDTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RIL-REHFEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLVDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547


>gi|374992668|ref|YP_004968163.1| alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
 gi|297163320|gb|ADI13032.1| Alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
          Length = 789

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 193/593 (32%), Positives = 278/593 (46%), Gaps = 46/593 (7%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL-S 73
           I    PA  F D+  IGNG LG  + G V +E + LN D+LW+G P    +  +P  L  
Sbjct: 6   IQLTEPATAFHDSFLIGNGSLGGTLRGAVGTERIDLNLDSLWSGGPVTAEDTGSPAGLLP 65

Query: 74  DVRSLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
            +R+ + +         +  + G    + YQ LG +E  + D+        Y+R L+L  
Sbjct: 66  QLRAAIRAEDNVRVEKLAQAMMGPGWTESYQPLGWLEWHYADTSDATG---YQRRLNLAD 122

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A A   Y     E     F S PD V+V  ++G   G+ S  V L + +  H       +
Sbjct: 123 AVATTGYGPAGAEVEMSSFVSAPDNVLVVTVTGP--GAASHPV-LPTFVSPHPVTTAAPR 179

Query: 193 ---IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
              ++  GR P + +P   N  D+   + +                      ++  G + 
Sbjct: 180 PGLLVATGRVPARVLP---NYVDEEPAVVYGEDEPDGAGTVAAGAGFAVAVAVERTGPEA 236

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSI-RNLSYS--DLYTRHLDDYQKLFH 306
             L+  A+S F G    PS    D  + + SA +++ R L+ +   L  RH+ DY+  F 
Sbjct: 237 LRLIAAAASGFRGYDRRPS---ADLAALARSAEETVTRALTRTAEQLVQRHVQDYRSYFD 293

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV + LS SP                             DP+  ELLF FGRYLLISSSR
Sbjct: 294 RVDLDLSASPA------------------------ADHGDPARAELLFHFGRYLLISSSR 329

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PGT+ ANLQGIWN D+ P W +    NIN+EMNYW +    L +   P+      L+ +G
Sbjct: 330 PGTEAANLQGIWNIDVRPGWSANYTTNINVEMNYWAAESTALEDVHGPMLTLADDLAESG 389

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
           + TA   Y A+G V+HH TDIW  S+  +G   WA WP G  WL  H+W+HY Y  + DF
Sbjct: 390 TATAARYYGAAGAVVHHNTDIWRFSTPVKGDTQWATWPTGLYWLAAHVWDHYEYGGNDDF 449

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG-KLACVSYSSTMDMA 545
               A  +    A F LD L+   DG L T+PSTSPEH F+ P   + A VS  +TMD  
Sbjct: 450 GAGPALRVHRSAALFALDMLVPDDDGLLVTSPSTSPEHRFVLPPAPRGAAVSEGTTMDQE 509

Query: 546 IIREVFSAIISAAEVLEK-NEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
           ++ EV S  ++ AE   + ++D L+ +   +L  LR   I   G ++EW   R
Sbjct: 510 LVHEVLSRYVTLAERFGRGDDDVLLARARHALGALRLPGIGASGELLEWKDER 562


>gi|393784536|ref|ZP_10372699.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665517|gb|EIY59041.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
           CL02T12C01]
          Length = 818

 Score =  272 bits (695), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 190/604 (31%), Positives = 286/604 (47%), Gaps = 84/604 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G V +E + LNE TLW G P      DY    N  +   + ++R  
Sbjct: 63  SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTAGGADYYWKVNKQSASVMEEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
              G Y +A   + K F   A              +  +G+I +E   S +  ++  Y R
Sbjct: 123 FTDGDYEKAELLTRKNFNGLAHYEEGDETPFRFGSFTTMGEIYVETGLSEIGMSD--YYR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
            L L++A A V +   N  + R++F S PD V+  K + +++G                 
Sbjct: 181 ALSLDSAMAVVSFKKDNTRYMRKYFISYPDSVMAMKFTANKTGK---------------- 224

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-------IKISD-DRGTISALE 238
                Q ++   CP         A DD  G+ ++ +LE       I+I    +G  + +E
Sbjct: 225 -----QNLVLRYCPNSEAKSSLCA-DDTDGLLYTGVLENNGMKFAIRIKAITKGGTTTVE 278

Query: 239 DKKLKVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDL 293
             +L V+ +D  V LL A +    +F   F +P      DP   +   ++      Y +L
Sbjct: 279 QDRLIVKDADEVVFLLTADTDYKMNFQPDFKDPKTYVGSDPEQTTRKTMEGAIRKGYDEL 338

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVEL 352
           Y  H  DY  LF+RV +QL+            E     +P+  R+ +++  + D  L EL
Sbjct: 339 YRAHEADYTSLFNRVKLQLN-----------PEVTARNLPTNLRLANYRKGQADYRLEEL 387

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            +Q+GRYLLI+ SR G   ANLQG+W+ +L+  W    H NIN++MNYW +   NL EC 
Sbjct: 388 YYQYGRYLLIACSRSGNMPANLQGMWHNNLNGPWRVDYHNNINIQMNYWPACSTNLGECT 447

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
            PL DF+  L   G++TA+  + A GW      +I+  +S    + + W   PM G WL 
Sbjct: 448 RPLVDFIRSLVKPGAETAKAYFNARGWTASISANIFGFTSPLSSEDMSWNFNPMAGPWLA 507

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
           TH+WE+Y+YT D++FL+   Y LL+  A F +D+L    DG     PSTSPEH       
Sbjct: 508 THIWEYYDYTRDKEFLKSTGYDLLKSSAQFTVDYLWHKPDGTYTAAPSTSPEH------- 560

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGS 589
               V   +T   A++RE+    I A++VL  +K E    E VL  L    P KI   G 
Sbjct: 561 --GPVDEGTTFVHAVVREILLNAIEASKVLGVDKKERKEWEYVLAHLA---PYKIGRYGQ 615

Query: 590 IMEW 593
           +MEW
Sbjct: 616 LMEW 619


>gi|121719440|ref|XP_001276419.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
 gi|119404617|gb|EAW14993.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
          Length = 781

 Score =  271 bits (694), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 197/600 (32%), Positives = 299/600 (49%), Gaps = 64/600 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PAK F+  +PIGN RL A +WG + ++ + LNE+++W+G   D  NP + +  + VR
Sbjct: 29  YTSPAKDFSSTLPIGNSRLAAAIWGSL-TDNITLNENSIWSGPFQDRVNPRSYEGFTQVR 87

Query: 77  SLVDSGQYAEATAAS-VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           S++  G+ + A   + V + G P     Y  LG ++L+F    +      Y R LDL   
Sbjct: 88  SMLQDGKISAANQLTLVDMAGIPTSPRAYNPLGALKLDFGHDTVN----NYTRFLDLGMG 143

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V+Y   NV ++RE+ +S+PD ++  ++  S  GSL+   SL+       YV  N   
Sbjct: 144 VAGVEYEYDNVTYSREYVASHPDGILAVRLRASTPGSLNVACSLE----RSRYVKSNTAN 199

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +   R     +  KAN       I F A  E +I    G +S+ +   + + G+    + 
Sbjct: 200 V---RKSWGTLTLKANTGQANDPISFVA--EAQIVSVGGHMSS-DGSSVVINGASTIDIF 253

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL- 312
             A +S+   F    DS+    S+ + A       +     TR   DY  L  RV + L 
Sbjct: 254 FDAQTSYR--FFE-EDSRAAQLSKQLDAAVKQGYPAVKKAATR---DYASLTSRVRLNLG 307

Query: 313 -SRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSRPGT 369
            S +     TD              R+ +++ D   DP L  L+F FGR+LLI+SSR G 
Sbjct: 308 SSGAAGGFSTDV-------------RLFNYKKDANSDPELATLMFNFGRHLLIASSRGGD 354

Query: 370 QV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
                ANLQGIWNED  P W     V++NLEMNYW +   NL+E   P+ D +  +  +G
Sbjct: 355 TPGLPANLQGIWNEDYEPAWGGKYTVDVNLEMNYWPAQVTNLAETFGPVVDLMDTVVPHG 414

Query: 427 SKTAQVNYLA-SGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
              AQ  Y   +G+V+HH TD+W  ++  D G           AW+  +L E Y +T D+
Sbjct: 415 KDVAQRMYHCDAGYVLHHNTDLWGDAAPVDNGT----------AWMSMNLIEQYRFTQDK 464

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
             L++R +PLL+  A+F   +L E H+G+  + PS SPEH FI PD     GK A +  S
Sbjct: 465 SLLKERIWPLLKEAANFYYCYLFE-HEGHYISGPSISPEHAFIVPDEMSVPGKEAGIDLS 523

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLN 599
            TMD ++++E+F+A+I A   L    D  ++K  K L +L P  I   G I+EW +R  N
Sbjct: 524 PTMDNSLLQELFAAVIEACTTLGITGDD-IDKAQKYLSKLPPPPIGSYGQILEW-RREYN 581


>gi|168071227|ref|XP_001787102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162659703|gb|EDQ48084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 319

 Score =  271 bits (693), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 130/322 (40%), Positives = 191/322 (59%), Gaps = 9/322 (2%)

Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
           +G+  S  +++    + GT    E  +L V G+    LL+ A++ F G    P     +P
Sbjct: 6   EGLGLSFEVQLLALTEGGTAKVDESGRLIVRGAQSVTLLVAAATDFAGYEKAPGSGGVNP 65

Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
               ++AL       Y  L  RH++D+++LF RV ++L        + T + E   + P+
Sbjct: 66  AERCLAALTKAAEFGYERLRERHVEDHRRLFERVELRLG-------SATAAAERA-SRPT 117

Query: 335 AERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
            ER+++++   ED +L  L F +GRYLL++SSRPGT+ A+LQGIWN  + P W+     N
Sbjct: 118 DERLEAYRNGAEDLALEALYFHYGRYLLMASSRPGTEAAHLQGIWNPHVQPPWNCGYTTN 177

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           IN +MNYW +    L EC EPLF+ +  LS+ GS+TA+++Y A GWV HH  D+W +S+ 
Sbjct: 178 INTQMNYWHAEVAGLPECHEPLFELIRDLSVTGSRTARIHYGARGWVAHHNVDLWRQSTP 237

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
             G+  WA WP+GG WLC HLWEHY +  +  FL + AYPL++G A F  DWL+ G DG 
Sbjct: 238 SDGESSWAFWPLGGVWLCRHLWEHYQFAPNESFLLETAYPLMKGAAEFSQDWLVAGPDGR 297

Query: 514 LETNPSTSPEHEFIAPDGKLAC 535
           L T PSTSPE++F+ PD    C
Sbjct: 298 LVTAPSTSPENKFLTPDRGEPC 319


>gi|326799708|ref|YP_004317527.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
 gi|326550472|gb|ADZ78857.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
          Length = 943

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 163/501 (32%), Positives = 260/501 (51%), Gaps = 45/501 (8%)

Query: 96  GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNP 155
           G   + YQ  GD+ L+F     +     Y+R LD+  A  +  Y    V F R +FSS P
Sbjct: 287 GKYQESYQPFGDLLLDF---RAQAPFSNYKRTLDVEQAICKTSYVQNGVSFERTYFSSAP 343

Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
           D  +   ++      +SF+ SL S    ++    ++  I        RI  +       +
Sbjct: 344 DACLAIHLTADRPRQISFDASLASPHKTYNVEKVDDSTI--------RISVQVKQGV-LR 394

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
           G+ F     + +  + G +  + D K+K+ G++ A L L A++++     + +D   D  
Sbjct: 395 GVGF-----LHVRHEGGELH-VGDGKIKILGANQATLFLTAATNYK----SYNDVSGDAE 444

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
             + S L  ++N  Y  +   H+ DYQ+ F + S++             ++E  +++P+ 
Sbjct: 445 EIAKSQLNKVKNKPYDVIRLAHIQDYQQYFTKFSLKFE-----------ADEASNSLPTD 493

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
           +R+  F    DP+L+ L  Q+GRYLLISSSR G    NLQGIWN+ L+P W S    NIN
Sbjct: 494 QRIAQFVKSRDPNLLALFVQYGRYLLISSSRSGGLAPNLQGIWNDLLTPPWGSKYTTNIN 553

Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
            EMNYW +   NLSE QEPLF  +  LS+ G +TA+  Y A GWV+HH TD+W + +A  
Sbjct: 554 AEMNYWLAENTNLSELQEPLFQMIKELSVVGQETAKTYYDAPGWVLHHNTDLW-RGTAPI 612

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
                 +W  GGAWLC HLWEH+ YT D  FL ++AYP+++  A F   +L+ +   G+L
Sbjct: 613 NNPNHGIWVTGGAWLCQHLWEHFLYTQDESFLREQAYPIMKASALFFDHFLVSDPKTGWL 672

Query: 515 ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
            + PS SPE       G L       TMD  +IR++F  + +AA +L+ +++   + +L 
Sbjct: 673 ISTPSNSPEQ------GGLVA---GPTMDHQLIRQLFRNVAAAATILKLDKE-FAQHILD 722

Query: 575 SLPRLRPTKIAEDGSIMEWVQ 595
              ++ P +I + G + EW++
Sbjct: 723 KGAKIAPNQIGKYGQLQEWLE 743



 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 53/83 (63%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           L + +  PA  +T+A+PIGNG+LGAMV+GGV ++ ++ NE +LWTG P +Y  P A   L
Sbjct: 28  LTLWYQHPANTWTEALPIGNGKLGAMVFGGVQADRIQFNESSLWTGGPRNYNQPGAKNYL 87

Query: 73  SDVRSLVDSGQYAEATAASVKLF 95
            ++R L+  G+   A   + + F
Sbjct: 88  GEIRKLLSEGKQQAAEELAGRHF 110


>gi|319902716|ref|YP_004162444.1| alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
 gi|319417747|gb|ADV44858.1| Alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
          Length = 832

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 193/600 (32%), Positives = 300/600 (50%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG +GA + G V +E +  NE TLW G P      DY    N  +   L  +R
Sbjct: 75  SQSLPIGNGSIGASIMGSVEAERITFNEKTLWRGGPNTSKGADYYWNVNKQSAHVLEQIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDD--SHLKYAEET---------Y 124
                G  A+A   + + F   +DV Y+   +    F +  +  ++  ET         Y
Sbjct: 135 KAFVEGDQAKAEKLTRENFN--SDVPYEAARENPFRFGNFTTMGEFYVETGLNIIGMSGY 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V+++   V++ R +F S P  V+V + + S +G  +L F+ + + +  
Sbjct: 193 KRALSLDSAMAVVQFTKDKVDYQRTYFISYPANVMVMRYTASRAGMQNLVFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G + ++              +A  D  G+++  ++ I    + G +S   D KL
Sbjct: 253 GSISADGMDGLVY-------------SAVLDNNGMKY--VVRIHAVVNGGKLSN-ADGKL 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            V+G+D  V  + A +    +FD  F NP+     +P   +   + S     Y  L   H
Sbjct: 297 TVKGADEVVFYVTADTDYQINFDPDFANPATYVGVNPAETTRKWMDSAVAKGYDLLRKEH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+  P    TD         +P+++R+K++++ + D  L EL +QF
Sbjct: 357 YEDYATLFNRVKLVLN--PDAKATD---------LPTSQRLKNYRSGKPDYYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC EPL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPACSTNLDECMEPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G +TAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGKRTAQAYFGARGWTASISGNIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSADFAVDYLWHKPDGTFTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           V   +T   A+IRE+    I A+ VL  +K E    E+VL    RL P +I   G +MEW
Sbjct: 577 VDQGTTFVHAVIREILLDAIEASRVLGVDKAERRQWEQVLA---RLLPYRIGRYGQLMEW 633


>gi|383115161|ref|ZP_09935919.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
 gi|313695424|gb|EFS32259.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
          Length = 829

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 188/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+KS++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKSYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|288801450|ref|ZP_06406903.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
           str. F0039]
 gi|288331661|gb|EFC70146.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
           str. F0039]
          Length = 827

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 186/600 (31%), Positives = 295/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++P+GNG LGA V G + +E +  NE TLW G P      DA           L ++R
Sbjct: 72  SQSLPLGNGSLGANVMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLKEIR 131

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  +E  Y
Sbjct: 132 QAFIEGNEKKAALLTRKNFNSTVPYESWKDKPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R +F S P+ ++V +    + G  +L F+   + +  
Sbjct: 190 KRALSLDSALAVVQFKKDGVRYERNYFISYPNNIMVVRFKADQPGKQNLVFSYETNPVST 249

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G+N ++            KA+ +++    Q   ++ IK  +  GTI+  +  KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIKALNQGGTINN-DKGKL 293

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRH 297
            + G++  V L+ A +    +F+  + NP        SE+ +A ++      Y+ L   H
Sbjct: 294 TINGANEVVFLITADTEYKVNFNPDYKNPRTYVGVNPSETTAAWMKKAVAQGYNALLEAH 353

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
             DY  LF+RVS+ L+           SE+    +P+ +R+ +++   ED  L EL +QF
Sbjct: 354 YKDYSSLFNRVSLTLN-----------SEQRTSDIPTPQRLINYRKGKEDFYLEELYYQF 402

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NLSEC  PL 
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++      + W   PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAQAYFDARGWTASISGNIFGFTAPLGSEDMSWNFNPMAGPWLATHVW 522

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           ++Y+YT D+ FL++  Y L++  A F +D+L +  DG     PSTSPEH           
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDFLWKKPDGTYTAAPSTSPEH---------GP 573

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +K E    E+VLK   R+ P K+   G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLDVDKKERKQWEEVLK---RIAPYKVGRYGQLLEW 630


>gi|419535657|ref|ZP_14075151.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
 gi|379561797|gb|EHZ26812.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
          Length = 746

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 186/578 (32%), Positives = 291/578 (50%), Gaps = 56/578 (9%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
           +PIGNG LG M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A
Sbjct: 1   MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60

Query: 88  TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
                + +F  P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  
Sbjct: 61  EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
           N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179

Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
           G+            KG+QF  +   K++D  G +S L  + + +  +    L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224

Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
            G                +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD 
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
           ++       I T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW 
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
           ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379

Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
             HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++   
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438

Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
            F  D+L E  DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497

Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 532


>gi|90022148|ref|YP_527975.1| hypothetical protein Sde_2503 [Saccharophagus degradans 2-40]
 gi|89951748|gb|ABD81763.1| a-L-fucosidase-like protein [Saccharophagus degradans 2-40]
          Length = 803

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 194/614 (31%), Positives = 310/614 (50%), Gaps = 72/614 (11%)

Query: 6   STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--- 61
           ST     L I F  PA  + ++ +P+GNG +G +V G V  ETL+LNE TLWTG PG   
Sbjct: 26  STVAAKSLPIWFGAPALDWESEGLPMGNGAMGIVVTGEVARETLQLNEKTLWTGGPGAKG 85

Query: 62  -------DYTNPDAPKALSDV--RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
                  D    D       +   + +D    A+    ++  +GH    YQ  G++++++
Sbjct: 86  YNFGLPTDSIKQDVAHVRQQITLHNGIDPQTAADKLGQNMHGYGH----YQSFGELDIQY 141

Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
           +D     A   Y R LDL    A V Y+  N  + RE+F S P Q  + K+S S   S+S
Sbjct: 142 NDQ--TGAVSNYVRSLDLTQGVATVAYTRNNTHYKREYFVSYPQQAAIVKLSASNKQSIS 199

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
           F++ +         V+ N  I  + +        K   N+    +Q+  I +++I  D G
Sbjct: 200 FDLGVR--------VHPNRTIETQVKRGVLTFSGKLFDNN----LQY--IGKVQIVVDGG 245

Query: 233 TISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
            ++  E   +++V  ++ AV+ +VA +++   +  P    + P       L+ I+   YS
Sbjct: 246 ELTENEKTGRIQVSRANSAVISIVAGTNYAQAY--PHYRGRLPVKTLDKNLEKIKASEYS 303

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPS 348
            L   HL DY  LF RV + L  +         +E  +   P+ E +K ++ +    + +
Sbjct: 304 ALLAEHLTDYTALFGRVELSLIEN---------AESYLLAKPTPELLKQYKGEGSAPERA 354

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L +L FQFGRYLLI+SSR G+  ANLQG+WN   +P W++  HVNINL+MNYW +   NL
Sbjct: 355 LEQLYFQFGRYLLIASSRNGSLPANLQGVWNNSATPPWNADYHVNINLQMNYWPAQVTNL 414

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PM 465
            E   P FDF+  L   G ++AQ  + A GW +   T+I+  +    G + W  A W P 
Sbjct: 415 GETALPFFDFIDSLVEPGKQSAQKVFGARGWTLFLNTNIFGYT----GLIEWPTAFWQPE 470

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
             AWL  H +EHY +  D  FL++RAYP+++  A F +D L+ + + G L  +PS SPE 
Sbjct: 471 AAAWLAQHYFEHYQFYQDNTFLKERAYPVMKEAALFWVDVLVADPNTGLLVVSPSFSPEQ 530

Query: 525 -EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRP- 581
             F++           + M   I+ ++F+ ++ AA ++    DA  +K++++ L +L P 
Sbjct: 531 GPFVS----------GAAMSQQIVFDLFTNVVEAANLV---GDAEFKKLIQAKLAKLDPG 577

Query: 582 TKIAEDGSIMEWVQ 595
           T+I   G + EW Q
Sbjct: 578 TRIGSWGQLQEWQQ 591


>gi|423280895|ref|ZP_17259807.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
           610]
 gi|404583536|gb|EKA88214.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
           610]
          Length = 829

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 182/600 (30%), Positives = 292/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P            N  +   L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  +   Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R +F S P  V+  +      G  +L+F+ + + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G+N +                A+ D  G+Q+  ++ I  +   GT+S   D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIHATTKGGTLSN-ADGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D AV L+ A +    +FD  F +P      +P   +   + +  ++ Y  L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+            ++    +P+A+R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSANLPTAKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW + P NL+EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P KI   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633


>gi|419441357|ref|ZP_13981397.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
 gi|421282151|ref|ZP_15732944.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
           GA04672]
 gi|421308367|ref|ZP_15759005.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
           GA60132]
 gi|421310565|ref|ZP_15761187.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
           GA62681]
 gi|421312927|ref|ZP_15763524.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
           GA58981]
 gi|379576014|gb|EHZ40943.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
 gi|395878598|gb|EJG89661.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
           GA04672]
 gi|395905170|gb|EJH16076.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
           GA60132]
 gi|395907679|gb|EJH18569.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
           GA58981]
 gi|395908180|gb|EJH19063.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
           GA62681]
          Length = 749

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 186/578 (32%), Positives = 291/578 (50%), Gaps = 56/578 (9%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
           +PIGNG LG M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A
Sbjct: 1   MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60

Query: 88  TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
                + +F  P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  
Sbjct: 61  EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
           N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179

Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
           G+            KG+QF  +   K++D  G +S L  + + +  +    L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224

Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
            G                +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD 
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
           ++       I T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW 
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
           ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379

Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
             HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++   
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438

Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
            F  D+L E  DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497

Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 532


>gi|251795324|ref|YP_003010055.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
 gi|247542950|gb|ACS99968.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
          Length = 775

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 184/599 (30%), Positives = 307/599 (51%), Gaps = 52/599 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
           +K+ +  PA  +   +P+GNG+LGA++ GG+ SET  + E T W+G P  +  +PDA + 
Sbjct: 4   MKMIYTQPAAGWKQGLPLGNGQLGAVLHGGINSETWNMTEITFWSGKPERFGGSPDAKEK 63

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR--RELD 129
           L  +R    +G Y        KL G   +  +      L   D  + Y +E  +  RELD
Sbjct: 64  LKTMREAFFNGNYVLGD----KLAGEQLEPVKGNFGTNLSLCDVLISYNDEGSQLVRELD 119

Query: 130 LNTATARVKYSVGN-VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV 187
           L  A A V Y  G+     RE F S+PD V+V++I G ++GS+S ++ ++       + +
Sbjct: 120 LEKAVAAVSYRSGSGAAMRRETFVSHPDGVLVSRIKGDQAGSVSLSLRIEGRTTTFDARL 179

Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +G ++++        R     N + D   G+     L+  ++  R      E   + +E 
Sbjct: 180 DGPDKLVF-------RTQATENIHSDGTCGVWSEGALKAVVTGGR---VFGEAGTVIIEQ 229

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPT--SESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           +D  VL L  ++ +          + D T   ES   L++     +  L   H+ DY+ L
Sbjct: 230 ADEVVLYLAVATDY---------GRMDDTWKVESTERLEAAEAKGFERLLRDHIADYRSL 280

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLI 362
           + RV + L  S           +  D +P+ ER++  +  E  D  L+ L +Q+GRYL I
Sbjct: 281 YGRVDLDLGGS-----------KAFDLLPTDERIRKLRAGEQTDNGLIALFYQYGRYLTI 329

Query: 363 SSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           + +R  +++  +LQG+WN  E  +  W    H+++N EMNY+ +   NL+EC  PL +++
Sbjct: 330 AGTRADSRLPLHLQGLWNDGEANAMAWSCDYHLDVNTEMNYYPTEISNLAECHIPLMNYI 389

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             LS  G   A+  Y   GWV H  ++ W  +S   G+  W L   GG W+ THL EHY 
Sbjct: 390 EQLSFAGRTAAEDFYGCEGWVAHVFSNAWGFASPGWGRS-WGLNVTGGLWIATHLKEHYE 448

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI-APDGK-LACV 536
           Y+ DR FL ++AYP+++  A F LD++ I    G+L T PSTSPE+ F   P+ +    +
Sbjct: 449 YSRDRGFLTRQAYPVMKEAALFFLDYMTIHPKYGWLVTGPSTSPENSFYPGPEEQGEQQL 508

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  STMD  ++R++F  ++ AAE+L  +E+ L  ++  ++  L P +I + G + EW++
Sbjct: 509 SMGSTMDQMLVRDLFGFVLEAAEMLAVDEE-LQHRLKDAMELLPPLQIGKRGQLQEWLE 566


>gi|299144684|ref|ZP_07037752.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
 gi|298515175|gb|EFI39056.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
          Length = 829

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 187/600 (31%), Positives = 298/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F++D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFVVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|332671290|ref|YP_004454298.1| alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
 gi|332340328|gb|AEE46911.1| Alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
          Length = 820

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 190/610 (31%), Positives = 290/610 (47%), Gaps = 54/610 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPG-------DYTNP 66
           ++F+GPA+ + +A P+GNGRLGAM+ GG     +++N+ T W+G V G            
Sbjct: 30  LSFDGPARRWVEAFPVGNGRLGAMLHGGTERALVQVNDATAWSGRVDGPARALAAVRAAG 89

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR- 125
             P  L+  R  + +G++ EA        G     +Q   D+ +    S  + A+  +R 
Sbjct: 90  AGPDRLARARDALAAGRHDEAADLLAVFQGPWTQAFQPFVDLHVTVA-SAPRPAQVRHRD 148

Query: 126 ---RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
              R LDL     R +   G VE   E F+S  D  +  + S +E   +   +S    + 
Sbjct: 149 DSPRTLDLRDGVVRERLPAG-VEV--EWFASAVDGALHGRWSAAEPFDVHVELSTPHHVR 205

Query: 183 NHSYVNGNNQIIME---GRCPGKRI-PPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
              +  G   +++E      PG     P     DD   +   A+L +   D  G +    
Sbjct: 206 TDHHAPGGRVLVLELPDDVAPGHEPDAPAVTRTDDGASLTGVAVL-LACGD--GEVGGTP 262

Query: 239 DKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
              L+VE + W  ++L   ++     DGP  +  +   D  + +  AL   R    +   
Sbjct: 263 GGALRVERATWVEVVLATGTTSPWPQDGPLRDREEVVADVLACARRALPGDRGTGDA-TR 321

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
            RH+ D++++     + L   P D+  D    + I T P A            +L + +F
Sbjct: 322 ARHVADHRRIADATVLALV--PHDL--DLRLPDAIGTTPHA------------ALAQAVF 365

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
             GRYLLI+SSRPG+  ANLQG+WN D  P W S   +N+NLEM YW +    L EC EP
Sbjct: 366 DHGRYLLIASSRPGSPPANLQGVWNADPRPPWSSNYTLNVNLEMAYWGAEAVGLGECHEP 425

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLC 471
           L   +  L+ +G+  A+  Y   GWV HH +D+W  +    A  G   WA W MGG WLC
Sbjct: 426 LLAHVGLLARHGAHVARELYGCQGWVAHHNSDVWGWALPVGAGHGDPSWAQWWMGGVWLC 485

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-- 529
            HLW+H +   D  FL   A+PLL G A F LDWL+E  DG L T+PSTSPE++F  P  
Sbjct: 486 RHLWDHADVGGDDAFLRDEAWPLLRGAALFCLDWLVEAPDGSLTTSPSTSPENQFRLPSS 545

Query: 530 ----DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                G +  ++  STMD+A++R++    +   + L+  +D L  ++  +L RL    + 
Sbjct: 546 ADGTGGGVGALATGSTMDLALVRDLLERCLDTIDRLDL-DDPLEGRLRSALARLARPVVG 604

Query: 586 EDGSIMEWVQ 595
            DG + EW  
Sbjct: 605 PDGLLREWAH 614


>gi|313149260|ref|ZP_07811453.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
 gi|313138027|gb|EFR55387.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
          Length = 829

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 182/600 (30%), Positives = 292/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P            N  +   L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  +   Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R +F S P  V+  +      G  +L+F+ + + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G+N +                A+ D  G+Q+  ++ I  +   GT+S   D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIYATTKGGTLSN-ADGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D AV L+ A +    +FD  F +P      +P   +   + +  ++ Y  L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+            ++    +P+A+R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSTNLPTAKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW + P NL+EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P KI   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633


>gi|421295152|ref|ZP_15745870.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
 gi|395891509|gb|EJH02504.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
          Length = 749

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 185/578 (32%), Positives = 289/578 (50%), Gaps = 56/578 (9%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
           +PIGNG LG M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A
Sbjct: 1   MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60

Query: 88  TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
                + +F  P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  
Sbjct: 61  EELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
           N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179

Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
           G+            KG+QF  +   K++D  G +S L  + + +  +    L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224

Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
            G                +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S   +
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL 271

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                   +I T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW 
Sbjct: 272 --------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
           ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379

Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
             HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++   
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438

Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
            F  D+L E  DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497

Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            L  N D +  V+++ K LPR   TKI  +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 532


>gi|336404392|ref|ZP_08585089.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
 gi|335943224|gb|EGN05065.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
          Length = 850

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 191/606 (31%), Positives = 299/606 (49%), Gaps = 84/606 (13%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 95  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 155 QAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++   +V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 214 RILSLDSAMAVVQFKKDHVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   + N  ++              +A+ D  GI++  ++ I+     GT+S   D KL 
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGIKY--VVRIQAETKGGTLSN-ADGKLT 317

Query: 244 VEGSDWAVLLLVASS----SFDGPF--------INPSDSKKDPTSESMSALQSIRNLSYS 291
           V+G+D  V  + A +    +FD  F        +NP ++ K+  + ++S         Y+
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKEPKTYVGVNPEETTKEWMNNAVSQ-------GYT 370

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
            L+++H +DY  LF+RV + L+ + K              +P+ +R+K+++  + D  L 
Sbjct: 371 ALFSQHYNDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 419

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
           EL FQFGRYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+E
Sbjct: 420 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 479

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
           C  PL DF+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G W
Sbjct: 480 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 539

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
           L TH+WE+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH     
Sbjct: 540 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 594

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
                 +   +T   A++RE+    I A++VL  +K E    E VL +   L P KI   
Sbjct: 595 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 647

Query: 588 GSIMEW 593
           G +MEW
Sbjct: 648 GQLMEW 653


>gi|293371889|ref|ZP_06618293.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292633135|gb|EFF51712.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 829

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFSSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|237718842|ref|ZP_04549323.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
 gi|229451974|gb|EEO57765.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
          Length = 829

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|423293334|ref|ZP_17271461.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
           CL03T12C18]
 gi|392678277|gb|EIY71685.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
           CL03T12C18]
          Length = 829

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|150003836|ref|YP_001298580.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149932260|gb|ABR38958.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
          Length = 818

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 198/639 (30%), Positives = 301/639 (47%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   LS++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLSEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +   +  Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  GR      
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ L +    +Y++L  RH  DY +LF RV +QL+  +P  
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VLK    L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620


>gi|336412577|ref|ZP_08592930.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942623|gb|EGN04465.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
           3_8_47FAA]
          Length = 799

 Score =  268 bits (684), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTEKGADYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADRENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYAALFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K E    E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|255693316|ref|ZP_05416991.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
           17565]
 gi|260620891|gb|EEX43762.1| hypothetical protein BACFIN_08516 [Bacteroides finegoldii DSM
           17565]
          Length = 861

 Score =  268 bits (684), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 193/647 (29%), Positives = 316/647 (48%), Gaps = 78/647 (12%)

Query: 1   MMNAESTSTT--NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           ++NA++T      PL+ T++ PAK + ++A+PIGNG +GAM++GGV  + ++ NE TLW+
Sbjct: 21  VVNAKTTDRNFPPPLRATYDTPAKIWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWS 80

Query: 58  GVP--------GDYTNPDAPK-ALSDVRSLV----------------------------- 79
           G P        G    P+  K  L   R+L+                             
Sbjct: 81  GGPSENPGYNGGHLRTPEINKDNLQKARNLLQQKMIDFMADKAAHFDANGKLITYDYEGD 140

Query: 80  ----DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL-KYAEETYRRELDLNTAT 134
               D  +Y +  A + + FG     YQ L +I +  +++     A   Y R LD++ + 
Sbjct: 141 GEETDLRRYIDNIAGTKEHFGS----YQTLSNIVITSNNTKCPDCAYSDYNRTLDIDNSI 196

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
             V Y    + + RE+F S PD V+V +++      +S  ++L+SL    + ++  N I 
Sbjct: 197 HTVSYKESGITYKREYFMSYPDNVMVIRLTSDSKDGISRTIALESLHKTKNIISEGNTIT 256

Query: 195 MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLL 254
           M G  P      K   +    G++++   ++ + +D G ISA+ D  +KV G+   V+L+
Sbjct: 257 MTGY-PTPVGGDKRVGDHWKNGLRYAQ--QVMVRNDGGKISAV-DGMIKVAGAKEIVILM 312

Query: 255 VASSSFDGPFINPSD--SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
            A++++     +  +  SK+DP  +  + L+     SY  L   H  DY+ L+ R+ I L
Sbjct: 313 SAATNYVQCMDDSYNFFSKEDPLDKVKAILKKASAKSYKKLLIAHQKDYRSLYDRMKINL 372

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA 372
               +  V  T      D +      ++    ++  L  L +QFGRYLLISSSR G+  A
Sbjct: 373 GNVKEAPVMTT------DKLLKGMDERTNLQADNLYLEMLYYQFGRYLLISSSREGSLPA 426

Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
           NLQG+W + L   W+S  H NIN++MNYW + P NLS C  P+ +++  L   G  TAQ 
Sbjct: 427 NLQGVWADRLQNAWNSDYHTNINVQMNYWPAQPTNLSPCHLPMVEYVKSLVPRGRYTAQH 486

Query: 433 NYL------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            Y         GWV HH+ +IW  ++  + K     +P G  W+C  +WE+Y +  DR F
Sbjct: 487 YYCRPDGKPVRGWVTHHENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNQDRKF 545

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           LE+    +L+    ++ +   +  DG L  NPS SPEH     +  L C     +   A+
Sbjct: 546 LEEYYDTMLQAALFWVDNLWTDKRDGMLVANPSHSPEH----GEYSLGC-----STSQAM 596

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           I E+F+ +I A++ L +  D  ++++  SL +L   KI   G  MEW
Sbjct: 597 IWEIFNIMIKASKELGRENDPEIKEISASLAKLSGPKIGLGGQFMEW 643


>gi|294775002|ref|ZP_06740531.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|294451046|gb|EFG19517.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 818

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 198/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWKVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +   +  Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  GR      
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ L +    +Y++L  RH  DY +LF RV +QL+  +P  
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A+IRE+    I 
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVIREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VLK    L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620


>gi|298480149|ref|ZP_06998348.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
 gi|298273958|gb|EFI15520.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
          Length = 837

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 188/599 (31%), Positives = 296/599 (49%), Gaps = 70/599 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 82  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 141

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 142 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 200

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++   +V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 201 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 260

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   +GN  ++              +A+ D  G+++  ++ I+     GT+    D KL 
Sbjct: 261 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 304

Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHL 298
           V+G+D  V  + A +    +FD  F +P      +P   +   + +  +  Y+ L+++H 
Sbjct: 305 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQGYTALFSQHY 364

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
           +DY  LF+RV + L+ + K              +P+ +R+K+++  + D  L EL FQFG
Sbjct: 365 NDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLEELYFQFG 413

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL D
Sbjct: 414 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 473

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
           F+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G WL TH+WE
Sbjct: 474 FIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 533

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           +Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           +
Sbjct: 534 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 584

Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
              +T   A++RE+    I A++VL  +K E    E VL +   L P KI   G +MEW
Sbjct: 585 DQGATFVHAVVREILLDAIEASKVLGIDKKERKQWEHVLAN---LVPYKIGRYGQLMEW 640


>gi|213693185|ref|YP_002323771.1| hypothetical protein Blon_2335 [Bifidobacterium longum subsp.
           infantis ATCC 15697 = JCM 1222]
 gi|384200416|ref|YP_005586159.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
           15697 = JCM 1222]
 gi|213524646|gb|ACJ53393.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           infantis ATCC 15697 = JCM 1222]
 gi|320459368|dbj|BAJ69989.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
           15697 = JCM 1222]
          Length = 782

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 189/609 (31%), Positives = 297/609 (48%), Gaps = 56/609 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G + H+ + IP GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSHWEEGIPFGNGRMGAVLCSEPDADVLYLNDDTLWSGYPHAETSPLTPEIV 60

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEET-----YR 125
           +  R     G Y  AT           D  Q   D ++   F  + ++Y+ E       +
Sbjct: 61  AKARQASSRGDYVSATRII-------QDATQREKDEQIYEPFGTACIRYSSEAGERKHVK 113

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
           R LDL  A A   + +G  +   + + S PD ++V ++S S     S +V+  + L    
Sbjct: 114 RSLDLARALAGESFRLGAADVHVDAWCSAPDDLLVYEMSSSAPVDASVSVT-GTFLKQTR 172

Query: 186 YVNGNNQ------IIMEGRCPGKRIPPKANANDDP-----KGIQFSAILEIKISDDRGTI 234
             +G++       +++ G+ PG  +   A+  D+P      GI  +      ++   G I
Sbjct: 173 ISSGSDSDARQATLVVMGQMPGLNVGSLAHVTDNPWEDERDGIGMAYAGAFSLTVTGGEI 232

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
           + ++D  L+  G     L   + S F G    P        D   E+++A  S       
Sbjct: 233 TVIDDV-LQCSGVTGLSLRFRSLSGFKGSAEQPERDMTVLADRLGETIAAWPS----DSR 287

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP---- 347
            +  RH+ DY++ F RV ++L  +  D       EE    VP AE ++S   ++ P    
Sbjct: 288 AMLDRHVADYRRFFDRVGVRLGPAHDD------DEE----VPFAEILRS--KEDTPHRLE 335

Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
           +L E +F FGRYLLISSSRP TQ +NLQGIWN    P W SA   NIN+EMNYW + PC 
Sbjct: 336 TLSEAMFDFGRYLLISSSRPHTQPSNLQGIWNHKDFPNWYSAYTTNINIEMNYWMTGPCA 395

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           L E  EPL      L   G   A       G  + H  DIW ++    G+  WA WP G 
Sbjct: 396 LKELIEPLVAMNRELLEPGHDAAGAILGCGGSAVFHNVDIWRRALPANGEPTWAFWPFGQ 455

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
           AW+C +L++ Y +  D  +L    +P++   A F +D+L +   G L   P+TSPE+ F+
Sbjct: 456 AWMCRNLFDEYLFNQDESYLAS-IWPIMRDSARFCMDFLSDTEHG-LAPAPATSPENYFV 513

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEV---LEKNEDALVEKVLKSLPRLRPTKI 584
             DG+   V+++S    AI+R +   +I AA+    L+  + ALV +   +  +L   ++
Sbjct: 514 V-DGETIAVAHTSENTTAIVRNLLDDLIHAAQTMPDLDDGDKALVREAESTRAKLAAVRV 572

Query: 585 AEDGSIMEW 593
             DG I+EW
Sbjct: 573 GSDGRILEW 581


>gi|345510592|ref|ZP_08790159.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|345454467|gb|EEO49096.2| glycoside hydrolase family 95 [Bacteroides sp. D1]
          Length = 850

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 192/602 (31%), Positives = 297/602 (49%), Gaps = 76/602 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 95  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 155 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++   +V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 214 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   +GN  ++              +A+ D  G+++  ++ I+     GT+    D KL 
Sbjct: 274 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 317

Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
           V+G+D  V  + A +    +FD  F +P      + ++ T E M+   S R   Y+ L++
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 374

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
           +H +DY  LF RV + L+ + K              +P+ +R+K+++  + D  L EL F
Sbjct: 375 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 423

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW     NL+EC  P
Sbjct: 424 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 483

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
           L DF+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G WL TH
Sbjct: 484 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 543

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           +WE+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH         
Sbjct: 544 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 594

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             +   +T   A++RE+    I A++VL  +K E    E VL +   L P KI   G +M
Sbjct: 595 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 651

Query: 592 EW 593
           EW
Sbjct: 652 EW 653


>gi|298483252|ref|ZP_07001431.1| fibronectin type III domain protein [Bacteroides sp. D22]
 gi|298270569|gb|EFI12151.1| fibronectin type III domain protein [Bacteroides sp. D22]
          Length = 815

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 184/597 (30%), Positives = 289/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
             +G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLNGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFAADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + +     Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
           DY  LF+RV  +++            E     +P+ +R+ S++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLAKLVPYRIGRYGQLLEW 619


>gi|319639947|ref|ZP_07994674.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
 gi|345516953|ref|ZP_08796433.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|254833732|gb|EET14041.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
 gi|317388225|gb|EFV69077.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
          Length = 818

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 197/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +   +  Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  GR      
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ L +    +Y++L  RH  DY +LF RV +QL+  +P  
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VLK    L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620


>gi|443630249|ref|ZP_21114539.1| putative Fibronectin type III domain-containing protein
           [Streptomyces viridochromogenes Tue57]
 gi|443336258|gb|ELS50610.1| putative Fibronectin type III domain-containing protein
           [Streptomyces viridochromogenes Tue57]
          Length = 744

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 186/586 (31%), Positives = 290/586 (49%), Gaps = 64/586 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DYTN-----PDAPKALSD- 74
           +A+PIGNG LGAMV+G + SE L+ NE TLWTG PG     D+ N     PDA  A+ D 
Sbjct: 14  EALPIGNGALGAMVFGTLASERLQFNEKTLWTGGPGSAQGYDHGNWRTPRPDAITAVQDD 73

Query: 75  --VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
              R+ +D  + A+        +G     +Q  GD+ L+   +      + YRRELDL+ 
Sbjct: 74  LDARTTLDPEEVADRLGQPRIGYG----AHQTFGDLHLDIPGAPTTPPAD-YRRELDLDK 128

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A A V Y+   V   R+  +S PD VI  ++     GS++F +   S   + +    +  
Sbjct: 129 AVASVGYTYQGVRHQRDFLASYPDGVIAGRLHADRPGSVTFTLRYTSPRADFTATAADGT 188

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           + + G          A A++   G++F A  ++++    GT+++  +  + V G+D A  
Sbjct: 189 LTVRG----------ALADN---GLRFEA--QVRVRSRGGTVTSDANGTITVTGADSAWF 233

Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
           +L A + +   +  P     DP +    A++   +  Y  L  RH+ D++ LF RV++ +
Sbjct: 234 VLAAGTDYADTY--PDYRGPDPHAAVGRAVRQAGD-RYEALLARHVRDHRALFRRVALDI 290

Query: 313 SRS-PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
            +S P D+ TD           +A+R              L F++GRYLLI+SSRPG+  
Sbjct: 291 GQSLPADVPTDRLLAAYAGGAGAADRALE----------ALYFEYGRYLLIASSRPGSLP 340

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQG+WN   +P W +  H NIN++MNYW +   NL+E   P   F+  L   G +TAQ
Sbjct: 341 ANLQGVWNNSTTPPWSADYHTNINIQMNYWPAEAANLAETTPPYDRFVEALRAPGRRTAQ 400

Query: 432 VNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
             + + GWV+H++T+ +  +   D     W  +P   AWL   L+EHY +    D+L   
Sbjct: 401 EMFGSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFAGSTDYLRTT 458

Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIR 548
           AYP ++    F LD L  +  DG L   PS SPEH +F A           + M   I+ 
Sbjct: 459 AYPAMKEATEFWLDNLRTDPRDGTLVVTPSYSPEHGDFTA----------GAAMSQQIVH 508

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
           ++F++ + AA +L    D    +V  +L RL P  +I   G + EW
Sbjct: 509 DLFTSTLEAARILGDAPD-FRRRVEAALNRLDPGLRIGSWGQLQEW 553


>gi|307565695|ref|ZP_07628164.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
 gi|307345521|gb|EFN90889.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
          Length = 771

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 187/598 (31%), Positives = 289/598 (48%), Gaps = 68/598 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVRSL 78
           ++PIGNG LGA + GG+  +   LNE +LW G PG           N  +   L  +R  
Sbjct: 64  SLPIGNGSLGANIMGGIACDRFTLNEKSLWRGGPGVKGGAAYYWDQNKQSAHFLKAIRKA 123

Query: 79  VDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD-----------SHLKYAEETYRRE 127
              G    A   +   F   A  Y +  +    F +            H +     Y+R 
Sbjct: 124 FLQGNTKLAAKLTQDNFNGKA-AYSIATEPHFRFGNFTTMGEVTIQTGHKEQDISGYKRC 182

Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHS 185
           L L++A A V Y      + R +F S PD V+V K +  G++  +L+   +   +     
Sbjct: 183 LSLDSAIASVSYHTNTTYYKRSYFISYPDNVMVIKYTAKGADLLNLTLTYTPSPIAQGQV 242

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
             +  + I  +G+            ND+   ++F+  + IK + D GT S + D KL + 
Sbjct: 243 VNDSTDGITYKGKL-----------NDN--NMRFT--IRIKANIDSGT-SKVIDGKLHIL 286

Query: 246 GSDWAVLLLVASSSFDGPFINPS--DSKK----DPTSESMSALQSIRNLSYSDLYTRHLD 299
            +      L A + +     NPS  D K     +P   +   ++      Y++L   HL 
Sbjct: 287 KAKTVTFFLTADTDYKQN-TNPSFTDPKTYIGVNPDKTTKKWIKHALQKGYNNLLNNHLA 345

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF RV + ++   KD     C       +P+ +R++ ++T + D  L  L FQ+GR
Sbjct: 346 DYTPLFKRVKLIINPDDKDTKEALC-------LPTNKRLQRYRTGKADYDLEALYFQYGR 398

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPGT  ANLQG+W+ ++   W    H NINL+MNYW +L  NL+EC  PL +F
Sbjct: 399 YLLIASSRPGTLPANLQGLWHNNVDGPWRVDYHNNINLQMNYWHALTTNLAECALPLNNF 458

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G +TA+  Y A GW     ++I+  ++    K + W L P+ G WL THLWE+
Sbjct: 459 ICMLEKPGRRTAKAYYNARGWTTSISSNIFGFTAPLIDKDMTWNLSPISGPWLSTHLWEY 518

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y++T ++ +L   AYP+L+G A F +D+L    DG     PSTSPEH           + 
Sbjct: 519 YDFTRNKTYLRNTAYPILKGSAQFAVDFLWHKPDGTYTAAPSTSPEH---------GSID 569

Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +T   A++RE+ +  I+A++VL  ++ E    EKVL    +L P +I   G +MEW
Sbjct: 570 QGATFVHAVVREILTDAIAASKVLDIDRKERKQWEKVLL---KLSPYRIGRYGQLMEW 624


>gi|423313025|ref|ZP_17290961.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686239|gb|EIY79545.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
           CL09T03C04]
          Length = 818

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 197/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +   +  Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  GR      
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ L +    +Y++L  RH  DY +LF RV +QL+  +P  
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VLK    L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620


>gi|262406087|ref|ZP_06082637.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
 gi|294648155|ref|ZP_06725698.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294809712|ref|ZP_06768400.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|262356962|gb|EEZ06052.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
 gi|292636539|gb|EFF55014.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294443087|gb|EFG11866.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 830

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 192/602 (31%), Positives = 297/602 (49%), Gaps = 76/602 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 75  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++   +V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   +GN  ++              +A+ D  G+++  ++ I+     GT+    D KL 
Sbjct: 254 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 297

Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
           V+G+D  V  + A +    +FD  F +P      + ++ T E M+   S R   Y+ L++
Sbjct: 298 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 354

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
           +H +DY  LF RV + L+ + K              +P+ +R+K+++  + D  L EL F
Sbjct: 355 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 403

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW     NL+EC  P
Sbjct: 404 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 463

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
           L DF+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G WL TH
Sbjct: 464 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 523

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           +WE+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH         
Sbjct: 524 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 574

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
             +   +T   A++RE+    I A++VL  +K E    E VL +   L P KI   G +M
Sbjct: 575 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 631

Query: 592 EW 593
           EW
Sbjct: 632 EW 633


>gi|299145135|ref|ZP_07038203.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
 gi|298515626|gb|EFI39507.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
          Length = 815

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 182/597 (30%), Positives = 290/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSAGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +  +YRR
Sbjct: 123 FLDGDSQKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--SYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + ++    Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF+RV  ++++           E     +P+ +R+ +++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NL EC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|242815487|ref|XP_002486578.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218714917|gb|EED14340.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 787

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 180/596 (30%), Positives = 288/596 (48%), Gaps = 56/596 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +   A  F  A+P+GNGRLG +++   P+E + LNE+++W+G   +  NP+A   L++VR
Sbjct: 29  YTSAATDFNSALPVGNGRLGGLMYC-TPTERVSLNENSIWSGPFLNRLNPNAKSVLTEVR 87

Query: 77  SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           S+++SG    A   ++  + G+P     Y  LG + L+F  S    ++ +  R LD    
Sbjct: 88  SMLESGNITGAGQVALPNMAGNPNSPQHYTPLGQLNLDFGHS----SQGSLNRWLDTYQG 143

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD----NHSYVNG 189
            +   Y    V +TRE  ++ P  V+  ++  S++G L+  +SL  L +      S   G
Sbjct: 144 NSGCSYIYNGVNYTREIIANYPTGVLAMRLQASQAGQLNIKISLSRLQNVISNTASTSGG 203

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
            N I+M+G   G           +P    F+A  ++  S       +     L V G+  
Sbjct: 204 ANSIVMKGNSGGS----------NPY---FAAEAQVIASGGS---VSASGSTLSVSGATT 247

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             +   A +S+         ++    +E    L S  +  Y  L T  + D   L  RVS
Sbjct: 248 VDIFFDAEASYR------YSTEAAAETELTRKLSSATSQGYQALRTAAIADNTALVGRVS 301

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR- 366
           + L  S                 P+ +R+ +++++   D  LV L++  GR+LL++SSR 
Sbjct: 302 LNLGSSSGSAANQ----------PTDKRLSNYKSNPGNDVQLVTLMYNMGRHLLVASSRD 351

Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
             P +  ANLQGIWNED +P W S   +NINLEMNYW +   NL+E  +P +D L     
Sbjct: 352 TGPLSLPANLQGIWNEDFNPAWGSKYTININLEMNYWHAETTNLAETTKPFWDLLAVAKT 411

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G   A   Y  SG+V+HH  D W   +       + +WP+GG WL THL EHY +T ++
Sbjct: 412 RGELAASSMYGCSGFVLHHNIDCWGDPAPVDYGTPYTIWPLGGVWLSTHLMEHYRFTGNK 471

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
            FL++ A+P+L+  A F   +     +GY  T PS SPE+ FI P      G    +  S
Sbjct: 472 TFLQETAWPILQSAADFCFCYTFL-WNGYYTTGPSLSPENSFIVPSNESKAGNAEGIDIS 530

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            TMD +++ ++FS +I A ++L              L +++P +    G I+EW Q
Sbjct: 531 PTMDNSLLYQLFSDVIEACQILGLTSSE-CSNAKNYLSKIKPPQTGSYGQILEWRQ 585


>gi|345514340|ref|ZP_08793853.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
 gi|229437320|gb|EEO47397.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
          Length = 818

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 197/639 (30%), Positives = 299/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +      Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +        V+G N+++  G       
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKVDGPNRLLYTGCL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ + +    SY++L  RH  DY +LF RV +QL+ R+P  
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VL     L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620


>gi|423301304|ref|ZP_17279328.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471905|gb|EKJ90434.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
           CL09T03C10]
          Length = 802

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 200/637 (31%), Positives = 304/637 (47%), Gaps = 98/637 (15%)

Query: 4   AESTSTTNPLKITFNGP---AKHF---TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           A  T  T  L I F+ P    +H    + ++PIGNG LGA + G V +E +  NE TLW 
Sbjct: 20  AGETEYTKGLSIWFDTPNVMEEHTAWESRSLPIGNGSLGANIIGSVDTERITFNEKTLWR 79

Query: 58  GVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH--PADV------ 101
           G P      +Y    N  +   L ++R     G   +A   + + F    P +       
Sbjct: 80  GGPNTAKGAEYYWNVNKQSAHVLDEIRKAFTEGDQQKAEMLTRQNFNSEVPYEANREKPF 139

Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
               + ++G+  +E     L  ++  Y+R L L++A A V++   NV + R +F S P  
Sbjct: 140 RFGNFTIMGEFYVETGLDTLGISD--YKRILSLDSALAVVQFKKNNVAYQRSYFISYPAN 197

Query: 158 VIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
           V+V + S   +G  +L F+ + +S              I +G   G          D  K
Sbjct: 198 VMVMRFSADRAGMQNLVFSYAPNS--------------ISQGSLSG----------DGDK 233

Query: 216 GIQFSA---------ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
           G+ FSA         ++ I+     GT+S     +L V+G+D  V  + A + +   F N
Sbjct: 234 GLVFSASLNNNGMKYVVRIQAETKGGTLSN-AGCRLTVKGADEVVFYVTADTDYKMNF-N 291

Query: 267 P--SDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
           P   D K     DP   +   + +     Y+ L+ +H  DY  LF+R+ + L+ + K   
Sbjct: 292 PDFKDPKTYVGVDPAETTCQWINNAVMQGYTALFQQHYSDYAALFNRLRLNLNPTVK--- 348

Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                      +P+ +R+K+++  + D  L EL +QFGRYLLI+SSR G   ANLQGIW+
Sbjct: 349 --------TSDIPTPQRLKNYRNGQPDYYLEELYYQFGRYLLIASSRAGNMPANLQGIWH 400

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
            D+   W    H NIN++MNYW + P NLSEC  PL DF+  L   G KTAQ  + A GW
Sbjct: 401 NDVDGPWRVDYHNNINVQMNYWPACPTNLSECMLPLVDFIRTLVKPGEKTAQSYFGARGW 460

Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
                ++I+  ++  +   + W   PM G WL TH+WE+Y+YT D +FL++  Y L++  
Sbjct: 461 TASISSNIFGFTTPLESQDMSWNFNPMAGPWLATHIWEYYDYTRDLNFLKETGYELIKSS 520

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
           A F +D+L    DG     PSTSPEH           V   +T   A++RE+    I A+
Sbjct: 521 ADFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIEAS 571

Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +VL  +K +      VL    +L P KI   G +MEW
Sbjct: 572 KVLGVDKKKRKQWNDVLS---KLVPYKIGRYGQLMEW 605


>gi|224026224|ref|ZP_03644590.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
           18228]
 gi|224019460|gb|EEF77458.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
           18228]
          Length = 825

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 183/598 (30%), Positives = 285/598 (47%), Gaps = 70/598 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP------GDY--TNPDAPKALSDVRSL 78
           ++P+GNG +GA + G V  E    NE TLW G P        Y   N ++   L D+R  
Sbjct: 70  SLPVGNGSIGANIMGSVSVERFTFNEKTLWRGGPRTVKNAASYWNVNKESAHVLKDIRQA 129

Query: 79  VDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETYRR 126
              G   +AT  +   F     + AD         +   G+  ++      KY+   Y R
Sbjct: 130 FADGNVEKATQLTQDNFNSEVPYEADAEEPFRFGSFTSCGEFRIQTGLDEQKYS--GYSR 187

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
            L L++A   V++    V + R+ F+S P  V+V + +  +    +L  N + + L  +H
Sbjct: 188 SLLLDSALVTVRFEQEGVHYRRDFFTSYPHNVMVVRFTADQEKRQNLVLNYTPNPL--SH 245

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
                 N+   +G C   R+             Q   ++  K   + G +       + V
Sbjct: 246 GKFKAENR---DGFCFDARLDNN----------QMHYVVRAKAVAEGGKVWTDRQGNIHV 292

Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHLD 299
           EG+D    L+ A +    +FD  F +P      DP   +   ++   +LSY++L   H  
Sbjct: 293 EGADEVYFLITADTDYQINFDPDFKDPKTYVGVDPLRTTREWMKQAASLSYAELLGEHYT 352

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF R  ++L+   K  +T          +P+  R++ ++T   D SL  L +QFGR
Sbjct: 353 DYAALFGRTQLELNPDQKGGMT----------LPTPRRLERYRTGAPDYSLESLYYQFGR 402

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ ++   W    H NIN++MNYW + P NLSEC++PL DF
Sbjct: 403 YLLIASSRPGNLPANLQGMWHNNVDGPWRVDYHNNINVQMNYWPACPTNLSECEQPLIDF 462

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +      G +TA+  + A GW     ++I+  ++  R K + W   P+ G WL TH+W +
Sbjct: 463 IRMQVKPGKETARAYFGARGWTTSISSNIFGFTTPLRDKDMSWNFSPVAGPWLATHVWNY 522

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D +FL    Y L++G A F +D+L    DG     PSTSPEH           + 
Sbjct: 523 YDYTRDLEFLRTVGYDLIKGAADFSVDYLWHKPDGTYTAAPSTSPEH---------GPID 573

Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +T   A+IRE+    I A+  L  ++ E A  E+VL+ +P   P +I   G +MEW
Sbjct: 574 QGATFSHAVIREILLDAIEASRTLNVDEQERARWEEVLQGMP---PYQIGRYGQLMEW 628


>gi|293373575|ref|ZP_06619926.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292631473|gb|EFF50100.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 815

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + +     Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
           DY  LF+RV  +++            E     +P+ +R+ S++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|423214546|ref|ZP_17201074.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692961|gb|EIY86197.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 815

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + +     Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
           DY  LF+RV  +++            E     +P+ +R+ S++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|423214184|ref|ZP_17200712.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693129|gb|EIY86364.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 850

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 188/599 (31%), Positives = 294/599 (49%), Gaps = 70/599 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 95  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 155 QAFMEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++    V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 214 RILSLDSAMAVVQFKKDYVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   + N  ++              +A+ D  G+++  ++ I+     GT+S   D KL 
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLT 317

Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHL 298
           V+G+D  V  + A +    +FD  F +P       P   +   + +  +  Y+ L+++H 
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVKPEETTKEWMNNAVSQGYTALFSQHY 377

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
           +DY  LF+RV + L+ + K              +P+ +R+K+++  + D  L EL FQFG
Sbjct: 378 NDYAALFNRVKLNLNPAIKG-----------KNMPTPQRLKNYRAGQPDYDLEELYFQFG 426

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL D
Sbjct: 427 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 486

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
           F+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G WL TH+WE
Sbjct: 487 FIHTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 546

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           +Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           +
Sbjct: 547 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 597

Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
              +T   A++RE+    I A++VL  +K E    E VL +   L P KI   G +MEW
Sbjct: 598 DQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLMEW 653


>gi|237722074|ref|ZP_04552555.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
 gi|229448943|gb|EEO54734.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
          Length = 815

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 182/597 (30%), Positives = 289/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFT--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + ++    Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
           DY  LF+RV  ++++           E     +P+ +R+ +++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NL EC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|255693982|ref|ZP_05417657.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
           17565]
 gi|260620198|gb|EEX43069.1| hypothetical protein BACFIN_09249 [Bacteroides finegoldii DSM
           17565]
          Length = 820

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 191/608 (31%), Positives = 294/608 (48%), Gaps = 73/608 (12%)

Query: 18  NGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP------GDYTNPDAPK 70
           N P K + ++ +PIGNG LGA + G + +E + LNE TLW G P      G Y N +   
Sbjct: 53  NNPDKAWENSSLPIGNGSLGANILGSISAERITLNEKTLWKGGPNTAKGAGYYWNVNKQS 112

Query: 71  A--LSDVRSLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSH 116
           A  L D+R     G   +A   + + F   A+             +  +G++ +E   S 
Sbjct: 113 ANILKDIRQAFLDGNKEKAARLTQENFNGLAEYEERDETPFRFGSFTTMGELYIETGLSE 172

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
           +    + Y R L L++A A V++     E+ R++F S PD V+V K + ++ G  +  +S
Sbjct: 173 INM--KNYHRILSLDSAMAVVQFDKDGTEYQRKYFISYPDSVMVMKFTANKKGKQNLVLS 230

Query: 177 LDSLLDNHSYV--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
                +  SY+  +GNN +   G           N N      +  A+        +G I
Sbjct: 231 YCPNSEAESYLSADGNNGLGYTGVL---------NNNKMKFAFRIKAL-------HKGGI 274

Query: 235 SALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLS 289
              E+ ++ V+ +D  V LL A +    +F+  F +P     KDP   +++ + +     
Sbjct: 275 LKTENSRIIVKDADEVVFLLTADTDYKINFNPDFNDPQTYVGKDPEQTTLAMMNNALEKG 334

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPS 348
           Y  L   H  DY  LF+RV +Q++            E     +P+ +R+ +++    D  
Sbjct: 335 YDKLIRNHKTDYTALFNRVQLQIN-----------PEAGTPDLPTYKRLDNYRKGVPDYQ 383

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L +L +QFGRYLLI+SSRPG   ANLQG+W+ +L   W    H NIN++MNYW +   NL
Sbjct: 384 LEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNLDGPWRVDYHNNINIQMNYWPACSANL 443

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGG 467
           SEC  PL DF+  L   G KTAQ  + A GW      +I+  ++    K + W L P+ G
Sbjct: 444 SECTWPLIDFIRSLVKPGEKTAQSYFNARGWTASISANIFGFTAPLSSKSMEWNLNPIVG 503

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
            WL TH+WE+Y+YT D+ FL +  Y L++  A F +D L    DG     PSTSPEH   
Sbjct: 504 PWLATHIWEYYDYTRDKRFLSEIGYELIKSSAQFTVDHLWHKPDGTYTAAPSTSPEH--- 560

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIA 585
                   V    T   A++RE+    I A++VL  ++ E    E +   L +L P +I 
Sbjct: 561 ------GPVDEGVTFAHAVVREILLDAIQASKVLGVDRKERRQWENI---LAKLVPYRIG 611

Query: 586 EDGSIMEW 593
             G ++EW
Sbjct: 612 RYGQLLEW 619


>gi|220911208|ref|YP_002486517.1| twin-arginine translocation pathway signal [Arthrobacter
           chlorophenolicus A6]
 gi|219858086|gb|ACL38428.1| twin-arginine translocation pathway signal [Arthrobacter
           chlorophenolicus A6]
          Length = 781

 Score =  265 bits (677), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 189/605 (31%), Positives = 287/605 (47%), Gaps = 56/605 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-----DAPKA 71
           + GPA+ F +++P+GNG  GA + G    E +++NE + W+G P D + P     +    
Sbjct: 4   YRGPAEKFVESLPVGNGLAGATLRGLAGGERIQINEGSAWSG-PTDRSAPPLDPAEGTAR 62

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
           L  VR  VD+G    A    +   G  +  Y  L    L  D        +   R LDL 
Sbjct: 63  LHAVREAVDAGDVRRAEELLLAFQGTHSQAY--LPFAVLSVDAEGTAAPADGPARWLDLR 120

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           T  A  +Y +   E     F+S+PD VIV  I+ S    L   ++ D +        G +
Sbjct: 121 TGVAGHRYLLDGAEARHRTFASHPDAVIVHDIAFSAPADLRIGIAPDKI-----TATGMD 175

Query: 192 QIIME-------GRCPGKRIPPKANANDDP----KGIQFSAILEIKISDDRGTISALEDK 240
            +  +       G      + P     D P     G +  A+     +D     +     
Sbjct: 176 AVTRDWGTELRLGLLLPADVAPAHEQADHPVVYGHGSRAGAVHAGVATDGD---AGFARG 232

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR------NLSYSDLY 294
            L + G+ +  +++   +  + PF   +++  D  +++++ L S R        +     
Sbjct: 233 VLAIRGATFVRIVVATGTVLNHPFARHANTADD--ADALAGLLSARIAGVLEEEAVEPAL 290

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELL 353
            RHL D+ +L+ RV+++L                    P+ ER+++F+TD+ D +L+ LL
Sbjct: 291 QRHLADHARLYSRVTLELG----------GGPAAAAGKPTDERIRAFETDKSDSALMALL 340

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           F +GRYLLI+SSR G   ANLQGIWNE+L   W S   +NIN +MNYW +L  +L+EC E
Sbjct: 341 FHYGRYLLIASSREGGFPANLQGIWNEELQAPWSSNYTININTQMNYWPALTTSLAECHE 400

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWL 470
           PL   +  L+      A   Y A GWV HH TD W       A +G  +WA W MGG WL
Sbjct: 401 PLLRLVDTLART-GAAAAGLYGARGWVAHHNTDPWGHPFAVGAGKGNAMWASWAMGGTWL 459

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
              +W HY +T D   LEK ++P LEG   F LDW+         T+PSTSPE+ F+A D
Sbjct: 460 AEAVWRHYAFTGDLARLEK-SWPALEGACLFALDWITGEPGSGTHTSPSTSPENRFVADD 518

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDG 588
           G  A V  S+TMD++++R +  +   AA VL      L E  + + +LP+     I   G
Sbjct: 519 GGPAAVGRSATMDVSLLRALCGSARQAAAVLGAPVPWLDEFTRKVAALPQ---PAIGSRG 575

Query: 589 SIMEW 593
            ++EW
Sbjct: 576 EVLEW 580


>gi|317505590|ref|ZP_07963500.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
 gi|315663302|gb|EFV03059.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
          Length = 828

 Score =  265 bits (677), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 181/600 (30%), Positives = 296/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++P+GNG LGA + G + +E +  NE TLW G P      DA           L+++R
Sbjct: 72  SQSLPLGNGSLGANIMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLNEIR 131

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  +E  Y
Sbjct: 132 QAFIEGDEKKAALLTRKNFNSTVPYESWKENPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R +F S P+ V+V +    + G  +L F+   + +  
Sbjct: 190 KRALSLDSALATVQFKKDGVRYERNYFISYPNNVMVVRFKADQPGKQNLVFSYESNPVST 249

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G+N ++            KA+ +++    Q   ++ I+  +  GTIS  ++ KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIQALNQGGTISN-DNGKL 293

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            + G++  V L+ A +    +F+  F NP +    +P+  + + ++      Y  L   H
Sbjct: 294 SINGANEVVFLITADTDYKVNFNPDFKNPRAYVGVNPSETTAAWMKKAVAQGYDALLQVH 353

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
             DY  LF+RVS+ L+   K              +P+ +R+ +++   ED  L EL +QF
Sbjct: 354 YKDYASLFNRVSLTLNDGQK-----------TQDIPTPQRLINYRKGKEDYYLEELYYQF 402

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NLSEC  PL 
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAKAYFGARGWTASISGNIFGFTAPLESEDMSWNFNPMAGPWLATHVW 522

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           ++Y+YT D+ FL++  Y L++  A F +D+L +  DG     PSTSPEH           
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDYLWKKPDGTYTAAPSTSPEH---------GP 573

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +K E    E+VL+   ++ P K+   G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLNVDKKERKQWEEVLR---KIAPYKVGRYGQLLEW 630


>gi|429749280|ref|ZP_19282413.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
           taxon 332 str. F0381]
 gi|429168711|gb|EKY10529.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
           taxon 332 str. F0381]
          Length = 805

 Score =  265 bits (676), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 191/608 (31%), Positives = 307/608 (50%), Gaps = 70/608 (11%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++ PIGNGR+GAM++GG  ++ + LNE +LW+G   +   P A + L
Sbjct: 23  VSVVFHNPATHFTESAPIGNGRIGAMLYGGTSTDRIVLNEISLWSGGAQESDEPQAYEYL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
             ++ L+   +  EA A   + F         G+ A+     YQ+ GD+ +++ D+    
Sbjct: 83  PHIQQLLLERKNIEAEALLQQHFIAKGEGSCRGNGANCSYGCYQIFGDLLIKWKDTS--- 139

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y R L L+ ATA   Y       T+  F+   + +I  KIS  +     F V++  
Sbjct: 140 PVQNYSRILRLDEATAVTTYQRNGNTITQTAFADFKNDIIWVKISAQKP----FEVAVSL 195

Query: 180 LLDNHSYVNG-NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK----ISDDRGTI 234
               ++ V+   ++II+ G  P          N + +G+ F+ I+ ++    +  D   I
Sbjct: 196 TRKENAIVSYLPDRIILTGVLP----------NKEQQGMHFAGIVALESDGNMQKDEAAI 245

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
           +    ++L          LL  S S +  + N   +   P   + + LQ+  N  +    
Sbjct: 246 TVQNAREL----------LLKVSMSTNYNYTNSGLTAVSPLETTKAYLQTA-NSDFESAL 294

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           T+    YQ+LF+R     +R       DT S      + + +R+++F   +  +L+ +L+
Sbjct: 295 TKSKSAYQELFNR-----NRWYAKANADTQS------LSTLQRLENFSKGKKDALLPILY 343

Query: 355 -QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
             FGRYLLI SSR G   ANLQG+W E+    W+   H+NINL+MNYW +   NLS   E
Sbjct: 344 YNFGRYLLICSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAEISNLSNLTE 403

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           PL  F   L  NG KTA+  Y A GWV H  ++ W  +S      VW     GGAWLC H
Sbjct: 404 PLHRFTKNLMPNGRKTAKSYYKAEGWVAHVISNPWFFTSPGES-AVWGSTLTGGAWLCQH 462

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP--D 530
           +W+HY +T D DFL K  YP+++   +F   +LI+     Y  T PS SPE+ ++ P   
Sbjct: 463 IWQHYLFTHDLDFL-KNYYPVMKEATAFFQSFLIKDPTTDYWVTAPSNSPENAYLFPIDS 521

Query: 531 GK--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAE 586
           GK   A    + TMDM I+RE+ +  I AA +L+ +++ + E  K++++ P   P +I +
Sbjct: 522 GKKVAAHTCIAPTMDMQIVRELLNNTIKAATILKVDDEKITEWKKIVENTP---PNRIGK 578

Query: 587 DGSIMEWV 594
            G + EW+
Sbjct: 579 KGDLNEWL 586


>gi|29348582|ref|NP_812085.1| hypothetical protein BT_3173 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340487|gb|AAO78279.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 815

 Score =  265 bits (676), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 190/599 (31%), Positives = 294/599 (49%), Gaps = 74/599 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
           ++PIGNG LGA + G V +E + LNE TLW G P      +Y    N  +   L ++R +
Sbjct: 63  SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTSKGAEYYWDVNKQSAGVLKEIRQA 122

Query: 78  LVDSGQYAEATAASVKLFGHPA-----------DVYQLLGDIELEFDDSHLKYAEETYRR 126
            +D  +   A        G  A             +  +G++ +E   + L+ +   YRR
Sbjct: 123 FLDEDKEKAAQLTRNNFNGLAAYEEKDETPFRFGSFTTMGELYVETGLNELRMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A   V++    V++ R++F S PD V+V K + ++SG  +  +S   +S   ++
Sbjct: 181 ILSLDSAMVVVQFDKDGVQYQRKYFISYPDSVMVMKFTANQSGKQNLILSYCPNSEAKSN 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +G + ++  G               D  G++F+    IK     GT+ A E+ +L V
Sbjct: 241 LRADGKDGLVYTGVL-------------DNNGMKFA--FRIKAIHKGGTLEA-ENDRLIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINP--SDSK----KDPTSESMSALQSIRNLSYSDLYTRHL 298
           +G+D  V LL A + +   F NP   D K     DP   +   +       Y +LY  H 
Sbjct: 285 KGADEVVFLLTADTDYKMNF-NPDFKDPKTYVGNDPEQTTRIMMDQAVQKGYDELYRNHE 343

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
            D+  LF+RV +QL+    DI +          +P+ +R+ +++    D  L +L +QFG
Sbjct: 344 ADHTALFNRVRLQLN---PDISSPN--------LPTYQRLANYKKGTPDYQLEQLYYQFG 392

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI+SSRPG   ANLQG+W+ +L   W    H NIN++MNYW +   NLSEC  PL D
Sbjct: 393 RYLLIASSRPGNMPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPACSANLSECTWPLID 452

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWE 476
           F+  L   G +TAQ  + A GW      +I+  ++     ++ W L P  G WL TH+WE
Sbjct: 453 FIRSLVKPGEQTAQAYFNARGWTASISANIFGFTAPLSSNMMSWNLNPTAGPWLATHIWE 512

Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
           +Y+YT D+ FL++  Y L++  A F +D L    DG     PSTSPEH           +
Sbjct: 513 YYDYTRDKKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPI 563

Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
               T   A++RE+    I A++ L  +  E    EK+L    +L P +I   G +MEW
Sbjct: 564 DEGVTFAHAVVREILLDAIQASKELGIDSKERKQWEKILD---KLVPYRIGRYGQLMEW 619


>gi|262405728|ref|ZP_06082278.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
 gi|294644470|ref|ZP_06722231.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294806820|ref|ZP_06765646.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510919|ref|ZP_08790478.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|229442942|gb|EEO48733.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
 gi|262356603|gb|EEZ05693.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
 gi|292640192|gb|EFF58449.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294445990|gb|EFG14631.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 815

 Score =  265 bits (676), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + +     Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
           DY  LF+RV  +++            E     +P+ +R+ S++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|295085494|emb|CBK67017.1| Trehalose and maltose hydrolases (possible phosphorylases)
           [Bacteroides xylanisolvens XB1A]
          Length = 782

 Score =  265 bits (676), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 188/606 (31%), Positives = 296/606 (48%), Gaps = 84/606 (13%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 75  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
                G   +A   + + F      Y   G+    F    +  ++  ET         Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
           R L L++A A V++   +V + R +F S P  V+V + S  + G  +L F+ + + +   
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
           +   + N  ++              +A+ D  G+++  ++ I+     GT+S   D KL 
Sbjct: 254 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLM 297

Query: 244 VEGSDWAVLLLVASSSFDGPF------------INPSDSKKDPTSESMSALQSIRNLSYS 291
           V+G+D  V  + A + +   F            +NP ++ K+  + ++S         Y+
Sbjct: 298 VKGADEVVFYITADTDYKPDFDPDFKDPKTYVGVNPEETTKEWMNNAVSQ-------GYT 350

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
            L+++H +DY  LF RV + L+ + K              +P+ +R+K+++  + D  L 
Sbjct: 351 ALFSQHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 399

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
           EL FQFGRYLLISSSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+E
Sbjct: 400 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 459

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
           C  PL DF+  L   G KTA+  + A GW      +I+  ++  +   + W   PM G W
Sbjct: 460 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 519

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
           L TH+WE+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH     
Sbjct: 520 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 574

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
                 +   +T   A++RE+    I A++VL  +K E    E VL +   L P KI   
Sbjct: 575 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 627

Query: 588 GSIMEW 593
           G +MEW
Sbjct: 628 GQLMEW 633


>gi|336403471|ref|ZP_08584186.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
 gi|335945801|gb|EGN07608.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
          Length = 815

 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
           ++PIGNG LGA + G + +E + LNE TLW G P      +Y    N  +   L ++R  
Sbjct: 63  SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122

Query: 79  VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
              G   +A   + + F                  +  +G++ +E   S +  +   YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180

Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
            L L++A A V++    + + R++F S PD V+V K +  + G  +  +S   ++   +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              +GN+ ++  G               +  G++F+    IK     GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
           + +D  V LL A + +   F       K     DP+  +++ + +     Y +LY  H  
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
           DY  LF+RV  +++            E     +P+ +R+ S++    D  L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLI+SSRPG   ANLQG+W+ +    W    H NIN++MNYW + P NLSEC  PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
           +  L   G KTAQ  + A GW      +I+  ++    K + W L P  G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
           Y+YT D  FL++  Y L++  A F +D L    DG     PSTSPEH           V 
Sbjct: 514 YDYTRDTRFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
              T   A++RE+    I A++VL    DA   K  ++ L +L P +I   G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619


>gi|423230473|ref|ZP_17216877.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
           CL02T00C15]
 gi|423240882|ref|ZP_17221996.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
           CL03T12C01]
 gi|423244182|ref|ZP_17225257.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
           CL02T12C06]
 gi|392630838|gb|EIY24820.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
           CL02T00C15]
 gi|392642736|gb|EIY36499.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
           CL02T12C06]
 gi|392643844|gb|EIY37593.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
           CL03T12C01]
          Length = 818

 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 196/639 (30%), Positives = 298/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++ A         +PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +      Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  G       
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ + +    SY++L  RH  DY +LF RV +QL+ R+P  
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VL     L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620


>gi|318078709|ref|ZP_07986041.1| alpha-L-fucosidase [Streptomyces sp. SA3_actF]
          Length = 769

 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 188/594 (31%), Positives = 290/594 (48%), Gaps = 59/594 (9%)

Query: 15  ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
           + +  PA  + T+++P+GNG LGA V+G +P+E ++  E TLWTG PG       ++ NP
Sbjct: 32  LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 91

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
             P AL+ VR+ +++        A+ +L G P   Y   Q  GD+ ++ D +    + E 
Sbjct: 92  R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 147

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y R LDL  A A V Y      F R  F+S PD+V+V   +    GS+  N+   S   +
Sbjct: 148 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 207

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            +     +++ + G                  G++F A  +I++  + GT++A  D+ L 
Sbjct: 208 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 251

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+D A  +L A + +   +  P     DP     +A+       Y +L  RH  D+  
Sbjct: 252 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 309

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF RV + L +       D+  +   D +  A       + +D +L  L FQ+GRYLLI+
Sbjct: 310 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 360

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR G+  ANLQG WN   +P W +  HVNINL+MNYW +   NL+E   P   F+  L 
Sbjct: 361 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 420

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
             G  TA+  + A GWV+H +T  +  +   D     W  +P   AWL + L+EHY +  
Sbjct: 421 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 478

Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
             D+L   AYP ++  A F +D L  +  D  L   PS SPEH +F A           +
Sbjct: 479 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 528

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
            M   I+RE+F   + AA+ L  ++ A    + ++L R+ P  +I   G +MEW
Sbjct: 529 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEW 581


>gi|265752589|ref|ZP_06088158.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
 gi|263235775|gb|EEZ21270.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
          Length = 818

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 195/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++         +++PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +      Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGIT--NYKRILSLDSAMAVVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  G       
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ + +    SY++L  RH  DY +LF RV +QL+ R+P  
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    +G     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAIDYLWHKPEGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VLK    L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620


>gi|317505420|ref|ZP_07963340.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
 gi|315663460|gb|EFV03207.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
          Length = 861

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 186/616 (30%), Positives = 295/616 (47%), Gaps = 85/616 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------------------------- 61
           ++PIGNG +GA ++G + +E + LNE +LW G PG                         
Sbjct: 79  SLPIGNGSVGANIFGSISAERITLNEKSLWRGGPGVSHDASYYWNVNDNNVFPVNIDDGH 138

Query: 62  --DY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL------LGDI-- 108
              Y    N  +   L D+R+   +G  A+A + + K F   A   Q        G+   
Sbjct: 139 DASYYWNVNKRSVSVLKDIRAAFLAGDKAKADSLTRKNFNGWASYEQRDEKPFRFGNFTT 198

Query: 109 --ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
             EL  +    +     YRREL L++A   V+++   V + R  F S PD V+V +   +
Sbjct: 199 MGELFIETGLTEEGISHYRRELSLDSARTLVQFNQNGVCYQRTAFVSYPDNVLVLRFKAN 258

Query: 167 ESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
             G  +L+F+ + + +       +G N ++  G               D  G+Q+  ++ 
Sbjct: 259 AEGRQNLNFSYAPNPVSTGQMQADGANGLVYRGAL-------------DDNGMQY--VVR 303

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESM 279
           I+     G+++   D  LK+  +D  + L+ A +    +F+  F NP       P   + 
Sbjct: 304 IQAVTKGGSVTNEHDT-LKIRHADEVMFLITADTDYRINFNPDFTNPKTYVGVQPEVTTQ 362

Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
           + +Q      Y+ L++RH  DY  LF RV ++L+           S    D  P+A+R++
Sbjct: 363 AWMQQAEKKDYNQLFSRHYRDYSALFQRVKLRLN----------PSNHAADDKPTAQRLE 412

Query: 340 SFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
           +++    D +L EL +QFGRYLLI+SSRPGT  ANLQG+W+ ++   W    H NINL+M
Sbjct: 413 AYRNGTTDNALEELYYQFGRYLLIASSRPGTLPANLQGLWHNNVDGPWHVDYHNNINLQM 472

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK- 457
           NYW     +L EC  PL DF+  L   G++TA+  Y A GW     ++I+  ++    + 
Sbjct: 473 NYWPVHTTHLDECALPLIDFVRSLVKPGAETAKAYYGARGWTTSVSSNIFGFTAPLSSED 532

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
           + W L PMGG WL THLWE+Y++T D+  L    Y L++  A F +D+L    DG     
Sbjct: 533 MSWNLCPMGGPWLATHLWEYYDFTRDKQLLRSTLYDLIKQSADFAVDYLWRKPDGTYTAA 592

Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
           PSTSPEH           +    T   A+IRE+    I+A++VL  + +A  ++  + L 
Sbjct: 593 PSTSPEH---------GPIDEGVTFVHAVIREILLDAIAASKVLGVDVEAR-KQWQQVLN 642

Query: 578 RLRPTKIAEDGSIMEW 593
            L P +I   G + EW
Sbjct: 643 HLAPYRIGRYGQLQEW 658


>gi|424665546|ref|ZP_18102582.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
           616]
 gi|404574619|gb|EKA79368.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
           616]
          Length = 829

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 291/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESSREKPFRFGNFTTMGEFYIETGLSAVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ + + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I  +   GT+S   D K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHATAKGGTLSN-ADGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            ++ +D  V L+ A +    +FD  F +P      +P   +   + +   + Y  L+ +H
Sbjct: 297 TIKDADEVVFLVTADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVTMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+            ++   ++P+A+R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSPSLPTAKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 406 GRYLLITSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECTLPLV 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKVGRYGQLMEW 633


>gi|160884032|ref|ZP_02065035.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
 gi|423291498|ref|ZP_17270346.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
           CL02T12C04]
 gi|156110374|gb|EDO12119.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
 gi|392663498|gb|EIY57048.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
           CL02T12C04]
          Length = 829

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 186/600 (31%), Positives = 296/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DY---TNPDAPKALSDVR 76
           + ++PIGNG LGA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 74  SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGVDYYWNVNKQSAHLLDEIR 133

Query: 77  SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F     + AD         +  +G+  +E   + +  ++  Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++    V + R  F S P  V+V + S  +SG  +L F+ + + L  
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +GN  ++               A+ D  G+++  ++ I+     GT+S   D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVCIQAETKGGTLSN-ADGKL 295

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V  + A +    +FD  F +P      +P   +   + +     Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            +DY  LF+RV + L+ + K +            +P+++R+K+++  + D  L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLGELYYQF 404

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL+EC  PL 
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++    + + W   PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESRDMSWNFNPMAGPWLATHIW 524

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A F +D+L    DG     PSTSPEH           
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A++VL  +K      E VL +   L P +I   G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKGRKQWEHVLAN---LVPYQIGRYGQLMEW 632


>gi|256818918|ref|YP_003140197.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
 gi|256580501|gb|ACU91636.1| glycoside hydrolase family protein [Capnocytophaga ochracea DSM
           7271]
          Length = 835

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 191/605 (31%), Positives = 309/605 (51%), Gaps = 63/605 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   D  +P+A   L
Sbjct: 52  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQDADDPNAHNYL 111

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 112 KEIQKLLLEGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 168

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   +   N    +  F+   + VI  KI  +    L+ ++SL  
Sbjct: 169 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 226

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          ND  +G+ F+++++++     G I +   
Sbjct: 227 K-ENATITYQNNKISLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 271

Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K + ++ +    L + A ++++   G  ++ S +KK     +   LQ    +S+      
Sbjct: 272 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 325

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
               +Q+LF+R                 +  N + + + ER++ F   E  +L+ +L+  
Sbjct: 326 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 374

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL
Sbjct: 375 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 434

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
             F   L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W
Sbjct: 435 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 493

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
           +HY +T D +FL +  YP+L+   +F    LI+    GY  T PS SPE+ ++ P   DG
Sbjct: 494 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 552

Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           K  +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I ++G 
Sbjct: 553 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKEGD 611

Query: 590 IMEWV 594
           + EW+
Sbjct: 612 LNEWL 616


>gi|212692624|ref|ZP_03300752.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
 gi|212664909|gb|EEB25481.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
          Length = 818

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 195/639 (30%), Positives = 299/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++         +++PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +      Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N+++  G       
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ + +    SY++L  RH  DY +LF RV +QL+ R+P  
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VL     L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620


>gi|318059330|ref|ZP_07978053.1| alpha-L-fucosidase [Streptomyces sp. SA3_actG]
          Length = 783

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 188/594 (31%), Positives = 290/594 (48%), Gaps = 59/594 (9%)

Query: 15  ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
           + +  PA  + T+++P+GNG LGA V+G +P+E ++  E TLWTG PG       ++ NP
Sbjct: 46  LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 105

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
             P AL+ VR+ +++        A+ +L G P   Y   Q  GD+ ++ D +    + E 
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 161

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y R LDL  A A V Y      F R  F+S PD+V+V   +    GS+  N+   S   +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            +     +++ + G                  G++F A  +I++  + GT++A  D+ L 
Sbjct: 222 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 265

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+D A  +L A + +   +  P     DP     +A+       Y +L  RH  D+  
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           LF RV + L +       D+  +   D +  A       + +D +L  L FQ+GRYLLI+
Sbjct: 324 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 374

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR G+  ANLQG WN   +P W +  HVNINL+MNYW +   NL+E   P   F+  L 
Sbjct: 375 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 434

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
             G  TA+  + A GWV+H +T  +  +   D     W  +P   AWL + L+EHY +  
Sbjct: 435 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 492

Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
             D+L   AYP ++  A F +D L  +  D  L   PS SPEH +F A           +
Sbjct: 493 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 542

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
            M   I+RE+F   + AA+ L  ++ A    + ++L R+ P  +I   G +MEW
Sbjct: 543 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEW 595


>gi|332882772|ref|ZP_08450383.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|332679274|gb|EGJ52260.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
          Length = 805

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 191/601 (31%), Positives = 300/601 (49%), Gaps = 56/601 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F  PAKHFT+++PIGNGRLGA+++G   ++ + LNE +LW+G   +  +P+A   L
Sbjct: 23  VSVVFKQPAKHFTESLPIGNGRLGAILFGKTDTDRIVLNEISLWSGGYQEADDPEAHTYL 82

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   K F         G  A+     YQ+  D+ L++ +   + 
Sbjct: 83  KEIQQLLLEGKNLEAQALLQKHFIARGKGSCHGQGANCSYGCYQVFADLLLDWKN---QT 139

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   Y+       +  F+   + ++  KI+G++      N+SL  
Sbjct: 140 PVKDYKRVLRLDEATAITTYTRDENSIEQVAFADFKNDLLWIKITGTKP--FDLNISLFR 197

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN I + G  P          +D  +G+ F++ ++++      T    E+
Sbjct: 198 K-ENATISYQNNHITLTGVLP----------DDKKEGMHFASAIDVQ------TDGKAEN 240

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           K+  +E      L+L  S + +  + N   S      ++ S LQ   + S+         
Sbjct: 241 KEKAIEIQAAKELILKISMATNYQYKNGGLSNVSVKEKAESYLQRCTS-SFEAALAESKT 299

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
            YQ LF++     +R   +      +  N   + + ER++ F + D+D  L  L + FGR
Sbjct: 300 IYQGLFNK-----NRWYGN------ANSNTSHLSTYERLEGFYKGDKDALLPILYYNFGR 348

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW +   NLSE  EPL  F
Sbjct: 349 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEATNLSELTEPLNRF 408

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
              L  NG KTA+  Y A GWV H  ++ W  +S      VW     GGAWLC H+W+HY
Sbjct: 409 TKNLVPNGYKTAKAYYNADGWVAHVISNPWFYTSPGE-SAVWGSTLTGGAWLCEHIWQHY 467

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP----DGKL 533
            +T D DFL K  YP+L+    F    LI E   GY  T PS SPE+ ++ P      ++
Sbjct: 468 LFTHDIDFL-KEYYPVLKQATDFFKSLLIKEPKKGYWITAPSNSPENAYLLPSKDNKKQV 526

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
                + TMDM I+RE+FS  + AA +L  + D   +     +    P +I + G + EW
Sbjct: 527 GNTCIAPTMDMQIVRELFSNTMQAATILGVDSDKFSQWT-DIIKHTAPNRIGKKGDLNEW 585

Query: 594 V 594
           +
Sbjct: 586 L 586


>gi|423219674|ref|ZP_17206170.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
           CL03T12C61]
 gi|392624879|gb|EIY18957.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
           CL03T12C61]
          Length = 831

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 184/600 (30%), Positives = 288/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG +GA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G  A+A   + + F                  +  +G+  +E   + +  ++  Y
Sbjct: 135 KAFTEGDQAKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R +F S P  V+V + S    G  +L F+ + + +  
Sbjct: 193 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                 G+N ++              +A  D  G+++  ++ I+     GT+    + KL
Sbjct: 253 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 296

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
            V+G+D  V  + A + +   F     + K     +P   +   L +     YS L   H
Sbjct: 297 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
             DY  LF+RV + L+ + K              +P+ +R+K+++  + D  L EL FQF
Sbjct: 357 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A+F +D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A+E L  +K E    E+VL +   L P KI   G +MEW
Sbjct: 577 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 633


>gi|393782601|ref|ZP_10370784.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672828|gb|EIY66294.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
           CL02T12C01]
          Length = 804

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 177/607 (29%), Positives = 297/607 (48%), Gaps = 67/607 (11%)

Query: 17  FNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD-------- 67
           F  PA+++++ A+ IGNG +GA  +G V  E   + E T WTG P  ++ PD        
Sbjct: 35  FTYPARNWSEQALHIGNGYMGASFYGDVEKERFDIAEKTFWTGGP--HSVPDFNYGVVKG 92

Query: 68  APKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
               ++ +R  +   ++AEA + S + + G   +   + ++G++ ++F   +     + Y
Sbjct: 93  GKDKIAAIRRSITDRRFAEADSLSRLYMVGDYTNYGYFSMVGNLFVDFGKKNQPV--QNY 150

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
            R +DL+T+   V+Y+ G+V F RE+F S PD+++    +  + G +SF++S   +    
Sbjct: 151 LRGIDLSTSRGFVEYTQGDVRFNREYFCSYPDKLMALHFTADQKGKISFSLSHSLVYQPE 210

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
               G +++I  G   G              G+ ++  + +K+    G+I  +  +++ V
Sbjct: 211 KVTEGKDELIFNGIIQGN-------------GLGYT--IRMKVLHQGGSIK-VGHQQITV 254

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           EG+D A +     + +    + P    + P   +   ++S     Y  +   H+ DYQ L
Sbjct: 255 EGADEATVFYTVDTEYSP--VYPLYKGEKPRQTTEKIIKSAITKGYETVKHTHISDYQTL 312

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLI 362
           ++RV   LS        DT SE+    +P+  RVK  Q    +D SL  L F   RYLLI
Sbjct: 313 YNRVKFTLS-------GDTASEK----LPTDIRVKQLQQGFTDDASLKVLWFNLSRYLLI 361

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           S+SRPGT  +NLQG+WN      W+     NINL+  YW   P  L EC+E   +++  L
Sbjct: 362 SASRPGTLPSNLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTQLPECEEAYLEWIEGL 421

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
              G KTA   Y   GWV H   +IW  +      ++W L+P G AW C HLWEHY +  
Sbjct: 422 VEPGRKTAGEYYGTKGWVSHSTGNIWGHTVPGD-DILWGLYPSGAAWHCRHLWEHYAFGG 480

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST- 541
           D+ +LE + YP+++  A F L+ ++E +  +    PS S EH     +G  + V YS+  
Sbjct: 481 DKSYLETKGYPIMKEAAEFWLENMVE-YQKHFIIAPSVSAEHGIEMKNG--SPVDYSTAN 537

Query: 542 --------------MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
                          D+ ++ ++++ +I A+E L   + A  EKV  +  +L P KI   
Sbjct: 538 GEQTAGRIFTLPAYQDIEMVYDLYTHVIKASECL-GIDSAFREKVTIARNKLLPLKIGRY 596

Query: 588 GSIMEWV 594
           G + EW+
Sbjct: 597 GQLQEWI 603


>gi|210613381|ref|ZP_03289701.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
 gi|210151223|gb|EEA82231.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
          Length = 1549

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 192/611 (31%), Positives = 305/611 (49%), Gaps = 88/611 (14%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPDAPK------ALSDVRS 77
           +PIGNG +GA V+G + SE L  NE TLWTG P     DY   ++ +      +L +++ 
Sbjct: 73  LPIGNGDMGANVYGEIASEHLTFNEKTLWTGGPSESRKDYMGGNSTEKGQDGASLKNIQK 132

Query: 78  LVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
           L   G+ +EATAA   L      G+ A  YQ  GDI  ++ D   K A E Y+R+LDL T
Sbjct: 133 LFAEGKTSEATAACNNLLVGISNGYGA--YQPWGDIYFDYKDITEKNATE-YQRDLDLKT 189

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A + V +     ++TRE F S+ D V+V ++    S  L+ +V   S     +   GN+ 
Sbjct: 190 AISTVSFKEDGTQYTREFFMSHDDDVLVARLEAKGSEKLNLDVRFPSKQGGKTVAEGNDT 249

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
           + + G     ++             ++++ L +K   D G+++   DK L V+ +    +
Sbjct: 250 LKLCGALTDNQM-------------KYASYLTVKA--DNGSVTGSGDK-LTVKDASAVTV 293

Query: 253 LLVASSSFDGPFINPSDSKKD---PTSESMSAL-QSIRNL-------SYSDLYTRHLDDY 301
            L A++ +   F N  D  +D    T E+  AL + ++          Y ++   HL+DY
Sbjct: 294 YLSAATDYKNAFYN-EDKTEDYYYRTGETDEALAKRVKETVDKAVEKGYKEVKATHLEDY 352

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q+LF+RVS+ + +        T SE+  D +    +  S    E   L  +LFQ+GRYL 
Sbjct: 353 QELFNRVSLNIGQ--------TVSEKTTDDLLKTYKDGSASESEKRQLENMLFQYGRYLT 404

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           I+SSR  +Q+ +NLQG+WN   +P W S  H+N+NL+MNYW +   NLSEC  PL D++ 
Sbjct: 405 IASSREDSQLPSNLQGVWNSLTNPPWSSDYHMNVNLQMNYWPTYSTNLSECALPLIDYVD 464

Query: 421 YLSINGSKTAQV-------NYLASGWVIHHKTD-------IWAKSSADRGKVVWALWPMG 466
            L   G  TA+V       +  A+G++ H +          WA S        W   P  
Sbjct: 465 SLREPGRVTAKVYAGVESKDGEANGFMAHTQNTPFGWTCPGWAFS--------WGWSPAA 516

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
             W+  + WE+Y +T D +F+E+  YP+L+  A+F    L E  DG L ++PS SPEH  
Sbjct: 517 VPWILQNCWEYYEFTGDTEFMEENIYPMLKEEATFYNQILTEDKDGKLVSSPSYSPEH-- 574

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIA 585
                     +  +T +  +I +++     AAEVL ++ + L  K  ++  +L+ P +I 
Sbjct: 575 -------GPYTAGNTYEHTLIWQLYEDAAKAAEVLGQDTE-LAAKWKENQSKLKGPIEIG 626

Query: 586 EDGSIMEWVQR 596
           +DG I EW + 
Sbjct: 627 DDGQIKEWYEE 637


>gi|153805874|ref|ZP_01958542.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
 gi|149130551|gb|EDM21757.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
          Length = 833

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 183/600 (30%), Positives = 287/600 (47%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG +GA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 77  SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 136

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F                  +  +G+  +E   + +  ++  Y
Sbjct: 137 KAFTEGDQVKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 194

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R +F S P  V+V + S    G  +L F+ + + +  
Sbjct: 195 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 254

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                 G+N ++              +A  D  G+++  ++ I+     GT+    + KL
Sbjct: 255 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 298

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
            V+G+D  V  + A + +   F     + K     +P   +   L +     YS L   H
Sbjct: 299 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 358

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
             DY  LF+RV + L+ + K              +P+ +R+K+++  + D  L EL FQF
Sbjct: 359 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 407

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 408 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 467

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 468 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 527

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D  FL++  Y L++  A+F +D+L    DG     PSTSPEH           
Sbjct: 528 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 578

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A++RE+    I A+E L  +K E    E+VL +   L P KI   G +MEW
Sbjct: 579 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 635


>gi|418113491|ref|ZP_12750487.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41538]
 gi|353781702|gb|EHD62143.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41538]
          Length = 535

 Score =  261 bits (668), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 181/574 (31%), Positives = 286/574 (49%), Gaps = 53/574 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ +   A ++ +A+PIGNG LG M++G    E ++LN++T+W     D  NPD+   L
Sbjct: 1   MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y RELD
Sbjct: 61  KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119

Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
           L+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++  
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179

Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
               ++ I+M     G+            KG+QF  +   K++D  G +S L  + + + 
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
            +    L L + + + G                +S+LQ    ++ Y      H+  YQ+ 
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           S+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  +  
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D 
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
             L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D 
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
            I+R    + I  A+ L  N D +    +K L R
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISR--VKELKR 529


>gi|237709067|ref|ZP_04539548.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456763|gb|EEO62484.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
          Length = 818

 Score =  261 bits (668), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 195/639 (30%), Positives = 298/639 (46%), Gaps = 82/639 (12%)

Query: 2   MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
           + A+ST  T  L I F+ P      A  ++         +++PIGNG LG  V G + +E
Sbjct: 17  LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76

Query: 47  TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
            + LNE TLW G P            N ++   L ++R     G   +A   + K F   
Sbjct: 77  RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136

Query: 99  ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
           AD             +  LG+  +E   S +      Y+R L L++A A V +    V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194

Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            R++F S PD V+V K +    G  +L F+   +         +G N ++  G       
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNCLLYTGCL----- 249

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
                     K  Q    L I+  +  G+++   D K  V  +D  + LL A +    +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298

Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
           +  F +P      DP   +++ + +    SY++L  RH  DY +LF RV +QL+ R+P  
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357

Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
               T     +  +P+ +R+  ++  + D  L E+ +QFGRYLLI+SSRPG   ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
           W   +   W    H NIN++MNYW +   NL+EC  PL DF+  L   G KTAQ  + A 
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473

Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           GW      +I+  +S      + W   PM G WL TH+WE+Y+YT D+ FL++  Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533

Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
             A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I 
Sbjct: 534 SSANFAVDYLWYKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584

Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           A++ L  +  +    + VL     L P +I   G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620


>gi|393785852|ref|ZP_10373996.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
           CL02T12C05]
 gi|392660966|gb|EIY54563.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
           CL02T12C05]
          Length = 810

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 300/610 (49%), Gaps = 69/610 (11%)

Query: 15  ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA----- 68
           + F  PAK +++ A+ IGNG +GA  +G V  E L + E T W G P  +  PD      
Sbjct: 35  VWFRYPAKSWSEQALHIGNGYMGASFYGEVEKERLDIAEKTFWAGGP--HAAPDFNYGII 92

Query: 69  ---PKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
                 ++ +R L+   ++AEA + S + + G   +   + ++G++ ++F  +  K   +
Sbjct: 93  KGDKDKIATIRQLIVERRFAEADSLSRIYMTGDYTNYGYFSMVGNLWIDFGKN--KQPVQ 150

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y R +DL+T+   V+Y+ G V+F RE+F S PD+++    +  ++G +SF++S   +  
Sbjct: 151 NYLRGIDLSTSRGFVEYTQGGVQFNREYFCSYPDKLMALHFTADKAGKISFSLSHSLVYP 210

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
               +   N +   G         + N          S  + IKI    G++  +  +++
Sbjct: 211 PEEVIESENGLTFNGII-------RKNG--------LSYTIRIKIVQQGGSVK-VAHQRI 254

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            VE ++ A +     + +  P + P    ++P   +   +       Y  +   H+ DYQ
Sbjct: 255 VVEKANEATVFYAVDTEY-AP-VYPLYKGENPQQNTGKVITKAITKGYETVKNTHISDYQ 312

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYL 360
            L++RV   L+        DT SE+    +P+  RVK  Q    +D SL  L F   RYL
Sbjct: 313 TLYNRVRFTLT-------GDTASEQ----LPTNMRVKQLQKGFTDDASLKVLGFNLSRYL 361

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           LIS+SRPGT  + LQG+WN      W+     NINL+  YW   P +L EC+E   +++ 
Sbjct: 362 LISASRPGTLPSTLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTHLPECEEAYLEWIE 421

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L   G +TA+  Y   GWV H   +IW  +      ++W L+P G AW C HLWEHY +
Sbjct: 422 GLVEPGRQTAREYYGTKGWVSHSTGNIWGHTVPGD-DILWGLYPSGAAWHCRHLWEHYAF 480

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
             D+++L  + YP+++  A F L+ ++E + G+    PS S EH     +G  + V YS+
Sbjct: 481 NGDKEYLRTKGYPIMKEAAEFWLENMVE-YQGHFIIAPSVSAEHGIEMKNG--SPVEYST 537

Query: 541 T---------------MDMAIIREVFSAIISAAEVLEKNEDALV-EKVLKSLPRLRPTKI 584
           T                D+ ++ +++S +I AAE L  N D++  +K+L +  +L P KI
Sbjct: 538 TNGEQTEGRLFTVPAYQDIEMVYDLYSHVIKAAECL--NTDSVFRQKLLIAKNKLLPLKI 595

Query: 585 AEDGSIMEWV 594
              G + EW+
Sbjct: 596 GRYGQLQEWI 605


>gi|429755750|ref|ZP_19288384.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
           taxon 324 str. F0483]
 gi|429173108|gb|EKY14643.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
           taxon 324 str. F0483]
          Length = 799

 Score =  261 bits (667), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 190/605 (31%), Positives = 306/605 (50%), Gaps = 63/605 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   +  +P+A   L
Sbjct: 16  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            D++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 76  KDIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ A A   +   N    +  F+   + VI  +I  +    L+ ++SL  
Sbjct: 133 PIQDYKRVLRLDEAIAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATSP--LNLDISLFR 190

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          ND  +G+ F++I++++     G I +   
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NDGKEGMHFASIVDVQTD---GKIESTH- 235

Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K + ++ +    L + A ++++   G  ++ S +KK     +   LQ    +S+      
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
               +Q+LF+R                 +  N + + + ER+  F   E  +L+ +L+  
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
             F   L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
           +HY +T D +FL +  YP+L+   +F    LI+    GY  T PS SPE+ ++ P   DG
Sbjct: 458 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516

Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           K  +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I + G 
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575

Query: 590 IMEWV 594
           + EW+
Sbjct: 576 LNEWL 580


>gi|383123942|ref|ZP_09944612.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
 gi|251838825|gb|EES66910.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
          Length = 812

 Score =  261 bits (667), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 190/637 (29%), Positives = 309/637 (48%), Gaps = 87/637 (13%)

Query: 3   NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
           +AEST  T  L I F+ P               A   + ++PIGNG +GA + G V +E 
Sbjct: 19  HAESTDYTKGLSIWFDSPNTLQGKEVWHSAQQDASWESQSLPIGNGSIGANILGSVEAER 78

Query: 48  LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
           +  NE TLW G P      DY    N  +   L ++R     G   +A   + + F    
Sbjct: 79  ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138

Query: 98  PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
           P +           +  +G+  +E   S +  ++  Y+R L L++A A V++   +V + 
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196

Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           R +F S P  V+V + S  +    +L+F  + + +       +GNN ++           
Sbjct: 197 RNYFISYPANVMVMRFSADQPSKQNLTFRYAPNPVSTGQFSTDGNNGLVY---------- 246

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
               A+ D  G++++  + I+ + + GT++   D ++ V+ +D  +  + A + +   F 
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVNGGTLNN-ADGRITVKEADEVIFYVTADTDYKMNFA 300

Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
            + +D K     +P   +   ++      Y++L   H  DY  LF+RV ++L+ + K   
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVAKGYANLLNEHYKDYASLFNRVKLELNPTVK--- 357

Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                   I  +P+A+R+K+++  + D  L +L +QFGRYLLI+SSRPG   ANLQGIW+
Sbjct: 358 --------IANLPTAQRLKNYRKGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
            ++   W    H NIN++MNYW +   NL EC  PL DF+  L   G KTAQ  + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469

Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
                 +I+  ++  +   + W   PM G WL TH+WE+Y+YT D  FL++  Y L++  
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
           A+F +D+L    DG     PSTSPEH           V   +T   A++RE+    I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIQAS 580

Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           + L  +K E    E VL +   L P KI   G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614


>gi|336436142|ref|ZP_08615855.1| hypothetical protein HMPREF0988_01440 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336008182|gb|EGN38201.1| hypothetical protein HMPREF0988_01440 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 473

 Score =  261 bits (667), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 157/489 (32%), Positives = 250/489 (51%), Gaps = 35/489 (7%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           K+ +  PAK FT+A+P+GNG LGAMV+GGVP E + LN DT W+G           K L 
Sbjct: 4   KLKYITPAKSFTEALPLGNGSLGAMVYGGVPEEHITLNHDTFWSGTGRRPEKEIDAKILG 63

Query: 74  DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
             R L+   ++ EA       + G   + Y  LG++   F++       E Y R LDL  
Sbjct: 64  HARELLFEEKFWEAEQFIKEHMLGFYNESYMPLGELNYRFEEIG---EIEQYSRNLDLEN 120

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
           A    ++   N  +  E F S P + ++ ++  S S  L+ +V+L+S + +      +  
Sbjct: 121 AIFSSEFCSKNTLYQTEVFISYPAKALILRMKVSGSEKLNLSVNLNSKVRHDMKAEVSQD 180

Query: 193 IIMEGRCPGKRIPPKANANDDP-------KGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           + + G  P   + P     D P        G+ F   L I+  +  G ++AL+D+ LKV+
Sbjct: 181 LYIFGNAPSN-VQPNYLTCDHPITYDEQNPGMAFGCYLHIE--NTGGEVTALKDE-LKVK 236

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDP---TSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            +D  +  L A   + G        +KDP    ++   +L+ ++N  Y  L   H+ DY+
Sbjct: 237 NADEVLFYLTAEDGYRG---YKKRIEKDPEVCITQCRKSLEILKNRDYESLKQEHIIDYK 293

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLL 361
            ++  V ++L +   D+             P  +R+  F+   +D  L+ L F + RYL+
Sbjct: 294 SVYKDVRLELEKEESDM-------------PLDQRLAEFRNGKQDLGLLCLFFHYNRYLM 340

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           ++SSR G+Q ANLQGIWNE + P W S   VNIN EMNYW +  CNL +   P  +F++ 
Sbjct: 341 VASSRKGSQPANLQGIWNESIRPVWSSNWTVNINTEMNYWMNGSCNLLDSYLPFVEFVSE 400

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
           LS  G +TA+  Y  SGW  +H  DIW ++    G+  +A WPMGG WLC   +E++ Y+
Sbjct: 401 LSDAGKETARKQYHCSGWTANHNVDIWRQTGPVAGEPKYAYWPMGGIWLCAQSYEYFKYS 460

Query: 482 MDRDFLEKR 490
            D ++L+++
Sbjct: 461 KDIEYLKQK 469


>gi|265767320|ref|ZP_06094986.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|263252625|gb|EEZ24137.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
          Length = 829

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 287/600 (47%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ S + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I      GT+S   + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V L+ A +    +FD  F +P      +P   +   + +   + Y  L+ +H
Sbjct: 297 TVKNADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+   +              +P+ +R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +  ++ W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633


>gi|423259841|ref|ZP_17240764.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
           CL07T00C01]
 gi|423267496|ref|ZP_17246477.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
           CL07T12C05]
 gi|387775879|gb|EIK37983.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
           CL07T00C01]
 gi|392696970|gb|EIY90157.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
           CL07T12C05]
          Length = 829

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ S + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I      GT+S   + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V L+ A +    +FD  F +P +    +P   +   + +   + Y  L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+   +              +P+ +R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +  ++ W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633


>gi|53715738|ref|YP_101730.1| hypothetical protein BF4459 [Bacteroides fragilis YCH46]
 gi|60683673|ref|YP_213817.1| hypothetical protein BF4255 [Bacteroides fragilis NCTC 9343]
 gi|336411650|ref|ZP_08592113.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
 gi|375360504|ref|YP_005113276.1| hypothetical protein BF638R_4337 [Bacteroides fragilis 638R]
 gi|383119758|ref|ZP_09940496.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
 gi|423252289|ref|ZP_17233283.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
           CL03T00C08]
 gi|423252862|ref|ZP_17233793.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
           CL03T12C07]
 gi|52218603|dbj|BAD51196.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|60495107|emb|CAH09926.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
           9343]
 gi|251944620|gb|EES85095.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
 gi|301165185|emb|CBW24755.1| conserved hypothetical exported protein [Bacteroides fragilis 638R]
 gi|335941084|gb|EGN02944.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
 gi|392647562|gb|EIY41261.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
           CL03T00C08]
 gi|392659231|gb|EIY52857.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
           CL03T12C07]
          Length = 829

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 287/600 (47%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ S + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I      GT+S   + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V L+ A +    +FD  F +P      +P   +   + +   + Y  L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+   +              +P+ +R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +  ++ W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633


>gi|339479496|gb|ABE95964.1| Conserved hypothetical protein (Glycosyl hydrolases family 95)
           [Bifidobacterium breve UCC2003]
          Length = 783

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 183/606 (30%), Positives = 294/606 (48%), Gaps = 49/606 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G +  + + IP+GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60

Query: 73  SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +  R  SL D    A        L      +Y+  G   +++  S      E+ +R+LDL
Sbjct: 61  AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
             A A   + +G+     + + S PD ++V ++S   S  ++ +VS          ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDASIDVNISVSGTFLKQSRASMETV 178

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
            D H        +++ GR PG  I     P  N  +D +   G+ ++    + ++   G 
Sbjct: 179 FDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
              + D  L+        L   + S F G    P  S     ++ +       +     +
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERSMT-VIADHLEKTIDEWSTDLRTM 289

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL---V 350
           + RH+ DY++ F RV+I L  +  D   DT        +P +  ++S +  E   L    
Sbjct: 290 FDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEMLA 339

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
           E +F FGRYLLISSSRP TQ ANLQGIWN    P W SA   NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
             EPL      L + G   A       G  + H  D+W ++    G  +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
           C +L++ Y +  D  +L  R +P++   A F +D+L E   G L  +P+TSPE+ F+  +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFLV-N 516

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
           G+L  V+ SS    AI+R +   +I A+   E L++ +  LV +       L  T++  D
Sbjct: 517 GELVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSPLAETRLGAD 576

Query: 588 GSIMEW 593
           G I+EW
Sbjct: 577 GRILEW 582


>gi|253574361|ref|ZP_04851702.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251846066|gb|EES74073.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 793

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 187/604 (30%), Positives = 301/604 (49%), Gaps = 51/604 (8%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
           K+ ++ PA  +++ +P+GNGR+GA+V      E   L E T W+G   +          A
Sbjct: 12  KLWYDKPAAGWSEGLPVGNGRIGAIVMAAPEREVWNLTESTYWSGQADETASAASGGKAA 71

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEET----- 123
           L+ +R  + +G YA     + +    P      +  + D+ +EF  S      ET     
Sbjct: 72  LAAIRERLFAGDYAGGDRLAKQALQPPKRNFGTHLAMCDVVIEFAPSGEPSETETGAVNG 131

Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               +RRELDL+TA              RE F+S+ D V+V++I    +G +SF + L  
Sbjct: 132 ACSPFRRELDLSTALLTTTSGQPGSTLVRELFASHADDVLVSRIWSEAAGGVSFTLGLAG 191

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
           L      V+ +    +E R  GK    +   +D   G++    +E+   D RG    +++
Sbjct: 192 LTPEFE-VSASGMAALEFR--GKAT--ETVHSDGACGVRCRGRIEL---DTRGGSLYVQN 243

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            +L V G+D A + L  ++ +        +S+    +  + A  ++    Y  L   HL 
Sbjct: 244 DRLVVRGADEACIYLTVATDYR------CESRSWELAPRLQASLALSK-GYDQLKADHLA 296

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
           DY+ LF RVSI+L  S           E    +P+ +R++   Q   DP L  L  Q+GR
Sbjct: 297 DYEPLFRRVSIELGPS-----------EEAAKLPTDQRIRLLRQGYSDPQLFALFLQYGR 345

Query: 359 YLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           YL ++ SR  + +  +LQGIWN  E     W    H+++N EMNY+ +   +L E Q+PL
Sbjct: 346 YLTLAGSREDSPLPLHLQGIWNDGEACRMGWSCDYHLDVNTEMNYYPTEVVHLGESQQPL 405

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHL 474
             +L  L+  G KTA+  Y + GWV H  +++W  +  D G    W L   GG WL   +
Sbjct: 406 MRYLEDLARAGQKTARDVYGSPGWVAHVFSNVWGFT--DPGWDTSWGLNVTGGLWLAMQM 463

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKL 533
            EHY + +DR FLEK+AYP+L   A F LD++ +    G+L T PS SPE+ F     + 
Sbjct: 464 IEHYRFGLDRVFLEKQAYPVLREAALFFLDYMTVHPKYGWLVTGPSNSPENHFYPGRPEE 523

Query: 534 AC--VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            C  +S  STMD A++RE+F+  + AAE+LE++ + L  ++  ++P L P +I + G + 
Sbjct: 524 GCWQLSMGSTMDQALVRELFTFCLEAAELLEEDVE-LRSRLSSAIPLLPPLQIGKKGQLQ 582

Query: 592 EWVQ 595
           EW++
Sbjct: 583 EWLE 586


>gi|29350090|ref|NP_813593.1| hypothetical protein BT_4682 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29342002|gb|AAO79787.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 812

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 189/637 (29%), Positives = 308/637 (48%), Gaps = 87/637 (13%)

Query: 3   NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
           +AE T  T  L I F+ P               A   + ++PIGNG +GA + G + +E 
Sbjct: 19  HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78

Query: 48  LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
           +  NE TLW G P      DY    N  +   L ++R     G   +A   + + F    
Sbjct: 79  ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138

Query: 98  PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
           P +           +  +G+  +E   S +  ++  Y+R L L++A A V++   +V + 
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196

Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           R +F S P  V+V + S  + G  +L+F  + + +       +GNN ++           
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
               A+ D  G++++  + I+ +   GT++   D ++ V+ +D  V  + A + +   F 
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300

Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
            + +D K     +P   +   ++   +  YS+L   H  DY  LF+RV ++L+ + K   
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357

Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                      +P+A+R+K+++  + D  L +L +QFGRYLLI+SSRPG   ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
            ++   W    H NIN++MNYW +   NL EC  PL DF+  L   G KTAQ  + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469

Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
                 +I+  ++  +   + W   PM G WL TH+WE+Y+YT D  FL++  Y L++  
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
           A+F +D+L    DG     PSTSPEH           +   +T   A++RE+    I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580

Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           + L  +K E    E VL +   L P KI   G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614


>gi|298384410|ref|ZP_06993970.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
 gi|298262689|gb|EFI05553.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
          Length = 812

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 189/637 (29%), Positives = 308/637 (48%), Gaps = 87/637 (13%)

Query: 3   NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
           +AE T  T  L I F+ P               A   + ++PIGNG +GA + G + +E 
Sbjct: 19  HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78

Query: 48  LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
           +  NE TLW G P      DY    N  +   L ++R     G   +A   + + F    
Sbjct: 79  ITFNEKTLWRGGPNTTKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138

Query: 98  PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
           P +           +  +G+  +E   S +  ++  Y+R L L++A A V++   +V + 
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196

Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           R +F S P  V+V + S  + G  +L+F  + + +       +GNN ++           
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
               A+ D  G++++  + I+ +   GT++   D ++ V+ +D  V  + A + +   F 
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300

Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
            + +D K     +P   +   ++   +  YS+L   H  DY  LF+RV ++L+ + K   
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357

Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                      +P+A+R+K+++  + D  L +L +QFGRYLLI+SSRPG   ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
            ++   W    H NIN++MNYW +   NL EC  PL DF+  L   G KTAQ  + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469

Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
                 +I+  ++  +   + W   PM G WL TH+WE+Y+YT D  FL++  Y L++  
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
           A+F +D+L    DG     PSTSPEH           +   +T   A++RE+    I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580

Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           + L  +K E    E VL +   L P KI   G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614


>gi|423282784|ref|ZP_17261669.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
           615]
 gi|404581655|gb|EKA86351.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
           615]
          Length = 829

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ S + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I      GT+S   + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V L+ A +    +FD  F +P +    +P   +   + +   + Y  L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+   +              +P+ +R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +  ++ W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633


>gi|423271952|ref|ZP_17250921.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
           CL05T00C42]
 gi|423276043|ref|ZP_17254986.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
           CL05T12C13]
 gi|392696307|gb|EIY89503.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
           CL05T00C42]
 gi|392699548|gb|EIY92724.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
           CL05T12C13]
          Length = 829

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
           + ++PIGNG +GA + G + +E +  NE TLW G P      DA           L ++R
Sbjct: 75  SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + K F                  +  +G+  +E   S +  ++  Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   +V + R++F S P  V+  +      G  +L+F+ S + +  
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G N +                A+ D  G+Q+  ++ I      GT+S   + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296

Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            V+ +D  V L+ A +    +FD  F +P +    +P   +   + +   + Y  L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
            DDY  LF+RV +QL+   +              +P+ +R+++++  + D  L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  +   GW      +I+  ++  +  ++ W   PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT D+ FL++  Y L++  A F  D+L    DG     PSTSPEH           
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++VL  +  E    ++VL     L P K+   G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633


>gi|116624427|ref|YP_826583.1| hypothetical protein Acid_5349 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227589|gb|ABJ86298.1| conserved hypothetical protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 718

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 190/591 (32%), Positives = 281/591 (47%), Gaps = 98/591 (16%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           L + +  PA+ + + A+PIGNGRLGAM++G    E L+LNE +LWTG             
Sbjct: 23  LALWYQQPAEDWQSQALPIGNGRLGAMIFGDARREHLQLNEISLWTG------------- 69

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
                   D+G+                  YQ LGD+ L+          + YRR LD++
Sbjct: 70  -----DEKDTGR------------------YQNLGDLFLDLTHG----PPQNYRRSLDID 102

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
           TA   V YS G   + RE+F+S P QVIV + +  + G+ +  + L    D H      +
Sbjct: 103 TAIHTVDYSAGGAAWRREYFASAPRQVIVLRCTADKRGAYTGTLRLT---DAHG-----S 154

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
            +  E    G R+   ++A     G++F   +++  +  R T S      L +E +D A+
Sbjct: 155 PVSAE----GTRL---SSAGKLENGLEFETQIQVMATGGRITASG---DALHIENAD-AL 203

Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
            + +A+ +   P    +     P +     L +   + Y+ +   H+ DYQ+LF RV++ 
Sbjct: 204 TIFIAAGTNYVPDRARAWRGDSPHARITRQLAAAAAMDYAGMRAAHIADYQQLFRRVTLN 263

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
           L  +P ++ TD             ER+  ++    DP L  L FQ+GRYLLISSSRPG+ 
Sbjct: 264 LGSTPGEMPTD-------------ERLLRYRDGSPDPELEALFFQYGRYLLISSSRPGSL 310

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            ANLQG+WN   +P W S  H NIN++MNYW +   NL+EC  P FD++   S+ G +T 
Sbjct: 311 PANLQGLWNNSNNPPWRSDYHSNINIQMNYWPAEVTNLAECALPFFDYVN--SLRGVRTE 368

Query: 431 QVNYL---ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
             +       GW +  + +I+       G   W   P G AW   H WEHY +T DRDFL
Sbjct: 369 ATHKYYPNVRGWTVQTENNIFGA-----GSFKWN--PPGSAWYAQHFWEHYAFTHDRDFL 421

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            K AYP+L+    F  D L+   DG L T    SPEH    P           T D  ++
Sbjct: 422 SKMAYPVLKEITQFWEDHLVARPDGALVTPDGWSPEHGPEEP---------GVTYDQELV 472

Query: 548 REVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQRR 597
            ++F+  + AA VL  N DA    KV +   RL   K+   G + EW + R
Sbjct: 473 WDLFTNYLEAAAVL--NVDAGYRIKVTQLRQRLLKPKVGAWGQLQEWPEDR 521


>gi|380694581|ref|ZP_09859440.1| hypothetical protein BfaeM_11488 [Bacteroides faecis MAJ27]
          Length = 812

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 181/600 (30%), Positives = 296/600 (49%), Gaps = 72/600 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
           + ++PIGNG +GA + G V +E +  NE TLW G P      DY    N  +   L ++R
Sbjct: 56  SQSLPIGNGSIGANILGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 115

Query: 77  SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
                G   +A   + + F                  +  +G+  +E   + +K +E  Y
Sbjct: 116 KAFIEGDQQKAEKLTRENFNSEVPYEYSGEKPFRFGNFTTMGEFYIETGLNTVKMSE--Y 173

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           +R L L++A A V++   NV + R +F S P  V+V + S  + G  +L F+ + + +  
Sbjct: 174 KRILSLDSAMAVVQFKKDNVAYQRNYFISYPANVMVMRFSADQPGKQNLIFSYAPNPMST 233

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
               ++G+N ++              +A  +  G++++  + I+ +   GT++   D KL
Sbjct: 234 GQIAIDGSNGLVY-------------SAFLENNGMKYA--VRIQATVKGGTLNN-SDGKL 277

Query: 243 KVEGSDWAVLLLVASSSFDGPFI-NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRH 297
            ++ +D AV  + A + +   F  + +D K     +P   +   ++      Y++L   H
Sbjct: 278 TIKDADEAVFYVTADTDYKMNFAPDFTDPKTYVGVNPLETTQQWMEDAVAKGYTNLLDEH 337

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
             DY  LF+RV ++L+ + K              +P+ +R+K+++  + D  L +L +QF
Sbjct: 338 YKDYAALFNRVKLELNPTVKTA-----------NLPTEQRLKNYRKGQPDYYLEKLYYQF 386

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYLLI+SSRPG   ANLQGIW+ ++   W    H NIN++MNYW +   NL EC  PL 
Sbjct: 387 GRYLLIASSRPGNMPANLQGIWHNNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLI 446

Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
           DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+W
Sbjct: 447 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVW 506

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
           E+Y+YT +  FL++  Y L++  A+F +D+L    DG     PSTSPEH           
Sbjct: 507 EYYDYTQNLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 557

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +   +T   A+IRE+    I A++ L  +K E    E VL +   L P KI   G +MEW
Sbjct: 558 IDQGATFVHAVIREILLDAIKASKELGIDKKERKQWEHVLAN---LTPYKIGRYGQLMEW 614


>gi|261408195|ref|YP_003244436.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
 gi|261284658|gb|ACX66629.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
          Length = 779

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 189/611 (30%), Positives = 297/611 (48%), Gaps = 64/611 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
           +K+ +  PA+ ++  +PIGNGR+G +V      E   + E T W+G P         KA 
Sbjct: 4   MKLWYTKPAQGWSQGLPIGNGRMGNVVISAPDREIWNITETTYWSGQPEPAQGRSNSKAD 63

Query: 72  LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDS--------H 116
           L  +R     G Y E    + K        FG    + Q++    LEFD +         
Sbjct: 64  LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFDHNVKPSEGGRQ 119

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNV 175
              AE  + RELDL  A AR    +   E TRE F+S+ DQVIV++I  S   S +SF +
Sbjct: 120 EAAAEPLFYRELDLQEAVARSFCEIDGAEMTREVFASHADQVIVSRIRSSHGSSGVSFRI 179

Query: 176 SLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
           S+    +N   H+ V G + I   G+          ++N +      S   +++++ + G
Sbjct: 180 SIRG--ENGPFHANVTGKDTIEFRGQAL-----EDVHSNGE---CGVSCQGQLRVAAEGG 229

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
            +S   D  + V G+D A +    ++ +           +    +S   L+    L Y  
Sbjct: 230 KVSCTADT-ISVSGADEAAIYFAVNTDY-------RQEGESWREKSAFQLEQAVLLGYDA 281

Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
           L  +HL DYQ L+ RV + L  S               ++P+ ER+  F+    +DP+L 
Sbjct: 282 LRAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKQDDPALF 329

Query: 351 ELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCN 407
            L +Q+GRYL IS SRP + +  +LQGIWN  E     W    H++ N +MNY+ +   N
Sbjct: 330 ALFYQYGRYLTISGSRPDSILPMHLQGIWNDGEANKMAWSCDYHLDTNTQMNYFPTEAAN 389

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           LSE  EPL  ++  LS+ G   A+  Y A GWV H  ++ W  +S    +  W L   GG
Sbjct: 390 LSESHEPLMRYIQQLSVAGRSAARHYYDAEGWVAHVFSNAWGFASPGW-ETSWGLNVTGG 448

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEF 526
            W+ TH+ EHY Y  D+ FLE+ AYP+L+  A+F +D++ +    G+L T PS SPE+ F
Sbjct: 449 LWIATHMMEHYAYNQDQAFLEELAYPVLKEAAAFFMDYMTVHPKYGWLVTGPSNSPENSF 508

Query: 527 IA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
               P+     +S   TMD  ++R++ +  + AA+ L  +E+ L +K   +L +L P  I
Sbjct: 509 YTGNPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LRQKWQTALDQLPPLMI 567

Query: 585 AEDGSIMEWVQ 595
            + G + EW++
Sbjct: 568 GKKGQLQEWLE 578


>gi|296453497|ref|YP_003660640.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
           JDM301]
 gi|296182928|gb|ADG99809.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
           JDM301]
          Length = 783

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 187/607 (30%), Positives = 294/607 (48%), Gaps = 51/607 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G +  + + IP+GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60

Query: 73  SDVRSLVDSGQYAEATA--ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +  R       Y  AT       L      +Y+  G   +++  S      E+ +R+LDL
Sbjct: 61  AKARQAASGDDYTAATRIIKEATLQEKDEQIYEPFGTARIQY--STPADGRESMKRQLDL 118

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
             A A   + +G+     + + S PD ++V ++S      ++ +VS          L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRI-----PPKANANDDPKGIQFSAILEIKISDDRGTIS 235
            D H        +I+ GR PG  +     P +    D+  G   +      ++   G I+
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNVGLLPHPSEHPWEDEQDGTGMAYAGAFSLTATGGDIN 233

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
            ++D  L+        L   + S F G    P  S     +     L+   +   +DL T
Sbjct: 234 -VDDNSLQCSHITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDLQT 288

Query: 296 ---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL--- 349
              RH+ DY++ F RV+I L  +  D   DT        +P +  ++S +  E   L   
Sbjct: 289 MLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEML 338

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
            E +F FGRYLLISSSRP TQ ANLQGIWN    P W SA   NIN+EMNYW + PC L 
Sbjct: 339 AEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALK 398

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
           E  EPL      L   G   A       G  + H  D+W ++    G+ +WA WP G AW
Sbjct: 399 ELIEPLVSMNEELLAPGHDAADKILGCRGSAVFHNVDLWRRALPANGEPMWAFWPFGQAW 458

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
           +C +L++ Y +  D  +L  R +P++   A F +D+L E   G L  +P+TSPE+ F+  
Sbjct: 459 MCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV- 515

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAE 586
           +G+   V+ SS    AI+R +   +I A+   E L++ + ALV +      +L  T++  
Sbjct: 516 NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDSALVREAESVRSQLAETRLGA 575

Query: 587 DGSIMEW 593
           DG I+EW
Sbjct: 576 DGRILEW 582


>gi|429746943|ref|ZP_19280255.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
           taxon 380 str. F0488]
 gi|429164651|gb|EKY06768.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
           taxon 380 str. F0488]
          Length = 799

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 188/605 (31%), Positives = 308/605 (50%), Gaps = 63/605 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   +  +P+A   L
Sbjct: 16  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPAD----VYQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 76  KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   +   N    +  F+   + VI  +I  +    L+ ++SL  
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATS--PLNLDISLFR 190

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          ND  +G+ F+++++++     G I +   
Sbjct: 191 -KENATITYQNNKITLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235

Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K + ++ +    L + A ++++   G  ++ S +KK     +   LQ    +S+      
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQ 355
               +Q LF+R                 +  N + + + ER++ F   E  +L+ +L + 
Sbjct: 290 SSIVFQGLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 338

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
             F   L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
           +HY +T + +FL +  YP+L+   +F  + LI+    GY  T PS SPE+ ++ P   DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516

Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           K  +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I + G 
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575

Query: 590 IMEWV 594
           + EW+
Sbjct: 576 LNEWL 580


>gi|295835067|ref|ZP_06822000.1| fibronectin type III domain-containing protein [Streptomyces sp.
           SPB74]
 gi|197698025|gb|EDY44958.1| fibronectin type III domain-containing protein [Streptomyces sp.
           SPB74]
          Length = 790

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 189/595 (31%), Positives = 286/595 (48%), Gaps = 61/595 (10%)

Query: 15  ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
           + +  PA  + T ++P+GNG LGA V+G +P+E ++  E TLWTG PG       ++ NP
Sbjct: 53  LRYTAPATDWETQSLPVGNGALGASVFGTLPTEHVQFAEKTLWTGGPGTPGYRYGNWENP 112

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
             P ALS VR+ +++        A+ +L G P   Y   Q  GD  L  D +    +   
Sbjct: 113 R-PDALSSVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGD--LLIDVAGAPASANG 168

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y R LDL    A V Y      F R  F+S PD+V+V   +    GS+  ++   S   +
Sbjct: 169 YSRTLDLAQGLAGVTYPHDGTTFRRTVFASYPDKVLVGHFTADRGGSVELSLRYTSPRQD 228

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            +     +++ + G                  G++F A  +I++  + GT+SA  D+ L 
Sbjct: 229 FTATASGDRLTLRGAL-------------QDNGMRFEA--QIRLLSEGGTVSANGDR-LT 272

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+D A  +L A + +   +  P     DP      A+       Y +L  RH  D+  
Sbjct: 273 VSGADSAWFVLSAGTDYADTY--PGYRGADPHDRVTGAVNQAAARPYRELLDRHTSDHGG 330

Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           LF RV + L + S  D  TD   +       +A+R          +L  L FQ+GRYLLI
Sbjct: 331 LFSRVVLDLGQQSAPDQSTDALLKAYTGGNSAADR----------ALEALFFQYGRYLLI 380

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           +SSR G+  ANLQG WN   +P W +  HVNINL+MNYW +   NL+E   P   F+  L
Sbjct: 381 ASSRAGSLPANLQGAWNNSTTPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEAL 440

Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYT 481
            + G  TAQ  + A GWV+H +T  +  +   D     W  +P   AWL + L+EHY + 
Sbjct: 441 RVPGRTTAQSMFGARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFD 498

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYS 539
              D+L   AYP ++  A F +D L  +  D  L   PS SPEH +F A           
Sbjct: 499 GSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------G 548

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
           + M   I+ E+F+  + AA+ L  ++ A   ++ ++L R+ P  ++   G +MEW
Sbjct: 549 AAMSQQIVHELFTNTLEAAQTL-GDDPAFRGRLKETLDRIDPGLRVGSWGQLMEW 602


>gi|315224299|ref|ZP_07866133.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
 gi|420159534|ref|ZP_14666333.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
           Holt 25]
 gi|314945689|gb|EFS97704.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
 gi|394761875|gb|EJF44190.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
           Holt 25]
          Length = 799

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 187/602 (31%), Positives = 305/602 (50%), Gaps = 57/602 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   +  +P+A   L
Sbjct: 16  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 76  KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   +   N    +  F+   + VI  KI  +    L+ ++SL  
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          N   +G+ F+++++++     G I +   
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           K + ++ +    L + A ++++  F     S    T ++   LQ    +S+         
Sbjct: 236 KAIAIQSAKEITLRISAVTNYN--FDKGGLSDISVTKKANEYLQKAP-MSFDKAKAESSI 292

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFGR 358
            +Q+LF+R                 +  N + + + ER++ F   E  +L+ +L+  FGR
Sbjct: 293 VFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYNFGR 341

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL  F
Sbjct: 342 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPLQRF 401

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
              L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W+HY
Sbjct: 402 TKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIWQHY 460

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK-- 532
            +T + +FL +  YP+L+   +F  + LI+    GY  T PS SPE+ ++ P   DGK  
Sbjct: 461 LFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDGKKQ 519

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
           +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I + G + E
Sbjct: 520 IGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGDLNE 578

Query: 593 WV 594
           W+
Sbjct: 579 WL 580


>gi|333022556|ref|ZP_08450620.1| putative fibronectin type III domain-containing protein
           [Streptomyces sp. Tu6071]
 gi|332742408|gb|EGJ72849.1| putative fibronectin type III domain-containing protein
           [Streptomyces sp. Tu6071]
          Length = 783

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 188/598 (31%), Positives = 292/598 (48%), Gaps = 67/598 (11%)

Query: 15  ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
           + +  PA  + T+++P+GNG LGA V+G +P+E ++  E TLWTG PG       ++ NP
Sbjct: 46  LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTSGYRYGNWENP 105

Query: 67  DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
             P AL+ VR+ +++        A+ +L G P   Y   Q  GD+ ++ D +    + + 
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSADG 161

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y R LDL  A A V Y      F R  F+S PD+V+V   +    GS+  N+   S   +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFRRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
            +     +++ + G                  G++F A  +I++  + G+++A  D+ L 
Sbjct: 222 FTATTDGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGSVTANGDR-LT 265

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           V G+D A  +L A + +   +  P     DP     +A+       Y +L  RH  D+  
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323

Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSF---QTDEDPSLVELLFQFGRY 359
           LF RV + L + S  D  TD               +K++    + +D +L  L FQ+GRY
Sbjct: 324 LFSRVVLDLGQGSAPDRTTDAL-------------LKAYTGGNSADDRALEALFFQYGRY 370

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLI+SSR G+  ANLQG WN   +P W +  HVNINL+MNYW +   NL+E   P   F+
Sbjct: 371 LLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFV 430

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHY 478
             L   G  TA+  + A GWV+H +T  +  +   D     W  +P   AWL + L+EHY
Sbjct: 431 EALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHY 488

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACV 536
            +    D+L   AYP ++  A F +D L  +  D  L   PS SPEH +F A        
Sbjct: 489 RFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA-------- 540

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
              + M   I+RE+F   + AA+ L  ++ A    + ++L R+ P  +I   G +MEW
Sbjct: 541 --GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRTTLKETLDRIDPGLRIGSWGQLMEW 595


>gi|393778744|ref|ZP_10367005.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
           412 str. F0487]
 gi|392611313|gb|EIW94052.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
           412 str. F0487]
          Length = 799

 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 188/605 (31%), Positives = 307/605 (50%), Gaps = 63/605 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   +  +P+A   L
Sbjct: 16  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 76  KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ ATA   +   N    +  F+   + VI  KI  +    L+ ++SL  
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIGQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          N   +G+ F+++++++     G I +   
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235

Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K + ++ +    L + A ++++   G  ++ S +KK     +   LQ    +S+      
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
               +Q+LF+R                 +  N + + + ER++ F   E  +L+ +L+  
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 338

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
             F   L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
           +HY +T + +FL +  YP+L+   +F    LI+    GY  T PS SPE+ ++ P   DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516

Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           K  +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I + G 
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575

Query: 590 IMEWV 594
           + EW+
Sbjct: 576 LNEWL 580


>gi|329923050|ref|ZP_08278566.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
 gi|328941823|gb|EGG38108.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
          Length = 767

 Score =  258 bits (659), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 193/614 (31%), Positives = 300/614 (48%), Gaps = 70/614 (11%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
           +K+ +  PA+ ++  +PIGNGR+G +V      E   + E T W+G P         KA 
Sbjct: 4   MKLWYTKPAQGWSQGLPIGNGRMGNVVVSTPDREIWNITETTYWSGQPEPAQGRSNSKAD 63

Query: 72  LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK------ 118
           L  +R     G Y E    + K        FG    + Q++    LEFD  H+K      
Sbjct: 64  LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFD-HHVKPSEGGR 118

Query: 119 ---YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFN 174
               AE  + RELDL  A AR    +   E  RE F+S+ DQVIV +I  S   S +SF 
Sbjct: 119 QDAAAEPLFHRELDLQEAVARSLCEIDGAEMAREVFASHADQVIVARIRSSHGSSGVSFR 178

Query: 175 VSLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
           +S+    +N   H+ V G + I  +G+   + I          +G+       +++  + 
Sbjct: 179 ISIRG--ENGPFHAVVTGKDTIDFQGQA-WEGIHSNGECGVSCQGL-------LRVVTEG 228

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN--LS 289
           G +S ++D  + V G+D A +            +N    ++  +    SALQ  +   L 
Sbjct: 229 GQVSCMDDTII-VSGADEAAIYFA---------VNTDYRQEGESWREKSALQLEQAVLLG 278

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
           Y +L  +HL DYQ L+ RV + L  S               ++P+ ER+  F+    +D 
Sbjct: 279 YDELKAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKRDDQ 326

Query: 348 SLVELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSL 404
           +L  L +Q+GRYL IS SR  + +  +LQGIWN  E     W    H+++N +MNY+ + 
Sbjct: 327 ALFALFYQYGRYLTISGSRQDSILPMHLQGIWNDGEANKMAWSCDYHLDVNTQMNYFPTE 386

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
             NLSE  EPL  ++  LS+ G   A+  Y A GWV H  ++ W  +S   G   W L  
Sbjct: 387 AANLSESHEPLMRYIQQLSVAGCSAARHYYDAEGWVAHVFSNAWGFASPGWG-TSWGLNV 445

Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPE 523
            GG W+ THL EHY Y  D+ FLE+ AYP+L+  A+F +D++ +    G+L T PS SPE
Sbjct: 446 TGGLWIATHLIEHYAYNRDQAFLEELAYPVLKEAAAFFMDYMTVHPQYGWLVTGPSNSPE 505

Query: 524 HEFIA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           + F    P+     +S   TMD  ++R++ +  + AA+ L  +E+ L +K   +L +L P
Sbjct: 506 NSFYTSKPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LQQKWQTALDQLPP 564

Query: 582 TKIAEDGSIMEWVQ 595
             I + G + EW++
Sbjct: 565 LIIGKKGQLQEWLE 578


>gi|375088282|ref|ZP_09734622.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
           51524]
 gi|374562320|gb|EHR33650.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
           51524]
          Length = 820

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 185/611 (30%), Positives = 302/611 (49%), Gaps = 74/611 (12%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG--VPGD--YTNPD---APKALSDVRSL 78
           +A+P+GNG +G+ V+G V  E ++ NE TLW+G   PGD  Y   +       L ++R  
Sbjct: 22  EALPVGNGTMGSKVFGWVGRERIQFNEKTLWSGGPKPGDDSYNGGNLEGKHSVLPEIRQA 81

Query: 79  VDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++ G   +A   + +    P       Y   GDI L+F +   +    T Y+R LD++TA
Sbjct: 82  LEDGNTEKAKQLAEEHLVGPNSPEYGRYLSFGDIYLDFTNQSKELESVTDYKRVLDMDTA 141

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHS-YVN- 188
           T  V+Y      F R+ F S+PD+V+VT +S      L FN  L     L+D  S +VN 
Sbjct: 142 TTSVRYKEDGTTFKRDTFISHPDKVMVTHLSKEGDKPLEFNAGLYLTKELVDGGSNHVNH 201

Query: 189 ------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                    Q  +E    G  +  K    D+  G++F++ +EI   D  G I  L D  L
Sbjct: 202 YAEKESDYKQATVEYTEKGALL--KGTVRDN--GLEFASYMEI---DTDGVIEVL-DGYL 253

Query: 243 KVEGSDWAVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           +V G+ +A L+  A +++   P  N  D+  D    + S +Q   + +Y  +   H++D+
Sbjct: 254 RVTGATYATLMTHAVTNYAQNPETNYRDTTMDVAEVAQSTVQQAIDKTYEQVKVDHINDH 313

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q LFHRV + L      + TD               + ++   +  +L EL +Q+GRYLL
Sbjct: 314 QDLFHRVQLDLGAKTSALFTDDL-------------LATYDKQDGRALEELFYQYGRYLL 360

Query: 362 ISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           I+SSRPG     ANLQG+WN   +P W+S  H+N+NL+MNYW +   N++E   PL +F+
Sbjct: 361 ITSSRPGKNALPANLQGVWNAVDNPAWNSDYHMNVNLQMNYWPAYSANMAETALPLINFV 420

Query: 420 TYLSINGSKTAQVNYL--------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
             L   G + A   Y          +GW+ H +   +  ++       W   P   AW+ 
Sbjct: 421 DDLRYYG-RVAASEYANITSKEGEENGWLAHTQVTPFGWTTPGW-NYYWGWSPAANAWIM 478

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAP 529
            +++E+Y YT D++FL+++ YP+L+  A F   +L   E  D ++ ++PS SPEH     
Sbjct: 479 QNVYEYYRYTQDKEFLQEKIYPMLKETAKFWNQFLHYDEASDRWV-SSPSYSPEH----- 532

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLE-----KNEDALVEKVLKSLPRLRPTKI 584
                 ++  +T D +++ ++F     A EVL      + +D L+ ++ +   +L+P  I
Sbjct: 533 ----GTITIGNTFDQSLVWQLFHDFKEATEVLRDVEGFRPDDTLLAEISEKFAKLKPLHI 588

Query: 585 AEDGSIMEWVQ 595
             DG I EW +
Sbjct: 589 NNDGHIKEWYE 599


>gi|420150260|ref|ZP_14657420.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
           335 str. F0486]
 gi|394752319|gb|EJF36021.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
           335 str. F0486]
          Length = 799

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 188/605 (31%), Positives = 305/605 (50%), Gaps = 63/605 (10%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           + + F+ PA HFT++IPIGNGRLGAM++G    + + LNE +LW+G   +  +P+A   L
Sbjct: 16  VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75

Query: 73  SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
            +++ L+  G+  EA A   + F         G  A+     YQ+L ++ L++  +    
Sbjct: 76  KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
             + Y+R L L+ A A   +   N    +  F+   + VI  KI  +    L+ ++SL  
Sbjct: 133 PIQDYKRVLRLDEAIAVTLFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N +    NN+I + G  P          ND  +G+ F+++++++     G I +   
Sbjct: 191 K-ENATITYQNNKITLNGALP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235

Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
           K + ++ +    L + A ++++   G  ++ S +KK     +   LQ    +S+      
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
               +Q LF+R                 +  N + + + ER+  F   E  +L+ +L+  
Sbjct: 290 SSIVFQGLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR G   ANLQG+W E+    W+   H+NIN++MNYW + P NLS+  EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
             F   L  NGSKTA+  Y A+GWV H  ++ W  +S       W     GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
           +HY +T + +FL +  YP+L+   +F    LI+    GY  T PS SPE+ ++ P   DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516

Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
           K  +     + TMDM I+RE+F+    AA++L  +     E    S   + P +I + G 
Sbjct: 517 KRQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575

Query: 590 IMEWV 594
           + EW+
Sbjct: 576 LNEWL 580


>gi|384196720|ref|YP_005582464.1| hypothetical protein HMPREF9228_0580 [Bifidobacterium breve
           ACS-071-V-Sch8b]
 gi|333110104|gb|AEF27120.1| conserved hypothetical protein [Bifidobacterium breve
           ACS-071-V-Sch8b]
          Length = 783

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 185/609 (30%), Positives = 296/609 (48%), Gaps = 55/609 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G +  + ++IP+GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60

Query: 73  SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +  R  SL D    A        L      +Y+  G   +++  S      E+ +R+LDL
Sbjct: 61  AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
             A A   + +G+     + + S PD ++V ++S      ++ +VS          ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
            D H        +++ GR PG  I     P  N  +D +   G+ ++    + ++   G 
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMTYAGAFSLTVT---GG 230

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
              + D  L+        L   + S F G    P  S     +     L+   +   +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286

Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL- 349
            T   RH+ DY++ F RV+I L  +  D   DT        +P +  ++S +  E   L 
Sbjct: 287 RTMLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336

Query: 350 --VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
              E +F FGRYLLISSSRP TQ ANLQGIWN    P W SA   NIN+EMNYW + PC 
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           L E  EPL      L + G   A       G  + H  D+W ++    G  +W+ WP G 
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQ 456

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
           AW+C +L++ Y +  D  +L  R +P++   A F +D+L E   G L  +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
             +G+   V+ SS    AI+R +   +I A+   E L++ +  LV +      +L  T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSQLAETRL 573

Query: 585 AEDGSIMEW 593
             DG I+EW
Sbjct: 574 GADGRILEW 582


>gi|225351622|ref|ZP_03742645.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
 gi|225157966|gb|EEG71249.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
          Length = 783

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 187/606 (30%), Positives = 294/606 (48%), Gaps = 49/606 (8%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G +  + + IP+GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60

Query: 73  SDVRSLVDSGQYAEATA--ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +  R       YA AT       L      +Y+  G   +++  S      E+ +R+LDL
Sbjct: 61  AKARQASLHDDYATATRIIKEATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
             A A   + +G+     + + S PD ++V ++S      ++ +VS          L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
            D H        +I+ GR PG  I     P  N  +D +   G+ ++    + ++   G 
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVTG--GD 231

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
           I+ + D  L+        L   + S F G    P  S     +     L+   +   +DL
Sbjct: 232 IN-VGDNSLQCSNITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDL 286

Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
            T   RH+ DY++ F RV+I L  +  D      S      + S E  +S + +    L 
Sbjct: 287 QTMLDRHIADYRRYFDRVAIHLGSAHADDAELLFSA----ILRSDENKESHRLE---MLA 339

Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
           E +F FGRYLLISSSRP TQ ANLQGIWN    P W SA   NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399

Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
             EPL      L   G   A       G  + H  D+W ++    G  +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLAPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459

Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
           C +L++ Y +  D  +L  R +P++   A F +D+L E   G L  +P+TSPE+ F+  +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV-N 516

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
           G+   V+ SS    AI+R +   +I A+   E L++ +  LV +      +L  T++  D
Sbjct: 517 GEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVREAEAVRSQLAETRLGAD 576

Query: 588 GSIMEW 593
           G I+EW
Sbjct: 577 GRILEW 582


>gi|367026916|ref|XP_003662742.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010011|gb|AEO57497.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 834

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 186/595 (31%), Positives = 286/595 (48%), Gaps = 54/595 (9%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           F   +PIGNGRL A V+G   +E L LNE+++W+G   D  NP++  A+  +R ++ SG 
Sbjct: 36  FKSTLPIGNGRLAAAVYG-TGTEKLVLNENSVWSGPWLDRANPNSKDAVPKIREMLISGN 94

Query: 84  YAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
              A  A++  + G+P         + L  D  H     + Y R LD    TA V Y+  
Sbjct: 95  ITGAGQAALDNMAGNPISPRAYHPLVNLGIDFGHGSGISD-YTRWLDTFQGTAAVNYTYH 153

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
              ++RE+ +S P  V+  ++S  + G L+ N SL        +V      + +G   G 
Sbjct: 154 GTSYSREYVASYPHGVLAFRLSADQPGKLNANFSLS----RSQWVLSRRASVSDGEG-GH 208

Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
            +   A++      I F +  E +I +  G  ++ +   + + G+D   +   A +S+  
Sbjct: 209 TVALSADSGQPSDAITFWS--EARIVNSGGNATS-DGTTVFITGADTVDVFFDAETSYRH 265

Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
           P    +D+ +    E    L +     Y  +    ++D+  L  RV + L  S       
Sbjct: 266 P---DADAAQ---RELKRKLDAAVAAGYPAVRDGAVEDFSSLMGRVRLDLGSS------G 313

Query: 323 TCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR---PGTQVANLQGI 377
           +  E+ + T     R+ +F+ D   DP L+ L+F FGR+LL +SSR   P +  ANLQGI
Sbjct: 314 SAGEQPVPT-----RLSNFRQDPDADPELMTLVFNFGRHLLAASSRDTGPRSLPANLQGI 368

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LA 436
           WN+D  P W S   +NIN+EMNYW +L  NL+E  +PLFD +      G   A+  Y   
Sbjct: 369 WNDDYDPPWQSKYTININIEMNYWPALVTNLAETHKPLFDLIDMAIPRGRDVARTMYGCE 428

Query: 437 SGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
            G+V+HH TD+W  ++  DRG   + +WPMG AWL TH  EHY +T +R FL + A+P+L
Sbjct: 429 RGFVLHHNTDLWGDAAPVDRG-TPYTVWPMGAAWLATHAMEHYRFTRNRTFLAEVAWPVL 487

Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSYSSTMDMAIIREV 550
              A F   +L E  D Y  T PS SPEH FI P G         +  S  MD  ++ ++
Sbjct: 488 RETARFYHCYLFE-WDSYWTTGPSLSPEHSFIVPPGMTTAGAAEGLDISPEMDNQLLHQL 546

Query: 551 FSAIISAAEVL-----------EKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEW 593
           F+ +  A   L           + + +         LPR+RP  +    G I EW
Sbjct: 547 FTDVTEACARLGLFSSSSSDDDDDDAETCTTTAETYLPRIRPPAVHPTTGRIQEW 601


>gi|325964568|ref|YP_004242474.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
           Sphe3]
 gi|323470655|gb|ADX74340.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
           Sphe3]
          Length = 863

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 197/641 (30%), Positives = 288/641 (44%), Gaps = 76/641 (11%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVP-----SETLKLNEDTLWTGVPGD------ 62
           ++ ++ PA  + +A+P+GNGR GAMV+GG P     S   +LN+ + W+G P        
Sbjct: 6   RLAYDAPAAEWLEALPLGNGRHGAMVFGGSPANGGMSHRFQLNDSSAWSGSPHSQDREPV 65

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
           ++  +A + LS  R L+ SG +A A      L    +  Y       L F D HL  A  
Sbjct: 66  FSREEADRILSGSRRLISSGDFAGAAETLKGLQHRHSQAY-------LPFVDLHLTAAPA 118

Query: 123 T-------------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
                         Y R LDL TA +   Y +       E F S+   V+V  +      
Sbjct: 119 ATPTAGPAAGRPSDYHRGLDLATAVSTNTYCLEGHAVRVEAFISHDPSVLVISLLADAPE 178

Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-----NDDPKGIQFSAILE 224
            ++ ++ LDS L             +E + P    P           D+   +Q +A + 
Sbjct: 179 GVNLSLRLDSPLRVLRRTEDRGTCSLELKLPSDAAPAHDGGLVEYSEDESLSLQGAAAVS 238

Query: 225 IKISD---DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
                   D    +A     L   G   A + + A+++F G   +P+       +E+   
Sbjct: 239 WAHDGQDVDAPGGTAGHYGGLAATGVRRADVFVTAATTFAGLGRHPAGDAASAAAEARGV 298

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT---VPSAERV 338
           L+     S S L  RH + + +L+    I+L         D  + E  DT   + +A   
Sbjct: 299 LELAHAASPSTLKERHQESHSRLYRAAQIEL---------DVPAWEGTDTGRRLLAANAH 349

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ-----------VANLQGIWNEDLSPTWD 387
                  D  L  LLF +GRYLLISSSRPG              ANLQG+WN +L   W 
Sbjct: 350 PGGPLAADAGLAALLFNYGRYLLISSSRPGPAGSGKGSAWRGVPANLQGLWNAELPAPWS 409

Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
           S    NINL+MNYW + P  L+EC  PLF  +  + + G+  A+  Y A GW +HH +DI
Sbjct: 410 SNYTTNINLQMNYWGAEPTGLAECVVPLFALIEAMQVTGAAVAREYYGARGWTVHHNSDI 469

Query: 448 WAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY---TMDRD---FLEKRAYPLLEGC 498
           WA +           W+ WPM G WL  HLWEH  +   T+DRD   F    A+P + G 
Sbjct: 470 WAYAKPVGHGAHSPEWSYWPMAGLWLVRHLWEHLQFGAATVDRDKAGFARDAAWPAIRGA 529

Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKL--ACVSYSSTMDMAIIREVFSA 553
           A F LD L E  DG L T PSTSPE+ F A D   G+     V+ SSTMD+ +  +VF  
Sbjct: 530 AEFALDLLAELPDGSLGTGPSTSPENTFAAVDPSSGRRIQGSVAQSSTMDLTLTGDVFRM 589

Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
           + +    L  + D ++++  ++LPRL   +   DG + EW+
Sbjct: 590 LDALGRDLGMDADPVLDEARRALPRLPAPEPGRDGKLREWL 630


>gi|418172315|ref|ZP_12808932.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
 gi|418196823|ref|ZP_12833294.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
 gi|419426112|ref|ZP_13966303.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
 gi|419445683|ref|ZP_13985694.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
 gi|419447843|ref|ZP_13987844.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
 gi|419449944|ref|ZP_13989937.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
 gi|419452089|ref|ZP_13992069.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
 gi|419519881|ref|ZP_14059484.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
 gi|421288567|ref|ZP_15739325.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
 gi|353833518|gb|EHE13628.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
 gi|353858855|gb|EHE38814.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
 gi|379569503|gb|EHZ34473.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
 gi|379611583|gb|EHZ76306.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
 gi|379616518|gb|EHZ81213.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
 gi|379620888|gb|EHZ85538.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
 gi|379621308|gb|EHZ85956.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
 gi|379638035|gb|EIA02581.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
 gi|395885199|gb|EJG96226.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
          Length = 739

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 179/568 (31%), Positives = 284/568 (50%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + +++ G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD + 
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|418092776|ref|ZP_12729912.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
 gi|353761446|gb|EHD42013.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
          Length = 739

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 179/568 (31%), Positives = 285/568 (50%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + +++ G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|418079608|ref|ZP_12716827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
 gi|418087855|ref|ZP_12725020.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
 gi|421286404|ref|ZP_15737176.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
           GA60190]
 gi|353745351|gb|EHD26021.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
 gi|353755532|gb|EHD36135.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
 gi|395884860|gb|EJG95894.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
           GA60190]
          Length = 739

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 179/568 (31%), Positives = 283/568 (49%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + + + G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD + 
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|418239710|ref|ZP_12866256.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|419489948|ref|ZP_14029693.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
 gi|419526922|ref|ZP_14066473.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
 gi|353890745|gb|EHE70505.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|379555528|gb|EHZ20595.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
 gi|379584934|gb|EHZ49797.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
          Length = 739

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 179/568 (31%), Positives = 284/568 (50%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + + + G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|291457532|ref|ZP_06596922.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
           JCM 1192]
 gi|291380585|gb|EFE88103.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
           JCM 1192]
          Length = 783

 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 184/609 (30%), Positives = 296/609 (48%), Gaps = 55/609 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+TF+G +  + ++IP+GNGR+GA++     ++ L LN+DTLW+G P   T+P  P+ +
Sbjct: 1   MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60

Query: 73  SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +  R  SL D    A        L      +Y+  G   +++  S      E+ +R+LDL
Sbjct: 61  AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
             A A   + +G+     + + S PD ++V ++S      ++ +VS          ++++
Sbjct: 119 ARALAGETFRMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
            D H        +++ GR PG  I     P  N  +D +   G+ ++    + ++   G 
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230

Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
              + D  L+        L   + S F G    P  S     +     L+   +   +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286

Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL- 349
            T   R + DY++ F RV+I L  +  D   DT        +P +  ++S +  E   L 
Sbjct: 287 RTMLDRRIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336

Query: 350 --VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
              E +F FGRYLLISSSRP TQ ANLQGIWN    P W SA   NIN+EMNYW + PC 
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396

Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           L E  EPL      L + G   A       G  + H  D+W ++    G+ +W+ WP G 
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGEPMWSFWPFGQ 456

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
           AW+C +L++ Y +  D  +L  R +P++   A F +D+L E   G L  +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514

Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
             +G+   V+ SS    AI+R +   +I A+   E L++ +  LV +      +L  T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLEDLDEEDRDLVHEAESVRSQLAETRL 573

Query: 585 AEDGSIMEW 593
             DG I+EW
Sbjct: 574 GADGRILEW 582


>gi|421290728|ref|ZP_15741475.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
 gi|421306123|ref|ZP_15756774.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
 gi|395885632|gb|EJG96654.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
 gi|395903807|gb|EJH14730.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
          Length = 739

 Score =  251 bits (641), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 178/568 (31%), Positives = 282/568 (49%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + + + G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P     NLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPVNLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD + 
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|418188158|ref|ZP_12824676.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
 gi|421271597|ref|ZP_15722447.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
 gi|353847967|gb|EHE27986.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
 gi|395865736|gb|EJG76874.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
          Length = 739

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 177/568 (31%), Positives = 281/568 (49%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     D  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + + + G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S   +        +I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SI 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD + 
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|332880351|ref|ZP_08448029.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357047449|ref|ZP_09109054.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
           11840]
 gi|332681796|gb|EGJ54715.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355529520|gb|EHG98947.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
           11840]
          Length = 746

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 187/595 (31%), Positives = 277/595 (46%), Gaps = 109/595 (18%)

Query: 12  PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           P+K+ ++ PAK + T A+P+GNG +GAM +GGV  E L+ N+ TLW G            
Sbjct: 25  PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKEQLQFNDKTLWAG------------ 72

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             S  R                         YQ +GD+  EFD          YRREL L
Sbjct: 73  --STTRR----------------------GAYQNMGDLFFEFDTPE---TCTNYRRELSL 105

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
           + A  RV Y++  V++ RE+F+SNPD VIV +++     G L+F++ +       + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPGHKGKLNFSLRMQDGRQGMTRVDG 165

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
           +   I                  D    +  A+L+     D G +    D+ L+V+G+D 
Sbjct: 166 HTMTI--------------KGTLDLLSYEAQALLQA----DGGMVETKSDR-LEVKGADA 206

Query: 250 AVLLLVASSSFD--GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
             ++L  +++FD   P     D+ +     S    ++ R  SY  L   HL DYQ LF R
Sbjct: 207 VTVVLTGATNFDLASPTYTRGDAYEIHRRVSARMDKATRK-SYKKLKAAHLADYQPLFAR 265

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
           V + L     D  TD    E+ D               +  L  L FQ+GRYL++ SSR 
Sbjct: 266 VELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSRG 310

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI--- 424
           G   +NLQG+WN   +P W+   H NIN++MNYW +   NLSEC  P   F+TY+S    
Sbjct: 311 GQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVTNLSECYAP---FITYVSTEAL 367

Query: 425 -NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
            +G    QV       GW +H + +I+       G   W +     AW CTHLW+HY YT
Sbjct: 368 KDGGAWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAYT 420

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYS 539
           +D+++L   A+P+++    +  D L E  +G L      SPEH    P  DG    V+Y+
Sbjct: 421 LDKEYLRDTAWPVMKVTCQYWFDRLKENAEGRLVAPNEWSPEH---GPWEDG----VAYA 473

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
             +  A+  E     ++AA+VL   +DA V ++ +   RL     I   G I EW
Sbjct: 474 QQLVYALFEET----LAAADVLAV-DDAFVSELKEKFSRLDNGLHIGSWGQIKEW 523


>gi|402847334|ref|ZP_10895629.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
           279 str. F0450]
 gi|402266647|gb|EJU16068.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
           279 str. F0450]
          Length = 838

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 180/603 (29%), Positives = 286/603 (47%), Gaps = 55/603 (9%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKA 71
           L   F+ PA    +A+P+GNGRLG +  GGV  + + LNE ++W+G V     N +A K 
Sbjct: 46  LTYFFDRPATSMMEALPLGNGRLGMLSDGGVQHQRITLNESSMWSGSVDSTAWNAEAYKQ 105

Query: 72  LSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLK 118
           L  +R L+ +G+  EA     + F               P   YQ+ G + L +D +   
Sbjct: 106 LPAIRKLLLAGRAKEAEDLIYRTFVCGGVGSGRGQGANTPYGSYQVGGFLHLNWDKAP-- 163

Query: 119 YAEETYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNV 175
                Y R L L+   +R  + V G    T+  +S    +V V  ++    E+   +  +
Sbjct: 164 -ELSGYYRGLSLSEGVSRESFVVDGQAYRTKRLYSVLGREVQVVHLTNHSEEARRDTLRL 222

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
           SL    + H        + + G+ P  +           +G+ + AI+   +    GT+ 
Sbjct: 223 SLSRPENGHPAAEAGF-LTLSGQLPDGK---------GGRGMSY-AIVVRPVLPQGGTLI 271

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
              D+ L V      V L +A ++      N  D +    + S+      + +  ++L+ 
Sbjct: 272 TRGDELLIVNAP--TVELYIAHNT------NYYDKRLPVMARSIEQTLQAKAVGEANLFA 323

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELL 353
            H+  +     RV  +             S+  + ++P   R+ ++    + DP+L  L 
Sbjct: 324 EHVQRFTAQMDRVQARF----------LGSDPALSSLPIQRRLIAYYEHPERDPALAALY 373

Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
            Q GRYLLISS+RPG    NLQGIW E +   W+   H+NINL+MNYW +    L E   
Sbjct: 374 MQLGRYLLISSTRPGALPPNLQGIWTETIQAPWNGDYHLNINLQMNYWPAEKGALPETVG 433

Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            L D++  +  +G +TA+  Y A GWV H   ++W + +A      W       AWLC H
Sbjct: 434 ALTDWVESIVPSGERTARTFYRAKGWVTHVLGNVW-QFTAPGEHPSWGATNTSAAWLCEH 492

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
           L+ HY Y+ DR +LE R YP+++G A F L  L++    GYL   P+TSPE+ +  P GK
Sbjct: 493 LYNHYRYSQDRAYLE-RIYPVMQGAARFFLTTLVKDPKSGYLVNVPTTSPENSYYTPQGK 551

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
              V+  STMD  I+RE+FS    AA  L ++    V+ +  +L +L+PT +  DG IME
Sbjct: 552 AVAVAAGSTMDNQILRELFSTTREAAMTLGRDR-TFVDSLSTALRQLKPTTLGPDGRIME 610

Query: 593 WVQ 595
           W++
Sbjct: 611 WME 613


>gi|317141175|ref|XP_001817567.2| hypothetical protein AOR_1_3054174 [Aspergillus oryzae RIB40]
          Length = 770

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 171/563 (30%), Positives = 277/563 (49%), Gaps = 59/563 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ P   F  ++P+GNGRLG  ++  +P+E +  NED++W+G   D  N +A      VR
Sbjct: 34  YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +L+ +G    A   ++  + G   D   YQ+L ++ ++            Y   L+  TA
Sbjct: 93  NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
               +Y    V +TRE  +S P  V+  +I  + S +++ N          +  NG   I
Sbjct: 153 ---CEYGFDGVSYTRELIASAPSGVLGFRIQTNTSRAINLN----------AVANGIASI 199

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +M+ R              +     F+A + + +  D G ++A  DK L V G+   V  
Sbjct: 200 VMKART------------GEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 244

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A SS+         +  D  +E    L +   L Y  L    + D++ L  RV++ L 
Sbjct: 245 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 298

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
            S  D  +          +P  ER+ ++++  D D     L+F +GR+LLI+SSR   + 
Sbjct: 299 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 348

Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +    LQGIWN+D SP+W +   VNINLEMNYW +   NL+E   PL+D L  +   G  
Sbjct: 349 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 408

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+  +   G+V+HH TD+W  S        +++WPMGGAWL  H+ EHY +T D+ FL+
Sbjct: 409 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 468

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
           ++A P+ +    F   +L +  DGYL T PS SPE+ F  P      GK   ++ S T+D
Sbjct: 469 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 527

Query: 544 MAIIREVFSAIISAAEVLEKNED 566
            +++ E+ +A+    ++LE + D
Sbjct: 528 NSMLFELLTALNETHQILEIDND 550


>gi|405761776|ref|YP_006702372.1| hypothetical protein SPNA45_02013 [Streptococcus pneumoniae SPNA45]
 gi|404278665|emb|CCM09296.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
          Length = 739

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 177/568 (31%), Positives = 282/568 (49%), Gaps = 56/568 (9%)

Query: 38  MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
           M++G    E ++LN++T+W     +  NPD+   L  +R  +  G+  +A     + +F 
Sbjct: 1   MIYGSATKECIQLNDETIWYRGKSNRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60

Query: 97  HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
            P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE+F+
Sbjct: 61  TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119

Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
           S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+        
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171

Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
               KG+QF  +   K++D  G +S L  + + +  +    L L + + + G        
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218

Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
                   +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++       I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263

Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S 
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
             +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD + 
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERIREPGRLTAKKMYGARGFTAHHNTDGFG 379

Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
            ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E 
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438

Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
            DGYL   PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D + 
Sbjct: 439 -DGYLMIGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497

Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V+++ K LP+   TKI  +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522


>gi|330998117|ref|ZP_08321945.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329569206|gb|EGG50997.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 746

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 185/596 (31%), Positives = 274/596 (45%), Gaps = 111/596 (18%)

Query: 12  PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           P+K+ ++ PAK + T A+P+GNG +GAM +GGV  E L+ N+ TLW G            
Sbjct: 25  PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKERLQFNDKTLWAG------------ 72

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
             S  R                         YQ +GD+  EFD          YRREL L
Sbjct: 73  --STTRR----------------------GAYQNMGDLFFEFDTPE---TCTNYRRELSL 105

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
           + A  RV Y++  V++ RE+F+SNPD VIV +++     G L+F++ +       + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPRHKGKLNFSLRMQDGRQGMTRVDG 165

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLKVEGS 247
           +   I                    KG     S   + ++  D G +    D+ L+V+G+
Sbjct: 166 HTMTI--------------------KGTLDLLSYEAQARLQADGGMVETKSDR-LEVKGA 204

Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFH 306
           D   ++L  +++FD      +    D     +SA +      SY  L   HL DYQ LF 
Sbjct: 205 DAVTVVLTGATNFDLASPTYTRGDADEIHRRVSARMDKAARKSYKKLKAVHLADYQPLFA 264

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           RV + L     D  TD    E+ D               +  L  L FQ+GRYL++ SSR
Sbjct: 265 RVELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSR 309

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-- 424
            G   +NLQG+WN   +P W+   H NIN++MNYW +   NLSEC  P   F+TY+S   
Sbjct: 310 GGQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVANLSECYAP---FITYVSTEA 366

Query: 425 --NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
             +G    QV       GW +H + +I+       G   W +     AW CTHLW+HY Y
Sbjct: 367 LKDGGSWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAY 419

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSY 538
           T+D+++L   A+P+++    +  D L E  +G L      SPEH    P  DG    V+Y
Sbjct: 420 TLDKEYLRDTAWPVMKVTCQYWFDRLKENTEGRLVAPNEWSPEH---GPWEDG----VAY 472

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
           +  +  A+  E     ++AA VL   +DA V ++ +   RL     +   G I EW
Sbjct: 473 AQQLVYALFEET----LAAAGVLAV-DDAFVSELKEKFSRLDNGLHVGSWGQIKEW 523


>gi|29347187|ref|NP_810690.1| hypothetical protein BT_1777 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29339086|gb|AAO76884.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 1019

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 166/497 (33%), Positives = 261/497 (52%), Gaps = 35/497 (7%)

Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
           EL  D S  L Y++  Y R LD++ A   V Y    + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381

Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
           + G LS  +SL+SL  + +     + I M G  P      K   +    G++++  L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLKYAQQLVVK 440

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
             +  G IS ++  KLKVE +D  ++L+ A++++     +  +  S++DP  +  + L  
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
           + +  Y+ L   H  DY  L+ R+ + L   P+  V  T S  + +D   ++E+      
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
            E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W++  H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
            P NLS C  P+ +++  L   G  TAQ  Y         GWV HH+ +IW  ++A   K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIW-DNTAPAKK 670

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
                +P G  W+C  +WE+Y + +D+DFL+K    +L+    ++ +   +  DG L  N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730

Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
           PS SPEH EF      L C     +   A+I E+F  +I A++ L +++D  + ++  ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780

Query: 577 PRLRPTKIAEDGSIMEW 593
            +L   KI   G  MEW
Sbjct: 781 SKLSGPKIGLGGQFMEW 797



 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)

Query: 1  MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
          MM          LK T+N PAK++ ++A+PIGNG +GAM++G V  + ++ NE TLW+G 
Sbjct: 23 MMACSEQPHQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82

Query: 60 PGD 62
          PG+
Sbjct: 83 PGE 85


>gi|225016900|ref|ZP_03706092.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
           DSM 5476]
 gi|224950294|gb|EEG31503.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
           DSM 5476]
          Length = 1565

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 188/640 (29%), Positives = 303/640 (47%), Gaps = 89/640 (13%)

Query: 6   STSTTNPLKITFNGPAKHFTDA-------IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
           S   TNPL++ +  PA   TD+       +P+GNG +G MV+GG+  E +  NE ++WTG
Sbjct: 38  SVRNTNPLRLWYTKPAPVNTDSKQWQYTVLPLGNGYMGGMVFGGISKERVHFNEKSMWTG 97

Query: 59  VPG---------DYTNPDAPKALSDVRSLVDSGQY----AEATAASVKLF----GHPAD- 100
            P          + T P   + L + R+ +D          ++A + KL     G   D 
Sbjct: 98  GPSASRPNHNGSNRTEPVTTEWLDEFRAELDDKTNDVWGLSSSAGNNKLLDLIRGPKRDN 157

Query: 101 ------VYQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
                 +YQ  GDI ++F  + +     E Y R+LDL TA + V Y +G V +TRE+F+S
Sbjct: 158 WDNGMGMYQDFGDIYMDFARAGITDDMAENYVRDLDLTTALSTVSYDIGGVHYTREYFNS 217

Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
            PD V+  +++ SE+G L+F+ S+       S  + N  +  EG     R   + N    
Sbjct: 218 YPDNVLAMRLNASEAGKLTFDASITPA---SSTSSTNRTVTAEGDIITLRGQIRDNQ--- 271

Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
              +Q+ A  ++K+ ++ GT+ A ED  + ++G+D   L+L   + +   +  P    +D
Sbjct: 272 ---LQYEA--QLKVLNEGGTLKANEDGTISIDGADSVTLILACGTDYKNEW--PKYRGED 324

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
           P     + + +  +  +  LY  HL+DYQ+LF RV + L              E +  +P
Sbjct: 325 PHEAISARIDNAADKGFDALYQTHLEDYQELFSRVDLDLG-------------EELPNIP 371

Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN-EDLSPTWDSAPH 391
           + E +++++  E + SL  L +Q GRYL I+ SR  T   NL G+W     S  W++  H
Sbjct: 372 TDELIQNYRDGEHNKSLEVLTYQMGRYLTIAGSRENTLPTNLNGVWMIGSASQFWNADYH 431

Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS-----------GWV 440
            N+N +MNYW ++  NL+EC  P  D++  L   G  TA      S           G+ 
Sbjct: 432 FNVNFQMNYWPTMAANLAECMLPYNDYMESLVEPGRVTAGATAGLSTEPGTPIGEGNGFN 491

Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGA-WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
            H   +I+  +     +V    W +GGA W   + +++Y YT D D+L  + YP+L+  A
Sbjct: 492 AHTVNNIFGTTGP--YQVQEFGWTLGGASWALENSYDYYAYTQDEDYLRDKIYPMLKEQA 549

Query: 500 SFLLDWLIEGHDGY---LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
           +F   +L   H  Y   L   PS SPE             +  ST D +I  E F   I+
Sbjct: 550 TFYSKFLW--HSDYQNRLVVGPSVSPEQ---------GPTTNGSTFDQSIAWEAFEEAIN 598

Query: 557 AAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQR 596
           A+E L  +ED L     +   +L P  + ++G I EW + 
Sbjct: 599 ASEALGVDED-LRATWAEMQSQLNPIIVGDEGQIKEWYEE 637


>gi|383125191|ref|ZP_09945845.1| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
 gi|382983436|gb|EES66608.2| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
          Length = 1019

 Score =  248 bits (633), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 165/497 (33%), Positives = 260/497 (52%), Gaps = 35/497 (7%)

Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
           EL  D S  L Y++  Y R LD++ A   V Y    + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381

Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
           + G LS  +SL+SL  + +     + I M G  P      K   +    G+ ++  L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLIYAQQLVVK 440

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
             +  G IS ++  KLKVE +D  ++L+ A++++     +  +  S++DP  +  + L  
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
           + +  Y+ L   H  DY  L+ R+ + L   P+  V  T S  + +D   ++E+      
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
            E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W++  H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
            P NLS C  P+ +++  L   G  TAQ  Y         GWV HH+ +IW  ++  + K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 670

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
                +P G  W+C  +WE+Y + +D+DFL+K    +L+    ++ +   +  DG L  N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730

Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
           PS SPEH EF      L C     +   A+I E+F  +I A++ L +++D  + ++  ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780

Query: 577 PRLRPTKIAEDGSIMEW 593
            +L   KI   G  MEW
Sbjct: 781 SKLSGPKIGLGGQFMEW 797



 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 30/63 (47%), Positives = 44/63 (69%), Gaps = 2/63 (3%)

Query: 2  MNAESTSTTNP-LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
          M A S     P LK T+N PAK++ ++A+PIGNG +GAM++G V  + ++ NE TLW+G 
Sbjct: 23 MTACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82

Query: 60 PGD 62
          PG+
Sbjct: 83 PGE 85


>gi|365122414|ref|ZP_09339317.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642654|gb|EHL82000.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 837

 Score =  248 bits (633), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 185/602 (30%), Positives = 288/602 (47%), Gaps = 76/602 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
           + PIGNG  G  + G V +E + LNE +LW G P       Y    N +  K L  +R S
Sbjct: 79  SFPIGNGSFGGNILGSVKTERITLNEKSLWKGGPNVSGGARYYWDANKEGYKVLDQIRHS 138

Query: 78  LVD-SGQYAEATAASVKLF----GHPADV--------YQLLGDIELEFDDSHLKYAE-ET 123
            +  SG  + AT  +   F    G+  D         +  +G+  +   D+ +  +E   
Sbjct: 139 FIQFSGINSVATELTRNNFNGKCGYEPDSEKSFRFGSFTTMGEFHI---DTGIAESEISD 195

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
           YRR L L++A   V+++ G   F R+ FSS PD +++ +   +  G  +L+F    +   
Sbjct: 196 YRRILSLDSALVVVQFNAGGDCFYRKFFSSYPDSIMIYRFECTRPGRQNLTFRYVANPQA 255

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                 +G   I+  GR              D  G+QF  ++ ++   + GT++ +E+  
Sbjct: 256 SGSVEADGTAGIVYNGRL-------------DSNGMQF--VIRVRAVAESGTVT-VENGA 299

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYT 295
           +KV G+D     +   + +   + NP  +D +     DP   + + L       Y  +Y 
Sbjct: 300 IKVIGADNVTFYVAGDTDYKMNY-NPDFNDDRAYVGVDPVMTTQNNLDFALAKGYDAVYN 358

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
            H  DY  LF RV I L+ S  + V+D         +P+  R+ +++    D  L EL F
Sbjct: 359 AHRADYSALFDRVKIDLNES--NPVSD---------IPTDMRLSNYRNGISDHYLEELYF 407

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           QFGRYLLI+SSR G   ANLQG+W+ ++   W    H NINL+MNYW + P NLSECQ P
Sbjct: 408 QFGRYLLIASSRAGNLPANLQGLWHNNVEGPWRVDYHNNINLQMNYWPACPANLSECQTP 467

Query: 415 LFDFLTYLSINGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
           L +++  L   G +TA+  Y     GW     ++I+  +S    + + W    + G WL 
Sbjct: 468 LIEYIRTLVKPGERTAKAYYGPDTRGWTTSVSSNIFGFTSPLSSRDMSWNFSFVAGPWLA 527

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
           TH+WE+Y+YT D DFL    Y L++G A F +D L    DG     PSTSPEH       
Sbjct: 528 THVWEYYDYTRDEDFLRTTGYELIKGSAEFAVDHLWHKPDGSYAAAPSTSPEH------- 580

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
               V   +T   A++RE+    I  +++L+ +     E+  + L +L P +I   G +M
Sbjct: 581 --GPVDQGATFAHAVVREILLDAIETSKILDVDASER-EEWQEVLNKLMPYEIGRYGQLM 637

Query: 592 EW 593
           EW
Sbjct: 638 EW 639


>gi|417920435|ref|ZP_12563942.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           australis ATCC 700641]
 gi|342829385|gb|EGU63741.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           australis ATCC 700641]
          Length = 1209

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 192/636 (30%), Positives = 307/636 (48%), Gaps = 107/636 (16%)

Query: 14  KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
           ++T+N PA    D     A+P+GNG +GA V+G +  E ++ NE TLW+G P        
Sbjct: 123 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 182

Query: 61  -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
            G+Y   D  K L+++R  +++G   +A   + +    P +     Y   GDI + F++ 
Sbjct: 183 GGNY--EDRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 240

Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
                  T Y R LD+  A     YS     F RE FSS PD V VT +S     +L F 
Sbjct: 241 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 300

Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
             N   + LL N  Y               +N I+++G         K N      G+QF
Sbjct: 301 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 347

Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
           ++ L IK     G ++A +D  L V G+ +A LLL A ++F     NP ++ +KD   E+
Sbjct: 348 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 400

Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
              S +++ +   Y  L   H++DYQ LF+RV + L  S               T  + E
Sbjct: 401 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 447

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
            ++++  ++   L EL FQ+GRYL+ISSSR  T    ANLQG+WN   +P W+S  H+N+
Sbjct: 448 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 507

Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
           NL+MNYW +   NL+E   P+ +++  L   G           SK  Q N    GW++H 
Sbjct: 508 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 563

Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
           +     W     D     W   P   AW+  +++++Y +T D  +L+++ YP+L+  A F
Sbjct: 564 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 620

Query: 502 LLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
              +L   +  D ++ ++PS SPEH           ++  +T D +++ ++F   + AA 
Sbjct: 621 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 670

Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            L  ++D LV +V     +L+P  I ++G I EW +
Sbjct: 671 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYE 705


>gi|319946487|ref|ZP_08020723.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
 gi|319747318|gb|EFV99575.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
          Length = 1643

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 192/636 (30%), Positives = 307/636 (48%), Gaps = 107/636 (16%)

Query: 14  KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
           ++T+N PA    D     A+P+GNG +GA V+G +  E ++ NE TLW+G P        
Sbjct: 148 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 207

Query: 61  -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
            G+Y   D  K L+++R  +++G   +A   + +    P +     Y   GDI + F++ 
Sbjct: 208 GGNYE--DRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 265

Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
                  T Y R LD+  A     YS     F RE FSS PD V VT +S     +L F 
Sbjct: 266 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 325

Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
             N   + LL N  Y               +N I+++G         K N      G+QF
Sbjct: 326 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 372

Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
           ++ L IK     G ++A +D  L V G+ +A LLL A ++F     NP ++ +KD   E+
Sbjct: 373 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 425

Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
              S +++ +   Y  L   H++DYQ LF+RV + L  S               T  + E
Sbjct: 426 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 472

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
            ++++  ++   L EL FQ+GRYL+ISSSR  T    ANLQG+WN   +P W+S  H+N+
Sbjct: 473 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 532

Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
           NL+MNYW +   NL+E   P+ +++  L   G           SK  Q N    GW++H 
Sbjct: 533 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 588

Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
           +     W     D     W   P   AW+  +++++Y +T D  +L+++ YP+L+  A F
Sbjct: 589 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 645

Query: 502 LLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
              +L   +  D ++ ++PS SPEH           ++  +T D +++ ++F   + AA 
Sbjct: 646 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 695

Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            L  ++D LV +V     +L+P  I ++G I EW +
Sbjct: 696 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYE 730


>gi|429740665|ref|ZP_19274345.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
           F0037]
 gi|429160458|gb|EKY02921.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
           F0037]
          Length = 837

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 177/597 (29%), Positives = 293/597 (49%), Gaps = 54/597 (9%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
           F+ PA+   + +P+GNGRLG +  G +  + + LNE ++W+G +     N DA K L  +
Sbjct: 48  FDRPAESMMEELPLGNGRLGMLSDGALRHQRVTLNESSMWSGSIDSLALNRDAAKHLPKI 107

Query: 76  RSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAEE 122
           R L+ +G++ +A     K F               P   Y++ G + L++          
Sbjct: 108 RELLFAGRHKDAEELIYKTFVCGGKGSGQGAGAKVPYGSYEVGGFLHLDWGRD---IPSP 164

Query: 123 TYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-SL 180
           +Y+R LDL    +       G     + +++S    V V  I      + +  + L  S 
Sbjct: 165 SYKRSLDLTYGISTETIETWGQPYRMKTYYTSYTHDVNVITIYNQAISARTDTLRLSLSR 224

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
            +N +    +  + + G  P  +           +G+ ++ + +  +    G + +  ++
Sbjct: 225 PENGTSTVSDGLLTLSGDLPNGK---------GGEGLHYAIVAKPYLLHG-GKVISRGNE 274

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            L V  S   + +L+A ++    + NP  S   P +  +  +     ++ + L   H   
Sbjct: 275 LLIVNAS--VIQILIAHNTN---YYNPQLS---PIAHGVEQIVKAAGITSAILERDHRAA 326

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGR 358
           +     RVS+++ +            EN+   P  +R++++  D   DP+L  L  QFGR
Sbjct: 327 FSSQMGRVSMRIGKG-------NAKAENL---PIDKRLEAYHKDPQSDPNLASLYMQFGR 376

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLL+SS+R G    NLQGIW   +   W+S  H+NINL+MNYW S   NLSE   PL  +
Sbjct: 377 YLLLSSTRKGALPPNLQGIWTNLIQAPWNSDYHLNINLQMNYWPSEKGNLSETVLPLTSW 436

Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +  L  +G +TA+  Y   GWV H   ++W  ++       W     G AWLC HL+ HY
Sbjct: 437 VEGLLPSGRETARAFYGGKGWVTHILGNVWGFTAPGE-HPSWGATNTGAAWLCQHLFNHY 495

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
            YT DR++L +R YP+L+G + F L  L+ + ++GYL T P+TSPE+ ++APD  +  VS
Sbjct: 496 LYTQDREYL-RRIYPILKGASQFFLSTLVRDPNNGYLVTAPTTSPENHYLAPDSSVVAVS 554

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             STMD  IIRE+F+   ++A  L   E    + ++++L  L PT IA DG IMEW+
Sbjct: 555 AGSTMDNQIIRELFTNTRTSALAL--GERVFADTLVRTLSELMPTTIAPDGRIMEWL 609


>gi|145251710|ref|XP_001397368.1| hypothetical protein ANI_1_1356144 [Aspergillus niger CBS 513.88]
 gi|134082905|emb|CAK46741.1| unnamed protein product [Aspergillus niger]
          Length = 497

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 168/506 (33%), Positives = 250/506 (49%), Gaps = 51/506 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  PA +FT  +PIGNGRLGA +WG   +E + LNE+++W+G   +  NP +  AL  VR
Sbjct: 28  YTTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWSGPFINRVNPRSYDALWPVR 86

Query: 77  SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           SL+  G   E   A++  + G P     Y  LG + L+F   H +     Y R LDL + 
Sbjct: 87  SLLAEGNMTEGNDATLANMVGIPDSPQSYSALGSLVLDF--GHDEAGISNYTRYLDLRSG 144

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V+Y+   V + RE+ +S+PD V+  ++S SE G L  NV+  S L    YV  NN  
Sbjct: 145 MAVVEYTYRAVRYRREYLASHPDNVVAVRLSSSEPGGL--NVA--SSLVRDRYVVSNNAT 200

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +      G  +  +A +N+    IQF+A   + +SD R T +                 L
Sbjct: 201 LSHD---GGLLTLRAYSNNVSNPIQFTAEARV-VSDGRATSNGTS--------------L 242

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRV 308
           +V ++S    FI+   S +    E+  A     L +  +  +  +    + DY  L  RV
Sbjct: 243 VVRNASTIDIFIDTETSYRYSAQENWEAEIKSKLDTACSSGFVAVKKNAIADYSALAQRV 302

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
            + L            S  +   +P+  R+ +++ D   DP LV L+F FGR+ LI+SSR
Sbjct: 303 DLNLG-----------SSGSAGNLPTDSRLVNYRIDPDSDPELVVLMFHFGRHSLIASSR 351

Query: 367 PGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
                A   NLQG+WN+D  P W     ++INLEMNYW +   NL++   P  D L  + 
Sbjct: 352 ATESPALPANLQGLWNQDFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDVVH 411

Query: 424 INGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
             G   A+  Y  S  G+V+HH TD+W  ++       W +WPMGGAWL  +L EHY ++
Sbjct: 412 DRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYRFS 471

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI 507
            D   L  R +PLL+  A F   +L 
Sbjct: 472 RDESILRNRIWPLLQSAARFYYCYLF 497


>gi|359404666|ref|ZP_09197493.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
           18206]
 gi|357560094|gb|EHJ41501.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
           18206]
          Length = 838

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 184/601 (30%), Positives = 277/601 (46%), Gaps = 74/601 (12%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
           + ++PIGNG +G  V G V +E +  NE TLW G P            N  +   + ++R
Sbjct: 75  SQSLPIGNGNIGGNVLGSVEAERITFNEKTLWRGGPNTARGAAYYWDVNKQSAHVVGEIR 134

Query: 77  SLVDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETY 124
                G + +A   + K F     + AD         +   G+  +E   S +   +  Y
Sbjct: 135 EAFTKGDWQKAELLTRKNFNSVVPYEADAEEPFRFGSFTTAGEFYIETGLSSVGMTD--Y 192

Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
           RREL L++A A+V +    V++ RE+F S+P  V+  + + S+ G  +L F+ + + +  
Sbjct: 193 RRELSLDSALAKVSFCKDGVQYEREYFVSHPANVMAVRFAASQRGKQNLVFSYAPNPVST 252

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
                +G + +    R              D   ++++  + IK     G +S  E  KL
Sbjct: 253 GEMKADGTDALCWLARL-------------DNNSMEYA--VRIKAVAKGGAVSN-EGGKL 296

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK------DPTSESMSALQSIRNLSYSDLYTR 296
            V+ +D  V L+ A + +  P  +P  S        DP   +   L       Y+ L   
Sbjct: 297 TVKDADEVVFLITADTDYK-PNYDPDFSAPKAYVGVDPAQTTADWLAKAATKGYAYLLNE 355

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
           H  DY +LF+RV + ++ +  D           D +P   R++++ Q   D  L +L +Q
Sbjct: 356 HYADYSELFNRVRLNINNATADA----------DDLPVNRRLEAYRQGKPDYYLEQLYYQ 405

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLISSSR     ANLQG+W+ ++   W    H NINL+MNYW + P  LSEC+ PL
Sbjct: 406 FGRYLLISSSRADNLPANLQGLWHNNVDGPWRIDYHNNINLQMNYWLACPTGLSECELPL 465

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHL 474
           F+F+  L   G  TA+  +   GW      +I+  +S    + + W   P  G WL THL
Sbjct: 466 FNFIRTLVKPGRVTAKSYFGTRGWTTSVSGNIFGFTSPLSSEDMSWNFSPFAGPWLATHL 525

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           W +Y++T DR FL    Y +L+  A F  D+L    DG     PSTSPEH          
Sbjct: 526 WNYYDFTRDRKFLADN-YEILKESADFASDYLWHRADGVYTAAPSTSPEH---------G 575

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAEDGSIME 592
            V   +T   A+IREV    + A  VL K+  E    E  LK    L P KI   G +ME
Sbjct: 576 PVDEGATFAHAVIREVLLDAVEANRVLGKSAKERRQWEDALK---HLAPYKIGRYGQLME 632

Query: 593 W 593
           W
Sbjct: 633 W 633


>gi|380696427|ref|ZP_09861286.1| hypothetical protein BfaeM_21066 [Bacteroides faecis MAJ27]
          Length = 1014

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 157/494 (31%), Positives = 249/494 (50%), Gaps = 38/494 (7%)

Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGSESGSLS 172
           D+ L+     Y R LD++ A   V Y  G + F RE+F S PD V+V ++ S +  G LS
Sbjct: 328 DASLELPYSDYARTLDIDNAIHTVTYKEGGITFKREYFMSYPDHVMVMRLTSDANEGKLS 387

Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
             +SL+SL  +       N I M G  P      K   +    G++++  L +K  +  G
Sbjct: 388 RIISLESLHTDKVIAADGNTITMTGY-PTPVSGDKRVGDAWKNGLRYAQQLVVK--NKGG 444

Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD------SKKDPTSESMSALQSIR 286
            IS ++  KLKVE +D  ++L+ A++++    +   D      S++DP  +  + L  + 
Sbjct: 445 KISVVDGAKLKVEDADEIIVLMSAATNY----VQCMDDSYCYFSEEDPLDKVRATLHKVA 500

Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
           +  Y+ L   H  DY  L+ R+ + L    +     T      D++       +    ++
Sbjct: 501 DKKYTSLLAAHQKDYHSLYDRMQLNLGEQLEAPAATT------DSLLKGMDANTNSEQDN 554

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
             L  L FQFGRYLLISSSR G+  ANLQG+W E L+  W++  H NIN++MNYW + P 
Sbjct: 555 QYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLANPWNADYHTNINVQMNYWPTQPT 614

Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGKVVW 460
           NLS C  P+ +++  L   G  TAQ  Y         GWV HH+ +IW  ++  + K   
Sbjct: 615 NLSPCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-KSTP 673

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
             +P G  W+C  +WE+Y + +D+DFL+K    +L+    ++ +   +  DG L  NPS 
Sbjct: 674 HHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVANPSH 733

Query: 521 SPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
           SPEH EF      L C     +   A+I E+F  +I A++ L + +D  + ++  ++ +L
Sbjct: 734 SPEHGEF-----SLGC-----STSQAMICEMFGMMIKASKELGREKDPEIAEIATAMSKL 783

Query: 580 RPTKIAEDGSIMEW 593
              KI   G  MEW
Sbjct: 784 SGPKIGLGGQFMEW 797



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/63 (49%), Positives = 45/63 (71%), Gaps = 2/63 (3%)

Query: 2  MNAESTSTTNP-LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
          M A S     P LK T+N PAK++ ++A+PIGNG +GAM++GGV  + ++ NE TLW+G 
Sbjct: 23 MTACSGQFHQPALKATYNKPAKNWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWSGG 82

Query: 60 PGD 62
          PG+
Sbjct: 83 PGE 85


>gi|391873203|gb|EIT82265.1| hypothetical protein Ao3042_00536 [Aspergillus oryzae 3.042]
          Length = 580

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 174/592 (29%), Positives = 278/592 (46%), Gaps = 88/592 (14%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ P   F  ++P+GNGRLG  ++  +P+E +  NED++W+G   D  N +A      VR
Sbjct: 34  YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +L+ +G    A   ++  + G   D   YQ+L ++ ++            Y   L+  TA
Sbjct: 93  NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
               +Y    V +T                                        NG   I
Sbjct: 153 ---CEYGFDGVSYT--------------------------------------VANGIASI 171

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +M+ R              +     F+A + + +  D G ++A  DK L V G+   V  
Sbjct: 172 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 216

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A SS+         +  D  +E    L +   L Y  L    + D++ L  RV++ L 
Sbjct: 217 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 270

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
            S  D  +          +P  ER+ ++++  D D     L+F +GR+LLI+SSR   + 
Sbjct: 271 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 320

Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +    LQGIWN+D SP+W +   VNINLEMNYW +   NL+E   PL+D L  +   G  
Sbjct: 321 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 380

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+  +   G+V+HH TD+W  S        +++WPMGGAWL  H+ EHY +T D+ FL+
Sbjct: 381 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 440

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
           ++A P+ +    F   +L +  DGYL T PS SPE+ F  P      GK   ++ S T+D
Sbjct: 441 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 499

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +++ E+ +A+    ++LE + D L   V   L ++RP +I  DG I+EW++
Sbjct: 500 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQILEWIE 550


>gi|298387491|ref|ZP_06997043.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
 gi|298259698|gb|EFI02570.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
          Length = 1036

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 163/497 (32%), Positives = 261/497 (52%), Gaps = 35/497 (7%)

Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
           EL  D S  L Y++  Y R LD++ A   V Y    + F RE+F S PD V+V ++ S S
Sbjct: 341 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 398

Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
           + G LS  +SL+SL  + +    ++ I M G  P      K   +    G++++  L +K
Sbjct: 399 KKGKLSRIISLESLHTDKTITADSHTITMTG-YPTPVSGDKRIGDAWKNGLKYAQQLVVK 457

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
             +  G +S ++  KLKVE +D  ++L+ A++++     +  +  S++DP  +  + L  
Sbjct: 458 --NKGGKVSVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 515

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
           + +  Y+ L   H  DY  L+ R+ + L   P+  V  T S  + +D   ++E+      
Sbjct: 516 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 569

Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
            E+  L  L FQFGRYLLISSSR G+  ANLQG+W E LS  W++  H NIN++MNYW +
Sbjct: 570 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 628

Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
              NLS C  P+ +++  L   G  TAQ  Y         GWV HH+ +IW  ++  + K
Sbjct: 629 QSTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 687

Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
                +P G  W+C  +WE+Y + +D+DFL+K    +L+    ++ +   +  DG L  N
Sbjct: 688 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAVLFWVDNLWTDERDGTLVAN 747

Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
           PS SPEH EF      L C     +   A+I E+F  +I A++ L +++D  + ++  ++
Sbjct: 748 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 797

Query: 577 PRLRPTKIAEDGSIMEW 593
            +L   KI   G  MEW
Sbjct: 798 SKLSGPKIGLGGQFMEW 814



 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           MM          LK T+N PAK++ ++A+PIGNG +GAM++G V  + ++ NE TLW+G 
Sbjct: 40  MMACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 99

Query: 60  PGD 62
           PG+
Sbjct: 100 PGE 102


>gi|421276774|ref|ZP_15727594.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
 gi|395876055|gb|EJG87131.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
          Length = 922

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 188/631 (29%), Positives = 306/631 (48%), Gaps = 101/631 (16%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
           P   +++G  K    A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+
Sbjct: 125 PTAPSYDGWEKQ---ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGN 181

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
           Y   D  K LS++R  ++ G   +A   + +    P +     Y   GDI + F++    
Sbjct: 182 YQ--DRYKVLSEIRKALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 239

Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
               T Y R LD++ A +   Y+     F RE FSS PD V VT +S     +L F   N
Sbjct: 240 LENVTDYHRGLDISEAISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 299

Query: 175 VSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAI 222
              + L+ N  Y               +N I+++G         K N      G++F++ 
Sbjct: 300 SLTEDLIANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASY 346

Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM-- 279
           L IK     G ++A +D  L V G+ +A LLL A ++F     NP ++ +KD   E    
Sbjct: 347 LGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVK 399

Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
           S +++ +   Y  L   H+ DYQ LF+RV + L  S  +  T              E + 
Sbjct: 400 SIVEASKAKDYETLKNNHIKDYQSLFNRVQLNLGGSRSNQTT-------------KEALH 446

Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLE 397
           ++  ++   L EL FQ+GRYLLISSSR  T    ANLQG+WN   +PTW+S  H+N+NL+
Sbjct: 447 TYNPEKGQKLEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPTWNSDYHLNVNLQ 506

Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTD 446
           MNYW +   NL+E  +P+ +++  +   G           SK  Q N    GW++H +  
Sbjct: 507 MNYWPAYMNNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQAT 562

Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
            +  ++       W   P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L
Sbjct: 563 PFGWTTPGW-NYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFL 621

Query: 507 I--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
              +  D ++ ++PS SPEH           ++  +T D +++ ++F   + AA  L  +
Sbjct: 622 HYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVD 671

Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +D LV +V     +L+P  I +DG I EW +
Sbjct: 672 QD-LVTEVKAKFDKLKPLHINQDGRIKEWYE 701


>gi|333029856|ref|ZP_08457917.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
 gi|332740453|gb|EGJ70935.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
          Length = 816

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 190/615 (30%), Positives = 287/615 (46%), Gaps = 106/615 (17%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD------YTNPDA------------ 68
           ++PIGNG  GA + G V  + + LNE TLW G P        Y N +             
Sbjct: 62  SLPIGNGSFGANIMGSVSVDRVTLNEKTLWRGGPNTANGASYYWNVNKLSAKYLPIIRQA 121

Query: 69  --PKALSDVRSLVDS---GQYA-EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE- 121
              K L  VR+L ++   G  A E T  S   FG     +  LG++ LE   + L+  E 
Sbjct: 122 FMDKDLDKVRTLTENNFNGLAAYEETDESPFRFGS----FTTLGELYLE---TGLEEKEI 174

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS--------------- 166
             Y+R L L++A   V +   N  ++R +F+S PD VIV + +                 
Sbjct: 175 SDYKRALSLDSAVVNVSFKEKNTMYSRSYFASYPDSVIVIRYTSEQKAKQNIKLFYAPNP 234

Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
           ES  +      D +L     +N N Q  +E +C    IP      +   GI         
Sbjct: 235 ESRGVCIKKGSDRILFKRELLNNNQQFALEIKC----IPIGGYYENIENGI--------S 282

Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMS 280
           I D                 +D  V +L A++ +   F NP  SD K      P  ++  
Sbjct: 283 ICD-----------------ADEVVFVLSAATDYQMNF-NPDFSDPKTYVGLPPEIKTSQ 324

Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
            L  +    Y+ +   HL DYQ LF+RV I L+           S  +  ++P+  R+  
Sbjct: 325 RLLRLNGQDYNQMLNEHLQDYQSLFNRVHIDLN-----------SIHSFSSLPTDLRLAQ 373

Query: 341 FQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
           ++  + D +  EL +Q+GRYLLI+SSR G+  ANLQG+W+ ++   W    H NIN++MN
Sbjct: 374 YKEGKLDKAFEELYYQYGRYLLIASSRIGSMPANLQGLWHNNIDGPWRVDYHNNINIQMN 433

Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-V 458
           YW +   NLSEC  PL DF+  L   G  TAQ  Y A GW     ++I+  ++    K +
Sbjct: 434 YWPASTANLSECIPPLIDFIKTLVKPGKVTAQSYYNARGWTASISSNIFGFTAPLSSKDM 493

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
            W   PM G WL TH+W++++YT D DFL++  Y L++  A+F +D+L +  +G     P
Sbjct: 494 SWNFNPMAGPWLATHVWDYFDYTQDLDFLKETGYELIKESANFAVDYLWKMPNGVYSAAP 553

Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
           STSPEH           +   +T   A+IR+V S  I A+++L +++D   E +   L  
Sbjct: 554 STSPEH---------GPIDQGATFVHAVIRQVLSNAIEASKLLREDDDNRQEWI-AVLNN 603

Query: 579 LRPTKIAEDGSIMEW 593
           L P ++   G +MEW
Sbjct: 604 LAPYQVGRYGQLMEW 618


>gi|169624315|ref|XP_001805563.1| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
 gi|160705148|gb|EAT77080.2| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
          Length = 792

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 180/606 (29%), Positives = 293/606 (48%), Gaps = 78/606 (12%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
           + FN P    + ++PIGNGR+ A  +G    E + +NE+++W+G   D  N  +  ALS 
Sbjct: 26  LYFNTPGSSLSSSLPIGNGRVAAAAYG-TTLERITINENSVWSGQWQDRGNSQSLNALSS 84

Query: 75  VRSLVDSGQYAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +R  +  G  + A   ++  + G+P    Q    +++  D  H      +Y R LD    
Sbjct: 85  IRQKLMDGDMSSAGQQTLDAMAGNPQSPKQYHPTVDMTIDFGH-SGTLGSYTRILDTRQG 143

Query: 134 TARVKYSVGNVEFT-----------REHFSSNPDQVIVTKISGSESGSLSFNVSL---DS 179
           TA   Y +G V +T           RE+ +S P  V+  ++  +++G L+ +++L    +
Sbjct: 144 TAMTTYILGGVNYTLMGAAHLTSFRREYVASYPAGVLAFRMMANQAGKLNVDIALARSQN 203

Query: 180 LLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           +  N +  +GN N I ++G                  GI F+A  E ++  D G+IS + 
Sbjct: 204 VASNAASSSGNINSITLKGNG----------------GIPFTA--EARVVSDTGSIS-VN 244

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           +K + V+G+    +   A +S+         S      E  + L +     Y+ + T  +
Sbjct: 245 EKTMSVKGATIVDIFFDAETSYR------YGSASAWELELKNKLDNAVKAGYNAVKTAAV 298

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQF 356
            D + +  RV+I L            S  +  T P   R+ +++ +   DP LV L F +
Sbjct: 299 KDAEGILSRVNINLG-----------SSGSAGTQPIPSRLSNYKKNAGADPELVTLYFNY 347

Query: 357 GRYLLISSSRPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           GR+LL++SSR     +  ANLQGIWN++  P W S   VNIN EMNYW +L  NL E  +
Sbjct: 348 GRHLLLASSRDTGDRSLPANLQGIWNDNYDPPWQSKYTVNINTEMNYWHALTTNLDETHK 407

Query: 414 PLFDFLTYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
           PLFD +      G   A+  Y  + G+V+HH TD+W  ++           P+      T
Sbjct: 408 PLFDLVDMTRAQGRAMAKKMYGCNDGFVVHHNTDLWGDAA-----------PVDKGTPYT 456

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-- 530
           HL EHY +T D++FL+ RA+P+L+  A+F   +L   ++G   T PS SPE+ F+ P   
Sbjct: 457 HLMEHYRFTQDKNFLQNRAWPVLKDAANFYYCYLFM-YNGSYVTGPSLSPENTFVVPSNM 515

Query: 531 ---GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
              GK   V  + TMD  ++ E+F+ +ISA + L    D  V K    L +++  KI   
Sbjct: 516 RTAGKTEGVDIAPTMDNELLWELFNNVISAGKALGIT-DITVSKAKDYLSKIKEPKIGSK 574

Query: 588 GSIMEW 593
           G ++EW
Sbjct: 575 GQLLEW 580


>gi|418966542|ref|ZP_13518273.1| gram positive anchor [Streptococcus mitis SK616]
 gi|383347120|gb|EID25122.1| gram positive anchor [Streptococcus mitis SK616]
          Length = 1697

 Score =  243 bits (621), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 185/611 (30%), Positives = 300/611 (49%), Gaps = 88/611 (14%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
           A     Y+     F RE FSS PD V VT +S     +L F +  SL   L        D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGQYSRD 318

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           N +Y  G   +   G      I  K    D+  G++F++ L IK     G ++A +D  L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366

Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
            V+G+ +A LLL A ++F     NP ++ +KD   E    S +++ +   Y  L   H+ 
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIK 423

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF+RV + L  S  +  T              E ++++   +   L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530

Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           ++  +   G           SK  Q N    GW++H +   +  ++       W   P  
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
            AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      ++  +T D +++ ++F   + AA  L+ ++D LV +V     +L+P  I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694

Query: 585 AEDGSIMEWVQ 595
            +DG I EW +
Sbjct: 695 NQDGRIKEWYE 705


>gi|322377414|ref|ZP_08051905.1| fibronectin type III domain protein [Streptococcus sp. M334]
 gi|321281614|gb|EFX58623.1| fibronectin type III domain protein [Streptococcus sp. M334]
          Length = 803

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 182/614 (29%), Positives = 288/614 (46%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
              D    L+++R  ++   Y  A   + +    P      +Y   GDI +EF +     
Sbjct: 72  NLQDQYVFLAEIRQDLEKRDYNRAKELAEQHLVGPKTSQYGIYLSFGDIHIEFSNQGKTL 131

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y    Y+R+L+++ A A   Y      F RE F+S PD ++V + +   S +L F + L 
Sbjct: 132 YQVTDYQRQLNISKALATTSYVYKGTRFEREVFASFPDDLLVQRFTKEGSETLDFTMDLS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      + +    C        I  K    D+   +QF++ L  K     G I
Sbjct: 192 LTRDLASDGKYEQEKLDYKECQLDISTSHILMKGRVKDND--LQFASCLAWKTD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               DK +++ G+ +A L LVA + F     +    K D   +    +++ +   Y+ L 
Sbjct: 247 RVWSDK-VQISGASYANLFLVAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEEGYTQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L               N D   + + +K++++ E   L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------ANGDISTTDDLLKNYKSQEGQDLEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW S   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPSYVTNLLETA 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A   Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYIDDLRVYG-RLAAAKYAGIISREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F  D+L +        ++PS S
Sbjct: 469 SPASNAWMMQTVYEVYSFYRDQDYLREKIYPMLSETVRFWNDFLHKDQQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ L  + D L E V +    L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDADLLTE-VKEKFDLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|225019389|ref|ZP_03708581.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
           DSM 5476]
 gi|224947845|gb|EEG29054.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
           DSM 5476]
          Length = 1708

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 160/511 (31%), Positives = 259/511 (50%), Gaps = 49/511 (9%)

Query: 96  GHPADVYQLLGDIELEFD-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
           G+  D  QL    EL FD  S    +   Y+R LDL+ ATA+V+Y++ +V FTRE+F SN
Sbjct: 320 GNTTDGVQL---SELSFDLKSSTGPSYTNYKRTLDLDNATAKVEYTLDDVNFTREYFVSN 376

Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP 214
           PD  +  +++  + G++S  +S+ +     +     + I M G+   +R           
Sbjct: 377 PDNFMAIRLTADQPGAISKAISITTPQSKKTITAEGDTITMTGQPADQR----------E 426

Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKK 272
            G++F+   +IK+    G+++A  +  + VEG+D  +LL+ A +++     +  D  + +
Sbjct: 427 DGLKFAQ--QIKVVPQGGSMTA-ANGTITVEGADSVLLLMTAGTNYQQCMDDTFDYFTDE 483

Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
           DP       + ++    Y DL   H+ DYQ LF+ + + L  +P         E+  D +
Sbjct: 484 DPLDAVSQRIATVAAKDYDDLLAAHVADYQSLFNNMKLNLCDAP-------MPEKPTDEL 536

Query: 333 PSAERVKSFQTD---EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            +A   ++   +   ED  L  L +QFGRYLLI+SSR G+  ANLQGIW + L+P WD+ 
Sbjct: 537 LAAYGGRTSNPNTALEDRYLETLYYQFGRYLLIASSRDGSLPANLQGIWADGLNPPWDAD 596

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS------GWVIHH 443
            H NIN++MNYW +   NL+EC  P+ D++  L   G  TAQ  +         GW  +H
Sbjct: 597 YHTNINVQMNYWLAESTNLTECHLPIVDYINSLVPRGEITAQRYHCTEDGGDVRGWTTYH 656

Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
           + +IW  ++       +  +P GGAW+   +WE Y +  D++FL +  +  L G A F +
Sbjct: 657 ENNIWGNTAPATSSAFY--FPAGGAWMTQDIWEIYAFNQDKEFLAEN-FDTLLGAALFWV 713

Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
           D L+ +  DG L ++PS SPEH            S  +  D  II + F   I AAE L 
Sbjct: 714 DNLVTDTRDGTLVSSPSYSPEH---------GPYSLGAACDQGIIWDTFQNTIEAAEALG 764

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
            +   + E + ++  +L   +I   G  MEW
Sbjct: 765 IDTPEIAE-IREAQSKLAGPQIGLAGQFMEW 794



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 3/76 (3%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           L+  +  PA  +  +A P+GNG LGAMV+GGV S+ +++NE +LW+G PG   N D    
Sbjct: 42  LQAFYTKPATDWEKEATPLGNGFLGAMVFGGVESDRIQINEHSLWSGGPGANENYDG--G 99

Query: 72  LSDVRSLVDSGQYAEA 87
           +SD  + V+     EA
Sbjct: 100 MSDTPAEVNRQNLMEA 115


>gi|419765946|ref|ZP_14292168.1| Gram-positive signal peptide protein, YSIRK family / gram positive
           anchor multi-domain protein [Streptococcus mitis SK579]
 gi|383354600|gb|EID32158.1| Gram-positive signal peptide protein, YSIRK family / gram positive
           anchor multi-domain protein [Streptococcus mitis SK579]
          Length = 1662

 Score =  241 bits (616), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 183/611 (29%), Positives = 297/611 (48%), Gaps = 88/611 (14%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLESVTDYHRGLDISE 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
           A     Y+     F RE FSS PD V VT +S     +L F +  SL   L        D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKNLDFTLWNSLTEDLIANGQYSRD 318

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           N +Y  G   +   G      I  K    D+  G++F++ L IK     G ++A +D  L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYTRHLD 299
            V+G+ +A LLL A ++F     NP  + +   D      S +++ +   Y  L   H+ 
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVGKTVKSIVEAAKAKDYETLKNDHIK 423

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF+RV + L  S  +  T              E ++++   +   L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530

Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           ++  +   G           SK  Q N    GW++H +   +  ++       W   P  
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
            AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      ++  +T D +++ ++F   + AA  L+ ++D LV +V     +L+P  I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694

Query: 585 AEDGSIMEWVQ 595
            +DG I EW +
Sbjct: 695 NQDGRIKEWYE 705


>gi|417935794|ref|ZP_12579111.1| gram positive anchor [Streptococcus infantis X]
 gi|343402703|gb|EGV15208.1| gram positive anchor [Streptococcus infantis X]
          Length = 1764

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 183/617 (29%), Positives = 301/617 (48%), Gaps = 98/617 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 153 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 210

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 211 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLESVTDYHRGLDISE 270

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
           A +   Y+     F RE FSS PD V VT +S     +L F   N   + L+ N  Y   
Sbjct: 271 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 330

Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
                       +N I+++G         K N      G++F++ L IK     G ++A 
Sbjct: 331 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 373

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D  L V G+ +A LLL A ++F     NP ++ +KD   E+   S +++ +   Y  L 
Sbjct: 374 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLENTVKSIVEAAKAKDYETLK 430

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S  +  T              E ++++   +   L EL F
Sbjct: 431 NDHIKDYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFF 477

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  
Sbjct: 478 QYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETA 537

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 538 KPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 592

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 593 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 651

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   + AA  L+ ++D LV +V     +L
Sbjct: 652 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFNKL 701

Query: 580 RPTKIAEDGSIMEWVQR 596
           +P  I +DG I EW + 
Sbjct: 702 KPLHINQDGRIKEWYEE 718


>gi|307707449|ref|ZP_07643931.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
 gi|307616401|gb|EFN95592.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
          Length = 803

 Score =  241 bits (615), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 184/624 (29%), Positives = 291/624 (46%), Gaps = 87/624 (13%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN-- 65
           P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY    
Sbjct: 16  PASTTYKGWEE---EALPIGNGSLGAKVFGIIGAERIQFNEKSLWSGGPLPDSSDYQGGN 72

Query: 66  -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYA 120
             D    L+++R  ++   Y  A   A   L G     Y      GDI +EF       +
Sbjct: 73  LQDQYGFLAEIRQALEKRDYNRAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLS 132

Query: 121 EET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD- 178
           + T Y+R+L+++ A A   Y     +F RE F+S PD ++V + +   + +L F + L  
Sbjct: 133 QVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDNLLVQRFTKEGAETLDFTIELSL 192

Query: 179 --SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
              L  +  Y               ++ I+M+GR            ND    +QF++ L 
Sbjct: 193 SRDLASDGKYEEEKSDYKECKLDITDSHILMKGRVKD---------ND----LQFASCLA 239

Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
            +     G I    DK  ++ G+ +A L L A + F     +    K D   +    ++ 
Sbjct: 240 WETD---GDIRVWSDKA-QISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVEI 295

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
            +   Y+ L +RH+ DYQ LF RV + L               ++DT  +   +K+++  
Sbjct: 296 AKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDNLLKNYKPQ 342

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
           E  +L EL FQ+GRYLLISSSR  +    ANLQG+WN   +P W+S  H+NINL+MNYW 
Sbjct: 343 EGHALEELFFQYGRYLLISSSRDCSDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWP 402

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSS 452
           +   NL E   P+ +++  L + G + A   Y          +GW++H +     W    
Sbjct: 403 AYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPG 461

Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
            D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  D+L E    
Sbjct: 462 WD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDQQA 518

Query: 513 Y-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
               ++PS SPEH           +S  +T D ++I ++F   I AA+ LE + D L E 
Sbjct: 519 QRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE- 568

Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
           V +    L P +I + G I EW +
Sbjct: 569 VKEKFDLLNPLQITQSGRIREWYE 592


>gi|354604085|ref|ZP_09022078.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
           12060]
 gi|353348517|gb|EHB92789.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
           12060]
          Length = 777

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 174/589 (29%), Positives = 274/589 (46%), Gaps = 101/589 (17%)

Query: 17  FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
           +  PA ++ T+A+P+GNGR+GAM++GG+P E ++ N+ TLWTG                 
Sbjct: 42  YTRPATNWMTEALPVGNGRIGAMIFGGLPVERIQFNDKTLWTG----------------- 84

Query: 76  RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDLNTA 133
            S  + G                   YQ  GDI ++F     +       YRRELDL+ A
Sbjct: 85  -STTERG------------------AYQNFGDIFIDFGAAGGNNPRGPVDYRRELDLDDA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A+V Y    V +TRE+ +S PD VI  + + ++ G + F V +D            N I
Sbjct: 126 LAKVVYKADGVTYTREYLASYPDDVIAMRFTANKKGKIGFTVRMDDAHTGGQRTVTGNSI 185

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
            + G+                     S   ++ + ++ GT+ A  D  L + G+D A LL
Sbjct: 186 TISGKL-----------------TLLSYKAQLTVLNEGGTLQA-GDSTLTLTGADAATLL 227

Query: 254 LVASSSFDGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
           L A + +D     ++  SD K   ++ +  A        Y+ L   HLDDY  L++R+S+
Sbjct: 228 LSAGTDYDPQSPDYLTRSDWKGKVSTVAARAGSK----GYAALRKAHLDDYHALYNRLSL 283

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
            +  +  ++ TD               V+  + + DP+   L FQ+GRYL I+SSRPG  
Sbjct: 284 NVGNTTPELPTDELF------------VRYSKGEYDPAADVLYFQYGRYLTIASSRPGLD 331

Query: 371 V-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSK 428
           + +NLQG+WN+  +P W S  H NIN++MNYW + P NL+EC EP   ++   S ++ S 
Sbjct: 332 LPSNLQGLWNDSNTPPWQSDIHSNINVQMNYWPAEPTNLAECHEPFTRYIYNESQLHDSW 391

Query: 429 TAQVNYL-ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
                 L   GW +  + +I+  S        W       AW C H+W+ Y +   RD+L
Sbjct: 392 KKMAGELDCGGWALKTQNNIFGYSD-------WNWNRPANAWYCMHVWDKYLFDPQRDYL 444

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA-- 545
           E+ AYP+++    F LD LI   DG L      SPEH             + S +  A  
Sbjct: 445 EQEAYPVMKSACRFWLDRLIVDDDGKLVAPNEWSPEHG-----------PWESGIPYAQQ 493

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
           +I ++F+  + A  +L  ++ A V+++   L RL     +   G + EW
Sbjct: 494 LIWDLFNNTVRAGRILGTDQ-AFVDQLESKLERLDNGLTVGSWGQLREW 541


>gi|385260919|ref|ZP_10039057.1| gram positive anchor [Streptococcus sp. SK140]
 gi|385190192|gb|EIF37641.1| gram positive anchor [Streptococcus sp. SK140]
          Length = 1717

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 181/606 (29%), Positives = 298/606 (49%), Gaps = 78/606 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 199 ALEDGDRQKAKQLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYVNG 189
           A     Y+     F RE FSS PD V VT ++     +L F   N   + L+ N  Y + 
Sbjct: 259 AITTTSYTQDGTSFKRETFSSYPDDVTVTHLTKKGDKTLDFTLWNSLTEDLIANGDY-SW 317

Query: 190 NNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
            N    +G        I  K    D+  G++F++ L IK     G ++A +D  L V G+
Sbjct: 318 ENSKYKQGTVSVDSNGILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYLTVTGA 371

Query: 248 DWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLDDYQKL 304
            +A LLL A ++F     NP ++ +KD   E    S +++ +   Y  L   H+ DYQ L
Sbjct: 372 SYATLLLSAKTNF---AQNPKTNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIKDYQSL 428

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
           F+RV + L  S  +  T              E ++++   +   L EL FQ+GRYLLISS
Sbjct: 429 FNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRYLLISS 475

Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  +P+ +++  +
Sbjct: 476 SRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMINYIDDM 535

Query: 423 SING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
              G           SK  Q N    GW++H +   +  ++       W   P   AW+ 
Sbjct: 536 RYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAANAWMM 590

Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAP 529
            +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS SPEH     
Sbjct: 591 QNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH----- 644

Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
                 ++  +T D +++ ++F   + AA  L+ +++ LV +V     +L+P  I +DG 
Sbjct: 645 ----GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQN-LVTEVKAKFDKLKPLHINQDGR 699

Query: 590 IMEWVQ 595
           I EW +
Sbjct: 700 IKEWYE 705


>gi|419527991|ref|ZP_14067534.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
 gi|379566144|gb|EHZ31135.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
          Length = 803

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 180/614 (29%), Positives = 291/614 (47%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTNP 66
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 67  DAPKA---LSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
           +       L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQNQHNFLAEIRQALEKRDYNRAKELAEQHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A A   Y+     F RE F+S PD ++V + +   S +L F + L 
Sbjct: 132 SQVTDYQRQLNVSKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGSETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTRDLASDGKYEQEKTDYKECQLDITASHILMKGWVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               DK +++ G+ +A L L A + F     +    K D   +  + +++ +   Y+ L 
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKNLVETAKEKGYARLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L               ++DT  + + +K+++  E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------SDVDTSTTDDLLKNYKPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQGIWN   +P W+S  H+NINL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGIWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETA 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A   Y+         +GW++H +     W     D     W  
Sbjct: 413 FPVINYVDDLRVYG-RLAATRYVGIVSREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y +  D+D+L ++ YP+L     F   +L E +      ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYLFYRDQDYLREKIYPILRETVRFWNAFLHEDNQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ LE + D L E V +    L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE-VKEKFDLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|295085851|emb|CBK67374.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
          Length = 729

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 159/503 (31%), Positives = 250/503 (49%), Gaps = 50/503 (9%)

Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
            +  +G++ +E   S +  +   YRR L L++A A V++    + + R++F S PD V+V
Sbjct: 71  AFTTMGELYVETGLSEINMS--NYRRILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMV 128

Query: 161 TKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ 218
            K +  + G  +  +S   ++   +H   +GN+ ++  G               +  G++
Sbjct: 129 MKFTADKGGKQNLVLSYCPNNEAKSHLEADGNDGLVYTGVL-------------NNNGMK 175

Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK-----D 273
           F+    IK     GT+ A E+ ++ V+ +D  V LL A + +   F       K     D
Sbjct: 176 FA--FRIKAIHKGGTLKA-ENDRIIVKDADEVVFLLTADTDYKMNFAPDFKDPKAYVGND 232

Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
           P+  +++ + +     Y +LY  H  DY  LF+RV  +++            E     +P
Sbjct: 233 PSQTTLAMMDNALKKGYDELYRNHEADYTALFNRVRFEIN-----------PEIGTPNLP 281

Query: 334 SAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
           + +R+ S++    D  L +L +QFGRYLLI+SSRPG   ANLQG+W+ +    W    H 
Sbjct: 282 TYKRLASYKKGVPDYQLEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHN 341

Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
           NIN++MNYW + P NLSEC  PL DF+  L   G KTAQ  + A GW      +I+  ++
Sbjct: 342 NINIQMNYWPACPTNLSECTWPLIDFIRSLVKPGEKTAQAYFNARGWTASISANIFGFTA 401

Query: 453 ADRGK-VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
               K + W L P  G WL TH+WE+Y+YT D  FL++  Y L++  A F +D L    D
Sbjct: 402 PLSSKSMAWNLNPTVGPWLATHIWEYYDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPD 461

Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
           G     PSTSPEH           V    T   A++RE+    I A++VL    DA   K
Sbjct: 462 GTYTAAPSTSPEH---------GPVDEGVTFAHAVVREILLDAIQASKVL--GTDAKERK 510

Query: 572 VLKS-LPRLRPTKIAEDGSIMEW 593
             ++ L +L P +I   G ++EW
Sbjct: 511 QWENVLTKLVPYRIGRYGQLLEW 533


>gi|423220535|ref|ZP_17207030.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
           CL03T12C61]
 gi|392623612|gb|EIY17715.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
           CL03T12C61]
          Length = 775

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 191/617 (30%), Positives = 284/617 (46%), Gaps = 113/617 (18%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           +   E  + T P+++ ++ PA ++ T A+PIGNG LGA+ +GGV SE +  NE TLWTG 
Sbjct: 21  VAGVEQKTETVPMRLWYDRPATNWMTSALPIGNGELGALFFGGVESEQILFNEKTLWTG- 79

Query: 60  PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
                            S    G                   YQ  GD+ + FD      
Sbjct: 80  -----------------STTTRG------------------AYQKFGDVWIHFDGQE--- 101

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNVSLD 178
               YRREL L+ A  +V Y+     + RE+F+S PD+VIV ++S  ++G  L+F+VSL 
Sbjct: 102 DVREYRRELSLDEAIGKVSYTSAGTHYLREYFASRPDEVIVLRLSTPKAGKKLNFSVSL- 160

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL-------EIKISDDR 231
                            +GR PG R     +      GI F   L       ++K+ ++ 
Sbjct: 161 ----------------ADGR-PGTRQEVTKD------GILFRRKLDLLSYEAQLKVINEG 197

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNL 288
           GT+ A +  KL V  ++  ++LL A++++D     ++  +  +         A  S +  
Sbjct: 198 GTLVA-DSNKLCVNAANSVLILLTAATNYDLSSATYVGETSGQLHKRLTDRLARASAK-- 254

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
            Y  L + HL+DYQ LF+RV   L R+          +  I +VP+ E V   +  E   
Sbjct: 255 GYDQLKSTHLNDYQSLFNRVRFDL-RTAAKTGGKIGMKTEIPSVPTNELVHLHK--EALY 311

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           L  L FQ+GRYL+I+SSR      NLQGIWN D +P W+   H NIN++MNYW +  CNL
Sbjct: 312 LDMLYFQYGRYLMIASSRGMNLSNNLQGIWNGDNAPPWECDIHSNINIQMNYWPAEVCNL 371

Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLA-----SGWVIHHKTDIWAKSSADRGKVVWALW 463
           SEC EP   ++   ++    + Q   LA      GW ++ + +I+       G   W + 
Sbjct: 372 SECHEPFIRYIATEALRPGGSWQ--QLARSEGLRGWTVNTQNNIF-------GYTDWNIN 422

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
               AW C HLW+HY YT D ++L   AYP++     +  D L    DG L      SPE
Sbjct: 423 RPANAWYCMHLWKHYAYTQDINYLRSVAYPVMRSTCEYWFDRLQLTADGVLLAPAEWSPE 482

Query: 524 HEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL----VEKVLKSLP 577
           H    P  DG    V+Y+  +    + ++FS  + A  VL      L    V K+ + L 
Sbjct: 483 H---GPWEDG----VAYAQQL----VWQLFSETMQAVRVLRGAGIPLDADFVRKLSEKLK 531

Query: 578 RL-RPTKIAEDGSIMEW 593
           RL     +   G I EW
Sbjct: 532 RLDNGVTLGAWGQIREW 548


>gi|167763307|ref|ZP_02435434.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
           43183]
 gi|167698601|gb|EDS15180.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
           43183]
          Length = 657

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 156/481 (32%), Positives = 239/481 (49%), Gaps = 50/481 (10%)

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
           YRREL L++A A V++    V++ R  F S P  V+V + S       +L F+ + + + 
Sbjct: 18  YRRELSLDSARAIVQFCKDGVKYKRTSFISYPANVLVMRYSADRPAKQNLRFSYAPNPVS 77

Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
                  G N ++   R              D   +++  ++ +++    GT++   D+ 
Sbjct: 78  AGSLQPEGKNGLVFRARL-------------DNNSMEY--VVRMRVLTQGGTVTNTHDQL 122

Query: 242 LKVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTR 296
           L +EG+D  V L+ A +    +F+  F NP      +P   +   +       Y  LY  
Sbjct: 123 L-IEGADEVVFLITADTDYLINFNPDFTNPKTYVGVNPEETTAYWINEAEKQGYEALYQA 181

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
           H  DY  LF+RV + L+ S            +   +P  +R+  ++  + D  L +L +Q
Sbjct: 182 HYADYTALFNRVKLNLTNS-----------SDFRDMPITQRLSRYREGQKDFYLEQLYYQ 230

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRYLLI+SSRPG   ANLQGIW+ ++   W    H NINL+MNYW +   NLSEC +PL
Sbjct: 231 FGRYLLIASSRPGNFPANLQGIWHNNVDGPWRVDYHNNINLQMNYWPACSTNLSECMKPL 290

Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
            DF+  L   G KTAQ  + A GW      +I+  ++  +   + W   PM G WL TH+
Sbjct: 291 IDFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESENMSWNFNPMAGPWLATHI 350

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
           WE+Y+YT D  FL++  Y L++  A+F +D+L    DG     PSTSPEH          
Sbjct: 351 WEYYDYTRDVKFLKEIGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------G 401

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
            V   +T   A++RE+    I A++VL  +  E    E+VL+   +L P KI   G +ME
Sbjct: 402 PVDQGATFVHAVVREILLDAIDASKVLRVDAKERKYWEQVLE---KLVPYKIGRYGQLME 458

Query: 593 W 593
           W
Sbjct: 459 W 459


>gi|444414515|ref|ZP_21210772.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
           PNI0199]
 gi|444281657|gb|ELU86965.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
           PNI0199]
          Length = 803

 Score =  238 bits (607), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L 
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L             E N+D   + + +K+++  E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALSANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A V Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|225016842|ref|ZP_03706034.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
           DSM 5476]
 gi|224950383|gb|EEG31592.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
           DSM 5476]
          Length = 1957

 Score =  238 bits (607), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 178/638 (27%), Positives = 312/638 (48%), Gaps = 77/638 (12%)

Query: 4   AESTSTTNPLKITFNGPAKH-----FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
           AE++   N L++ +  PA        T+++PIGNG +G+ V+GGV  E L LNE TLW+G
Sbjct: 37  AEASVNDNDLRLWYTSPAPDTYNGWMTNSLPIGNGYMGSNVFGGVGRERLSLNEKTLWSG 96

Query: 59  VPG---DYTNPDAP------KALSDVRSLVDSGQYAEATAASVKLFGHPAD-------VY 102
            P    DY   +        + +  ++     G  + A +   +L G   D        Y
Sbjct: 97  GPAEGRDYNGGNLESRGKNGETMKQIQQAFAEGNTSLANSLCNQLTGLSDDGGTQGYGYY 156

Query: 103 QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
              G++ LEF       A+  Y R+LD+ TA A V Y    V + RE+F+S PD ++V +
Sbjct: 157 LSYGNMYLEFPGMSDGNAQN-YVRDLDMKTAIASVNYDYDGVNYNREYFTSYPDNMMVAR 215

Query: 163 ISGSESGSLSFNVSLDSLLDNHS------YVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
           ++ SE+G L+FN+S++   DN S        N   Q        G  I  +   +D+   
Sbjct: 216 LTASEAGKLTFNLSVNP--DNTSGKGQGPNTNNGYQRTWIQTADGGLITIQGQLSDNQ-- 271

Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDP 274
           ++F++  + K+ +  GT+   ED  + V G+D  V+L+   + +D   P      +  + 
Sbjct: 272 LKFAS--QTKVLNTGGTLVDNEDGTVSVTGADEVVILMTMGTDYDDNYPVYRTGQTDAEL 329

Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
            ++    + +   L Y  L   HL DYQ +F RV + L +              I  +P+
Sbjct: 330 LADIQGRIDAATELGYEGLLKSHLADYQGIFDRVHLDLGQE-------------ISQIPT 376

Query: 335 AERVKSFQTDED-PSLVE----LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
            + + +++   + P+L +    LL+Q+GRYL I+SSR G+  +NLQG+W    +  W S 
Sbjct: 377 NQLLTNYKNGSNTPALNQALEVLLYQYGRYLTIASSREGSLPSNLQGVWTGANNSPWHSD 436

Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWV 440
            H+N+NL+MNYW +   N++EC  PL +++  L   G  TA++ Y           +G++
Sbjct: 437 YHMNVNLQMNYWPTYSTNMAECAIPLIEYVDALRAPGRVTAKI-YAGIESTEENPENGFM 495

Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
            H + + +  +        W   P    W+  + WE+Y YT D D++++  YP+L+  A 
Sbjct: 496 AHTQNNPYGWTCPGW-SFDWGWSPAATPWIIQNCWEYYEYTGDLDYMKENIYPMLKEEAR 554

Query: 501 FLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
                LIE  + G L  +P+ SPEH            +  +T + ++I ++F+  I A +
Sbjct: 555 LYEQMLIEDPETGKLVCSPAYSPEH---------GPRTNGNTYEQSLIWQLFTDAIIAGK 605

Query: 560 VLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
           ++++++ A ++K  + +  L+ P +I + G I EW + 
Sbjct: 606 LVDEDQ-ATLDKWQEIIDNLKGPIEIGDSGQIKEWYEE 642


>gi|335029650|ref|ZP_08523157.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
           SK1076]
 gi|334268947|gb|EGL87379.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
           SK1076]
          Length = 806

 Score =  238 bits (607), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 183/622 (29%), Positives = 304/622 (48%), Gaps = 83/622 (13%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
           P   +++G  K    A+P+GNG +GA ++G +  E ++ NE TLW+G P         G+
Sbjct: 14  PTAPSYDGWEKQ---ALPVGNGEMGAKIFGLIGEERIQYNEKTLWSGGPQLDSTDYNGGN 70

Query: 63  YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
           Y   D  K L+++R  +++G   +A   + +    P +     Y   GDI + F++    
Sbjct: 71  YQ--DRYKVLAEIRKALEAGDRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 128

Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
               T Y R+LD+  A     YS     F RE FSS PD V VT +S     +L F   N
Sbjct: 129 LENVTDYHRDLDITEAITTTSYSQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 188

Query: 175 VSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
              ++LL N  Y    +   Q  +     G  I  K    D+  G++F++ L IK     
Sbjct: 189 SLTENLLANGDYSWEYSNYKQGAVTTDSNG--ILLKGTVKDN--GLKFASYLGIKTD--- 241

Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNL 288
           G ++A +D  L V G+ +A LLL   +++     NP ++ +KD   E+   S +++ +  
Sbjct: 242 GQVTA-QDGYLTVTGASYATLLLSVKTNYAQ---NPKTNYRKDIDVENTVKSIVEAAKAK 297

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
            Y  L   H+ DYQ LF+RV + L               N  +  + E ++++   +   
Sbjct: 298 DYETLKNNHIKDYQSLFNRVQLNLGG-------------NKSSQTTKEALQTYDPTKGQQ 344

Query: 349 LVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
           L EL FQ+GRYLLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   
Sbjct: 345 LEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMN 404

Query: 407 NLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADR 455
           NL+E  +P+ +++  +   G           SK  Q N    GW++H +   +  ++   
Sbjct: 405 NLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW 460

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGY 513
               W   P   AW+  +++++Y +T D  +L+++ YP+L+    F   +L   +  D +
Sbjct: 461 -NYYWGWSPAANAWMMQNVYDYYKFTKDEAYLKEKIYPMLKETTKFWNSFLHYDKSSDRW 519

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
           + ++PS SPEH           ++  +T D +++ ++F   + AA  L  ++D LV +V 
Sbjct: 520 V-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVDQD-LVTEVK 568

Query: 574 KSLPRLRPTKIAEDGSIMEWVQ 595
               +L+P  I +DG I EW +
Sbjct: 569 AKFDKLKPLHINQDGRIKEWYE 590


>gi|444387033|ref|ZP_21185059.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
           PCS125219]
 gi|444389242|ref|ZP_21187159.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
           PCS70012]
 gi|444393004|ref|ZP_21190665.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
           PCS81218]
 gi|444400918|ref|ZP_21198254.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
           PNI0007]
 gi|444418365|ref|ZP_21214349.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
           PNI0360]
 gi|444419893|ref|ZP_21215727.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
           PNI0427]
 gi|444254243|gb|ELU60689.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
           PCS125219]
 gi|444257842|gb|ELU64175.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
           PCS70012]
 gi|444262591|gb|ELU68882.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
           PCS81218]
 gi|444264795|gb|ELU70844.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
           PNI0007]
 gi|444281712|gb|ELU87019.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
           PNI0360]
 gi|444285998|gb|ELU91006.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
           PNI0427]
          Length = 803

 Score =  238 bits (606), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L 
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L             E N+D   + + +K+++  E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A V Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|221232393|ref|YP_002511546.1| hypothetical protein SPN23F_16560 [Streptococcus pneumoniae ATCC
           700669]
 gi|225857271|ref|YP_002738782.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
 gi|298254439|ref|ZP_06978025.1| alpha-fucosidase [Streptococcus pneumoniae str. Canada MDR_19A]
 gi|298503399|ref|YP_003725339.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
 gi|410477028|ref|YP_006743787.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
           gamPNI0373]
 gi|415700118|ref|ZP_11457832.1| fibronectin type III domain protein [Streptococcus pneumoniae
           459-5]
 gi|415752860|ref|ZP_11479842.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
 gi|418079078|ref|ZP_12716300.1| fibronectin type III domain protein [Streptococcus pneumoniae
           4027-06]
 gi|418081275|ref|ZP_12718485.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6735-05]
 gi|418083460|ref|ZP_12720657.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44288]
 gi|418123978|ref|ZP_12760909.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44378]
 gi|418128522|ref|ZP_12765415.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP170]
 gi|418178700|ref|ZP_12815283.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41565]
 gi|419427712|ref|ZP_13967893.1| fibronectin type III domain protein [Streptococcus pneumoniae
           5652-06]
 gi|419436452|ref|ZP_13976539.1| fibronectin type III domain protein [Streptococcus pneumoniae
           8190-05]
 gi|419450962|ref|ZP_13990948.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP02]
 gi|419473709|ref|ZP_14013558.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13430]
 gi|444394527|ref|ZP_21192078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
           PNI0002]
 gi|444398107|ref|ZP_21195590.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
           PNI0006]
 gi|444402905|ref|ZP_21200052.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
           PNI0008]
 gi|444404353|ref|ZP_21201309.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
           PNI0009]
 gi|444407726|ref|ZP_21204393.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
           PNI0010]
 gi|444409151|ref|ZP_21205749.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
           PNI0076]
 gi|444412799|ref|ZP_21209118.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
           PNI0153]
 gi|444422007|ref|ZP_21217672.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
           PNI0446]
 gi|220674854|emb|CAR69429.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
           700669]
 gi|225726032|gb|ACO21884.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
 gi|298238994|gb|ADI70125.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
 gi|353746605|gb|EHD27265.1| fibronectin type III domain protein [Streptococcus pneumoniae
           4027-06]
 gi|353752014|gb|EHD32645.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6735-05]
 gi|353754680|gb|EHD35292.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44288]
 gi|353795798|gb|EHD76144.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44378]
 gi|353799021|gb|EHD79344.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP170]
 gi|353842759|gb|EHE22805.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41565]
 gi|379550873|gb|EHZ15969.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13430]
 gi|379612891|gb|EHZ77606.1| fibronectin type III domain protein [Streptococcus pneumoniae
           8190-05]
 gi|379617905|gb|EHZ82585.1| fibronectin type III domain protein [Streptococcus pneumoniae
           5652-06]
 gi|379622667|gb|EHZ87301.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP02]
 gi|381308507|gb|EIC49350.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
 gi|381314814|gb|EIC55580.1| fibronectin type III domain protein [Streptococcus pneumoniae
           459-5]
 gi|406369973|gb|AFS43663.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
           gamPNI0373]
 gi|444259769|gb|ELU66078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
           PNI0002]
 gi|444260764|gb|ELU67072.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
           PNI0006]
 gi|444265666|gb|ELU71662.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
           PNI0008]
 gi|444271322|gb|ELU77073.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
           PNI0010]
 gi|444274038|gb|ELU79693.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
           PNI0153]
 gi|444276986|gb|ELU82513.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
           PNI0009]
 gi|444280076|gb|ELU85452.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
           PNI0076]
 gi|444288631|gb|ELU93522.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
           PNI0446]
          Length = 803

 Score =  238 bits (606), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L 
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L             E N+D   + + +K+++  E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A V Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|419767010|ref|ZP_14293181.1| alpha-L-fucosidase [Streptococcus mitis SK579]
 gi|383353528|gb|EID31137.1| alpha-L-fucosidase [Streptococcus mitis SK579]
          Length = 803

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 284/610 (46%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GD+ +EF        + T Y+R+L+++ A
Sbjct: 87  LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLFQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNG- 189
            A   Y+     F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LATTSYAYKGTMFKREAFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLTSDEKYEQKK 206

Query: 190 -----------NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR                  ++F+  L  +     G I    
Sbjct: 207 SDYKECQLEITDSHILMKGRVK-------------DNNLRFAGCLAWQTD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           DK +++ G+ +A L L A + F     +    K D   +    +++ +   Y+ L +RH+
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
            DYQ LF RV + L               ++DT  + + +K+++  E  +L EL FQ+GR
Sbjct: 310 QDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQGIWN   +P W+S  H+NINL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A   Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F  D+L E        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  + D L E V +    L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGMDADLLTE-VKEKFDLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|83765422|dbj|BAE55565.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 546

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 172/588 (29%), Positives = 274/588 (46%), Gaps = 88/588 (14%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           ++ P   F  ++P+GNGRLG  ++  +P+E +  NED++W+G   D  N +A      VR
Sbjct: 34  YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           +L+ +G    A   ++  + G   D   YQ+L ++ ++            Y   L+  TA
Sbjct: 93  NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
               +Y    V +T                                        NG   I
Sbjct: 153 ---CEYGFDGVSYT--------------------------------------VANGIASI 171

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
           +M+ R              +     F+A + + +  D G ++A  DK L V G+   V  
Sbjct: 172 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 216

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A SS+         +  D  +E    L +   L Y  L    + D++ L  RV++ L 
Sbjct: 217 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 270

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
            S  D  +          +P  ER+ ++++  D D     L+F +GR+LLI+SSR   + 
Sbjct: 271 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 320

Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
           +    LQGIWN+D SP+W +   VNINLEMNYW +   NL+E   PL+D L  +   G  
Sbjct: 321 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 380

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
            A+  +   G+V+HH TD+W  S        +++WPMGGAWL  H+ EHY +T D+ FL+
Sbjct: 381 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 440

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
           ++A P+ +    F   +L +  DGYL T PS SPE+ F  P      GK   ++ S T+D
Sbjct: 441 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 499

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
            +++ E+ +A+    ++LE + D L   V   L ++RP +I  DG I+
Sbjct: 500 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQIL 546


>gi|322387111|ref|ZP_08060722.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
 gi|321142098|gb|EFX37592.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
          Length = 1840

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 181/616 (29%), Positives = 298/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 230 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 287

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 288 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 347

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
           A +   Y+     F RE FSS PD V VT +S     +L F   N   + L+ N  Y   
Sbjct: 348 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 407

Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
                       +N I+++G         K N      G++F++ L IK     G ++A 
Sbjct: 408 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 450

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D  L V G+ +A LLL A ++F     NP ++ +KD   E    + +++ +   Y  L 
Sbjct: 451 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 507

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV +    S     T              E + ++  ++   L EL F
Sbjct: 508 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 554

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  
Sbjct: 555 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 614

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 615 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 669

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 670 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 728

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   + AA  L+ ++D LV +V     +L
Sbjct: 729 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 778

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I +DG I EW +
Sbjct: 779 KPLHINQDGRIKEWYE 794


>gi|418101115|ref|ZP_12738198.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
           7286-06]
 gi|418183181|ref|ZP_12819739.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
 gi|418196314|ref|ZP_12832790.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
           GA47688]
 gi|418223856|ref|ZP_12850496.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
           5185-06]
 gi|419447313|ref|ZP_13987318.1| fibronectin type III domain protein [Streptococcus pneumoniae
           7879-04]
 gi|353770615|gb|EHD51127.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
           7286-06]
 gi|353848164|gb|EHE28181.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
 gi|353860325|gb|EHE40270.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
           GA47688]
 gi|353878654|gb|EHE58484.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
           5185-06]
 gi|379614853|gb|EHZ79563.1| fibronectin type III domain protein [Streptococcus pneumoniae
           7879-04]
          Length = 778

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    D+ +++ G+ +A L L A + F     +    K D   + +  + 
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           + +   Y+ L +RH++DYQ LF RV + L             E N+D   + + +K+++ 
Sbjct: 295 TAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A V Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
             D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +   
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517

Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
                ++PS SPEH           +S  +T D ++I ++F   I AA+ L  +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568

Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
              KS   L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592


>gi|419844091|ref|ZP_14367392.1| gram positive anchor [Streptococcus infantis ATCC 700779]
 gi|385702207|gb|EIG39356.1| gram positive anchor [Streptococcus infantis ATCC 700779]
          Length = 1757

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 181/617 (29%), Positives = 298/617 (48%), Gaps = 98/617 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 147 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 204

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD++ 
Sbjct: 205 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 264

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
           A +   Y+     F RE FSS PD V VT +S     +L F   N   + L+ N  Y   
Sbjct: 265 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 324

Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
                       +N I+++G         K N      G++F++ L IK     G ++A 
Sbjct: 325 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 367

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D  L V G+ +A LLL A ++F     NP ++ +KD   E    + +++ +   Y  L 
Sbjct: 368 QDGYLTVTGASYATLLLSAKTNF---AQNPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 424

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV +    S     T              E + ++  ++   L EL F
Sbjct: 425 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 471

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL+E  
Sbjct: 472 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 531

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 532 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 586

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 587 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 645

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   + AA  L+ ++D LV +V     +L
Sbjct: 646 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 695

Query: 580 RPTKIAEDGSIMEWVQR 596
           +P  I +DG I EW + 
Sbjct: 696 KPLHINQDGRIKEWYEE 712


>gi|419425597|ref|ZP_13965793.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
           7533-05]
 gi|379619058|gb|EHZ83732.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
           7533-05]
          Length = 778

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    D+ +++ G+ +A L L A + F     +    K D   + +  + 
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           + +   Y+ L +RH++DYQ LF RV + L             E N+D   + + +K+++ 
Sbjct: 295 TAKEEGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A V Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
             D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +   
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517

Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
                ++PS SPEH           +S  +T D ++I ++F   I AA+ L  +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568

Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
              KS   L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592


>gi|417846683|ref|ZP_12492676.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
 gi|339458316|gb|EGP70859.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
          Length = 803

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 179/625 (28%), Positives = 293/625 (46%), Gaps = 87/625 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF +     
Sbjct: 72  NLQDQYAFLAEIRQALEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A A   Y     +F RE F+S PD  +V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALATTSYVYKGTKFERETFASFPDDFLVQRFTKEGAETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    +QF++ L
Sbjct: 192 LSRDLASDGKYEQEKSDYKECKLDITDSYILMKGRV---------KDND----LQFASYL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    DK +++ G+ +A L L A + F     +    K D   +    + 
Sbjct: 239 AWETD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVD 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           + +   Y+ L +RH++DYQ LF RV + L               ++DT  + + +K+++ 
Sbjct: 295 TAKEKGYAQLKSRHIEDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E  +L E+ FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+NINL+MNYW
Sbjct: 342 QEGQALEEMFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A   Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
             D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +   
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517

Query: 512 -GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
                ++PS SPEH           +S  ++ D ++I ++F   I AA+ L  +ED L E
Sbjct: 518 VQRWVSSPSYSPEH---------GPISIGNSYDQSLIWQLFHDFIQAAQELSLDEDLLTE 568

Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V +    L P +I + G I EW +
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYE 592


>gi|419445162|ref|ZP_13985177.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
 gi|379572855|gb|EHZ37812.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
          Length = 778

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    D+ +++ G+ +A L L A + F     +    K D   + +  + 
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           + +   Y+ L +RH++DYQ LF RV + L             E N+D   + + +K+++ 
Sbjct: 295 TAKEEGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A V Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
             D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +   
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517

Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
                ++PS SPEH           +S  +T D ++I ++F   I AA+ L  +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568

Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
              KS   L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592


>gi|289167478|ref|YP_003445747.1| hypothetical protein smi_0630 [Streptococcus mitis B6]
 gi|288907045|emb|CBJ21879.1| conserved hypothetical protein [Streptococcus mitis B6]
          Length = 803

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 181/623 (29%), Positives = 288/623 (46%), Gaps = 83/623 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
              D    L+D+R  ++   Y      + +    P       Y   GDI +EF +     
Sbjct: 72  NLQDQHNFLTDIRQALEKRDYNRTKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y    Y+R+L+++ A A   Y     +F RE F+S PD ++V + +     +L F + L 
Sbjct: 132 YQVTDYQRQLNISKALATASYVYKGTKFERETFASFPDDLLVQRYTKEGLETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    +QF++ L
Sbjct: 192 LTHDLASDGKYEQEKSDYKECQLDISDSYILMKGRV---------KDND----LQFTSCL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +   D    S     K+++ G+ +A L L A + F     +    K D   +    ++
Sbjct: 239 AWETDGDIRVWS----NKVQISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVE 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
             +   Y+ L +RH+ DYQ LF RV + L               ++DT  + + +K+++ 
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E   L EL FQ+GRYLLISSSR  P    ANLQGIWN   +P W+S  H+NINL+MNYW
Sbjct: 342 QEGQVLEELFFQYGRYLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDIWAKSSA 453
            +   NL E   P+ +++  L + G + A   Y          +GW++H +   +   +A
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFG-WTA 459

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
                 W   P   AWL   ++E Y++  D+D+L ++ YP+L     F  D+L E     
Sbjct: 460 PGWNYYWGWSPAANAWLMQTVYEAYSFYSDQDYLREKIYPMLRETVYFWNDFLHEDRQAQ 519

Query: 514 -LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
              ++PS SPEH           +S  +T D ++I ++F   I AA+ L  + D L E V
Sbjct: 520 RWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDGDLLTE-V 569

Query: 573 LKSLPRLRPTKIAEDGSIMEWVQ 595
            +    L P ++ + G I EW +
Sbjct: 570 KEKFDLLNPLQLTQSGRIREWYE 592


>gi|332982836|ref|YP_004464277.1| alpha-L-fucosidase [Mahella australiensis 50-1 BON]
 gi|332700514|gb|AEE97455.1| Alpha-L-fucosidase [Mahella australiensis 50-1 BON]
          Length = 816

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 175/597 (29%), Positives = 282/597 (47%), Gaps = 52/597 (8%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  + DAIP GNG +GA+V+G + +E + LN + L+        N    + LS +R ++
Sbjct: 13  PAIRWQDAIPCGNGSIGALVYGHIKNEIITLNHEALFLKSQKPQIN-SIYEYLSQLRKML 71

Query: 80  DSGQYAEATAASVKLFGH------PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
             G+Y E      +            D YQ   DI++   DS    A   Y R LD  T 
Sbjct: 72  MEGKYNEGAQFFERKLKENYIGIARTDPYQPAFDIKI---DSETHEAFTGYCRYLDFETG 128

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A V++S GN  + R+ F S  D  ++ +I+   S  ++  +SL         V G   +
Sbjct: 129 EAVVRWSEGNTNYHRDLFVSRVDDAVILRINAVGSEKVNCVISLVP-----CRVEGATGM 183

Query: 194 IMEGRCPGKRIPPKANANDD----------PKGIQFSAILEIKISDDRGTISALEDKKLK 243
                  G ++P +  A+ +          P G +F  +  + ++   G +  +E +   
Sbjct: 184 GSGKDVKGDKLPFEWQASSEENWISFEAQYPDGNEFGGVARLIVNG--GCMEGIEAQNNC 241

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
           +   D   +L++        F+N    K   T E+  +     ++ Y  L ++H+  +++
Sbjct: 242 IYIKDATEVLMMVKV-----FVN---EKSKTTIENTKSQLEKMDVCYEALLSKHVYQHRE 293

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           L+ RV+I+     +D +      E +        ++S+      +L++ +F FGRYLLIS
Sbjct: 294 LYKRVNIEFHEQREDKLAKQKFNEEL-------LLESYNGQIPTALIQRMFYFGRYLLIS 346

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSRPG   ANLQGIWN D  P W S  H + N+EMNYW +LP NL E   P FD+   + 
Sbjct: 347 SSRPGGLPANLQGIWNGDYVPAWASDYHNDENIEMNYWAALPGNLPETTLPYFDYYMSML 406

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +    A+V Y   G +             D    +WA W  G  WL    ++++ +T D
Sbjct: 407 EDFRTNAKVIYGCRGILAPIAQTTHGLVYTDP---IWATWTAGAGWLSQLFYDYWLFTGD 463

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
            DFL+ +A P ++  A F  D+L+EG DG     PS SPE+    P+  L  V+ ++TMD
Sbjct: 464 MDFLKNKAIPFMKEIALFYEDFLVEGEDGKFMFIPSLSPENTPPIPNASL--VTINATMD 521

Query: 544 MAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRL 598
           +AI REV + + +A + L  EK    + + +L  LP     ++ EDG+I EW+   L
Sbjct: 522 IAIAREVLANLCAACKYLGIEKENVKIWKHMLSKLPEY---QVNEDGAIKEWIHSDL 575


>gi|418137717|ref|ZP_12774555.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11663]
 gi|353900672|gb|EHE76223.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11663]
          Length = 782

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
                Y      F RE F+S PD ++V + +   + +L F + L    D  S      + 
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E N+D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|189207008|ref|XP_001939838.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187975931|gb|EDU42557.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 742

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 185/606 (30%), Positives = 286/606 (47%), Gaps = 97/606 (16%)

Query: 4   AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
           A   S ++  ++ +  PA+   +T+A+PIGNGRLGAMV+G    E + LNE+T+W+G   
Sbjct: 14  ASLASASDNTRLWYKTPAQSSAWTNALPIGNGRLGAMVFGIPLQERIALNEETIWSGGQQ 73

Query: 62  DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
           D    D+P+ +S+VR L+  G+  +A   A++ + G P     YQ LGD+++ FD +   
Sbjct: 74  DRIGQDSPQTVSEVRDLLAQGRAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132

Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           Y   TY+R LD++TA A V++ V    + RE F S PD V V  +  + SG LSF + + 
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVFVHHLKATGSGKLSFQIRV- 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
               +     GN     E    G           DP  I F+  L ++ SD  G +  L 
Sbjct: 192 ----HRPDKGGNEAADHEWNANGLAYMTGGAGGIDP--IVFTTALAVQ-SD--GHVKNL- 241

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
              + VE +  A  +  AS+S+            D  +   S +Q  R  +Y +L  RH+
Sbjct: 242 GPFIVVENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
            DY  L++   + LS           S+    ++P+  R+ + +    DP+L  L + +G
Sbjct: 293 ADYAPLYNASVLDLS----------GSDLKASSLPTDARINATREGASDPALTALSYNYG 342

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLLI+SSR G   +NLQGIWN++ +P W S   VNINL+MNYW +   +LS   EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402

Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
            L  +                     +TD                             EH
Sbjct: 403 LLDLM---------------------RTD-----------------------------EH 412

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
           Y YT D+ FL  +   + E  A F LD L    I G   YL TNPS SPE+ ++  D   
Sbjct: 413 YWYTGDKAFLASKLDVVTEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 470

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
                + T D+ I+ E+F+  ++A   L     +   + ++  +  +L P + ++   G+
Sbjct: 471 YHFDIAPTCDIEILNELFTNYLNAVATLPNYTVDSTFLTRIRDTQAQLPPYRYSKRYPGT 530

Query: 590 IMEWVQ 595
           + EW+Q
Sbjct: 531 LQEWMQ 536


>gi|418198481|ref|ZP_12834939.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47778]
 gi|353861591|gb|EHE41526.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47778]
          Length = 782

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
                Y      F RE F+S PD ++V + +   + +L F + L    D  S      + 
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEEGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E N+D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|419504391|ref|ZP_14044059.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47760]
 gi|379605779|gb|EHZ70529.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47760]
          Length = 803

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDILVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|417849512|ref|ZP_12495432.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
 gi|339456106|gb|EGP68701.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
          Length = 803

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 181/626 (28%), Positives = 291/626 (46%), Gaps = 87/626 (13%)

Query: 10  TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN 65
           T P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY  
Sbjct: 14  TKPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQG 70

Query: 66  ---PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLK 118
               D    L+++R  ++   Y  A   + +    P       Y   GDI +EF +    
Sbjct: 71  GNLQDQYGFLAEIRQALEKRDYNTAKELAEQHLVGPQTSQYGTYLSFGDIFIEFSNQGKT 130

Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
            ++ T Y+R+L+++ A A   Y     +F RE F+S PD ++V +       +L F + L
Sbjct: 131 LSQVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDDLLVQRFIKEGLETLDFTIEL 190

Query: 178 --------DSLLDNHSYVNGNNQ-------IIMEGRCPGKRIPPKANANDDPKGIQFSAI 222
                   D   +   Y     Q       I+M+GR            ND    +QF++ 
Sbjct: 191 SLTRDLASDGKYEQEKYDYKECQLNITASHILMKGRV---------KDND----LQFASY 237

Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
           L  +     G I    DK +++ G+ +A L L A + F     +    K D   + +  +
Sbjct: 238 LTWQTD---GDIRVWSDK-IQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLV 293

Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
            + +   Y+ L +RH++DYQ LF  V + L               ++D   + + +K+++
Sbjct: 294 DTAKEKGYAQLKSRHIEDYQALFQSVQLDLG-------------SDVDASTTDDLLKNYK 340

Query: 343 TDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
             E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+NINL+MNY
Sbjct: 341 PQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNY 400

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAK 450
           W +   NL E   P+ +++  L + G + A   Y          +GW++H +     W  
Sbjct: 401 WPAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTA 459

Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
              D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +  
Sbjct: 460 PGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQ 516

Query: 511 DGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
                 ++PS SPEH           +S  +T D ++I ++F   I AA+ L  +ED L 
Sbjct: 517 QAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELSLDEDLLT 567

Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           E V +    L P +I + G I EW +
Sbjct: 568 E-VKEKFDLLNPLQITQSGRIREWYE 592


>gi|419443014|ref|ZP_13983041.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA13224]
 gi|379551714|gb|EHZ16808.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA13224]
          Length = 612

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 289/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   A   L G     Y      GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  + +   G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETN---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWSAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|421236745|ref|ZP_15693342.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2071004]
 gi|395601508|gb|EJG61655.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2071004]
          Length = 803

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 177/614 (28%), Positives = 288/614 (46%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L 
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L             E N+D   + + +K+++  E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A V Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I  A+ L  +ED L E   KS   L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|358383778|gb|EHK21440.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
          Length = 794

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 173/596 (29%), Positives = 274/596 (45%), Gaps = 71/596 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
           T  +PIGN RLGA ++GG  +E + +NEDT+W G   D    +   AL  VR ++ +   
Sbjct: 39  TGVLPIGNSRLGAAIFGG-GNEVVTINEDTIWDGPLQDRIPANGLAALPKVRQMLMANNL 97

Query: 85  AEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
            +A    +     PA      +   G++ L F           Y R LD     + V Y+
Sbjct: 98  TDAGNLVLSQM-TPASCCERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
              V +TRE+ +SNPD VI  + + S++G+LS + +   ++++L N +  +G  N + ++
Sbjct: 154 FNGVTYTREYVASNPDGVIAARYTASKAGALSVSATFSRINNILSNVASTSGGVNSVTLQ 213

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
           G   G+   P          I F+   + +      T SA               L +  
Sbjct: 214 GTS-GQSTNP----------ILFTG--KARFVASGATFSA-----------SGGTLTITG 249

Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           +++ D  F++   + + PT+ +++A     L +  +  +  ++   + D   L  R +I 
Sbjct: 250 ATTID-VFVDVETNYRYPTASALAAEVDNKLNAAVSKGFPAVHNSAIADSSALLGRANIN 308

Query: 312 LSRSPK---DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
           L  SP    D+ TD             +RVKS ++   DP L+ L + +GR+LL++SSR 
Sbjct: 309 LGTSPNGLADLSTD-------------QRVKSARSAFNDPQLIVLAWNYGRHLLVASSRD 355

Query: 368 GTQV----ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            +       NLQG+WN   S  W     +NIN EMN W +   NL E Q PLFD L    
Sbjct: 356 TSAAIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQ 415

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G + AQ  Y  +G V HH  D+W   +         +WPMG  WL  H+ E Y +T D
Sbjct: 416 PRGQEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMMEQYRFTGD 475

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSY 538
            +FL   AYP L   + FL  +      G   T PS SPE+ ++ P G         +  
Sbjct: 476 LNFLRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYVVPSGANKAGTQEPMDM 534

Query: 539 SSTMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +  MD  ++R+V ++I+ AA  L   + D+ V+     LP +R  +I   G I+EW
Sbjct: 535 APEMDNQLMRDVMTSILEAAAALGISSSDSNVQAATNFLPLIRTPRIGSYGQILEW 590


>gi|307068282|ref|YP_003877248.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
 gi|306409819|gb|ADM85246.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
          Length = 796

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|148997704|ref|ZP_01825268.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
           SP11-BS70]
 gi|168491464|ref|ZP_02715607.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
 gi|168575158|ref|ZP_02721121.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
 gi|225861483|ref|YP_002742992.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
 gi|387788703|ref|YP_006253771.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
 gi|417313133|ref|ZP_12099845.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA04375]
 gi|418142169|ref|ZP_12778982.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13455]
 gi|418151161|ref|ZP_12787907.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA14798]
 gi|418164950|ref|ZP_12801618.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17371]
 gi|418171792|ref|ZP_12808416.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19451]
 gi|418194221|ref|ZP_12830710.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47439]
 gi|418228162|ref|ZP_12854779.1| fibronectin type III domain protein [Streptococcus pneumoniae
           3063-00]
 gi|419429855|ref|ZP_13970019.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11856]
 gi|419438693|ref|ZP_13978761.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13499]
 gi|419449442|ref|ZP_13989438.1| fibronectin type III domain protein [Streptococcus pneumoniae
           4075-00]
 gi|419471542|ref|ZP_14011401.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA07914]
 gi|419502307|ref|ZP_14041991.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47628]
 gi|419506539|ref|ZP_14046200.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA49194]
 gi|419519366|ref|ZP_14058972.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA08825]
 gi|421238983|ref|ZP_15695547.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2071247]
 gi|421245493|ref|ZP_15701991.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2081685]
 gi|421292526|ref|ZP_15743260.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA56348]
 gi|421312462|ref|ZP_15763064.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58981]
 gi|421314530|ref|ZP_15765117.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA47562]
 gi|147756203|gb|EDK63245.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
           SP11-BS70]
 gi|183574240|gb|EDT94768.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
 gi|183578740|gb|EDT99268.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
 gi|225727028|gb|ACO22879.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
 gi|327389841|gb|EGE88186.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA04375]
 gi|353806420|gb|EHD86694.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13455]
 gi|353814371|gb|EHD94597.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA14798]
 gi|353828782|gb|EHE08918.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17371]
 gi|353835529|gb|EHE15623.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19451]
 gi|353857799|gb|EHE37761.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47439]
 gi|353880557|gb|EHE60372.1| fibronectin type III domain protein [Streptococcus pneumoniae
           3063-00]
 gi|379138445|gb|AFC95236.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
 gi|379537100|gb|EHZ02285.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13499]
 gi|379546258|gb|EHZ11397.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA07914]
 gi|379550033|gb|EHZ15135.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11856]
 gi|379600520|gb|EHZ65301.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47628]
 gi|379608453|gb|EHZ73199.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA49194]
 gi|379622060|gb|EHZ86696.1| fibronectin type III domain protein [Streptococcus pneumoniae
           4075-00]
 gi|379641203|gb|EIA05741.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA08825]
 gi|395600626|gb|EJG60781.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2071247]
 gi|395608020|gb|EJG68116.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2081685]
 gi|395891833|gb|EJH02827.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA56348]
 gi|395909316|gb|EJH20192.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58981]
 gi|395913215|gb|EJH24068.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA47562]
          Length = 803

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|418119103|ref|ZP_12756060.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA18523]
 gi|419453708|ref|ZP_13993678.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP03]
 gi|353791055|gb|EHD71436.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA18523]
 gi|379625778|gb|EHZ90404.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP03]
          Length = 782

 Score =  236 bits (601), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 562 QSGRIREWYE 571


>gi|418160358|ref|ZP_12797057.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17227]
 gi|353822091|gb|EHE02267.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17227]
          Length = 809

 Score =  236 bits (601), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|419521584|ref|ZP_14061179.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
 gi|379538884|gb|EHZ04064.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
          Length = 803

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|417687098|ref|ZP_12336372.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41301]
 gi|332073988|gb|EGI84466.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41301]
          Length = 782

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 186 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 229

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 562 QSGRIREWYE 571


>gi|418085647|ref|ZP_12722826.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
 gi|418149015|ref|ZP_12785777.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
 gi|421207087|ref|ZP_15664139.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
 gi|353756356|gb|EHD36957.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
 gi|353811351|gb|EHD91593.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
 gi|395574423|gb|EJG35001.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
          Length = 778

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|417699038|ref|ZP_12348209.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
 gi|332199684|gb|EGJ13759.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
          Length = 757

 Score =  235 bits (599), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 562 QSGRIREWYE 571


>gi|418167255|ref|ZP_12803910.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17971]
 gi|353829247|gb|EHE09381.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17971]
          Length = 803

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|340520176|gb|EGR50413.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
          Length = 794

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 174/594 (29%), Positives = 270/594 (45%), Gaps = 67/594 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
           T  +PIGN RLG  ++GG  +E + +NEDTLW G   +    +   AL  VR ++ +   
Sbjct: 39  TGVLPIGNSRLGGAIFGG-GNEVITINEDTLWDGPLQNRIPANGLAALPKVRQMLLANNL 97

Query: 85  AEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
            +A    +     PA      +   G++ L F           Y R LD     + V Y+
Sbjct: 98  TDAGNLVLSQM-MPAVGGERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
              V +TRE+ +S P  VI  + + S++G+LS + +   + ++L N +  +G  N + ++
Sbjct: 154 FNGVTYTREYVASAPVGVIAARFTASKAGALSVSATFSRISNILSNVASTSGGVNSVTLQ 213

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
           G     + P           I F+   + +     G++SA               L +  
Sbjct: 214 GTSGQAQNP-----------ILFTG--KARFVPQGGSVSA-----------SGGTLTITG 249

Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           +++ D  FI+   + + PT+ +++A     + +  +  +  ++   + D   L  R +I 
Sbjct: 250 ATTID-VFIDVETNYRYPTASALAAEVDNKINTAVSQGFQKVHDDAIADSSALLGRANIN 308

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQ 370
           L  SP  I             P+ +RVKS ++   DP L+ L + +GR+LL++SSR  + 
Sbjct: 309 LGTSPNGIANQ----------PTDQRVKSARSAFNDPQLIVLAWNYGRHLLVASSRDTSA 358

Query: 371 V----ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
                 NLQG+WN   S  W     +NIN EMN W +   NL E Q PLFD L      G
Sbjct: 359 AIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQPRG 418

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
            + AQ  Y  +G V HH  D+W   +        ++WPMG  WL  H+ E Y +T D DF
Sbjct: 419 QEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYPSSSMWPMGATWLVQHMMEQYRFTGDLDF 478

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA- 545
           L   AYP L   + FL  +      G   T PS SPE+ +  P G          MDMA 
Sbjct: 479 LRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYAVPQGA-NVAGQQEPMDMAP 536

Query: 546 -----IIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
                ++R+V SAI+ AA  L   + DA V+     LP +R  +I   G I+EW
Sbjct: 537 EMDNQLMRDVMSAIVEAAAALGISSSDANVKAASDFLPLIRTPRIGSYGQILEW 590


>gi|225859410|ref|YP_002740920.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
 gi|225721936|gb|ACO17790.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
          Length = 803

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  N  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQSPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|149011485|ref|ZP_01832732.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
           SP19-BS75]
 gi|147764475|gb|EDK71406.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
           SP19-BS75]
          Length = 803

 Score =  234 bits (598), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|257070006|ref|YP_003156261.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
 gi|256560824|gb|ACU86671.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
          Length = 762

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 179/596 (30%), Positives = 267/596 (44%), Gaps = 61/596 (10%)

Query: 9   TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
            T P  +  +GPA+ + +A+P+GNGRLGAM WG        LNE TLW+G PG       
Sbjct: 14  VTPPPALLRHGPAERWLEALPLGNGRLGAMAWGDPGRARFSLNESTLWSGAPGVDLPHRT 73

Query: 69  PK-----ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
           P+     AL   R+L  SG   EA     +L    +  Y  +GD+ +  D        + 
Sbjct: 74  PRAEAAAALERSRALFTSGAVQEAQEEIERLGASWSQAYLPVGDLTVRLDGDAGPEGGDG 133

Query: 124 YRRELDLNTATARVKYSVGNVEFTREH--FSSNPDQVIVTKISGSESGS--LSFNVSLDS 179
            RRELDL     RV  + G      EH  F S  D+V+V  +   E     L  +  L  
Sbjct: 134 -RRELDLQHGEHRVLAADG------EHLSFVSAADEVLVHCLPCPEGARAVLELDSPLVE 186

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
                   +G+  + +  R P          +D P G QF    +I    +  + +A+  
Sbjct: 187 EQREEQPADGDAALTIVLRAP----------SDVPGG-QFRQQEQIAWESEGASRAAVVV 235

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           +  +  G    V  +V  +++ G    P  +  +   E+ +  ++       +L+ RH D
Sbjct: 236 RTRREAGRLLVVCAIV--TTWQGLGRTPDRAVAEAVQEATAQAETALARGAEELHRRHRD 293

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
             +     V +QL+ S +  +  TC                             F +GRY
Sbjct: 294 RPRPGADAVGLQLTGSEEAELLATC-----------------------------FAYGRY 324

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LL S+SRPG   ANLQG+WN  L   W S   VNINLEMN+W +    + E    L  ++
Sbjct: 325 LLASASRPGLPPANLQGLWNAKLEAPWSSNYTVNINLEMNHWGAAIAQVPEAAGALEQYV 384

Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
             L   G  TA+  Y A GW +HH +D W  +   RG+  WA WPMGG WL   L + + 
Sbjct: 385 EMLREQGRDTARRLYGADGWTVHHNSDPWGYTDPVRGEPSWATWPMGGLWL-EQLLDTFA 443

Query: 480 YTMDRDFLE--KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
                D  E  +  +P L    +F L  L E  DG+L T PSTSPE+ +   DG + C+S
Sbjct: 444 ACSGSDPAEVARDRFPALREAVAFALGLLHESADGHLATFPSTSPENRWRTADGTVVCLS 503

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             + MD  ++RE    ++ AA VL + +D +V++   +L  +   ++  DG I+EW
Sbjct: 504 EGTGMDRWLLRETAQHLVEAAAVLGREDDPVVQQAASALDLVPGPRVGADGRILEW 559


>gi|419495837|ref|ZP_14035554.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47461]
 gi|421302877|ref|ZP_15753541.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA17484]
 gi|379593923|gb|EHZ58734.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47461]
 gi|395901499|gb|EJH12435.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA17484]
          Length = 803

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  N  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|358399331|gb|EHK48674.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
           206040]
          Length = 797

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 168/588 (28%), Positives = 266/588 (45%), Gaps = 52/588 (8%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
           T  +PIGN RLGA ++GG  +E + +NEDTLW G   +    +   AL  VR ++++   
Sbjct: 39  TGVLPIGNSRLGAAIFGGA-NEVVTINEDTLWDGPLQNRIPANGLAALPKVRQMLEANSL 97

Query: 85  AEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
             A    +     P      +   G++ L F   H       Y R LD     + V Y+ 
Sbjct: 98  TAAGNLVLSQMTPPISGERQFSYFGNLNLNF--GHSSGGISNYIRSLDTRQGNSSVSYTY 155

Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDN-HSYVNGNNQIIMEG 197
             V +TRE+ +S P  VI  + + S++G+LS + +   + ++L N  S   G N + ++G
Sbjct: 156 NGVTYTREYVASTPAGVIAARFTASKAGALSVSATFSRISNILSNVASTSGGANTLTLQG 215

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
                       A+D+P  I F+   +   S   G   +     L + G+    + +   
Sbjct: 216 SS-------GQAASDNP--ILFTGTAQFVAS---GATFSTSGGTLTISGATTIDVFIDVE 263

Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
           +S+  P      S  D  ++  S L +  +  +  ++   + D   L  R +I L  SP 
Sbjct: 264 TSYRYP------SASDLAAQVNSKLSAAVSQGFQKIHDGAIADASALLGRANINLGTSPN 317

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVA---- 372
            + +          + + +RVK+ ++   DP L  L + +GR+LL++SSR  T  A    
Sbjct: 318 GLAS----------LSTDQRVKNARSSFNDPQLAVLAWNYGRHLLVASSR-NTSAAIDMP 366

Query: 373 -NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
            NLQG+WN   S  W     +NIN EMN W +   NL E Q PLFD +      G + AQ
Sbjct: 367 PNLQGVWNNQTSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLMKVAQPRGQQMAQ 426

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
             Y  +G V HH  D+W   +         +WPMG  WL  H+ E Y +  D + L    
Sbjct: 427 DLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMIEQYRFGGDLNLLRSAT 486

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-----DGKLACVSYSSTMDMAI 546
           YP L   + FL  +      G L T PS SPE+ ++ P      G+   +  +  MD  +
Sbjct: 487 YPYLLDISKFLQCYTFS-WQGNLVTGPSLSPENTYVVPSNATVSGQQEPMDLAPEMDNQL 545

Query: 547 IREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +R+V   II AA  L   + D+ V+     +P++R  +I   G I+EW
Sbjct: 546 MRDVMKGIIEAAAALGISSSDSNVQAATNFIPQIRTPRIGSYGQILEW 593


>gi|429764051|ref|ZP_19296381.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
 gi|429188824|gb|EKY29689.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
          Length = 1566

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 174/618 (28%), Positives = 294/618 (47%), Gaps = 85/618 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA------------------P 69
           +P+GNG LG+ V+GGV  E +  N+ TLWTG P    NPD                    
Sbjct: 49  LPLGNGNLGSSVFGGVEKERIHFNDKTLWTGGP---DNPDGTMNDGTQYQGGNRLFEFNE 105

Query: 70  KALSDVRSLVDSGQY---AEATAASVKLFGHPADV--YQLLGDIELEFDD--SHLKYAEE 122
           +  +++ S  DS         T  S  LF +  ++  +Q  GDI L+F +  S+ K  + 
Sbjct: 106 EGYNNLISKFDSNDPLVPTGNTGVSSTLFSNRPNLGSWQDFGDIYLDFSEMGSNSKNVD- 164

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
            Y R LD+  A + V Y      + REHF S PD V+VT++S    G L F+V L     
Sbjct: 165 NYERSLDIKNAISEVIYDYNETTYLREHFVSYPDNVLVTRLSKDGDGKLDFDVELKKSSA 224

Query: 179 -SLLDNHSYVNGNNQII-MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
            S  D  + ++ NN  I + G   G ++             ++SA L++ +     T+  
Sbjct: 225 LSSNDATTSIDDNNTTIKLIGTLNGNKM-------------KYSASLKVIVDGKESTVEP 271

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
             +  +KV  +D  VL+    + +    P     ++ ++ T+     +       Y+ L 
Sbjct: 272 NGNSTIKVRNADEVVLIFSTGTDYKNIYPGYRTGETSEEVTNRVNKVINDAAKKGYNTLL 331

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DY++LF RVS+ L+    ++ TD   E   + + S             +L  L+F
Sbjct: 332 ENHVSDYKELFDRVSLDLNEIAPNVPTDELIENYRNGIYS------------KALEALVF 379

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           Q+GRYL I+SSR G+  +NL G+W+   SP W    H N+N++MNYW +   NL+EC + 
Sbjct: 380 QYGRYLTIASSREGSLPSNLAGLWSIG-SPLWSGDYHFNVNVQMNYWPAFSTNLAECGKV 438

Query: 415 LFDFLTYLSINGSKTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWA 461
             D+++ L I G K+A+++  A             +G++IH   + + K+  + G+  + 
Sbjct: 439 FADYMSSLVIPGRKSAEMSIGAKTDDFETTPIGEGNGFMIHTANNPFGKTCPN-GEEYYG 497

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
             P G  W   + +++Y +T D+++LE   YP+++  A+   + LIE     ++   ST 
Sbjct: 498 WNPNGATWALQNAFDYYEFTKDKEYLESTIYPMVKEVANMWTNSLIESK---VQKIGSTE 554

Query: 522 PEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PR 578
            +   +AP    +   ++  +T D +++ E+F   I AA +LEK+ D +  K+   +  +
Sbjct: 555 EQRLVVAPSTSAEQGPMTVGTTYDQSLVWEIFEKAIKAANILEKDSDEI--KIWTEMQSK 612

Query: 579 LRPTKIAEDGSIMEWVQR 596
           L P  I E G I EW Q 
Sbjct: 613 LDPVIIGEGGQIKEWYQE 630


>gi|148993776|ref|ZP_01823203.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
           SP9-BS68]
 gi|168488632|ref|ZP_02712831.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
 gi|418234852|ref|ZP_12861428.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA08780]
 gi|421220741|ref|ZP_15677580.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070425]
 gi|421222994|ref|ZP_15679776.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070531]
 gi|421279430|ref|ZP_15730236.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17301]
 gi|421294642|ref|ZP_15745363.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA56113]
 gi|147927732|gb|EDK78756.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
           SP9-BS68]
 gi|183572723|gb|EDT93251.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
 gi|353886474|gb|EHE66256.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA08780]
 gi|395586651|gb|EJG47018.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070425]
 gi|395586974|gb|EJG47336.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070531]
 gi|395878923|gb|EJG89985.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17301]
 gi|395893211|gb|EJH04198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA56113]
          Length = 803

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  N  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|336430063|ref|ZP_08610019.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336001234|gb|EGN31379.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 782

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 168/601 (27%), Positives = 277/601 (46%), Gaps = 54/601 (8%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
           + +  PA ++ +A+P+GNGRLGAM +GG   ETL+L+E T W+G   +  N  D+ + L+
Sbjct: 5   LMYKQPAGNWKEALPLGNGRLGAMDFGGAWRETLQLDESTYWSGEASEENNRADSRELLA 64

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLL------------GDIELEFDDSHLKYAE 121
            +R  +    Y  A        G+  +    L            G  E E++++      
Sbjct: 65  QIREALLEEDYERADELGHGFVGNKNNYGTNLPVGNFYIDCFPEGRPEKEWEEAAGADTV 124

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
             + R L L  A + V +  G   + RE F SNP Q  V  +        +  +  + + 
Sbjct: 125 TDFVRRLYLEEARSEVSFKAGGSTYRREVFLSNPAQTAVIHMDTRPGKPFALRIRFEGIA 184

Query: 182 DNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                     Q  ++ G+        +   +D   G+  +    I++  D      L++ 
Sbjct: 185 SRVGITEERQQDYLIRGQA------RETLHSDGFTGVNLAG--RIRVVTD--GYHHLKES 234

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            + VE +  A LL+   +    P         DP   +   L+      Y  L   H+ D
Sbjct: 235 GIWVENATRATLLVDLETDMFQP---------DPEETAGRRLEEAWQKGYEQLRQEHIQD 285

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRY 359
              L++R+ I L              E++  +P+ ER+ K  +  EDP L  LLFQ+GRY
Sbjct: 286 VSALYNRMDISL------------GAEDMRELPTDERLRKQTEGKEDPGLAALLFQYGRY 333

Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAP--HVNINLEMNYWQSLPCNLSECQEPLF 416
           LLISSSR  + +  ++ GIWN+++    D     HV++NL+M YW +  C L EC +P F
Sbjct: 334 LLISSSREDSPLPTHMGGIWNDNIYNNIDCTQDMHVDMNLQMQYWLAALCALPECYQPAF 393

Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
            ++  + + +G KTA   Y A GW  H  T+ W  +S       W +W +GG W    +W
Sbjct: 394 AYMRDILVPSGEKTAAGVYGARGWTAHVVTNPWGFTSLG-WSYNWGVWSLGGVWCAALIW 452

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLA 534
           ++Y +T D+DFL +  +P+L+G A F  D++  +   G+  T PS SPE+ F + +GK  
Sbjct: 453 DYYEFTGDKDFL-REWWPVLKGAAEFAADYVFPDEKSGFYMTGPSYSPENMF-SVEGKEY 510

Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
            +S S+  D  ++RE+   I    + L    D+ +EK ++    L P +I   G + EW 
Sbjct: 511 FLSLSTACDCILVREILDIIAKGYQELSLERDSFLEKCVEIRENLPPYRIGSRGQLQEWF 570

Query: 595 Q 595
            
Sbjct: 571 H 571


>gi|417679619|ref|ZP_12329015.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
 gi|332072484|gb|EGI82967.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
          Length = 778

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  N  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|334337751|ref|YP_004542903.1| alpha-L-fucosidase [Isoptericola variabilis 225]
 gi|334108119|gb|AEG45009.1| Alpha-L-fucosidase [Isoptericola variabilis 225]
          Length = 879

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 190/642 (29%), Positives = 278/642 (43%), Gaps = 90/642 (14%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----YTNPDAPK 70
           + ++ PA  + +A+P+GNG   AM  G    E L LN+ T W+G P D     T    P+
Sbjct: 49  LRYDRPASKWIEALPVGNGHRAAMCAGRPARERLWLNDVTAWSGPPPDDPLAGTRARGPE 108

Query: 71  ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYR-RE 127
            L  VR  VD G    A      L       Y  L ++E+     + +    + T+  R 
Sbjct: 109 HLDRVRRAVDEGDVRTAERLLQDLQTPWVQAYLPLAELEVSVVPGEGNGPTDDVTFAGRH 168

Query: 128 LDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
           LDL TA A   + S G     +E ++     V+V  +       +   V + SLL     
Sbjct: 169 LDLRTAVATHAWTSPGTGRVVQETWADARGGVLVHVVRAERP--VRAEVRVSSLLRRADE 226

Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPK--GIQFSAILEIKISDDRG------------ 232
           V                  P A+    P   G +  A+L++ +    G            
Sbjct: 227 VR-----------------PDADRGAGPADGGARLHAVLDLPVDVAPGHEPVDDPVRYAP 269

Query: 233 -------TISALEDKKLKVE------GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
                   ++AL D +  VE       +    +L VA+++ D P   P+D        +M
Sbjct: 270 DGRQGVVAVAALGDPEAVVEQDVLRTATARCHVLAVATATTDPPGDVPADRSAASRVAAM 329

Query: 280 -----------SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
                       A    R     +L   H+  +++L+ R  + L   P+ +         
Sbjct: 330 LREAGSVAVPGPAGDGARTALARELRAAHVAAHRRLYDRCRLVLPTPPEAL--------- 380

Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
              +P+  RV + Q   DP L  L F  GRYLL +SSR G   A LQGIWN +L   W S
Sbjct: 381 --GLPTDVRVAAAQHRPDPGLAALAFHHGRYLLAASSRDGGLPATLQGIWNAELPGPWSS 438

Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN-GSKTAQVNYLASGWVIHHKTDI 447
           A  +NIN +M YW +    L+EC EPL   +  ++   G   A+  Y   GW  HH +D 
Sbjct: 439 AYTLNINTQMAYWPAEVTGLAECHEPLLRLVARIAAGPGGVVARELYGTDGWTAHHNSDA 498

Query: 448 WAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD---FLEKRAYPLLEGCASF 501
           WA ++   A  G   WA W MGG WL  HL EH+ +  D D   FL   A+P+LEG A F
Sbjct: 499 WAHAAPVGAGHGDASWAAWAMGGLWLAQHLVEHHRFAADTDGDAFLRDVAWPVLEGAARF 558

Query: 502 LLDWLIEGHDG------YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
            L W+    D          T+PSTSPE+ F A DG  A V+ S TMD+A++R +  A  
Sbjct: 559 ALGWVRTETDADSGRVVRAWTSPSTSPENRFTADDGAPAAVTTSVTMDVALVRWLAEACR 618

Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
            AAEVL +  DA V+++++    L   +    G ++EW + R
Sbjct: 619 EAAEVLGRR-DAWVDRLVEVAAALPHPRAGARGELLEWDRER 659


>gi|418076872|ref|ZP_12714105.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47502]
 gi|353747012|gb|EHD27670.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47502]
          Length = 803

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y     +F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|260588898|ref|ZP_05854811.1| putative fibronectin type III domain protein [Blautia hansenii DSM
           20583]
 gi|260540677|gb|EEX21246.1| putative fibronectin type III domain protein [Blautia hansenii DSM
           20583]
          Length = 744

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 163/585 (27%), Positives = 270/585 (46%), Gaps = 62/585 (10%)

Query: 21  AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
           AK +   +P+GNG+ GA++ GGV  E + LNE++LW G   +       + L  VR L++
Sbjct: 11  AKSWEQGLPVGNGQQGAVLLGGVQQERIVLNEESLWYGGKRERAVEAGKEKLEKVRELLE 70

Query: 81  SGQYAEATAASVKLF-GHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
            G+ ++A     + F G+P   + Y    +  L F+    K  E  Y R +DL    A V
Sbjct: 71  KGEASKAQTLCSRWFVGNPRYTNPYHPAAEAVLNFEPFG-KVKE--YFRGIDLEKGEAGV 127

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
           K    N +  RE FSS   QV   ++   +   +SF++ L+                   
Sbjct: 128 KICFDNCKTVREIFSSVKYQVTALRMETDKEQGMSFSLGLN------------------- 168

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
                R P + NA  + + I  +      +  D        D ++ VEG      LLV  
Sbjct: 169 -----RRPFEENAEVEDREISLNGHSGDGVCYDVRCRVGKTDGRVCVEGG----YLLVER 219

Query: 258 SSFDGPF--INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
           +S+   F  +      K+   +    L++   + + ++   H+++Y +L++ + +++  +
Sbjct: 220 ASYVEIFFCVRTDYESKECLDKCSRLLKAAAKVGFEEIKKAHIEEYGRLYNNMRLEIEGA 279

Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS----LVELLFQFGRYLLISSSRPGTQV 371
                      E +  +P+ E +K     E+P     L+ L+F + RYLLISSS      
Sbjct: 280 -----------EELAQIPADELLKRC---EEPKVQGYLIWLMFSYARYLLISSSYGCALP 325

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
           ANLQGIWN   +P W+S   +NINL+MNYW +    L  C E  F+ +  +  NG KTA+
Sbjct: 326 ANLQGIWNGSFTPPWESGYTININLQMNYWMADRAGLGVCYESFFNLIEKMLPNGRKTAK 385

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
             Y   G+V HH T++W  +      +   LWPMGGAW+   L+ H  +  +   + +R 
Sbjct: 386 KVYACRGFVAHHNTNLWGDTDITGLWLPAFLWPMGGAWMANQLYHHSEFEENPKEIRERV 445

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
            P+++ C  F  D+L    D    + P+ SPE+ +   DG+ A V+    MD  IIRE+ 
Sbjct: 446 LPVMKECILFFYDYLYRKSDKMWISGPTVSPENTYRLLDGQEASVAMGVAMDHQIIRELA 505

Query: 552 SAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
              +           E   + + +++L+ LP   PTKI + G I+
Sbjct: 506 ENYLEGCRRYNTGSPEYETEKMAQEILEHLP---PTKIGKSGRIL 547


>gi|418096757|ref|ZP_12733868.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16531]
 gi|353768478|gb|EHD49002.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16531]
          Length = 782

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 176/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|167749996|ref|ZP_02422123.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
 gi|167657017|gb|EDS01147.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
          Length = 796

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 176/612 (28%), Positives = 290/612 (47%), Gaps = 76/612 (12%)

Query: 14  KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
           KI F  P    K      PIGNG +GA  +GG+  E + LNE TLW G P + + PD   
Sbjct: 24  KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82

Query: 68  -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
                + + +  V+ L+  G+Y EA A    L G       YQLL D+ L F +     A
Sbjct: 83  GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
            + Y R LDL+ +    +++       RE F++ P  VI  K+S  +   +   +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                  NG+  +  EG                  G+++  I   K+ +  G +   +D 
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            + VE +D   + L AS+ +   +  P+  +  +P++     +++  +  +  LY  HL 
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
           DY+ LF RV+++++    DI+            P  + +  ++ +   S+      L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRY+LISSSR G+  ANLQG+WNE   P W    H+N+NL+MNYW +   NLSE   PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410

Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPM 465
            DFL  +  +G K+A+  Y          +GW  H ++    W     D     W     
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFGWTAPGWD---FYWGWSTA 467

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
             AWL  +++EH+ +T D+++  +  YP++     F   WLI +     L ++P+ SPEH
Sbjct: 468 AVAWLMQNIYEHFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH 527

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      V+  +T + ++I ++++  I+A+E L  +E+ L   V   + +L+P  I
Sbjct: 528 ---------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSI 577

Query: 585 AED-GSIMEWVQ 595
           ++  G + EW +
Sbjct: 578 SKKTGLLKEWFE 589


>gi|418976823|ref|ZP_13524668.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
 gi|383350822|gb|EID28673.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
          Length = 803

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 176/613 (28%), Positives = 286/613 (46%), Gaps = 90/613 (14%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LG  ++G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGVKIFGLIGAERIQFNEKSLWSGGPQPDSSDYQGGNLQDQYSFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF +     ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y     +F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTKFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLSRDLSSDGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    +QF++ L  +     G I    
Sbjct: 207 SDYKECQLDISDSYILMKGRV---------KDND----LQFASCLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYT 295
           DK +++ G+ +A L L A + F     NP+ + +   D   +    +++ +   Y  L +
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQ---NPASNYRKELDLERQVKDLVETAKEKGYDQLKS 306

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
           RH+ DYQ LF RV + L                +D   + + +K+++  E  +L EL FQ
Sbjct: 307 RHIQDYQALFQRVQLDLG-------------AEVDASNTDDLLKNYKPQEGQALEELFFQ 353

Query: 356 FGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
           +GRYLLISSSR  P    ANLQG+WN   +P W+S  H+NINL+MNYW +   NL E   
Sbjct: 354 YGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAF 413

Query: 414 PLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALW 463
           P+ +++  L + G + A   Y          +GW++H +     W     D     W   
Sbjct: 414 PVINYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWS 469

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSP 522
           P   AW+   ++E Y +  D+D+L ++ YP+L     F  D+L E        ++PS SP
Sbjct: 470 PAANAWMMQTVYEGYTFYRDKDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSP 529

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
           EH           +S  +T D ++I ++F   I AA+ L  +E  L E V +    L P 
Sbjct: 530 EH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNPL 579

Query: 583 KIAEDGSIMEWVQ 595
           +I + G I EW +
Sbjct: 580 QITQSGRIREWYE 592


>gi|149006721|ref|ZP_01830407.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
           SP18-BS74]
 gi|147761636|gb|EDK68600.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
           SP18-BS74]
          Length = 803

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 176/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418162701|ref|ZP_12799382.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17328]
 gi|353826763|gb|EHE06920.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA17328]
          Length = 782

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|168483476|ref|ZP_02708428.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
 gi|418169754|ref|ZP_12806395.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19077]
 gi|418219383|ref|ZP_12846048.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP127]
 gi|418221685|ref|ZP_12848338.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47751]
 gi|418239181|ref|ZP_12865732.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|419460465|ref|ZP_14000393.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02270]
 gi|419462818|ref|ZP_14002721.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02714]
 gi|419489320|ref|ZP_14029069.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44386]
 gi|419526372|ref|ZP_14065930.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
 gi|421273311|ref|ZP_15724151.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR55]
 gi|172043064|gb|EDT51110.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
 gi|353833733|gb|EHE13841.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19077]
 gi|353873743|gb|EHE53602.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP127]
 gi|353874995|gb|EHE54849.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47751]
 gi|353892172|gb|EHE71921.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|379530250|gb|EHY95490.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02714]
 gi|379530601|gb|EHY95840.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02270]
 gi|379557012|gb|EHZ22059.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
 gi|379586862|gb|EHZ51712.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44386]
 gi|395873742|gb|EJG84832.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR55]
          Length = 803

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418230426|ref|ZP_12857025.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP01]
 gi|419478291|ref|ZP_14018115.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA18068]
 gi|421271063|ref|ZP_15721917.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR48]
 gi|353885307|gb|EHE65096.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP01]
 gi|379565727|gb|EHZ30719.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA18068]
 gi|395867277|gb|EJG78401.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR48]
          Length = 803

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF+      ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL++NYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|417696816|ref|ZP_12345994.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
 gi|418176443|ref|ZP_12813034.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
 gi|421232359|ref|ZP_15689000.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
 gi|332200214|gb|EGJ14287.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
 gi|353840514|gb|EHE20578.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
 gi|395594862|gb|EJG55097.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
          Length = 778

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|421211527|ref|ZP_15668509.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070035]
 gi|395572635|gb|EJG33230.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070035]
          Length = 803

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|419480494|ref|ZP_14020298.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19101]
 gi|419500201|ref|ZP_14039895.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47597]
 gi|379569663|gb|EHZ34630.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19101]
 gi|379599509|gb|EHZ64292.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47597]
 gi|429316503|emb|CCP36209.1| conserved hypothetical protein [Streptococcus pneumoniae SPN034156]
          Length = 803

 Score =  231 bits (590), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +E+ L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDENLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418146897|ref|ZP_12783675.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13637]
 gi|353812472|gb|EHD92707.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13637]
          Length = 782

 Score =  231 bits (590), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFYDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|418130799|ref|ZP_12767682.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
 gi|418187633|ref|ZP_12824156.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
 gi|353802123|gb|EHD82423.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
 gi|353849618|gb|EHE29623.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
          Length = 778

 Score =  231 bits (590), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF+      ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  +  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL++NYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDYPDALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|225855085|ref|YP_002736597.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
 gi|225723201|gb|ACO19054.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
          Length = 803

 Score =  231 bits (590), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 179/625 (28%), Positives = 288/625 (46%), Gaps = 87/625 (13%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYGFLAEIRQALEKRDYNTAKELAEEHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL- 177
           ++ T Y+R+L+++ A A   Y+     F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIKLF 191

Query: 178 --DSLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M GR            ND    ++F+  L
Sbjct: 192 LTRDLASDGKYDQEKSDYKECQLDITDSHILMNGRV---------KDND----LRFAGCL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    DK +++ G+ +A L L A + F     +    K D   +    ++
Sbjct: 239 AWQTD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPDSNYRKKIDLEKQVKDLVE 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
             +   Y+ L +RH+ DYQ LF RV + L             E ++DT  + + +K+++ 
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDL-------------EADVDTFTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
               +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+NINL+MNYW
Sbjct: 342 QAGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A   Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSREGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
             D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L E   
Sbjct: 461 GWD---YYWGWSPATNAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWTGFLHEDQQ 517

Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
                ++PS SPEH           +S  +T D ++I ++F   I A + L  + D L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQATQELGLDGDLLTE 568

Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
            V +    L P +I + G I EW +
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYE 592


>gi|405760473|ref|YP_006701069.1| hypothetical protein SPNA45_00586 [Streptococcus pneumoniae SPNA45]
 gi|404277362|emb|CCM07874.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
          Length = 803

 Score =  231 bits (589), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 281/599 (46%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKMSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y     +F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D    ++F++ L  K     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD--TDLRFASYLAWKTD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKDYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAEIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I  A+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418202865|ref|ZP_12839294.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA52306]
 gi|419456006|ref|ZP_13995963.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP04]
 gi|421285997|ref|ZP_15736773.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60190]
 gi|421307849|ref|ZP_15758491.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60132]
 gi|353867422|gb|EHE47317.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA52306]
 gi|379627982|gb|EHZ92588.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP04]
 gi|395885984|gb|EJG97005.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60190]
 gi|395907234|gb|EJH18128.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60132]
          Length = 803

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|419491555|ref|ZP_14031293.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47179]
 gi|379592917|gb|EHZ57732.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47179]
          Length = 803

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|149021254|ref|ZP_01835500.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
           SP23-BS72]
 gi|147930355|gb|EDK81339.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
           SP23-BS72]
          Length = 803

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 283/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A ++F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTNFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|194398489|ref|YP_002038269.1| hypothetical protein SPG_1564 [Streptococcus pneumoniae G54]
 gi|418121711|ref|ZP_12758654.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44194]
 gi|419532855|ref|ZP_14072370.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
           GA47794]
 gi|421275369|ref|ZP_15726198.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA52612]
 gi|194358156|gb|ACF56604.1| conserved hypothetical protein [Streptococcus pneumoniae G54]
 gi|353792547|gb|EHD72919.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44194]
 gi|379605375|gb|EHZ70126.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
           GA47794]
 gi|395873333|gb|EJG84425.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA52612]
          Length = 803

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|421241117|ref|ZP_15697662.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2080913]
 gi|395607495|gb|EJG67592.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2080913]
          Length = 782

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|116191887|ref|XP_001221756.1| hypothetical protein CHGG_05661 [Chaetomium globosum CBS 148.51]
 gi|88181574|gb|EAQ89042.1| hypothetical protein CHGG_05661 [Chaetomium globosum CBS 148.51]
          Length = 537

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 157/524 (29%), Positives = 255/524 (48%), Gaps = 45/524 (8%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           F  A+PIGNGRLGA V+G  P+E L LNE+++W+G   D  N  +  A+  +R ++ +G 
Sbjct: 36  FKSALPIGNGRLGAAVFG-TPTEKLVLNENSVWSGGFLDRANSRSKDAVPKIRQMLIAGD 94

Query: 84  YAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
              A  +++  +  +P         + +  D  H      +Y R LD    TA V Y  G
Sbjct: 95  ITGAGQSAMDNMAANPTSPRAYNPLVNMGIDLGH-GSGIGSYTRWLDTLEGTAGVNYLQG 153

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD------SLLDNHSYVNGNNQIIME 196
              ++RE+ +S P  V+  +++ S  G L+  +SL       S         G +++ + 
Sbjct: 154 GTNYSREYVASYPHGVLAIRLTASAPGKLNAKISLSRSKWVTSQTAKTDSGTGGHKVTLS 213

Query: 197 GRCPGKRIPPKANAN-------DDPKGI-QFSAILEIKISDDRGTISALEDKKLKVEGSD 248
           G      +   + A          PKG+  +S + +  I    GT ++ + K + + G++
Sbjct: 214 GNSGSDALAFWSEARVVNSGGVYHPKGLLPYSMVADSHI----GTATSPDGKSISISGAN 269

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
              +   A +S+   + + + ++    +E    L +     Y  + +  ++D+  L  RV
Sbjct: 270 TVDIFFDAETSYR--YADATAAQ----AELKQKLDAATAAGYPAVRSAAIEDFSSLMSRV 323

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
            + L            S  +    P   R+++F+ +   DP L+ L+F FGR+LL SSSR
Sbjct: 324 KLDLG-----------SSGDAGRQPVTTRLQNFKNNPNADPQLMTLMFNFGRHLLASSSR 372

Query: 367 ---PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
              P +  ANLQG+WN+D  P W S   +NINLEMNYW +L  NL+E Q+P+FD L    
Sbjct: 373 DTGPRSLPANLQGLWNQDYDPAWQSKYTININLEMNYWPALVTNLAETQKPVFDLLNEAI 432

Query: 424 INGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
             G   A+  Y  + G+V+HH TD+W  ++       + +WPMG AWL     EHY +T 
Sbjct: 433 PRGKAVAKTMYGCNDGFVLHHNTDLWGDAAPVDKGTPYTIWPMGAAWLSADAMEHYRFTQ 492

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
           ++ FL   A+P+L   A F    L +  +G+    PS SPEH F
Sbjct: 493 NKTFLSTTAWPILRDAARFFHCHLFQ-WNGHWTAGPSLSPEHAF 535


>gi|419482693|ref|ZP_14022480.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40563]
 gi|379579285|gb|EHZ44192.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40563]
          Length = 803

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|336321550|ref|YP_004601518.1| alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
 gi|336105131|gb|AEI12950.1| Alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
          Length = 792

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 173/618 (27%), Positives = 288/618 (46%), Gaps = 73/618 (11%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
           + F GPA  + +A P+GNG +GAMV GG     +++N+ T W+G P        +    D
Sbjct: 5   LRFAGPALRWDEAFPLGNGSVGAMVHGGHRRARVQVNDATAWSGHPAGPGLALAELRRRD 64

Query: 68  -APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD------DSHLKYA 120
             P+ LS +RS +  G+  EA   + +  G  A  +Q   D+ +         D  +  A
Sbjct: 65  VGPRTLSALRSAIAEGRDDEAARLAQRFQGPYAQAFQPFVDLLVTLSPADPTGDDDVDAA 124

Query: 121 EETYRRELDLNTATAR--VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
            E   R LDL        V +           F+S PD  +  +    +   + F++ L+
Sbjct: 125 YEG--RSLDLRDGLVHEAVTFESAGCRVMTTWFTSAPDGCLHARWRAPD---VPFSLELE 179

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRI----------------PPKANANDDPKGIQFSAI 222
                 +   G + +++E    G ++                P +         + ++ +
Sbjct: 180 L---RGAQPGGPSALVVEAGVVGAQVRVELPFDVAPGHEPDRPGRIAVGSHASLVGYATV 236

Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSES 278
           L    +D R T S      ++V G+ W   +L  +++      GP  +P++++      +
Sbjct: 237 L--VSTDGRATASP---GGVRVAGATWVEAVLATATTTRWPEPGPLAHPAEAEHASRERA 291

Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
            +AL      + +    RH++D++ L     ++L   P D++           +P A   
Sbjct: 292 RAALPP-SPAAGAVAQRRHVEDHRALADATRLELG-EPADLL-----------LPDA--- 335

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
               T   P+     F FGRYLL+++SRPG    NLQG+WN++  P W S   +NINL+M
Sbjct: 336 --LGTAPLPARARAAFAFGRYLLMAASRPGAPPVNLQGVWNDEARPPWSSGYTLNINLQM 393

Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADR 455
            YW + P  L  C EPL D +  L+  G+  A+  Y  +GWV HH +D+W  +       
Sbjct: 394 AYWPAEPTGLGVCVEPLVDQVRVLAREGAAVARDLYGCAGWVAHHNSDVWGWALPVGDGH 453

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
           G   WA W MGGAWLC HLW+ Y Y++D D L +  +PLL G A+F++DWL+    G L 
Sbjct: 454 GDPSWASWWMGGAWLCRHLWDRYEYSLDEDVL-RDVWPLLRGAAAFVVDWLVPDGRGGLV 512

Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
            +PS+SPE+      G+   +   ST+D+A+ R++ S  + A ++L  +E  L  + + +
Sbjct: 513 PSPSSSPEN-VRERAGREVALCAGSTVDVALARDLLSHCLEAVDILGLDE-PLAARWVDA 570

Query: 576 LPRLRPTKIAEDGSIMEW 593
           + RL    +  DG + EW
Sbjct: 571 VARLPRPDVDADGLLREW 588


>gi|291557898|emb|CBL35015.1| hypothetical protein ES1_21610 [Eubacterium siraeum V10Sc8a]
          Length = 796

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/612 (28%), Positives = 290/612 (47%), Gaps = 76/612 (12%)

Query: 14  KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
           KI F  P    K      PIGNG +GA  +GG+  E + LNE TLW G P + + PD   
Sbjct: 24  KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82

Query: 68  -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
                + + +  V+ L+  G+Y EA A    L G       YQLL D+ L F +     A
Sbjct: 83  GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
            + Y R LDL+ +    +++       RE F++ P  VI  K+S  +   +   +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                  NG+  +  EG                  G+++  I   K+ +  G +   +D 
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            + VE +D   + L AS+ +   +  P+  +  +P++     +++  +  +  LY  HL 
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
           DY+ LF RV+++++    DI+            P  + +  ++ +   S+      L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRY+LISSSR G+  ANLQG+WNE   P W    H+N+NL+MNYW +   NLSE   PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410

Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPM 465
            DFL  +  +G K+A+  Y          +GW  H ++    W     D     W     
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFGWTAPGWD---FYWGWSTA 467

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
             AWL  +++E++ +T D+++  +  YP++     F   WLI +     L ++P+ SPEH
Sbjct: 468 AVAWLMQNIYEYFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH 527

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      V+  +T + ++I ++++  I+A+E L  +E+ L   V   + +L+P  +
Sbjct: 528 ---------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPYSV 577

Query: 585 AED-GSIMEWVQ 595
           ++  G + EW +
Sbjct: 578 SKKTGLLKEWFE 589


>gi|419487130|ref|ZP_14026892.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44128]
 gi|421209422|ref|ZP_15666435.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070005]
 gi|379585499|gb|EHZ50355.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44128]
 gi|395573518|gb|EJG34108.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070005]
          Length = 803

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418092259|ref|ZP_12729399.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44452]
 gi|353762959|gb|EHD43516.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44452]
          Length = 803

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|419475985|ref|ZP_14015821.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA14688]
 gi|379558767|gb|EHZ23799.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA14688]
          Length = 778

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   A   L G     Y      GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|387626851|ref|YP_006063027.1| hypothetical protein INV104_14070 [Streptococcus pneumoniae INV104]
 gi|444382288|ref|ZP_21180491.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
           PCS8106]
 gi|444385525|ref|ZP_21183597.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
           PCS8203]
 gi|301794637|emb|CBW37088.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
 gi|444249595|gb|ELU56083.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
           PCS8203]
 gi|444252562|gb|ELU59024.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
           PCS8106]
          Length = 803

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|418103344|ref|ZP_12740416.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
 gi|421225485|ref|ZP_15682223.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
 gi|353774645|gb|EHD55132.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
 gi|395588972|gb|EJG49294.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
          Length = 757

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD +++   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571


>gi|331091988|ref|ZP_08340820.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
           2_1_46FAA]
 gi|330402887|gb|EGG82454.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
           2_1_46FAA]
          Length = 1785

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 175/614 (28%), Positives = 290/614 (47%), Gaps = 82/614 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD----------APKALSDVR 76
           ++P+GNG LG +++GG+  E +  NE TLWTG P + T PD            K +   R
Sbjct: 71  SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSE-TRPDYQFGNKKTAYTDKEIEAYR 129

Query: 77  SLVDSGQY----------AEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE-E 122
            L+D                  +  +K  G        YQ  GDI ++F ++ ++    +
Sbjct: 130 KLLDDKSKNVFNDDTSLGKPGMSGKIKFPGEDNLNKGSYQDFGDIWIDFSETGIRDDNVK 189

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRRELDL T  A   +S   V++ REHF S+PDQV+VT++S S+   L  ++ ++    
Sbjct: 190 NYRRELDLQTGVAATTFSHQGVDYKREHFVSSPDQVMVTELSASKEKKLDVSIKMEL--- 246

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           N+S + G  +   E       I  K   N    G++F   +  KI    G I+A E  +L
Sbjct: 247 NNSGLEGTAKFDAEQNMY--TIFGKVKDN----GLKFRTTM--KIVQSGGDITADEKNQL 298

Query: 243 -KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
            KVE +D  ++++ A + +   +    D+KKD     +  ++     SY +L   H++D+
Sbjct: 299 YKVENADKIMIVMAAETDYKNDYPTYRDTKKDLEKVVVERVKRASEKSYQELKENHIEDH 358

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYL 360
           Q LF RVS+ L              EN   +P+ E + +++       +E+L FQ+GRYL
Sbjct: 359 QGLFDRVSLDLG-------------ENRSNIPTNELIDAYRKGSYSKYLEVLAFQYGRYL 405

Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
            I+ SR GT  +NL G+W    S  W    H N+N++MNYW     NL+EC   + D++ 
Sbjct: 406 TIAGSR-GTLPSNLVGLWTMGAS-AWTGDYHFNVNVQMNYWPVYVTNLAECGTTMVDYME 463

Query: 421 YLSINGSKTAQ-------VNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCT 472
            L   G  TA+            +G+ +H + + +  ++  +  +  W   P G AW   
Sbjct: 464 NLREPGRLTAERVHGIEDATTKKNGFTVHTENNPFGMTAPTNNQEYGWN--PTGAAWAIQ 521

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-------IEGHDGYLETNPSTSPEHE 525
           +LW HY +T ++D+L+   YP+++  A F  ++L       +   +   +  P       
Sbjct: 522 NLWAHYEFTQNKDYLKNTIYPIMKEAAQFWDNYLWTSDYQKVHDKNSKYDGQPRLVVVPS 581

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS----LPRLRP 581
           F A  G  A     +T D +++ E+++  I A +++   ED   E VLKS    + RL P
Sbjct: 582 FSAEQGPTAV---GTTYDQSLVWELYNECIKAGKIV--GED---ETVLKSWEEKMQRLDP 633

Query: 582 TKIAEDGSIMEWVQ 595
            ++     I EW +
Sbjct: 634 IEMNATNGIKEWYE 647


>gi|15903541|ref|NP_359091.1| hypothetical protein spr1498 [Streptococcus pneumoniae R6]
 gi|116515332|ref|YP_816923.1| hypothetical protein SPD_1467 [Streptococcus pneumoniae D39]
 gi|421266644|ref|ZP_15717524.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR27]
 gi|15459158|gb|AAL00302.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
 gi|116075908|gb|ABJ53628.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
 gi|395866712|gb|EJG77840.1| fibronectin type III domain protein [Streptococcus pneumoniae
           SPAR27]
          Length = 803

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF+      ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNTFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L   +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNSLQITQSGRIREWYE 592


>gi|418192091|ref|ZP_12828593.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
 gi|353855177|gb|EHE35147.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
          Length = 778

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T  +R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    + F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|417923725|ref|ZP_12567182.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
 gi|342836607|gb|EGU70818.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
          Length = 803

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 279/599 (46%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GD+ +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y+     F RE F+S PD ++V + +   + +L F + L    D  S      + 
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F+  L  +     G I    DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   +    +++ +   Y+ L +RH++D Q LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L                +D   + + +K+++  E  SL EL FQ+GRYLLISSSR  +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367

Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+NINL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A   Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L  R YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E V +    L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYE 592


>gi|168486978|ref|ZP_02711486.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
 gi|237650661|ref|ZP_04524913.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974]
 gi|237822420|ref|ZP_04598265.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974M2]
 gi|418126305|ref|ZP_12763211.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44511]
 gi|418214849|ref|ZP_12841583.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA54644]
 gi|419458244|ref|ZP_13998186.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02254]
 gi|419484883|ref|ZP_14024658.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43257]
 gi|419510919|ref|ZP_14050560.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP141]
 gi|419530550|ref|ZP_14070077.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
 gi|421213591|ref|ZP_15670545.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070108]
 gi|421215753|ref|ZP_15672674.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070109]
 gi|421301508|ref|ZP_15752178.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA19998]
 gi|183570104|gb|EDT90632.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
 gi|353796245|gb|EHD76590.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44511]
 gi|353869579|gb|EHE49460.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA54644]
 gi|379529908|gb|EHY95149.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02254]
 gi|379573458|gb|EHZ38413.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
 gi|379581636|gb|EHZ46520.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43257]
 gi|379631522|gb|EHZ96099.1| fibronectin type III domain protein [Streptococcus pneumoniae
           NP141]
 gi|395578822|gb|EJG39332.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070108]
 gi|395579960|gb|EJG40455.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2070109]
 gi|395899068|gb|EJH10012.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA19998]
          Length = 803

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T  +R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    + F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|418144603|ref|ZP_12781398.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13494]
 gi|418185405|ref|ZP_12821946.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47283]
 gi|353807069|gb|EHD87341.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA13494]
 gi|353848689|gb|EHE28701.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47283]
          Length = 782

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T  +R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 126 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    + F++ L  +     G I    
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 229

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 562 QSGRIREWYE 571


>gi|67902324|ref|XP_681418.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
 gi|74593077|sp|Q5AU81.1|AFCA_EMENI RecName: Full=Alpha-fucosidase A; AltName: Full=Alpha-L-fucoside
           fucohydrolase A; Flags: Precursor
 gi|40739981|gb|EAA59171.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
 gi|95025957|gb|ABF50892.1| alpha-fucosidase [Emericella nidulans]
 gi|259480915|tpe|CBF73981.1| TPA: Alpha-fucosidasePutative uncharacterized protein ;
           [Source:UniProtKB/TrEMBL;Acc:Q5AU81] [Aspergillus
           nidulans FGSC A4]
          Length = 809

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 176/594 (29%), Positives = 289/594 (48%), Gaps = 65/594 (10%)

Query: 30  IGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLVDSG 82
           IGNG+LG + +G   +E L LN D+LW+G P    +YT  NP +P   AL  +R  +   
Sbjct: 46  IGNGKLGVIPFGPPDTEKLNLNVDSLWSGGPFEVENYTGGNPSSPIYDALPGIRERI--- 102

Query: 83  QYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
            +   T    +L G   + Y   ++LG+I +  D      A   Y+R LDL+    R  +
Sbjct: 103 -FENGTGGMEELLG-SGNHYGSSRVLGNITIALDGVE---AYSKYKRTLDLSDGVHRTSF 157

Query: 140 SVGN---VEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSLDSLLDNHSYVNGNNQIIM 195
           ++ N          F S PDQV V  +  +    L    +S+++LL N S        ++
Sbjct: 158 TIANRTTAALKSSIFCSYPDQVCVYHLESASDARLPKVTISIENLLVNQS--------LL 209

Query: 196 EGRCP--GKRIPPKANA---NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
           +  C    KR   + +       P+G++++A+ E+ ++      + L +  L++      
Sbjct: 210 QTSCESEAKRAVLRHSGVTQAGPPEGMKYAAVAEV-VNPRSSVTTCLGEGALQISSRKKQ 268

Query: 251 VLLLV-ASSSFDGPFINPSD-----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
           + +++ A++++D    N        + KDP S       +     Y  L  RH+ DY+KL
Sbjct: 269 LTIIIGAATNYDQKAGNAKSGWSFKNAKDPASIVDGIASAAGWKGYQRLLDRHVKDYKKL 328

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
               S++L         DT    + DT    E+        +P L  LL  + R+LL+SS
Sbjct: 329 MGDFSLELP--------DTTDSASKDTSELIEKYSYASATGNPYLENLLLDYARHLLVSS 380

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRP +  ANLQG W E L+P+W +  H NINL+MNYW +    L E Q  L++++    +
Sbjct: 381 SRPNSLPANLQGRWTESLTPSWSADYHANINLQMNYWLADQTGLGETQHALWNYMADTWV 440

Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G++TA++ Y ASGWV+H++ +I+   +A +    WA +P   AW+  H+W++++YT D
Sbjct: 441 PRGTETARLLYNASGWVVHNEINIFG-FTAMKEDAGWANYPAAAAWMMQHVWDNFDYTHD 499

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
             +L  + Y LL+G ASF L  L E    +DG L  NP  SPE     P     C  Y  
Sbjct: 500 TAWLVSQGYALLKGIASFWLSSLQEDKFFNDGSLVVNPCNSPE---TGPT-TFGCTHYQQ 555

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
                +I +VF  +++A E + +++   V+ V  +L RL     ++  G + EW
Sbjct: 556 -----LIHQVFETVLAAQEYIHESDTKFVDSVASALERLDTGLHLSSWGGLKEW 604


>gi|307709595|ref|ZP_07646048.1| alpha-fucosidase [Streptococcus mitis SK564]
 gi|307619631|gb|EFN98754.1| alpha-fucosidase [Streptococcus mitis SK564]
          Length = 803

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 177/614 (28%), Positives = 282/614 (45%), Gaps = 65/614 (10%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P   T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLSNSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKY 119
              D    ++++R  ++   Y  A   A   L G     Y      GDI +EF       
Sbjct: 72  NLQDQYAFIAEIRQDLEKRDYNRAKELAEQHLVGSKTSQYGTYLSFGDIHIEFSKQGKTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++   Y+R+L+++ A A   Y      F RE F+S PD ++V + +     +L F + L 
Sbjct: 132 SQVMDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLLVQRFTKEGLETLDFTIELS 191

Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
              D  S      +      C        I  K    D+   ++F++ L  +     G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
               DK +++ G+ +A L L A + F     +    K D   +    +++ +   Y+ L 
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEKGYAQLK 305

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
           +RH++DYQ LF RV + L                +D   + + +K++   E  +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------AEVDASTTDDLLKNYNPQEGQALEELFF 352

Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+NINL+MNYW +   NL E  
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLEAV 412

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
            P+ +++  L + G + A   Y          +GW++H +     W     D     W  
Sbjct: 413 FPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGW 468

Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
            P   AW+   ++E Y++  D+D+L ++ YP+L     F   +L E        ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHEDRQAQRWVSSPSYS 528

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           +S  +T D ++I ++F   I AA+ L  +E  L E V +    L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNP 578

Query: 582 TKIAEDGSIMEWVQ 595
            +I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592


>gi|418966783|ref|ZP_13518495.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
 gi|383346450|gb|EID24496.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
          Length = 803

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 279/599 (46%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GD+ +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y+     F RE F+S PD ++V + +   + +L F + L    D  S      + 
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F+  L  +     G I    DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   +    +++ +   Y+ L +RH++D Q LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L                +D   + + +K+++  E  SL EL FQ+GRYLLISSSR  +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367

Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+NINL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A   Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L  R YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E V +    L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYE 592


>gi|169604462|ref|XP_001795652.1| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
 gi|160706577|gb|EAT87635.2| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
          Length = 771

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 170/606 (28%), Positives = 276/606 (45%), Gaps = 76/606 (12%)

Query: 6   STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
           S STT    I F  P   +TDA+P+GNGRLGA++ GG   E + LNED++W+G      N
Sbjct: 21  SASTT----IWFGKPGVIWTDALPVGNGRLGAVIHGGYGMEQVGLNEDSIWSGGLQKRIN 76

Query: 66  PDAPKALSDVRSLVDSGQYAEATAA---SVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
            +A  A   +     +G  ++A      ++K  G     YQ  G++ +EF  +    +  
Sbjct: 77  SNALAAFPGIPEAFTNGNISKADEIWHNNLKGTGTQVRQYQPAGNMMIEFGQN--VSSVS 134

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y R LDL T    V Y+  +V + R+  +S P   +  + +  ++G+L   +SL     
Sbjct: 135 GYNRSLDLTTGENHVSYTRNDVTYLRQALASYPHDTLGFRYTADKAGALDMKISLT---- 190

Query: 183 NHSYVNGNN------QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
            +  V G         I M G+            ND    ++F  +  I++  D G    
Sbjct: 191 RNESVTGLKVDLEKLSITMYGQ----------GTNDSS--LKF--VHSIRVVADTG---- 232

Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
              K++++           A ++F    +  +++  +   ++  A+       + +  ++
Sbjct: 233 --GKEVRI--------YYGAETTFRHANVEAAEAAMNAKLDAAVAV------PWEEFKSK 276

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD----EDPSLVEL 352
            ++DY+ L  RV +           D  S   I  + + +R+K++ T      DP L+ L
Sbjct: 277 AIEDYKNLADRVQL-----------DVGSSGEIGRLDTGQRLKNWNTTGNATSDPELMAL 325

Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
            + +GR+LLI SSR G+  +NLQG+WN+   P W S   +NIN EMNYW +   NL+E  
Sbjct: 326 TYNYGRFLLIGSSRIGSLPSNLQGVWNDKFKPPWGSRFTININTEMNYWPAETTNLAETH 385

Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
            P+FD L  +   G   A+  Y  SGWV HH TD+W        +  WA  P+GGAWL  
Sbjct: 386 LPVFDHLLRMQEQGRYVAKGMYNMSGWVCHHNTDLWGDCVPVDDQTYWAANPVGGAWLAL 445

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
           HL EH+ +  +  +    A P+L    +F  D+ I+  D Y      +SPE+ +  P  K
Sbjct: 446 HLIEHFRFNGNTTWASSTALPILSDALTFFYDFSIKKGD-YNALIYDSSPENSYHIPSNK 504

Query: 533 -----LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
                   +   S     ++ E+FS  I  +E     +   V K    L  + P  +A D
Sbjct: 505 QVPNATTGIDQGSAHPRQVLHELFSGFIEMSEATGSIDG--VAKAKDYLAHIEPPNVATD 562

Query: 588 GSIMEW 593
           G ++EW
Sbjct: 563 GHLLEW 568


>gi|421290215|ref|ZP_15740965.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA54354]
 gi|421305607|ref|ZP_15756261.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA62331]
 gi|395887900|gb|EJG98914.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA54354]
 gi|395904565|gb|EJH15479.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA62331]
          Length = 803

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYET 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L +   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTDVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|291530512|emb|CBK96097.1| hypothetical protein EUS_08620 [Eubacterium siraeum 70/3]
          Length = 776

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 172/610 (28%), Positives = 290/610 (47%), Gaps = 72/610 (11%)

Query: 14  KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
           KI F  P    K      PIGNG +GA  +GG+  E + LNE TLW G P + + PD   
Sbjct: 4   KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNG 62

Query: 71  ALSD--------VRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
            + D        V+ L+  G+Y EA A    L G       YQLL D+ L F +     A
Sbjct: 63  GIIDGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGYGAYQLLCDMMLTFSNIDETQA 122

Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
            + Y R LDL+ +    +++       RE F++ P  VI  K+S  +   +   +SLD+L
Sbjct: 123 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICIKLSSDKPRRICVKLSLDNL 181

Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
                  NG+  +  EG                  G+++  +   K+ +  G +   +D 
Sbjct: 182 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTVF--KVVNKGGELIDAKDS 225

Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            + VE +D   + L AS+ +   +  P+  +  +P++     +++  +  ++ LY  HL 
Sbjct: 226 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFNALYEEHLA 282

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
           DY+ LF  V+++++    DI+            P  + ++ ++ +   S+      L FQ
Sbjct: 283 DYKALFDSVTLKINEDTDDII------------PCDKLIREYKENGSRSIANRLETLYFQ 330

Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
           FGRY+LISSSR G+  ANLQG+WNE   P W    H+N+NL+MNYW +   NLSE   PL
Sbjct: 331 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 390

Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
            DFL  +  +G K+A+  Y          +GW  H ++  +   +A      W       
Sbjct: 391 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFG-WTAPGWNFYWGWSTAAV 449

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
           AWL  +++E++ +T D+ +  +  YP++     F   WLI +     L ++P+ SPEH  
Sbjct: 450 AWLMQNIYEYFEFTGDKKYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH-- 507

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
                    V+  +T + ++I ++++  I+A+E L  +E+ L   V   + +L+P  +++
Sbjct: 508 -------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSVSK 559

Query: 587 D-GSIMEWVQ 595
             G + EW +
Sbjct: 560 KTGLLKEWFE 569


>gi|419428224|ref|ZP_13968401.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
 gi|379616100|gb|EHZ80800.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
          Length = 707

 Score =  229 bits (584), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 168/534 (31%), Positives = 265/534 (49%), Gaps = 56/534 (10%)

Query: 72  LSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
           L  +R  +  G+  +A     + +F  P D   Y+LLG++ +E  D     A   Y REL
Sbjct: 3   LKKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYEREL 61

Query: 129 DLNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
           DL+TA + V +  +  N++  RE+F+S    ++  +I  S   +L+ N++L  +   ++ 
Sbjct: 62  DLDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDE 121

Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
                ++ I+M     G+            KG+QF  +   K++D  G +S L  + + +
Sbjct: 122 VSKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVI 166

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQK 303
             +    L L + +++ G                +S+LQ    ++ Y      H+  YQ+
Sbjct: 167 RNATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQE 213

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
            F+RV  +L  S KD ++       I T    E  K +       L  LLF +GRYLLIS
Sbjct: 214 QFNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLIS 261

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SS+P    ANLQGIW ++L+P W S   +NIN +MNYW   PC+L E + PLFD L  + 
Sbjct: 262 SSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMR 321

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G  TA+  Y A G+  HH TD +  ++     +  A+W +   WLCTH+WEHY Y  D
Sbjct: 322 EPGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQD 381

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
              L +  + +++    F  D+L E  DGYL T PS SPE+++   +G       SST+D
Sbjct: 382 ERILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTID 439

Query: 544 MAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             I+R    + I  A+ L  N D +  V+++ K LP+   TKI  +G I EW++
Sbjct: 440 NQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 490


>gi|256376305|ref|YP_003099965.1| hypothetical protein Amir_2174 [Actinosynnema mirum DSM 43827]
 gi|255920608|gb|ACU36119.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
          Length = 646

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 114/266 (42%), Positives = 157/266 (59%), Gaps = 5/266 (1%)

Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
            +D  P+ +   S    E P+L  LLFQ GR+LL++SSRPGT  ANLQG+WN    P W 
Sbjct: 199 ELDLGPAPDGPPSTWPREHPALAALLFQHGRHLLVASSRPGTLPANLQGVWNPHAEPPWR 258

Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
           S   +NIN EMNYW + P  L+EC EPL +FL  L+ +G++ A+  Y   GW  HH TD 
Sbjct: 259 SNYTLNINTEMNYWPAEPTALAECHEPLLEFLHGLAESGTRVARELYGLPGWCAHHNTDR 318

Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
           W  ++  +G   WA WPM GAWL  HLWE Y +  D  +L  RA+PLL G A F L WL+
Sbjct: 319 WFLATPVQGDPAWANWPMAGAWLSLHLWERYEFGGDAVWLRGRAWPLLLGAAEFCLAWLV 378

Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
           E   G L T PSTSPE+ ++  DG+   V   +TMD+A+  E+   ++ A  VL ++   
Sbjct: 379 E-DRGELTTAPSTSPENHYLTADGREVAVGVGATMDLALTWELLDRVVRAGAVLGED--- 434

Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEW 593
            V +  ++L R+    +  DG ++EW
Sbjct: 435 -VGRFAEALARIPEPPVGSDGRVLEW 459



 Score = 46.6 bits (109), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 51/114 (44%), Gaps = 12/114 (10%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---PDAPKALSDVR 76
           PA  + +A PIG+GR GAM WG        LN+D LWT       +     AP+ +   R
Sbjct: 15  PAARWEEAHPIGDGRFGAMCWG---DGRFDLNDDRLWTDPSPPDPSQPAAGAPEVVRAAR 71

Query: 77  SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
           +   +G    A      + G     YQ LG + L +       AE  YRRELDL
Sbjct: 72  AAALAGDPERADELLRSVQGPDTASYQPLGTLVLGY------RAEGGYRRELDL 119


>gi|306824549|ref|ZP_07457895.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
           73H25AP]
 gi|304433336|gb|EFM36306.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
           73H25AP]
          Length = 1749

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 183/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 184 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 241

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 242 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 301

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYV-- 187
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N +Y   
Sbjct: 302 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 361

Query: 188 -----NGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
                NG+     N I+++G         K N      G++F++ L IK     G + A+
Sbjct: 362 YSHYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GKV-AV 404

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E+     +++ +   Y  L 
Sbjct: 405 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLENTVKGIVEAAKAKDYETLK 461

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++S+  ++   L EL F
Sbjct: 462 QDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPEKGQKLEELFF 508

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NLSE  
Sbjct: 509 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLSETA 568

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 569 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 623

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 624 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKVSDRWV-SSPS 682

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 683 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 732

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I  +G I EW +
Sbjct: 733 KPLHINNEGRIKEWYE 748


>gi|270292150|ref|ZP_06198365.1| fibronectin type III domain protein [Streptococcus sp. M143]
 gi|270279678|gb|EFA25520.1| fibronectin type III domain protein [Streptococcus sp. M143]
          Length = 1747

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 181/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--ERYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEVGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  +  D  T              E ++ +  D+   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGNKTDQTT-------------KEALQGYNPDKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRVAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|418190394|ref|ZP_12826903.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
 gi|353851653|gb|EHE31644.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
          Length = 682

 Score =  228 bits (582), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 165/511 (32%), Positives = 255/511 (49%), Gaps = 55/511 (10%)

Query: 94  LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTRE 149
           +F  P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  RE
Sbjct: 1   MFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKRE 59

Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
           +F+S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+     
Sbjct: 60  YFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR----- 114

Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
                  KG+QF  +   K++D  G +S L  + + +  +    L L + + + G     
Sbjct: 115 -------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI--- 161

Query: 268 SDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
                      +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S KD ++     
Sbjct: 162 ----------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS----- 205

Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTW 386
             I T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W
Sbjct: 206 --IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIW 259

Query: 387 DSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTD 446
            S   +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD
Sbjct: 260 GSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTD 319

Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
            +  ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L
Sbjct: 320 GFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYL 378

Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
            E  DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D
Sbjct: 379 FEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSD 437

Query: 567 AL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +  V+++ K LPR   TKI  +G I EW++
Sbjct: 438 FISRVKELKKKLPR---TKIGSNGQIQEWLE 465


>gi|293364225|ref|ZP_06610951.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
 gi|307702420|ref|ZP_07639376.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
 gi|291317071|gb|EFE57498.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
 gi|307624002|gb|EFO02983.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
          Length = 1707

 Score =  228 bits (582), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 181/616 (29%), Positives = 304/616 (49%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G+QF++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ ++  Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  +               T  + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|365119726|ref|ZP_09337619.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363648290|gb|EHL87470.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1009

 Score =  228 bits (582), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 161/501 (32%), Positives = 244/501 (48%), Gaps = 45/501 (8%)

Query: 105 LGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI 163
           L DIELE++  +      + Y R LD++ A   V Y      FTRE F S PD V+V ++
Sbjct: 317 LSDIELEYEQLYEPLEPYSDYVRMLDIDNAVHSVIYKENGTTFTRECFMSYPDNVMVMRL 376

Query: 164 SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
              + G +S    + S           N + M G+      P     N    G++F+   
Sbjct: 377 KADKGGCISRTFGITSPQPKKRIFASGNTLTMTGQ------PALHKEN----GLKFAQ-- 424

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSA 281
           ++K+ +  G +  +++KK++V+ +D  +LL+ A++++        D  S +DP +     
Sbjct: 425 QVKVLNKGGYLEVIDNKKIRVKDADEVILLMSAATNYQQSMDEKFDYFSDEDPLTTVKRT 484

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           L +  + +Y DL + H  DY+ L+ R+S+ L          T        +   +  K  
Sbjct: 485 LMAAESKTYEDLLSSHKKDYKALYDRMSLNLGNI-------TGMSTKTTDILLKDFYKGN 537

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
             +E+     L +QFGRYLLI+SSR  +  ANLQG+W E LS  W++  H NIN++MNYW
Sbjct: 538 TVEENLYTEMLYYQFGRYLLIASSRENSLPANLQGVWGERLSNPWNADYHTNINVQMNYW 597

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADR 455
            +   NLS C  PL  ++  L   G  TA+  Y         GWV HH+ +IW  ++   
Sbjct: 598 PAQQTNLSPCHIPLISYINSLVPRGKITARHYYCKPDGGDVRGWVTHHENNIWGNTAPGT 657

Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD--WLIEGHDGY 513
               +   P G AW+C  +WE+Y +  D+ FLE+  Y  L G A F +D  W  E  DG 
Sbjct: 658 SYGAFHF-PAGAAWMCQDIWEYYQFNCDKKFLEQN-YNTLLGAALFWVDNLWTDE-RDGT 714

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KV 572
           L  NPS SPEH     +  L C    ST+  A+I E+F  +I A+E L K+   + E K 
Sbjct: 715 LVANPSHSPEH----GEYSLGC----STV-QAMIAEIFDIVIKASEDLGKDTKEVAEIKA 765

Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
            KS  +L   +I   G  MEW
Sbjct: 766 AKS--KLAGPQIGLGGQFMEW 784



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 42/56 (75%), Gaps = 3/56 (5%)

Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
          +K  +N PAK + ++A+PIGNG +GAM++G V  + +++NE +LW+G PG+  NPD
Sbjct: 40 MKAVYNKPAKVWESEALPIGNGYMGAMIFGDVYRDVIQVNEHSLWSGGPGE--NPD 93


>gi|417939732|ref|ZP_12583021.1| gram positive anchor [Streptococcus oralis SK313]
 gi|343389927|gb|EGV02511.1| gram positive anchor [Streptococcus oralis SK313]
          Length = 1727

 Score =  228 bits (582), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 180/617 (29%), Positives = 301/617 (48%), Gaps = 98/617 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKTKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTGQTT-------------KEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDQTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKTKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQR 596
           +P  I ++G I EW + 
Sbjct: 691 KPLHINKEGRIKEWYEE 707


>gi|419778183|ref|ZP_14304079.1| gram positive anchor [Streptococcus oralis SK10]
 gi|383187500|gb|EIC79950.1| gram positive anchor [Streptococcus oralis SK10]
          Length = 1707

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 181/616 (29%), Positives = 304/616 (49%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G+QF++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ ++  Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  +               T  + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|406576906|ref|ZP_11052529.1| LPXTG cell surface protein, calx-beta domain-containing protein
           [Streptococcus sp. GMD6S]
 gi|404460587|gb|EKA06837.1| LPXTG cell surface protein, calx-beta domain-containing protein
           [Streptococcus sp. GMD6S]
          Length = 1707

 Score =  228 bits (581), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 180/616 (29%), Positives = 302/616 (49%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ+LF+RV + L               N     + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQRLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|400594907|gb|EJP62734.1| alpha-fucosidase [Beauveria bassiana ARSEF 2860]
          Length = 798

 Score =  228 bits (581), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 167/593 (28%), Positives = 272/593 (45%), Gaps = 63/593 (10%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
           T  + IGNGR+GA ++G   +E + LNED++W+G   +       +AL  +R  +     
Sbjct: 42  TGVLAIGNGRIGAAIFGS-GNEVITLNEDSIWSGPLQNRMPTRGLQALPKIRQQLVEDNI 100

Query: 85  AEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
            EAT++ +     P+     VY   G++ L+F           Y R LD     A + Y+
Sbjct: 101 TEATSSIMNDM-MPSVSRERVYSYFGNLHLDFGHER---GMTNYVRWLDTRQGNAGISYT 156

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHSYVNGNNQIIMEG 197
              + +TRE+ +S P  ++  + + S++G+LSFN +     ++L N +    N  ++   
Sbjct: 157 YNGINYTREYIASFPAGILAARFTASKAGALSFNTTFTRESNILANSASATTNGGLLTMR 216

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
              G+      +  +DP  I F+   +  I+D+  T  ++    L + G+    L     
Sbjct: 217 GSSGQ------STKNDP--ILFTGKGQF-IADNAHT--SVSGSTLSITGATEVDLFFDIE 265

Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
           +S+         +++   +E    L++     Y+D+    + D   L  R SI   +SP 
Sbjct: 266 TSYR------HQTQQKLEAEVDRKLKASIAKGYTDIRDGAIADATALLGRASINFGKSPN 319

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG----TQVA 372
                         +P+ +R+K  +   +D  L  L + +GR+LL++SSR      +  A
Sbjct: 320 GAAN----------LPTDKRIKMARKGLDDTQLAVLAWNYGRHLLVASSRHNDADVSLPA 369

Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
           NL G+WN   +  W     +N+NLEMNYW +   N+ E QE +F  L      G + AQ 
Sbjct: 370 NLLGLWNNRTTSAWGGKFTINVNLEMNYWPAGQTNIIETQESMFSLLKIAKPRGEEMAQK 429

Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
            Y  +G V HH  D+W  ++         +WPMG AW   H+ +HY +T D  FL   AY
Sbjct: 430 LYGCNGTVFHHNLDLWGDAAPSDNNTSATMWPMGAAWTVQHMMDHYRFTGDAGFLLHTAY 489

Query: 493 PLLEGCASFL----LDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
           P L   ASF      DW      G   T PS SPE+ FI P      G       +  MD
Sbjct: 490 PFLTDVASFYRCYAFDW-----QGSKVTGPSVSPENSFIVPKNASVAGSRKAYDIAPEMD 544

Query: 544 MAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             ++R+V  +++ AA+ L   + +ED  V++  K LP +R   I   G I+EW
Sbjct: 545 NQLMRDVMESLLEAAKALNIPQTDED--VKEATKFLPLIRRPAIGSYGQILEW 595


>gi|15901489|ref|NP_346093.1| hypothetical protein SP_1654 [Streptococcus pneumoniae TIGR4]
 gi|111658563|ref|ZP_01409226.1| hypothetical protein SpneT_02000319 [Streptococcus pneumoniae
           TIGR4]
 gi|421243582|ref|ZP_15700095.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2081074]
 gi|421247923|ref|ZP_15704402.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2082170]
 gi|14973145|gb|AAK75733.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
 gi|395606587|gb|EJG66691.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2081074]
 gi|395612939|gb|EJG72972.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2082170]
          Length = 803

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 281/599 (46%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL++NYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y +  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YLFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|336431570|ref|ZP_08611417.1| hypothetical protein HMPREF0991_00536, partial [Lachnospiraceae
           bacterium 2_1_58FAA]
 gi|336011929|gb|EGN41858.1| hypothetical protein HMPREF0991_00536 [Lachnospiraceae bacterium
           2_1_58FAA]
          Length = 1869

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 176/643 (27%), Positives = 302/643 (46%), Gaps = 88/643 (13%)

Query: 6   STSTTNPLKITFNGPAKHFTD----------AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
           + S +  LK+ +  PA   T           ++P+GNG LG +++GG+  E +  NE TL
Sbjct: 40  TESISQSLKLWYTSPANINTQETNGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTL 99

Query: 56  WTGVP---------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKL 94
           WTG P         G+       + + + R L+D             G Y     A +K 
Sbjct: 100 WTGGPSPSRPGYQFGNKATAYTDEEIENYRKLLDDKSTKVFNDDQSLGGYG--MGAQIKF 157

Query: 95  FGHP---ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREH 150
            G        YQ  GDI L+F    L+    + YRRELDL T  A  ++S  +V + REH
Sbjct: 158 PGENNLNKGSYQDFGDIWLDFSKMGLQDQNVKNYRRELDLQTGVASTEFSYEDVNYKREH 217

Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
           F SNPDQ++VTK+S SESG L  +V ++   + L+  +  +  NQ      C    I  K
Sbjct: 218 FVSNPDQIMVTKLSASESGKLDLSVKMELNNNGLEGKTTFDSENQT-----CT---IEGK 269

Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFIN 266
              ND    ++F   +++ +  + G +   E  ++ ++E ++  ++++ A + +   +  
Sbjct: 270 VKDND----LKFYTTMKLVL--EGGDLEVDEKNQVYQIEDANQVMIVMAAETDYKNDYPT 323

Query: 267 PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
             D +K+        + S    SY  L  +H+ D+QKLF RVS+ L     +I       
Sbjct: 324 YRDKEKNLKKMVDDRVNSNAKKSYQKLKEKHIADHQKLFDRVSLDLGEQRTNI------- 376

Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLSPT 385
                 P+ + V  ++       +E+L FQ+GRYL I+ SR GT  +NL G+W    S  
Sbjct: 377 ------PTNQLVDEYRNGTYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGDSA- 428

Query: 386 WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLA 436
           W    H N+N++MNYW     NL+EC     D+         LT   ++G + A  N+  
Sbjct: 429 WTGDYHFNVNVQMNYWPVYTTNLAECGVTFVDYMDKLREPGRLTAERVHGIEGAVENH-- 486

Query: 437 SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
           +G+ +H + + +  ++    +  +   P G AW   +LW HY +T + D+L+   YP+++
Sbjct: 487 TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGAAWAIQNLWWHYEFTQNEDYLKNTIYPIMK 545

Query: 497 GCASFLLD--WLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIREVFS 552
             A F     W  E      E++P    +   +AP    +    +  +T D +++ E++ 
Sbjct: 546 EAAQFWDSYLWTSEYQKINDESSPYNGQDRLVVAPSFSEEQGPTAIGTTYDQSLVWELYK 605

Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             I A +++ ++E AL++   +++ +L P +I E   I EW +
Sbjct: 606 ECIQAGKIVGEDE-ALLKSWEENMQKLDPIEINETNGIKEWYE 647


>gi|315611778|ref|ZP_07886700.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
 gi|315316193|gb|EFU64223.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
          Length = 1707

 Score =  228 bits (580), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 180/616 (29%), Positives = 302/616 (49%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKVKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L               N     + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|418094446|ref|ZP_12731573.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA49138]
 gi|353764942|gb|EHD45490.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA49138]
          Length = 803

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 176/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +L +G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLCSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
            A   Y      F RE F+S PD ++V   +     +L F + L     L  N  Y    
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCYLASNGKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A + Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAALKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
            AW+   ++E Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH 
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     +S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I 
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582

Query: 586 EDGSIMEWVQ 595
           + G I EW +
Sbjct: 583 QSGRIREWYE 592


>gi|148988700|ref|ZP_01820133.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
           SP6-BS73]
 gi|182684597|ref|YP_001836344.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
 gi|303255977|ref|ZP_07342005.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
           BS455]
 gi|303258584|ref|ZP_07344564.1| hypothetical protein CGSSp9vBS293_05634 [Streptococcus pneumoniae
           SP-BS293]
 gi|303262671|ref|ZP_07348611.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
           SP14-BS292]
 gi|303263611|ref|ZP_07349533.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
           BS397]
 gi|303266372|ref|ZP_07352261.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
           BS457]
 gi|303268245|ref|ZP_07354043.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
           BS458]
 gi|387759769|ref|YP_006066747.1| hypothetical protein SPNINV200_14790 [Streptococcus pneumoniae
           INV200]
 gi|419515161|ref|ZP_14054786.1| fibronectin type III domain protein [Streptococcus pneumoniae
           England14-9]
 gi|421296489|ref|ZP_15747198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58581]
 gi|147925901|gb|EDK76976.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
           SP6-BS73]
 gi|182629931|gb|ACB90879.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
 gi|301802358|emb|CBW35112.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
 gi|302597036|gb|EFL64154.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
           BS455]
 gi|302636227|gb|EFL66722.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
           SP14-BS292]
 gi|302640085|gb|EFL70540.1| hypothetical protein CGSSpBS293_05634 [Streptococcus pneumoniae
           SP-BS293]
 gi|302642196|gb|EFL72545.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
           BS458]
 gi|302644072|gb|EFL74330.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
           BS457]
 gi|302646649|gb|EFL76874.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
           BS397]
 gi|379635710|gb|EIA00269.1| fibronectin type III domain protein [Streptococcus pneumoniae
           England14-9]
 gi|395895362|gb|EJH06337.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58581]
          Length = 803

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L   + +  G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ Y +L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|421488290|ref|ZP_15935682.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           oralis SK304]
 gi|400368666|gb|EJP21674.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           oralis SK304]
          Length = 1687

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 181/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 199 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITD 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 319 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP  S +KD   E      +++ +   Y  L 
Sbjct: 362 QDETLTVTGASYATLYLSAKTNFAQ---NPKTSYRKDIDLEKTVKGIVEAAKAKDYETLK 418

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L               N     + E ++ +  ++   L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDESYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPS 639

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 689

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 690 KPLHINKEGRIKEWYE 705


>gi|418139981|ref|ZP_12776806.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
 gi|418181010|ref|ZP_12817579.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
 gi|353843082|gb|EHE23127.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
 gi|353904760|gb|EHE80210.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
          Length = 778

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L   + +  G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ Y +L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|401683949|ref|ZP_10815833.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
 gi|400186628|gb|EJO20836.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
          Length = 1687

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 182/616 (29%), Positives = 299/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 122 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYK--DRYKVLAEIRK 179

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 180 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 239

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   +  L F   N   + LL N      
Sbjct: 240 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKKLDFTLWNSLTEDLLANGEYSWE 299

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 300 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 342

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 343 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 399

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++S+   +   L EL F
Sbjct: 400 QDHIKDYQNLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFF 446

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 447 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 506

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 507 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 561

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 562 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 620

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 621 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 670

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 671 KPLHINKEGRIKEWYE 686


>gi|210614863|ref|ZP_03290362.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
 gi|210150505|gb|EEA81514.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
          Length = 1797

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 181/645 (28%), Positives = 302/645 (46%), Gaps = 96/645 (14%)

Query: 8   STTNPLKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           S    LK+ +  PAK  T           ++P+GNG LG +++GG+  E +  NE TLWT
Sbjct: 43  SINQELKLWYTSPAKIDTAETNGGEWMQQSLPLGNGNLGNLIFGGIAKERIHFNEKTLWT 102

Query: 58  GVPG----DYTNPDAPKALSDV-----RSLVDS------------GQYAEATAASVKLFG 96
           G P     +Y   +   A +D      R L+D             G Y     A +K  G
Sbjct: 103 GGPSSSRPNYQFGNKATAYTDTEIEEYRKLLDDKSTNVFNDDKSLGGYG--MGAKIKFPG 160

Query: 97  HP---ADVYQLLGDIELEF-----DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
                   YQ  GDI L+F     +D+++K     YRRELD+ T  A  ++S  +V + R
Sbjct: 161 ENNLNKGSYQDFGDIWLDFSKMGINDNNVK----DYRRELDIQTGIAATEFSCKDVTYKR 216

Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           EHF SNPDQV+VT++S SE G L  NV ++   S L+  +  +  NQ      C    I 
Sbjct: 217 EHFVSNPDQVMVTELSASEKGKLDLNVKMELNNSGLEGKTTFDEKNQT-----CT---IE 268

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPF 264
            K   ND    ++F   +++ ++   G +SA E  ++ +++ +D  ++++ A + +   +
Sbjct: 269 GKVKDND----LKFCTTMKLVLTG--GKLSADEKNQVYQIQDADCVMIVMAAETDYKNDY 322

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
               D  KD        + +    SY +L   H+ D+Q LF RVS+ L            
Sbjct: 323 PTYRDKNKDLKKVVADRVNNGTKKSYDELKETHIADHQGLFDRVSLDLG----------- 371

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
             E   +VP+ + V  ++       +E+L FQ+GRYL I+ SR GT  +NL G+W    S
Sbjct: 372 --EQRTSVPTNQLVDEYRNGNYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGNS 428

Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNY 434
             W    H N+N++MNYW     NL+EC     D+         LT   ++G + A  N+
Sbjct: 429 A-WTGDYHFNVNVQMNYWPVYATNLAECGTTFVDYMDKLREPGRLTAERVHGIEGAVKNH 487

Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
             +G+ +H + + +  ++    +  +   P G AW   +LW HY +T D  +L+   YP+
Sbjct: 488 --TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGAAWAIQNLWWHYEFTQDEAYLKNTIYPI 544

Query: 495 LEGCASFLLD--WLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIREV 550
           ++  A F     W  E      E +P        +AP    +    +  +T D +++ E+
Sbjct: 545 MKEAALFWDSYLWTSEYQKINDENSPYNGQNRLVVAPSFSEEQGPTAVGTTYDQSLVWEL 604

Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++  I A +++ ++E AL++   + + +L P +I +   I EW +
Sbjct: 605 YNECIKAGKIVGEDE-ALLKSWEEKMQKLDPIEINDTNGIKEWYE 648


>gi|322375926|ref|ZP_08050437.1| fibronectin type III domain protein [Streptococcus sp. C300]
 gi|321279194|gb|EFX56236.1| fibronectin type III domain protein [Streptococcus sp. C300]
          Length = 1707

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 181/617 (29%), Positives = 302/617 (48%), Gaps = 98/617 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G+QF++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ ++  Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  +               T  + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQR 596
           +P  I  +G I EW + 
Sbjct: 691 KPLHINNEGRIKEWYEE 707


>gi|418975961|ref|ZP_13523855.1| gram positive anchor [Streptococcus oralis SK1074]
 gi|383346616|gb|EID24639.1| gram positive anchor [Streptococcus oralis SK1074]
          Length = 1687

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 183/611 (29%), Positives = 299/611 (48%), Gaps = 88/611 (14%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATA-ASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   A   LFG     Y      GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLFGPNNAQYGRCLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 319

Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             +Y NG+      G      I  K    D+  G++F++ L IK     GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367

Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
            V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L   H+ 
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKAHIK 424

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF+RV + LS S     T              E ++ +  ++   L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLSGSKTAQTT-------------KEALQGYNPEKGQKLEELFFQYGRY 471

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  +P+ +
Sbjct: 472 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531

Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           ++  +   G           SK  Q N    GW++H +   +  ++       W   P  
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
            AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPSYSPEH 645

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+P  I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVEAKFDKLKPLHI 695

Query: 585 AEDGSIMEWVQ 595
             +G I EW +
Sbjct: 696 NNEGRIKEWYE 706


>gi|317501845|ref|ZP_07960030.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|336439520|ref|ZP_08619132.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
           1_1_57FAA]
 gi|316896735|gb|EFV18821.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|336015952|gb|EGN45750.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
           1_1_57FAA]
          Length = 1802

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)

Query: 13  LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
           LK+ +  PAK  T           ++P+GNG LG +++GG+  E +  NE TLWTG P  
Sbjct: 47  LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106

Query: 61  -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
                  G+         + + R L+D             G Y     A ++  G     
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164

Query: 99  ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
              YQ  GDI L+F    +     + YRREL+L T  A  ++S  NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
           V+VT +S SE G L+F+  ++  L+N    N   ++  + R     I  K   ND    +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275

Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
           +F   +++ ++   G I+A E  ++ +++ +D   +++ A + +   +    D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
              + +      SY +L   H++D+Q LF RVS+ L      + TD      ID   +  
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
                +T        L FQ+GRYL I+ SR GT  +NL G+W   + P+ W    H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438

Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
           ++MNYW     NL+EC     D+         LT   ++G K A  N+  +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496

Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
            +  ++    +  +   P G AW   +LW HY +T D  +L+   YP+++  A F   +L
Sbjct: 497 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555

Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
                 Y + N  TSP H     +  +A  S+S         +T D ++I E+++  I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610

Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +++ ++E A+++   + + +L P +I     I EW +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 647


>gi|331088642|ref|ZP_08337553.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|330407599|gb|EGG87099.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
           3_1_46FAA]
          Length = 1802

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)

Query: 13  LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
           LK+ +  PAK  T           ++P+GNG LG +++GG+  E +  NE TLWTG P  
Sbjct: 47  LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106

Query: 61  -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
                  G+         + + R L+D             G Y     A ++  G     
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164

Query: 99  ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
              YQ  GDI L+F    +     + YRREL+L T  A  ++S  NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
           V+VT +S SE G L+F+  ++  L+N    N   ++  + R     I  K   ND    +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275

Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
           +F   +++ ++   G I+A E  ++ +++ +D   +++ A + +   +    D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
              + +      SY +L   H++D+Q LF RVS+ L      + TD      ID   +  
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
                +T        L FQ+GRYL I+ SR GT  +NL G+W   + P+ W    H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438

Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
           ++MNYW     NL+EC     D+         LT   ++G K A  N+  +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496

Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
            +  ++    +  +   P G AW   +LW HY +T D  +L+   YP+++  A F   +L
Sbjct: 497 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555

Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
                 Y + N  TSP H     +  +A  S+S         +T D ++I E+++  I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610

Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +++ ++E A+++   + + +L P +I     I EW +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 647


>gi|306830121|ref|ZP_07463305.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
 gi|304427647|gb|EFM30743.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
          Length = 1685

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 179/610 (29%), Positives = 295/610 (48%), Gaps = 86/610 (14%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 318

Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             +Y NG+      G      I  K    D+  G++F++ L IK     GT++ ++++ L
Sbjct: 319 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 366

Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
            V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L   H+ 
Sbjct: 367 TVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKDHIK 423

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF+RV + L               N  T  + E ++S+   +   L EL FQ+GRY
Sbjct: 424 DYQSLFNRVKLNLGG-------------NKTTQTTKEALQSYNPSKGQKLEELFFQYGRY 470

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  +P+ +
Sbjct: 471 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 530

Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           ++  +   G           SK  Q N    GW++H +   +  ++       W   P  
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE 525
            AW+  +++++Y +T D  +L+++ YP+L+  A F   +L  +       ++PS SPEH 
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSYSPEH- 644

Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
                     ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+P  I 
Sbjct: 645 --------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHIN 695

Query: 586 EDGSIMEWVQ 595
            +G I EW +
Sbjct: 696 NEGRIKEWYE 705


>gi|187735615|ref|YP_001877727.1| hypothetical protein Amuc_1120 [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187425667|gb|ACD04946.1| conserved hypothetical protein [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 796

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 182/608 (29%), Positives = 275/608 (45%), Gaps = 100/608 (16%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
            +  PIGNGR+GAM++     E L LNE +LW+                        G Y
Sbjct: 65  AEGYPIGNGRVGAMIFSAPGRERLALNEISLWS------------------GGANPGGGY 106

Query: 85  AEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVG 142
                A    FG+    Y   GD+ ++F   D     + E + R LDL     +V Y   
Sbjct: 107 GYGPDAGTNQFGN----YLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKAD 162

Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
            V + RE FSS P  V+V     S+ G  S + S++S L       G+  I  +G     
Sbjct: 163 GVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGS-VITWKGMLK-- 219

Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
                        G+ +     + I    GT+SA  DK + V+ +D  ++++   + +  
Sbjct: 220 ------------NGMNYEG--RVLIRPKGGTLSASGDK-ISVKNADSCMVVIAMETDY-- 262

Query: 263 PFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
                 D KKD   ES S           +  Y+ L   H+  Y+ +F RV +   ++  
Sbjct: 263 ----LMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKT-- 316

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQG 376
                   EE++  +P+ +R+++++ +  DP L E +FQFGRYLL+SSSRPGT  ANLQG
Sbjct: 317 --------EEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQG 368

Query: 377 IWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN--- 433
           +WN+ + P W    H NIN++M YW + P NLSEC E L +++  ++      +Q N   
Sbjct: 369 LWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGF 428

Query: 434 -----YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
                    GW +    +I+  +        W     G AW   H+WEHY +T DR +LE
Sbjct: 429 NTKDGKPVRGWTVRTSQNIFGGNG-------WQWNIPGAAWYALHIWEHYAFTGDRKYLE 481

Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHE-----------FIAPDG--- 531
           K+AYPL++    F  D L E   G +G+ +TN     E E            +AP+G   
Sbjct: 482 KQAYPLMKEICHFWEDHLKELGAGGEGF-KTNGKDPSEEEKKDLADVKAGTLVAPNGWSP 540

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSI 590
           +          D  +I E+FS  I AA +L K  DA   K L+  L RL   KI ++G++
Sbjct: 541 EHGPREDGVMHDQQLIAELFSNTIKAARILGK--DAAWAKSLEGKLKRLAGNKIGKEGNL 598

Query: 591 MEWVQRRL 598
            EW+  R+
Sbjct: 599 QEWMIDRI 606


>gi|419781688|ref|ZP_14307504.1| gram positive anchor [Streptococcus oralis SK610]
 gi|383183996|gb|EIC76526.1| gram positive anchor [Streptococcus oralis SK610]
          Length = 1707

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 179/616 (29%), Positives = 299/616 (48%), Gaps = 96/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++ +  ++   L EL F
Sbjct: 420 NAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPST 520
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L  +       ++PS 
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSY 641

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
           SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+
Sbjct: 642 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 691

Query: 581 PTKIAEDGSIMEWVQR 596
           P  I ++G I EW + 
Sbjct: 692 PLHINKEGRIKEWYEE 707


>gi|153815077|ref|ZP_01967745.1| hypothetical protein RUMTOR_01294 [Ruminococcus torques ATCC 27756]
 gi|145847645|gb|EDK24563.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
          Length = 1812

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)

Query: 13  LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
           LK+ +  PAK  T           ++P+GNG LG +++GG+  E +  NE TLWTG P  
Sbjct: 57  LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 116

Query: 61  -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
                  G+         + + R L+D             G Y     A ++  G     
Sbjct: 117 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 174

Query: 99  ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
              YQ  GDI L+F    +     + YRREL+L T  A  ++S  NV + REHF S+PDQ
Sbjct: 175 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 234

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
           V+VT +S SE G L+F+  ++  L+N    N   ++  + R     I  K   ND    +
Sbjct: 235 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 285

Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
           +F   +++ ++   G I+A E  ++ +++ +D   +++ A + +   +    D +K+ ++
Sbjct: 286 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 343

Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
              + +      SY +L   H++D+Q LF RVS+ L      + TD      ID   +  
Sbjct: 344 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 399

Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
                +T        L FQ+GRYL I+ SR GT  +NL G+W   + P+ W    H N+N
Sbjct: 400 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 448

Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
           ++MNYW     NL+EC     D+         LT   ++G K A  N+  +G+ +H + +
Sbjct: 449 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 506

Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
            +  ++    +  +   P G AW   +LW HY +T D  +L+   YP+++  A F   +L
Sbjct: 507 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 565

Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
                 Y + N  TSP H     +  +A  S+S         +T D ++I E+++  I A
Sbjct: 566 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 620

Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +++ ++E A+++   + + +L P +I     I EW +
Sbjct: 621 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 657


>gi|154505582|ref|ZP_02042320.1| hypothetical protein RUMGNA_03121 [Ruminococcus gnavus ATCC 29149]
 gi|153794240|gb|EDN76660.1| hypothetical protein RUMGNA_03121, partial [Ruminococcus gnavus
           ATCC 29149]
          Length = 1873

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 170/612 (27%), Positives = 292/612 (47%), Gaps = 78/612 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           ++P+GNG LG +++GG+  E +  NE TLWTG P         G+       + + + R 
Sbjct: 4   SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSPSRPGYQFGNKATAYTDEEIENYRK 63

Query: 78  LVDS------------GQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE- 121
           L+D             G Y     A +K  G        YQ  GDI L+F    L+    
Sbjct: 64  LLDDKSTKVFNDDQSLGGYG--MGAQIKFPGENNLNKGSYQDFGDIWLDFSKMGLQDQNV 121

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD--- 178
           + YRRELDL T  A  ++S  +V + REHF SNPDQ++VTK+S SESG L  +V ++   
Sbjct: 122 KNYRRELDLQTGVASTEFSYEDVNYKREHFVSNPDQIMVTKLSASESGKLDLSVKMELNN 181

Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
           + L+  +  +  NQ      C    I  K   ND    ++F   +++ +  + G +   E
Sbjct: 182 NGLEGKTTFDPENQT-----CT---IEGKVKDND----LKFYTTMKLVL--EGGDLEVDE 227

Query: 239 DKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
             ++ ++E ++  ++++ A + +   +    D +K+        + S    SY  L  +H
Sbjct: 228 KNQVYQIEDANQVMIVMAAETDYKNDYPTYRDKEKNLKKMVDDRVNSNAKKSYQKLKEKH 287

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQF 356
           + D+QKLF RVS+ L     +I             P+ + V  ++       +E+L FQ+
Sbjct: 288 IADHQKLFDRVSLDLGEQRTNI-------------PTNQLVDEYRNGTYSHYLEVLAFQY 334

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GRYL I+ SR GT  +NL G+W    S  W    H N+N++MNYW     NL+EC     
Sbjct: 335 GRYLTIAGSR-GTLPSNLVGLWTVGDSA-WTGDYHFNVNVQMNYWPVYTTNLAECGVTFV 392

Query: 417 DF---------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
           D+         LT   ++G + A  N+  +G+ +H + + +  ++    +  +   P G 
Sbjct: 393 DYMDKLREPGRLTAERVHGIEGAVENH--TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGA 449

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD--WLIEGHDGYLETNPSTSPEHE 525
           AW   +LW HY +T + D+L+   YP+++  A F     W  E      E++P    +  
Sbjct: 450 AWAIQNLWWHYEFTQNEDYLKNTIYPIMKEAAQFWDSYLWTSEYQKINDESSPYNGQDRL 509

Query: 526 FIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
            +AP    +    +  +T D +++ E++   I A +++ ++E AL++   +++ +L P +
Sbjct: 510 VVAPSFSEEQGPTAIGTTYDQSLVWELYKECIQAGKIVGEDE-ALLKSWEENMQKLDPIE 568

Query: 584 IAEDGSIMEWVQ 595
           I E   I EW +
Sbjct: 569 INETNGIKEWYE 580


>gi|418098974|ref|ZP_12736071.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6901-05]
 gi|353768956|gb|EHD49478.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6901-05]
          Length = 795

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 279/599 (46%), Gaps = 70/599 (11%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P      +Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGIYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN D         H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584


>gi|415750047|ref|ZP_11477991.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
 gi|381318341|gb|EIC59066.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
          Length = 803

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 173/599 (28%), Positives = 281/599 (46%), Gaps = 62/599 (10%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+ IGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALLIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F R+ F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592


>gi|331265740|ref|YP_004325370.1| LPXTG cell surface protein, calx-beta domain-containing protein
           [Streptococcus oralis Uo5]
 gi|326682412|emb|CBZ00029.1| LPXTG cell surface protein, calx-beta domain protein [Streptococcus
           oralis Uo5]
          Length = 1707

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 181/617 (29%), Positives = 300/617 (48%), Gaps = 98/617 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G+QF++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      ++  +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKNNYRKDIDLEKTVKGIVEVAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  +               T  + E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQR 596
           +P  I  +G I EW + 
Sbjct: 691 KPLHINNEGRIKEWYEE 707


>gi|417794521|ref|ZP_12441772.1| gram positive anchor [Streptococcus oralis SK255]
 gi|334269196|gb|EGL87623.1| gram positive anchor [Streptococcus oralis SK255]
          Length = 1707

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 179/614 (29%), Positives = 300/614 (48%), Gaps = 94/614 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSLV 79
           A+P+GNG +GA V+G +  E ++ NE TLW+G P     DY      D  K L+++R  +
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSSDYNGGNYKDRYKVLAEIRKAL 201

Query: 80  DSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
           + G   +A   + +    P +     Y   GDI + F++        T Y R LD+  AT
Sbjct: 202 EDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYYRGLDITEAT 261

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN-------H 184
               Y+     F RE FSS PD V VT ++   + +L F   N   + LL N        
Sbjct: 262 TTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWEYS 321

Query: 185 SYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
           +Y NG+     N I+++G         K N      G++F++ L IK     GT++ +++
Sbjct: 322 NYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GTVT-VQN 364

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTR 296
           + L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L   
Sbjct: 365 ETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKA 421

Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
           H+ DYQ LF+RV + L               N     + E ++ +  ++   L EL FQ+
Sbjct: 422 HIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFFQY 468

Query: 357 GRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  +P
Sbjct: 469 GRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMNNLAETAKP 528

Query: 415 LFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
           + +++  +   G           SK  Q N    GW++H +   +  ++       W   
Sbjct: 529 MINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWS 583

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTS 521
           P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS S
Sbjct: 584 PAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYS 642

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
           PEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+P
Sbjct: 643 PEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKP 692

Query: 582 TKIAEDGSIMEWVQ 595
             I ++G I EW +
Sbjct: 693 LHINKEGRIKEWYE 706


>gi|385261489|ref|ZP_10039611.1| Gram-positive signal peptide protein, YSIRK family, partial
           [Streptococcus sp. SK643]
 gi|385193017|gb|EIF40405.1| Gram-positive signal peptide protein, YSIRK family, partial
           [Streptococcus sp. SK643]
          Length = 1474

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 181/615 (29%), Positives = 293/615 (47%), Gaps = 96/615 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 152 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYQ--ERYKVLAEIRK 209

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 210 ALEEGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDITE 269

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SL-DSLLDNHSY--- 186
           AT    Y+     F RE FSS PD V VT ++      L F V  SL + LL N +Y   
Sbjct: 270 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTQKGDKKLDFTVWNSLTEDLLANGNYSAE 329

Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
                        N I+++G         K N      G++F++ L IK     G ++  
Sbjct: 330 YSHYKSGHVTTDPNGILLKGTV-------KDN------GLRFASYLGIKTD---GKVTVH 373

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           ED  L V G+ +A LLL + ++F     NP ++ +KD   E      +++ R   Y  L 
Sbjct: 374 EDS-LTVTGASYATLLLSSKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAARGKDYETLK 429

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++++   +   L EL F
Sbjct: 430 KNHIKDYQSLFNRVKLNLGGSNTAQTT-------------KEALQTYNPTKGQKLEELFF 476

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 477 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 536

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 537 KPMINYIDDMRYYGRIAAKEYAGIKSKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 591

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPST 520
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L    D     ++PS 
Sbjct: 592 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKDSDRWVSSPSY 651

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
           SPEH           ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+
Sbjct: 652 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 701

Query: 581 PTKIAEDGSIMEWVQ 595
           P  I ++G I EW +
Sbjct: 702 PLHINKEGRIKEWYE 716


>gi|418134701|ref|ZP_12771558.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
 gi|419535112|ref|ZP_14074611.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
 gi|353901938|gb|EHE77468.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
 gi|379563273|gb|EHZ28277.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
          Length = 770

 Score =  225 bits (573), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN D         H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584


>gi|331092304|ref|ZP_08341132.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
           2_1_46FAA]
 gi|330401736|gb|EGG81315.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
           2_1_46FAA]
          Length = 1730

 Score =  225 bits (573), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 169/596 (28%), Positives = 273/596 (45%), Gaps = 62/596 (10%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
           +PIGN  +GA V+G +  E L  N+ TLW G P         G+    D  K +SDV   
Sbjct: 76  LPIGNSFMGANVYGEIGKERLTFNQKTLWNGGPSTSRPNYKGGNKDTADNGKKMSDVYKE 135

Query: 78  ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDI--ELEFDDSHLKYAEETYRRELDL 130
              L   G+ A+A   + KL G  A    YQ  GDI  + +FD+S  K     Y R+L++
Sbjct: 136 IIELYKKGEDAKANELAKKLTGEVAGYGAYQSWGDIYVDFKFDESQAK----NYVRDLNM 191

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
             A A V +   N +  RE+F S PD V+  K +   +  L+ ++S    +DN   V G 
Sbjct: 192 ENAVASVDFDYKNTKMHREYFVSYPDNVLAMKFTADGNEKLNLDISFP--IDNAEGVTG- 248

Query: 191 NQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
                  +  GK +      N      + +  Q     ++K+  + GT+ A +  KL V 
Sbjct: 249 -------KKLGKNVQTTVKDNTITVAGEMQDNQLKLNGKLKVETENGTVEAKDGDKLHVA 301

Query: 246 GSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
            +    + + A + +  D P     ++K+         +       Y  +   H+ DY +
Sbjct: 302 NASEVTVYVSADTDYKNDYPKYRTGETKEQLNDSVQKTIDKASKKGYEKVKEDHIADYTE 361

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           +F RV + L +S   + T T      D + +  + K     ED +L  +LFQ+GRYL I+
Sbjct: 362 IFDRVDLDLGQS---VPTKTT-----DVLLNDYKAKKNTAAEDRALEVMLFQYGRYLTIA 413

Query: 364 SSRPGTQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           SSR G   +NLQG+W   +       W S  H+N+NL+MNYW +   N++EC  PL D++
Sbjct: 414 SSRAGDLPSNLQGVWQNRVGDHNRVPWASDYHMNVNLQMNYWPTYSTNMAECATPLVDYI 473

Query: 420 TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
             L   G  TA+  + + +G    H  +     +       W   P    W+  + WE+Y
Sbjct: 474 NSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWNFSWGWSPAALPWILQNCWEYY 533

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
            YT D  ++E+  YP+L+  A      LIE    G L + P+ SPEH           V+
Sbjct: 534 EYTGDVKYMEEHIYPMLKEAALLYDQILIEDTKTGRLVSAPAYSPEH---------GPVT 584

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +T + ++I +++    +AAE+L  ++D   +   +   +L+P +I + G I EW
Sbjct: 585 AGNTYEQSLIWQLYEDAATAAEILNVDKDKAAQ-WRERQAKLKPIEIGDSGQIKEW 639


>gi|168493554|ref|ZP_02717697.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
 gi|418074476|ref|ZP_12711729.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11184]
 gi|418087331|ref|ZP_12724500.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47033]
 gi|418090009|ref|ZP_12727163.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43265]
 gi|418105755|ref|ZP_12742811.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44500]
 gi|418115170|ref|ZP_12752156.1| fibronectin type III domain protein [Streptococcus pneumoniae
           5787-06]
 gi|418117327|ref|ZP_12754296.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6963-05]
 gi|418217095|ref|ZP_12843775.1| fibronectin type III domain protein [Streptococcus pneumoniae
           Netherlands15B-37]
 gi|419432029|ref|ZP_13972162.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP05]
 gi|419433932|ref|ZP_13974050.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40183]
 gi|419440837|ref|ZP_13980882.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40410]
 gi|419464830|ref|ZP_14004721.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA04175]
 gi|419469454|ref|ZP_14009322.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA06083]
 gi|419498019|ref|ZP_14037726.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47522]
 gi|421281641|ref|ZP_15732438.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA04672]
 gi|183576395|gb|EDT96923.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
 gi|353748545|gb|EHD29197.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA11184]
 gi|353758347|gb|EHD38939.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47033]
 gi|353761200|gb|EHD41772.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43265]
 gi|353775931|gb|EHD56410.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA44500]
 gi|353785254|gb|EHD65673.1| fibronectin type III domain protein [Streptococcus pneumoniae
           5787-06]
 gi|353788008|gb|EHD68406.1| fibronectin type III domain protein [Streptococcus pneumoniae
           6963-05]
 gi|353870368|gb|EHE50241.1| fibronectin type III domain protein [Streptococcus pneumoniae
           Netherlands15B-37]
 gi|379536430|gb|EHZ01616.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA04175]
 gi|379544258|gb|EHZ09403.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA06083]
 gi|379576933|gb|EHZ41857.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40183]
 gi|379577907|gb|EHZ42824.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA40410]
 gi|379598852|gb|EHZ63637.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47522]
 gi|379629110|gb|EHZ93711.1| fibronectin type III domain protein [Streptococcus pneumoniae
           EU-NP05]
 gi|395880906|gb|EJG91957.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA04672]
          Length = 795

 Score =  224 bits (572), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN D         H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418

Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584


>gi|358463765|ref|ZP_09173746.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
           058 str. F0407]
 gi|357067821|gb|EHI77905.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
           058 str. F0407]
          Length = 1707

 Score =  224 bits (572), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 180/616 (29%), Positives = 299/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +++ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++ +   +   L EL F
Sbjct: 420 KDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPSKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706


>gi|418174043|ref|ZP_12810655.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41277]
 gi|353837999|gb|EHE18080.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41277]
          Length = 774

 Score =  224 bits (572), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + SE ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
            A   Y      F RE F+S PD ++V   +     +L F + L    D  S      + 
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185

Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
                C        I  K    D+   ++F++ L  +     G I    D ++++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 239

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
           A L L A + F     +    K D   + +  + + +   Y+ L +RH++DYQ LF RV 
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
           + L             E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
               ANLQG+WN D         H+N+NL+MNYW +   NL E   P+ +++  L + G 
Sbjct: 347 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 397

Query: 428 KTAQVNYLA--------SGWVIHHKTD--IWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
           + A V Y          +GW++H +     W     D     W   P   AW+   ++E 
Sbjct: 398 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 454

Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
           Y++  D+D+L ++ YP+L     F   +L +        ++PS SPEH           +
Sbjct: 455 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 505

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           S  +T D ++I ++F   I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 506 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 563


>gi|414159134|ref|ZP_11415425.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
 gi|410868266|gb|EKS16233.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
          Length = 1707

 Score =  224 bits (572), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 178/612 (29%), Positives = 297/612 (48%), Gaps = 88/612 (14%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   D  K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEGGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF----NVSLDSLLDN----- 183
           AT    Y+     F RE FSS PD V VT ++   + +L F    N++ D L +      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNNLTEDLLANGDYSWE 319

Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             +Y NG+      G      I  K    D+  G++F++ L IK     GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367

Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
            V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L   H+ 
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKQDHIK 424

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ LF+RV + L  S     T              E ++S+   +   L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFFQYGRY 471

Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           LLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  +P+ +
Sbjct: 472 LLISSSRDKTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531

Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
           ++  +   G           SK  Q N    GW++H +   +  ++       W   P  
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
            AW+  +++++Y +T D  +L+++ YP+L+    F   +L   +  D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPSYSPEH 645

Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
                      ++  +T D +++ ++F   +  A  L+ ++D LV +V     +L+P  I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHI 695

Query: 585 AEDGSIMEWVQR 596
             +G I EW + 
Sbjct: 696 NNEGRIKEWYEE 707


>gi|269955992|ref|YP_003325781.1| hypothetical protein Xcel_1192 [Xylanimonas cellulosilytica DSM
           15894]
 gi|269304673|gb|ACZ30223.1| conserved hypothetical protein [Xylanimonas cellulosilytica DSM
           15894]
          Length = 837

 Score =  224 bits (572), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 185/641 (28%), Positives = 281/641 (43%), Gaps = 84/641 (13%)

Query: 15  ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
           + ++ PA  + +A+P+GNG   AM  G    E L LN+   W+G  G       D   P 
Sbjct: 4   LRYDSPATCWDEALPVGNGVRAAMCEGRAGGERLWLNDLRAWSGPVGAGPRGDVDAPVPA 63

Query: 68  A-----------------------PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL 104
           A                       P+ L+ VR+ +D G    A     +        Y  
Sbjct: 64  AQDSASQDPAAEDPAAASRRAAAGPEHLAAVRAAIDDGDVRTAERLLQESQSPWVQAYLP 123

Query: 105 LGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
           LG++E+        L      + R LDL TA A   Y++G      E ++      +V  
Sbjct: 124 LGELEVTVTAVGDELAAPGGAHARSLDLRTAVAAHSYALGAARVRHETWADAAGGALVHV 183

Query: 163 ISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR-------CPGKR-------IPP-- 206
           ++      +       SLL   S                     P  R       +PP  
Sbjct: 184 VTADRP--VRLTARFTSLLRAESDAGAVPVAAAAPDAAAPGVDAPAPRDVLLHRLVPPVD 241

Query: 207 -KANANDDPKGIQFSA-----ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
                   P+ +++       ++ ++ + D   +  +ED +L+  G+  A LLL+ +++ 
Sbjct: 242 VAPGHESAPEPVRYGPTTARLVVAVRAAGDPDAV--VEDGELRT-GAATAHLLLIGTATT 298

Query: 261 DGPFINPSDSKKDPTSESMSALQSIRNLS-YSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
             P    + ++  PT    +AL  +      S     H   ++ L+ RV + L       
Sbjct: 299 HDPA---AGTQATPTEAVAAALALVTGPEPASPRRAAHEAAHRALYDRVELTLP------ 349

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
                S    DT+P+  R+ +    +DP L  L F +GRYLL++SSRPG   A LQGIWN
Sbjct: 350 -----SSSGADTLPTDARIAAAADVDDPGLTALAFHYGRYLLLASSRPGGLPATLQGIWN 404

Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSKTAQVNYLASG 438
             L   W SA   NINL+M YW +    L EC EPL  F+  L+   G + A+  Y A G
Sbjct: 405 PLLPGPWSSAYTTNINLQMAYWPAETTALPECHEPLLAFVERLATTTGPEAARRLYGARG 464

Query: 439 WVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
           WV HH +D W  +    A  G   WA W +GG WL  HLWE + +  D  FL +RA+P+L
Sbjct: 465 WVAHHNSDAWGHADPVGAGHGDPAWASWALGGVWLAHHLWERWLFGGDATFLRERAWPVL 524

Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
            G   F LDW ++       T+PSTSPE+ ++APDG+   V  S+TMD  ++R + +A  
Sbjct: 525 RGAGLFALDW-VQSDGTRAWTSPSTSPENHYVAPDGRPTGVGTSATMDGELLRWLAAACR 583

Query: 556 SAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWV 594
           +AA+ L  +ED L  + KV   LP     ++   G ++EW 
Sbjct: 584 AAADALGVSEDWLDDLAKVTALLPA---PEVGPRGELLEWA 621


>gi|419779913|ref|ZP_14305766.1| gram positive anchor [Streptococcus oralis SK100]
 gi|383185738|gb|EIC78231.1| gram positive anchor [Streptococcus oralis SK100]
          Length = 1707

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 180/616 (29%), Positives = 298/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y N    K L+++R 
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYKN--RYKVLAEIRK 199

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++ ++ L IK +D + T+   
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKLASYLGIK-TDGKVTV--- 362

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L  S     T              E ++ +  ++   L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEVKAKFDKL 690

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I  +G I EW +
Sbjct: 691 KPLHINNEGRIKEWYE 706


>gi|417934856|ref|ZP_12578176.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
 gi|340771426|gb|EGR93941.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
          Length = 1668

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 178/616 (28%), Positives = 300/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 103 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 160

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            +++G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 161 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 220

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 221 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 280

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 281 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 323

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +D+ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 324 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 380

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L               N     + E ++ +  ++   L EL F
Sbjct: 381 KDHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 427

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 428 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 487

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 488 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 542

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+    F   +L   +  D ++ ++PS
Sbjct: 543 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPS 601

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV +V     +L
Sbjct: 602 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 651

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 652 KPLHINKEGRIKEWYE 667


>gi|417915380|ref|ZP_12558993.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           mitis bv. 2 str. SK95]
 gi|342834366|gb|EGU68637.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
           mitis bv. 2 str. SK95]
          Length = 1686

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 177/616 (28%), Positives = 300/616 (48%), Gaps = 98/616 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
           A+P+GNG +GA V+G +  E ++ NE TLW+G P         G+Y   +  K L+++R 
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 198

Query: 78  LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
            ++ G   +A   + +    P +     Y   GDI + F++        T Y R LD+  
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 258

Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
           AT    Y+     F RE FSS PD V VT ++   + +L F   N   + LL N      
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318

Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
             +Y NG+     N I+++G         K N      G++F++ L IK +D + T+   
Sbjct: 319 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361

Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
           +++ L V G+ +A L L A ++F     NP ++ +KD   E      +++ +   Y  L 
Sbjct: 362 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYKTLK 418

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
             H+ DYQ LF+RV + L               N     + E ++ +  ++   L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465

Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           Q+GRYLLISSSR  T    ANLQG+WN   +P W++  H+N+NL+MNYW +   NL+E  
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525

Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
           +P+ +++  +   G           SK  Q N    GW++H +   +  ++       W 
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
             P   AW+  +++++Y +T D  +L+++ YP+L+  A F   +L   +  D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 639

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            SPEH           ++  +T D +++ ++F   +  A  L  ++D LV ++     +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEIKAKFDKL 689

Query: 580 RPTKIAEDGSIMEWVQ 595
           +P  I ++G I EW +
Sbjct: 690 KPLHINKEGRIKEWYE 705


>gi|298351514|sp|A2R797.1|AFCA_ASPNC RecName: Full=Probable alpha-fucosidase A; AltName:
           Full=Alpha-L-fucoside fucohydrolase A; Flags: Precursor
 gi|134083134|emb|CAK48586.1| unnamed protein product [Aspergillus niger]
          Length = 793

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 175/593 (29%), Positives = 273/593 (46%), Gaps = 63/593 (10%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
           T A P+GNGRLGAM  G    E + LN D+LW G P +   Y+  NP+  KA  L  +R 
Sbjct: 36  TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95

Query: 78  LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDLNTAT 134
            +    +   T     L G +P    YQ+L ++ ++  + S +    + YRR LDL++A 
Sbjct: 96  WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDLDSAV 147

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNN 191
               +S G     RE F S PD V V ++S + S   ++F +   L S   N S  +GN+
Sbjct: 148 YSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNS 206

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWA 250
             +      G+  P          G+ ++A + + +     T        +KV EG    
Sbjct: 207 ISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEV 253

Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
            L+  A ++++    N   S     ++P  + +    +    SYS L + H+ DYQ +F+
Sbjct: 254 FLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFN 313

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           + ++ L                    P+ E + S+    DP +  LLF +GRYL ISSSR
Sbjct: 314 KFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPYVENLLFDYGRYLFISSSR 362

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
           PG+   NLQG+W E  SP W    H NINL+MN+W      L E  EPL+ ++    +  
Sbjct: 363 PGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPR 422

Query: 426 GSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           G++TA++ Y  S GWV H + + +   +A +    WA +P   AW+  H+W+H++Y+ D 
Sbjct: 423 GAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDS 481

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            +  +  YP+L+G A F L  L++     DG L  NP  SPEH          C  Y   
Sbjct: 482 AWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEHGPTLTPQTFGCTHYQQ- 540

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
               +I E+F  ++        ++ +    +      L P   I   G I EW
Sbjct: 541 ----LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 589


>gi|429766026|ref|ZP_19298301.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
 gi|429185266|gb|EKY26251.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
          Length = 1927

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 170/611 (27%), Positives = 295/611 (48%), Gaps = 85/611 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----------YTNPDAP---KALS 73
           +PIGNG +G  V+G +  E +  NE TLWTG P D           Y N       + L 
Sbjct: 70  LPIGNGDIGGNVYGEIVHERITFNEKTLWTGGPSDKRPNYNGGNKEYANDGITPMYEILQ 129

Query: 74  DVRS----LVDSGQYAEATAASV--KLFG--HPADVYQLLGDIELEF---DDSHLKYAEE 122
            VR       D G   +ATA+S+  +L G       YQ  G+I L+F   D++++     
Sbjct: 130 QVRENFALHTDEG---DATASSLCNQLVGISDGYGAYQAWGEINLDFIGIDENNVT---- 182

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            Y R+L+L  A + V Y+ G+ E+ RE+F S+PD V+V ++  +    L+F+VS  S   
Sbjct: 183 DYVRDLNLRNAISSVNYTYGDTEYIRENFVSHPDDVMVIRVEANGENKLNFDVSFPSKQG 242

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             + V  N+ I +EG     ++  K N+             ++KI  D G ++   DK L
Sbjct: 243 ATTIVE-NDTITLEGEVSDNQL--KYNS-------------QLKIVSDDGEVTEGTDK-L 285

Query: 243 KVEGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
            VE +  A + + A++ +  D P     ++ ++  +     ++++   SY ++   H+ D
Sbjct: 286 TVENATSATIYISAATDYKNDYPEYRTGETAEELDARVGDVIEALDGKSYEEVKADHIAD 345

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
           Y+ +F RV + L ++  +I TD       +   S E  ++ +         + FQ+GRYL
Sbjct: 346 YKSIFDRVDLDLGQALPNIPTDELLSGYGNNTVSEEARRALEV--------MFFQYGRYL 397

Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
            I+SSR  +Q+ +NLQG+WN   +P W S  H+N+NL+MNYW +   N++EC  PL +++
Sbjct: 398 TIASSREDSQLPSNLQGVWNNKNNPAWSSDYHMNVNLQMNYWPTYSTNMAECATPLVEYI 457

Query: 420 TYLSINGSKTAQV------------NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
             L   G +TA++             Y+ +   + H  +     +       W   P   
Sbjct: 458 DSLREPGRETARIYAGVESAKDENGEYIEANGFMAHTQNTPFGWTCPGWSFDWGWSPAAV 517

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
            W+  ++WE Y YT D +++    YP+++   +   + L+ +     + ++P+ SPEH  
Sbjct: 518 PWILQNVWEMYEYTGDVEYMRDVIYPMMKEEVNLYENMLVWDEVQQRMVSSPTYSPEH-- 575

Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIA 585
                     +  +T +  +I +++   I+AAE L  + D +VE K  +S  +L P +I 
Sbjct: 576 -------GPRTVGNTYEQTLIWQLYEDTITAAETLGVDADLVVEWKDTQS--KLDPIQIG 626

Query: 586 EDGSIMEWVQR 596
           +DG I EW + 
Sbjct: 627 DDGQIKEWFEE 637


>gi|242815430|ref|XP_002486567.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218714906|gb|EED14329.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 773

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 169/600 (28%), Positives = 283/600 (47%), Gaps = 58/600 (9%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
           K+ ++ PA+ + D +PIGNG +GA++     SE    N  + W+G             +A
Sbjct: 5   KLWYDQPAQKWQDGLPIGNGHMGAVIISQPSSEIWSFNNISFWSGRSESTPVIEYGGREA 64

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEETYRREL 128
           L  +R    +  Y      + K        Y    ++  I L  +    + +   +RREL
Sbjct: 65  LDKIRKEYFADNYEHGKRLTEKYLQPEKGNYGTNLMVARIYLALEHGGEEPSFTDFRREL 124

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           +L+ A  R +Y   +V F RE F+S P QV++ ++       ++  + +  +    S  +
Sbjct: 125 NLDEAIVRTEYKSKSVLFRREVFASYPHQVLMARLRTECLEGMNLKLGVSGVTKEFSISD 184

Query: 189 GNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           G     ++ E +   + I          +GI       ++     G++  + D +L+V+ 
Sbjct: 185 GETTDCLVFETQAV-EEIHSNGTCGVRGRGI-------VQAHTVGGSVHIV-DGELRVKN 235

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +   ++ +    SF   F + +D   D      + L ++ + SY +L   H+ DYQ L+ 
Sbjct: 236 ASEVIIKV----SFQTDFRSLND---DWKLRVQTLLDNVWDTSYEELRALHVRDYQSLYR 288

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
           RV I L  +                 P  +R  SFQ     DPSL         YL IS 
Sbjct: 289 RVHIDLGHTEDS------------NFPLNKRKASFQKSGYNDPSL---------YLTISG 327

Query: 365 SRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           +R  + +  +LQGIWN  E  +  W    H++IN +MNY+ +   NL + Q PL  +  Y
Sbjct: 328 TRATSPLPLHLQGIWNDGEANAMNWSCDYHLDINTQMNYFPTETTNLGDLQGPLMRYCEY 387

Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNY 480
           L+ +G K+A+  Y A GWV H  +++W  +  D G +  W L   GG W+ TH+ EHY Y
Sbjct: 388 LASSGKKSARNFYGAGGWVAHVFSNVWGYT--DPGWETSWGLNITGGLWMATHMIEHYEY 445

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI----APDGKLAC 535
           ++DR+FL  +AYP+L   A F LD++ I+   GYL T PS SPE+ F     +P  K   
Sbjct: 446 SLDRNFLTTQAYPVLREAAEFFLDYMTIDPRTGYLVTGPSNSPENSFYPSTQSPREKQE- 504

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           +S   T+D+ ++R++F   I + + L  NE     +V ++L +L P +I + G + EW +
Sbjct: 505 LSLGPTIDITLVRDLFKFCIFSVDELGLNESEFAARVHEALAKLPPFRIGKRGQLQEWFE 564


>gi|336433106|ref|ZP_08612933.1| hypothetical protein HMPREF0991_02052, partial [Lachnospiraceae
           bacterium 2_1_58FAA]
 gi|336017272|gb|EGN47037.1| hypothetical protein HMPREF0991_02052 [Lachnospiraceae bacterium
           2_1_58FAA]
          Length = 1786

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 190/649 (29%), Positives = 303/649 (46%), Gaps = 87/649 (13%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
           +++AE +   + LK+ +   A    D     ++PIGN  +GA V+GGV +E ++LNE +L
Sbjct: 32  VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91

Query: 56  WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
           W+G P + + PD             + + +++ L  +G    A++   +L G   D    
Sbjct: 92  WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150

Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
               Y   G++ L+F     K  E  Y R LDLNTA A V+Y  G+  +TRE+F S PD 
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
           V+VT+++      L+ +V ++   DN +    N   I         E       I     
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267

Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
             D+   ++FS+  + K+  + GT    ED   KV   D   + ++ S   D     P  
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320

Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
              +S++   S   +    A  ++ N SY  L   H+DDY  +F RV++ L + P     
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375

Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
              SE+  D +  A    S    E   L  +LFQ+GRYL I SSR          T  +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVILFQYGRYLTIESSRETPEDDPSRATLPSN 432

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIW    S  W S  H+N+NL+MNYW +   N++EC +PL  ++  L   G  TA++ 
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492

Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
             +  G++ H + + +  +    S D     W   P    W+  + WE+Y +T D  +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
              YP+++  A F  + LI+   G+L ++PS SPEH    P  + A  +Y  T+    I 
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
           +++   I AAE L  + D LV        RL+ P +I + G I EW + 
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEWYEE 646


>gi|154503020|ref|ZP_02040080.1| hypothetical protein RUMGNA_00842 [Ruminococcus gnavus ATCC 29149]
 gi|153796374|gb|EDN78794.1| fibronectin type III domain protein [Ruminococcus gnavus ATCC
           29149]
          Length = 2168

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 190/649 (29%), Positives = 303/649 (46%), Gaps = 87/649 (13%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
           +++AE +   + LK+ +   A    D     ++PIGN  +GA V+GGV +E ++LNE +L
Sbjct: 32  VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91

Query: 56  WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
           W+G P + + PD             + + +++ L  +G    A++   +L G   D    
Sbjct: 92  WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150

Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
               Y   G++ L+F     K  E  Y R LDLNTA A V+Y  G+  +TRE+F S PD 
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209

Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
           V+VT+++      L+ +V ++   DN +    N   I         E       I     
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267

Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
             D+   ++FS+  + K+  + GT    ED   KV   D   + ++ S   D     P  
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320

Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
              +S++   S   +    A  ++ N SY  L   H+DDY  +F RV++ L + P     
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375

Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
              SE+  D +  A    S    E   L  +LFQ+GRYL I SSR          T  +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVMLFQYGRYLTIESSRETPEDDPSRATLPSN 432

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
           LQGIW    S  W S  H+N+NL+MNYW +   N++EC +PL  ++  L   G  TA++ 
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492

Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
             +  G++ H + + +  +    S D     W   P    W+  + WE+Y +T D  +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547

Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
              YP+++  A F  + LI+   G+L ++PS SPEH    P  + A  +Y  T+    I 
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
           +++   I AAE L  + D LV        RL+ P +I + G I EW + 
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEWYEE 646


>gi|197302981|ref|ZP_03168031.1| hypothetical protein RUMLAC_01709 [Ruminococcus lactaris ATCC
           29176]
 gi|197297976|gb|EDY32526.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus lactaris
           ATCC 29176]
          Length = 1960

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 177/650 (27%), Positives = 303/650 (46%), Gaps = 83/650 (12%)

Query: 1   MMNAESTSTT-----NPLKITFNGPA---KHFT----DAIPIGNGRLGAMVWGGVPSETL 48
            +NAE  + T     N LK+ +  PA   K++      ++PIGNG +G  V+GG+  E +
Sbjct: 29  QVNAEPAAVTQQTGDNDLKLWYTSPADITKYYEGWQEKSLPIGNGAIGGTVFGGITRERI 88

Query: 49  KLNEDTLWTGVP---------GDYTNPDAPKA-LSDVRSLVDSGQYAEATA-ASVKLFGH 97
           +LN+ +LW+G P         G+  N     A ++ + +   +GQ + A + A+  L G 
Sbjct: 89  QLNDKSLWSGGPSTSRPNYNGGNLENKGNNGATMTSIHNYFANGQDSSAISLANSNLVGV 148

Query: 98  PADV-------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREH 150
             D        Y   G++ ++F +         Y R+LDL TA A V Y  G+  ++RE+
Sbjct: 149 SDDAGTNGYGYYLSWGNMYIDFKNVSSNNDVTNYTRDLDLKTAIAGVNYDKGSTHYSREN 208

Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG----NNQIIMEGRCPGKRIPP 206
           F+S PD VIVT I+   S  +S +VS++      S +NG    + Q   +      RI  
Sbjct: 209 FTSYPDNVIVTHITADGSEKISLDVSVEPDNSRGSAINGIGDSSYQRTWDTTVSDGRISI 268

Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
                D+   ++FS+  ++ I+D+ GT++   D K+ V G+    ++    + +   +  
Sbjct: 269 NGQLTDNQ--MKFSSQTQV-ITDNAGTVTD-GDGKVSVSGASEVTIITSMGTDYKDEY-- 322

Query: 267 PSDSKKDPTSESMSALQ------SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
           PS    +  SE  + ++      +++  +Y +L   H+ DYQ++F+RV + L +      
Sbjct: 323 PSYRTGETASELTNRVKWYVDQAAVK--TYEELKANHVSDYQEIFNRVDLNLGQ------ 374

Query: 321 TDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG----------TQ 370
             T S +  D + SA +  +    E   L  +LFQ+GR++ I SSR            T 
Sbjct: 375 --TVSTKTTDALLSAYKAGTASEAERRQLEVMLFQYGRFMTIESSRETKTDGNGYVRETL 432

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
            +NLQG+W    +  W S  H+N+NL+MNYW +   N++EC +PL D++  L   G  TA
Sbjct: 433 PSNLQGLWVGANNSPWHSDYHMNVNLQMNYWPTYSTNMAECAQPLVDYIDALREPGRVTA 492

Query: 431 QVNYLAS-------GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +    S       G++ H + + +  +        W   P    W+  + W +Y YT D
Sbjct: 493 AIYAGVSSADGEENGFMAHTQNNPFGWTCPGW-SFSWGWSPAAVPWILQNCWAYYEYTGD 551

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
             +L    YP+++  A      L+   DG L ++P+ SPEH           V+  +T +
Sbjct: 552 TSYLRDNIYPMMKEEAKLYDRMLVRDSDGKLVSSPAYSPEH---------GPVTSGNTYE 602

Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             +I +++   I AAEVL  + D +            P ++ + G I EW
Sbjct: 603 QTLIWQLYEDTIKAAEVLGTDADLVATWKANQADLKGPIEVGDSGQIKEW 652


>gi|302346987|ref|YP_003815285.1| hypothetical protein HMPREF0659_A7263 [Prevotella melaninogenica
           ATCC 25845]
 gi|302151004|gb|ADK97265.1| conserved hypothetical protein [Prevotella melaninogenica ATCC
           25845]
          Length = 1163

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 155/525 (29%), Positives = 248/525 (47%), Gaps = 70/525 (13%)

Query: 11  NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N   + +  PA ++ T  +PIGNG+ GA + G V  + ++ N+ TLW+G  G  T     
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
                            +TAA    +G+    Y   G++ +    S        Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
           +N A A VKY++  V ++R +F+SNPD  +V + + S++G ++  ++L +    N SY V
Sbjct: 428 INDAVAGVKYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487

Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           + NNQ  I  +G+         A  +D       S     +I  D GTI+      ++V 
Sbjct: 488 DNNNQATITFDGQV--------ARQDDHGATTPESYYCAARIVTDGGTITKNAKGIIEVN 539

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G++   + L   + FD                + + +   +N  Y  L   H  DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAGRAAATVNDAQNKGYDALLAAHKADYKSLF 599

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
            R  + LS    +I             P+ + + S++ ++  +L   EL F +GRYLLIS
Sbjct: 600 DRCQLTLSDVKNNI-------------PTPQLISSYRDNQHDNLFLEELYFNYGRYLLIS 646

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---T 420
           SSR  +  ANLQGIWN++ +P W S  H NIN++MNYW + P NLSE   P  D++    
Sbjct: 647 SSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYIYREA 706

Query: 421 YLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
            +     + AQ + ++ +GW +  + +I+       G      + +  AW C HLW+HY 
Sbjct: 707 CVKPTWRRFAQDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYT 761

Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           YTMD+DFL  +A+P ++    +    L++  DG  E     SPEH
Sbjct: 762 YTMDKDFLRAKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH 806


>gi|350633541|gb|EHA21906.1| hypothetical protein ASPNIDRAFT_184037 [Aspergillus niger ATCC
           1015]
          Length = 758

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 176/593 (29%), Positives = 275/593 (46%), Gaps = 67/593 (11%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
           T A P+GNGRLGAM  G    E + LN D+LW G P +   Y+  NP+  KA  L  +R 
Sbjct: 36  TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95

Query: 78  LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDLNTAT 134
            +    +   T     L G +P    YQ+L ++ ++  + S +    + YRR LDL++A 
Sbjct: 96  WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDLDSAV 147

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNN 191
               +S G     RE F S PD V V ++S + S   ++F +   L S   N S  +GN+
Sbjct: 148 YSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNS 206

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWA 250
             +      G+  P          G+ ++A + + +     T        +KV EG    
Sbjct: 207 ISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEV 253

Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
            L+  A ++++    N   S     ++P  + +    +    SYS L + H+ DYQ +F+
Sbjct: 254 FLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFN 313

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
           + ++ L                    P+ E + S+    DP++  LLF +GRYL ISSSR
Sbjct: 314 KFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPNVENLLFDYGRYLFISSSR 362

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
           PG+   NLQG+W E  SP W    H NINL+MN+W      L E  EPL+ ++    +  
Sbjct: 363 PGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPR 422

Query: 426 GSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           G++TA++ Y  S GWV H + + +   +A +    WA +P   AW+  H+W+H++Y+ D 
Sbjct: 423 GAETAELLYGTSKGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDS 481

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            +  +  YP+L+G A F L  L++     DG L  NP  SPEH    P     C  Y   
Sbjct: 482 AWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEH---GPT-TFGCTHYQQ- 536

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
               +I E+F  ++        ++ +    +      L P   I   G I EW
Sbjct: 537 ----LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 585


>gi|340514861|gb|EGR45120.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
          Length = 795

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 172/630 (27%), Positives = 297/630 (47%), Gaps = 68/630 (10%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV-----PGDYTNPDA 68
           ++ +  P+  F  ++P+GNGR  A V      E L LNE + W+G       G    P+ 
Sbjct: 6   RLFYTTPSTAFPTSLPLGNGRFAASVLSSPSKEVLILNEVSFWSGKEQPAGAGLSHKPER 65

Query: 69  PK-ALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSH-LKY 119
            K  L + +    SG YA+    + +        FG    V    G +E+  +    +  
Sbjct: 66  AKDELRETQRCYLSGDYAQGKKRAERFLESRKTNFGTNLGV----GRLEIAVNGQETIDG 121

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
               + REL L+ A    +Y++   +F R  F S+P QV+V ++ G +   L   V +  
Sbjct: 122 VVSGFERELRLDEAVTETRYTLSGRQFKRRCFLSHPHQVLVVQLEGDDLQGLEIEVDVQG 181

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
             +N ++ +  N    +G+        +   +D   G++   ++   +  D G +    +
Sbjct: 182 --ENEAFTSNVN---ADGKLEFNVQALETVHSDGTCGVKGYGLIAATV--DEGKVQR-RN 233

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
            KL +       +L+    +F+  +  P D+ +  T   M A      LS SDL+  HL 
Sbjct: 234 GKLVISAKKSITILV----TFNTDYAEPGDAWRRRTVAQMDA---ALELSASDLFQAHLQ 286

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFG 357
           D+Q L+ RVSI L        +++CS     + P+ +R +SF+     D  +  L F + 
Sbjct: 287 DFQPLYRRVSISLG-------SESCS---TASAPTDQRRQSFEASGYADAGMFALYFHYA 336

Query: 358 RYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           RYL I+ +R  + +  +LQG+WN  E     W    H++IN +MNY+  +   LS+  +P
Sbjct: 337 RYLTIAGTRHDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAIMNSGLSDLMQP 396

Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTH 473
           L ++L  L  +G  TA+V Y   GWV H  +++W  +  D G +V + L   GG WL +H
Sbjct: 397 LINYLVRLGESGQDTARVCYGCPGWVAHVFSNVWGFT--DPGWEVSYGLNVTGGLWLASH 454

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF--IAPD 530
           L E + Y++D  F    A+ +L G + F LD++IE    G+L T PS SPE+ F  +  D
Sbjct: 455 LIEMFEYSLDDSFTRNEAWSVLLGASKFFLDYMIEDPKTGWLLTGPSVSPENSFFVVKED 514

Query: 531 GKLA--CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV---LKSLPRLRPTKIA 585
           G+      + + T+D+ ++R++F+    A   L+  E    E V    ++L +L P +I 
Sbjct: 515 GEKEEHYAALAPTLDIVLVRDLFAFCEYALTKLDCQESNYKEDVRMYREALAKLPPFQIG 574

Query: 586 EDGSIMEWV---------QRRLNTSFSTCK 606
           ++G + EW+          R L+ + + C+
Sbjct: 575 KNGQLQEWLHDFEEAQPYHRHLSHTMALCR 604


>gi|302523529|ref|ZP_07275871.1| fibronectin type III domain-containing protein [Streptomyces sp.
           SPB78]
 gi|302432424|gb|EFL04240.1| fibronectin type III domain-containing protein [Streptomyces sp.
           SPB78]
          Length = 661

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 155/496 (31%), Positives = 237/496 (47%), Gaps = 46/496 (9%)

Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
           +Q  GD+ ++ D +    + E Y R LDL  A A V Y      F R  F+S PD+V+V 
Sbjct: 20  HQTFGDLLIDVDGA--PGSAEGYTRTLDLAQALATVSYPHDGATFHRTVFTSCPDKVLVG 77

Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
             +    GS+  N+   S   + +     +++ + G                  G++F A
Sbjct: 78  HFTADRGGSVGLNLRYTSPRQDFTATTDGDRLTVRGAL-------------QDNGMRFEA 124

Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
             +I++  + GT++A  D+ L V G+D A  +L A + +   +  P     DP     +A
Sbjct: 125 --QIRLLSEGGTVTANGDR-LAVSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVATA 179

Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
           +       Y +L  RH  D+  LF RV + L +       D+  +   D +  A    S 
Sbjct: 180 VDQAAARPYRELLDRHTSDHAALFSRVVLDLGQ-------DSAPDRTTDALLKAYTGGS- 231

Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            + +D +L  L FQ+GRYLLI+SSR G+  ANLQG WN   +P W +  HVNINL+MNYW
Sbjct: 232 -SADDRALEALFFQYGRYLLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYW 290

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVW 460
            +   NL+E   P   F+  L   G  TA+  + A GWV+H +T  +  +   D     W
Sbjct: 291 PAEATNLAETTAPYDRFVEALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW 350

Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
             +P   AWL + L+EHY +    D+L   AYP ++  A F +D L  +  D  L   PS
Sbjct: 351 --FPEAAAWLTSQLYEHYRFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPS 408

Query: 520 TSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
            SPEH +F A           + M   I+RE+F   + AA+ L  ++ A    + ++L R
Sbjct: 409 FSPEHGDFTA----------GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDR 457

Query: 579 LRPT-KIAEDGSIMEW 593
           + P  +I   G +MEW
Sbjct: 458 IDPGLRIGSWGQLMEW 473


>gi|336429327|ref|ZP_08609294.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336002938|gb|EGN33035.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 779

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 178/594 (29%), Positives = 275/594 (46%), Gaps = 53/594 (8%)

Query: 21  AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALSDVRSLV 79
           A+ + +A  +GNGR+GA V+GGV  ET+ L+E T ++G      N   A  A  ++RSL+
Sbjct: 11  AERWQEAYLLGNGRMGAAVYGGVFEETVDLSEITFFSGSSSSENNQKGAALAFQEMRSLL 70

Query: 80  DSGQYAEATAASVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
             G+   A   +    G   +    L  G +++  ++S  K   + Y R LDL T    +
Sbjct: 71  QEGKEEAAMERASDFIGIRENYGTNLPVGRLKIMLENSGEK--PDGYVRRLDLQTGLFSM 128

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
           +Y        R  F S PDQV   +I   +  SLS  + ++          G N      
Sbjct: 129 EYRQEGSTVVRNAFVSWPDQVFCYEIKTGKPESLSGRIWVE---------GGENPFSART 179

Query: 198 RCPGKRIPPKANA---NDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDW 249
                R   +A     +D   G+  S +++      KIS   GTI+     +L +     
Sbjct: 180 EEEEYRFQVQAREKLHSDGSCGVDLSGMVKAWCEDGKISCSGGTIAFTGCSRLLIG---- 235

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
             L +              D K     +S+          Y  + +RH++D +    RVS
Sbjct: 236 --LWMETDYEEKAGLTACKDKKAGCAKQSLPK-------EYDRIRSRHMEDVKSRMERVS 286

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
           + L    +        +E+   VP+ ERV  S Q  EDP L  L FQFGRYLL  SSR  
Sbjct: 287 LCLGTKEE--------QEDAAAVPTDERVLASRQGKEDPLLFALAFQFGRYLLQCSSRED 338

Query: 369 TQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
           + + A+LQG+WN++++    W    H++IN +MNYW S P NL EC+ PLF ++  L I 
Sbjct: 339 SPLPAHLQGVWNDNVACRIGWTCDMHLDINTQMNYWLSGPGNLPECRRPLFAWMEKLLIP 398

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
           +G  +A+ +Y   GW     ++ W  S+    + + +  P GG W  +   EHY YT D 
Sbjct: 399 SGRISARESYGRKGWSADLVSNAWGFSAPYWSRTI-SPCPTGGIWQASDYMEHYRYTRDE 457

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
            F  + AYP++     F   ++ EG DG   + PS SPE+ +I  +G+    S   T ++
Sbjct: 458 AFAREHAYPVIREAVEFFTGYVFEGEDGCYLSGPSISPENAYIK-EGEKRFFSNGCTYEI 516

Query: 545 AIIREVFSAIISAAEV---LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +IRE+    +  A     L + + ALV +  K LPRL P +I  DG++ EW  
Sbjct: 517 LMIRELLEEFLELASFLPDLAEKDRALVMQAEKILPRLLPYRILPDGTLAEWAH 570


>gi|345882387|ref|ZP_08833873.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
 gi|345044169|gb|EGW48214.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
          Length = 1163

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 152/530 (28%), Positives = 248/530 (46%), Gaps = 80/530 (15%)

Query: 11  NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N   + +  PA ++ T  +PIGNG+ GA + G V  + ++ N+ TLW+G  G  T     
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET----YR 125
                            +TAA    +G+            L F + +++  E T    Y 
Sbjct: 396 -----------------STAA----YGY-----------YLNFGNLYIRSRELTKVTDYV 423

Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NH 184
           R LD+N A A V+Y++  V + R +F++NPD  +V + + SE G ++  ++L +    N 
Sbjct: 424 RYLDINDAVAGVRYTMDGVAYDRTYFATNPDSCLVIRYTASEKGRINTTLTLKNQNGRNV 483

Query: 185 SY-VNGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
           +Y V+ NNQ  I  EG+         A  ND       S     +I  D G+++      
Sbjct: 484 NYTVDNNNQATITFEGKV--------ARQNDKGATTPESYYCAARIVTDGGSVTKNAKGL 535

Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           ++V G++   + L   + FD                + + + +  N  Y  L   H  DY
Sbjct: 536 IEVSGANSMTVYLRGLTDFDPDAAEYVSGADRLAGRATATVNNAENKGYDALLAAHKADY 595

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRY 359
           + LF R  + L+ S              +T+P+ + + +++ ++  +L   EL F +GRY
Sbjct: 596 KSLFDRCQLTLADSK-------------NTIPTPQLISNYRDNQHDNLFLEELYFNYGRY 642

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSSR  +  ANLQGIWN++ +P W S  H NIN++MNYW + P NLSE   P  D++
Sbjct: 643 LLISSSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYI 702

Query: 420 TYLSINGSKT-----AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
            Y       T       + ++ +GW +  + +I+       G      + +  AW C HL
Sbjct: 703 -YREACVKPTWRRFAKDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHL 756

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           W+HY YTMD++FL  +A+P ++    +    L++  DG  E     SPEH
Sbjct: 757 WQHYTYTMDKEFLRTKAFPAMKTAVDYWFKKLVKAADGTYECPNEWSPEH 806


>gi|427404601|ref|ZP_18895341.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
 gi|425716772|gb|EKU79741.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
          Length = 764

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 166/591 (28%), Positives = 271/591 (45%), Gaps = 87/591 (14%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
           +A+PIGNGR+GAMV+G    E L+ N+ TLWTG               D +++       
Sbjct: 46  EALPIGNGRIGAMVFGQPGREHLQFNDITLWTG---------------DDKTM------- 83

Query: 86  EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE 145
                           +Q  GD+ +E         +  YRR LDL      V Y+ G V 
Sbjct: 84  --------------GAFQPFGDLLVELPGHESGVTD--YRRTLDLGRGVHTVTYTHGGVR 127

Query: 146 FTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           + RE ++S P QVIV +++    G  S  VSL      H  V  N ++   G   G  +P
Sbjct: 128 YRREAWASFPAQVIVLRLTADRPGRYSGAVSLTDRHGAHLAV-ANGRLHATGTLAGFALP 186

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
            +A     P G   S   + ++  D G ++A + +++   G+D   L+L A +S+    +
Sbjct: 187 DQA-----PSGNVMSYASQAQVISDGGKLTA-DGQRIAFAGADGLTLILGAGTSY---VL 237

Query: 266 NPSD--SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
           + +       P +   + +      + + L   H++D+++L  RV+I L  +P       
Sbjct: 238 DAARRFEGGHPLARVTAQVDQAAARAPAALLEEHVEDFRRLMQRVAIDLGETPA------ 291

Query: 324 CSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
                   +P+  R+ ++ +   DP L    FQ+GRYLL SSSR G+  ANLQG+WN  L
Sbjct: 292 ----ARRALPTDARLLAYTKAGGDPELEAQYFQYGRYLLASSSR-GSLPANLQGLWNNSL 346

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS----- 437
           +P W++  H NIN++MNYW +   NL E   P FDF+  ++    +     +  +     
Sbjct: 347 TPPWNADYHTNINVQMNYWPAEVTNLGESALPFFDFVNGMAPVWRRATTEEFRRADGQPV 406

Query: 438 -GWVIHHKTDIWAKSSADRGKVVWALW-PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
            GW +  +++ +             LW   G AW   H WEHY +  D  FL + AYP++
Sbjct: 407 RGWTLRTESNPFGAMD--------YLWNKTGNAWYAQHFWEHYAFNRDERFLREVAYPVM 458

Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
           +  ++F  D+L    DG L      SPEH  +  DG    V+Y    D  I+ ++F+  +
Sbjct: 459 KEASAFWQDYLKALPDGRLVAPQGWSPEHGPVE-DG----VAY----DQQIVWDLFNNTV 509

Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCK 606
            AA +L  + D L  ++     RL   +I   G ++EW++ + +    T +
Sbjct: 510 EAAGILRVDPD-LRAQLAAMRDRLAGPRIGSWGQLLEWLEEKKDPVLDTPR 559


>gi|451852884|gb|EMD66178.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
          Length = 805

 Score =  219 bits (557), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 173/597 (28%), Positives = 277/597 (46%), Gaps = 63/597 (10%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSGQ 83
           A+P+GNGRL AM  G   +ETL LN D+LW+G P    +YT  +   ++      +    
Sbjct: 38  ALPVGNGRLAAMPIGSPSAETLTLNLDSLWSGGPFEASNYTGGNPESSIDSTLPGIRDWI 97

Query: 84  YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
           +   T    KL G   +   Y++L ++ +    S +      Y R+LDL        ++ 
Sbjct: 98  FTNGTGNVTKLLGTNDNYGSYRVLANLTVTIP-SLVGIQVSNYTRKLDLTNGLHSTSFNT 156

Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLDNHSYV-NGNNQIIM 195
            + +     F S PDQV V  I  S S   +F + L     D+ L+N + V NG      
Sbjct: 157 NDTQLESTVFCSYPDQVCVYTIQSSRSLP-AFELKLGNELVDAKLENITCVANGTGADSG 215

Query: 196 EGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EGSDWAV 251
             R  G  ++ P       P+G+ +  I  +  + D  T        LKV    G+  A 
Sbjct: 216 HVRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKTTCDSNTGILKVTPENGAKSAT 268

Query: 252 LLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           +++ A +++D          S    DP       +Q +   +  +L + HL+D+  L  R
Sbjct: 269 VIIGAETNYDMKKGTAEHQYSFRGNDPGPAVEETIQKVSMKTLEELKSSHLEDFTSLTGR 328

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRYLLISS 364
               L   P  +        N   VP+ E + S+    T  DP +  LLF + +YLLISS
Sbjct: 329 FEFHL---PDPL--------NSAQVPTPELIASYDSNVTSGDPFVESLLFDYAQYLLISS 377

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+   NLQG W E ++P W +  H NINL+MNYW +    L+E Q PL+D++    +
Sbjct: 378 SRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYMINTWV 437

Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G +TA + Y A GWV+H++ +I+  +    G+  WA +P   AW+  H++++++YT D
Sbjct: 438 PRGHETAMLLYGAPGWVVHNEMNIFGHTGMKDGE-GWANYPAAPAWMMLHVFDYWDYTRD 496

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGKLACVS 537
             +L  + YPL++  A F   WL + H      D  L  NP +SPEH    P     C  
Sbjct: 497 TTWLRTQGYPLIKSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-TFGCAH 549

Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
           Y       +I +VF A+++   +  +++ +    +  +L RL +   +     I EW
Sbjct: 550 YQQ-----LIHQVFEAVLTTHSLAGESDTSFTSNISSTLSRLDKGFHVGSWSQIKEW 601


>gi|331082986|ref|ZP_08332105.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
           6_1_63FAA]
 gi|330399723|gb|EGG79384.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
           6_1_63FAA]
          Length = 1760

 Score =  218 bits (555), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 170/594 (28%), Positives = 273/594 (45%), Gaps = 58/594 (9%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
           +PIGN  +GA V+G +  E L  N+ TLW G P         G+    D  + +SDV   
Sbjct: 75  LPIGNSFMGANVYGEIGQERLTFNQKTLWNGGPSENRPDYDGGNKETADNGQKMSDVYKE 134

Query: 78  ---LVDSGQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
              L   G  A+A   + KL G  +    YQ  GDI ++F    LK  + E Y R+L+L 
Sbjct: 135 IIELYKEGNDAQANELAKKLTGEVNGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A A V +   + +  RE+F S PD V+  K +   S  L F++S    +DN   V    
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTAEGSEKLDFDISFP--IDNAEGVADKK 249

Query: 192 -QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
               +E       I       D+    Q     ++K+  + G +   +  KL V G+  A
Sbjct: 250 LGKSVETTVEDDTITVSGEMQDN----QLQLNGKLKVETEGGKVQEKDGDKLHVSGASEA 305

Query: 251 VLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
           V+ + A + +    P     ++ ++  +    A+       Y  +   H+ DY ++F RV
Sbjct: 306 VVYVSADTDYLNKYPDYRTGETAQELDASVERAVDKASKKGYEKVKKEHIKDYSEIFSRV 365

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + L ++  D  TD      +    + +  ++    E+ +L  +LFQ+GRYL I+SSR G
Sbjct: 366 QLDLGQNVPDKTTDIL----LKDYNAGKNTEA----ENRALEVILFQYGRYLTIASSRAG 417

Query: 369 TQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
              +NLQG+W   +       W S  H+N+NL+MNYW +   N++EC  PL D++  L  
Sbjct: 418 DLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDYINSLVE 477

Query: 425 NGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            G  TA+  + + +G    H  +    W     D     W   P    W+  + WE+Y Y
Sbjct: 478 PGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNCWEYYEY 534

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYS 539
           T D  ++E+  YP+L+  A      LIE    G L + P+ SPEH           V+  
Sbjct: 535 TGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH---------GPVTAG 585

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +T + ++I +++    +AAE+L K+E+   E   +   +L+P +I E G I EW
Sbjct: 586 NTYEQSLIWQLYEDAATAAEILSKDEEKAKEWRQRQ-QKLKPIEIGESGQIKEW 638


>gi|225018139|ref|ZP_03707331.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
           DSM 5476]
 gi|224949136|gb|EEG30345.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
           DSM 5476]
          Length = 1556

 Score =  218 bits (554), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 165/617 (26%), Positives = 288/617 (46%), Gaps = 72/617 (11%)

Query: 11  NPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---- 65
           N L++ +  PA ++T D + IGNG  G +++ GV  + +  NE TLW G PG  +N    
Sbjct: 57  NTLRMWYTKPASNWTNDCLVIGNGSTGGVLFSGVGRDRVHFNEKTLWNGGPGSVSNYNGG 116

Query: 66  ----PDAPKALSDVRSLVD---SGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSH 116
               P   + L  +R   D   +  +   T       G+ + +  YQ  GD+ L+F  + 
Sbjct: 117 NRTIPTTKEQLDAIREQADDHSTSVFPLGTGGVRDFMGNGSGMGQYQDFGDLYLDFSKTG 176

Query: 117 LKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
           +  A  T Y R+LD+ TA + + Y    V + RE+F S+PD+V+  +++ SE+G L+F+ 
Sbjct: 177 MTDANATNYVRDLDMRTAVSSLNYDYDGVHYEREYFVSHPDKVMAVRLTASEAGKLTFDA 236

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
           S          V   + +         RI       ++    +  A    ++ ++ GT++
Sbjct: 237 S----------VAAASGLTTTATAQDGRITLAGTVRNNGMKCEMQA----QVINEGGTLT 282

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
           + +D  + VEG+D   ++L   + +   +  P+    DP  E  + + +    SY +L  
Sbjct: 283 SNDDGTVSVEGADAVTIVLTTGTDYANDW--PTYRTDDPHDELTATVDAAAAKSYQELKD 340

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV-ELLF 354
            HL DYQ+LF R+ I L           C +     VP+ E +K+++  E      E+++
Sbjct: 341 AHLADYQELFSRLEIDLGGE--------CPQ-----VPTDEMMKAYRRGETSHAAEEMVY 387

Query: 355 QFGRYLLISSSRPGTQV-ANLQGIW-NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           QFGRYL I+ SR G ++  NL G+W        W +  H N+N++MNYW +   NL+EC 
Sbjct: 388 QFGRYLTIAGSREGDELPTNLCGLWLIGSAGSYWGADFHFNVNVQMNYWPAYQTNLAECG 447

Query: 413 EPLFDFLTYLSINGSKTAQVNYL-----------ASGWVIHHKTDIWAKSSADRGKVVWA 461
               D++  L   G  TA  +              +G++++ + + +   +A  G   + 
Sbjct: 448 SVFTDYMESLVEPGRVTAGASAALPTEPGTPIGEGNGFLVNTQNNPFG-CTAPFGSQEYG 506

Query: 462 LWPMGG-AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
            W +GG +W   ++++ Y YT D++ L+ + YP+L+  A+F   +L    + G L   PS
Sbjct: 507 -WNIGGSSWALQNVYDQYLYTGDKELLKNKIYPMLKEQANFWNQFLWYSDYQGRLVVGPS 565

Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
            S E                +T D +I+ E++   I A+E+L  +ED       K   +L
Sbjct: 566 VSAEQ---------GPTVNGTTYDQSIVWELYKMAIEASEILGVDEDQRAVWEDKQ-SQL 615

Query: 580 RPTKIAEDGSIMEWVQR 596
            P  I   G + EW + 
Sbjct: 616 NPIIIGSQGQVKEWYEE 632


>gi|288803110|ref|ZP_06408545.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
 gi|288334371|gb|EFC72811.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
          Length = 1163

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 152/526 (28%), Positives = 248/526 (47%), Gaps = 72/526 (13%)

Query: 11  NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           N   + +  PA ++ T  +PIGNG+ GA + G V  + ++ N+ TLW+G  G  T     
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
                            +TAA    +G+    Y   G++ +    S        Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
           +N A A V+Y++  V ++R +F+SNPD  +V + + S++G ++  ++L +    N SY V
Sbjct: 428 INDAVAGVRYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487

Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           + NNQ  I  +G+         A  +D       S     +I  D GTI+      ++V 
Sbjct: 488 DNNNQATITFDGQI--------ARQDDHGATTPESYYCVARIVTDGGTITKNAKGVIEVN 539

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
           G++   + L   + FD              + + + +   +N  Y  L+  H  DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAARAAATVNGAQNKGYDALFAAHKTDYKSLF 599

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
            R  + L     +I             P+ + + S++ ++  +L   EL F +GRYLLIS
Sbjct: 600 DRCQLTLGDVKNNI-------------PTPQLISSYRNNQHDNLFLEELYFNYGRYLLIS 646

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR  +  ANLQGIWN++ +P W +  H NIN++MNYW + P NLSE   P  D++ Y  
Sbjct: 647 SSRGISLPANLQGIWNDNNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYI-YRE 705

Query: 424 INGSKTAQ-----VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
                T +     + ++ +GW +  + +I+       G      + +  AW C HLW+HY
Sbjct: 706 ACVKPTWRRFAPDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHY 760

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
            YTMD+DFL  +A+P ++    +    L++  DG  E     SPEH
Sbjct: 761 TYTMDKDFLRTKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH 806


>gi|429860747|gb|ELA35469.1| glycoside hydrolase family 95 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 797

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 177/588 (30%), Positives = 274/588 (46%), Gaps = 64/588 (10%)

Query: 29  PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAPK--ALSDVRSLVDS 81
           P+GNG+LGA+ +G   SE + LN D+LW G P    +YT  NP  PK  AL ++R+ +  
Sbjct: 44  PVGNGKLGAIPFGPPGSEKVNLNIDSLWAGGPFGASNYTGGNPTEPKYEALPEIRATI-- 101

Query: 82  GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
             +   T     L G   D    ++L ++ +        Y++  YRR LDL T     K+
Sbjct: 102 --FENGTGDVSPLLGVGDDYGSNRVLANLTVNIQGIS-DYSD--YRRTLDLKTGVHTTKF 156

Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN---GNNQIIME 196
           +     F   HF S PDQV V  I+ SE    +  V  ++ L      N   G++ +   
Sbjct: 157 TANGAAFEISHFCSYPDQVCVYHIA-SEGALPAVEVGYENQLVEQDTFNVSCGDDHVRFA 215

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
           G    +  PP+    D    I   A +    S +  T++  +D+K          +++  
Sbjct: 216 GLT--QLGPPEGMKFDSIARINKGAAITNCTSANFLTVTPEKDQKA-------LTIIIGG 266

Query: 257 SSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
            +++D    N     S    DP            + S+  +   H+ DYQKL     + L
Sbjct: 267 ETNYDQKNGNAESDYSFKGGDPGPIVEKTTSDAASKSFHTILKDHIADYQKLESACELNL 326

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
                    DT   E  +T    + +  +  TD  DP +  LLF + RYLLI+SSR  + 
Sbjct: 327 P--------DTQGSEEKET---GQLISDYVYTDGGDPYVEALLFDYSRYLLITSSRANSL 375

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKT 429
            ANLQG W E L P W +  H NIN++MNYW +    L E Q  L+D++    +  G++T
Sbjct: 376 PANLQGRWTEQLWPAWSADYHANINIQMNYWAADQTGLGETQTALWDYMEDTWVPRGAET 435

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A++ Y ASGWV+H++ + +  ++   G   WA +P   AW+  H+W+++ YT D ++  +
Sbjct: 436 AKLLYNASGWVVHNEMNTFGHTAMKEGS-SWANYPAAAAWMMQHVWDNFEYTQDLEWFIR 494

Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           + YPL++G A F L  L E    +DG L  NP  SPEH    P     C  Y       +
Sbjct: 495 QGYPLIKGVAEFWLSQLQEDLYFNDGTLVVNPCNSPEH---GPT-TFGCTHYHQ-----M 545

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
           I +VF A++  A  +       +E V  +L RL +   + E G + EW
Sbjct: 546 IHQVFEAVLHGATFVSTK---FIEDVPPNLNRLDKGVHVTEWGGLKEW 590


>gi|359406206|ref|ZP_09198915.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
           18206]
 gi|357556624|gb|EHJ38213.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
           18206]
          Length = 1013

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 172/568 (30%), Positives = 264/568 (46%), Gaps = 91/568 (16%)

Query: 8   STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
           +TT  L     G +     A+PIG+G+ GA ++GGV  + ++ NE TLW+G P       
Sbjct: 216 ATTAKLYSGGQGYSNWMEYALPIGDGQFGACLFGGVYRDEIQFNEKTLWSGTP------- 268

Query: 68  APKALSDVRSLVDSGQYAEATAASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYR 125
                   RS      Y +        + +   +Y   L G+  L  D      A   Y 
Sbjct: 269 -------ARSSQGGKGYGK--------YENFGSIYAKDLSGEFGLTTDK-----AASNYV 308

Query: 126 RELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLD 182
           R LDL TAT +  + S   VE+TRE+ +SNP +V+V   + S+ G LSF  ++   S+  
Sbjct: 309 RLLDLTTATGKTMFKSAAGVEYTREYIASNPARVVVAHYTASKGGKLSFRFTMAAGSITA 368

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
           + +Y +G      EG   GK      NA              +K+    GT++  +D+ +
Sbjct: 369 DPTYADG------EGTFSGKLETISYNA-------------RMKVVPVGGTMTT-DDEGI 408

Query: 243 KVEGSDWAVLLLVASSSFDG---PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           +V G+D  +++L   + FD     +   + +     S+ ++A  +    S+ DLY  H+ 
Sbjct: 409 EVIGADEIMVVLGGGTDFDAYESTYTKNTSALAQTISDRVAAAAA---KSWKDLYAEHVA 465

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           DYQ  F+R    L+ +  D+ T+      IDT  S     +        L +L F +GRY
Sbjct: 466 DYQSFFNRCEFDLAGTKNDMTTNRL----IDTYNSGRGADALM------LEQLYFAYGRY 515

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           L ISSSR     +NLQGIWN      W+S  H NIN++MNYW + P NLSE   P   FL
Sbjct: 516 LEISSSRGVDSPSNLQGIWNNINGVAWNSDIHSNINVQMNYWPAEPTNLSEMHLP---FL 572

Query: 420 TYLSINGSKTAQVNYLAS------GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
            Y+     K  Q    A       GW    + +I+   SA +   V     +  AW  TH
Sbjct: 573 NYIWAMAEKQPQWKQWAKLQGQDRGWTCFTENNIFGGVSAFKNNYV-----IANAWYTTH 627

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
           LW+HY YT+DR++L KR +P +   + F +D L    DG  E     SPEH   + +G  
Sbjct: 628 LWQHYRYTLDREYL-KRVFPAMLSASQFWMDRLKLASDGTYECPNEWSPEHGPESENG-- 684

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL 561
             V+++  +    + ++FS  ++A +VL
Sbjct: 685 --VAHAQQL----VYDLFSNTLAAIDVL 706


>gi|317500980|ref|ZP_07959190.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|316897683|gb|EFV19744.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
           8_1_57FAA]
          Length = 1966

 Score =  216 bits (549), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 180/632 (28%), Positives = 300/632 (47%), Gaps = 87/632 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
           A+P+GN  +GA V+GGV +E ++LNE +LW+G P D           +     K ++ ++
Sbjct: 68  ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127

Query: 77  SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
             + SGQ  ++  A  +L G   D        Y   G++ L+F +   K     Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A V Y +    +TRE+F S PD V+VT+++ ++ G+L F+V ++    +      
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242

Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
            NQ   +   R   K++   A A D       ++FS+  ++ I DD GT   ++D  K  
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNG 300

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
           K+  S    + ++ S   D     P   +   T E ++AL           ++   Y  L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H++DY  +F R+ + + ++  D  TD   E        A +  +    E   L  +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411

Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
           FQ+GRYL + SSR               T  +NLQGIW    +  W S  H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
           W +   N++EC EPL D++  L   G  TA++ Y           +G++ H + + +  +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530

Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
           +   G V  W   P G  W+  + WE+Y +T D ++++   YP+++  A+     L+  +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588

Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
           DG L + PS SPEH            +  +T + ++I +++   I+AAE L  +E A V 
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638

Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWV-QRRLNT 600
           +  K+   L+ P ++   G I EW  +  LNT
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNT 670


>gi|336439275|ref|ZP_08618890.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
           1_1_57FAA]
 gi|336016192|gb|EGN45981.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
           1_1_57FAA]
          Length = 1977

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 180/632 (28%), Positives = 300/632 (47%), Gaps = 87/632 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
           A+P+GN  +GA V+GGV +E ++LNE +LW+G P D           +     K ++ ++
Sbjct: 68  ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127

Query: 77  SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
             + SGQ  ++  A  +L G   D        Y   G++ L+F +   K     Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A V Y +    +TRE+F S PD V+VT+++ ++ G+L F+V ++    +      
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242

Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
            NQ   +   R   K++   A A D       ++FS+  ++ I DD GT   ++D  K  
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNG 300

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
           K+  S    + ++ S   D     P   +   T E ++AL           ++   Y  L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359

Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
              H++DY  +F R+ + + ++  D  TD   E        A +  +    E   L  +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411

Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
           FQ+GRYL + SSR               T  +NLQGIW    +  W S  H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471

Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
           W +   N++EC EPL D++  L   G  TA++ Y           +G++ H + + +  +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530

Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
           +   G V  W   P G  W+  + WE+Y +T D ++++   YP+++  A+     L+  +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588

Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
           DG L + PS SPEH            +  +T + ++I +++   I+AAE L  +E A V 
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638

Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWV-QRRLNT 600
           +  K+   L+ P ++   G I EW  +  LNT
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNT 670


>gi|260589559|ref|ZP_05855472.1| putative fibronectin type III domain protein [Blautia hansenii DSM
           20583]
 gi|260540127|gb|EEX20696.1| putative fibronectin type III domain protein [Blautia hansenii DSM
           20583]
          Length = 1719

 Score =  214 bits (546), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 169/600 (28%), Positives = 275/600 (45%), Gaps = 70/600 (11%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
           +PIGN  +GA V+G +  E L  N+ TLW G P         G+    D  + +S+V   
Sbjct: 75  LPIGNSFMGANVYGEIGEERLTFNQKTLWNGGPSESRPNYDGGNKETADNGQKMSEVYKE 134

Query: 78  ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
              L   G   +A   + KL G       YQ  GDI ++F    LK  + E Y R+L+L 
Sbjct: 135 IIKLYKEGNDTQANELAKKLTGEVEGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
            A A V +   + +  RE+F S PD V+  K +   +  L F++S    +DN   V    
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTADGNEKLDFDISFP--IDNAEGV---- 245

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI-------QFSAILEIKISDDRGTISALEDKKLKV 244
                 +  GK +  K    DD   +       Q     ++K+  + G +   +  KL V
Sbjct: 246 ----ADKKLGKSV--KTTVEDDMITVSGEMQDNQLKLNGKLKVETEGGKVQEKDGDKLHV 299

Query: 245 EGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            G+  AV+ + A + +    P     ++ ++  +    A+       Y  +   H+ DY 
Sbjct: 300 SGASEAVVYVSADTDYLNKYPDYRTGETAQELDASVEKAVDKASKKGYEKVKKEHIKDYS 359

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
           ++F RV + L ++  +  TD      ++   + +  ++    E+ +L  +LFQ+GRYL I
Sbjct: 360 EIFSRVQLDLGQNVPEKTTDIL----LNDYNAGKNTEA----ENRALEVILFQYGRYLTI 411

Query: 363 SSSRPGTQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           +SSR G   +NLQG+W   +       W S  H+N+NL+MNYW +   N++EC  PL D+
Sbjct: 412 ASSRAGDLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDY 471

Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHL 474
           +  L   G  TA+  + + +G    H  +    W     D     W   P    W+  + 
Sbjct: 472 INSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNC 528

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKL 533
           WE+Y YT D  ++E+  YP+L+  A      LIE    G L + P+ SPEH         
Sbjct: 529 WEYYEYTGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH--------- 579

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
             V+  +T + ++I +++    +AAE+L K+ED   E   +   +L+P +I E G I EW
Sbjct: 580 GPVTAGNTYEQSLIWQLYEDAATAAEILGKDEDKAKEWRQRQ-EKLKPIEIGESGQIKEW 638


>gi|153816042|ref|ZP_01968710.1| hypothetical protein RUMTOR_02288 [Ruminococcus torques ATCC 27756]
 gi|331089120|ref|ZP_08338023.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|145846689|gb|EDK23607.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus torques
           ATCC 27756]
 gi|330405897|gb|EGG85423.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
           3_1_46FAA]
          Length = 1966

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 176/636 (27%), Positives = 297/636 (46%), Gaps = 81/636 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
           A+P+GN  +GA V+GGV +E ++LNE +LW+G P D           +     K ++ ++
Sbjct: 68  ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127

Query: 77  SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
             + SGQ  ++  A  +L G   D        Y   G++ L+F +   K     Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L TA A V Y +    +TRE+F S PD V+VT+++ ++ G+L F+V ++   +     N 
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEPDEEKGGSQN- 244

Query: 190 NNQIIMEGRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKLKV 244
             +     R   K++   A A D       ++FS+  ++ I DD GT   ++D  K  K+
Sbjct: 245 KPEADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNGKI 302

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDLYT 295
             S    + ++ S   D     P   +   T E ++AL           ++   Y  L  
Sbjct: 303 TVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETLKE 361

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
            H++DY  +F R+ + + ++  D  TD   E        A +  +    E   L  +LFQ
Sbjct: 362 DHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELMLFQ 413

Query: 356 FGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
           +GRYL + SSR               T  +NLQGIW    +  W S  H+N+NL+MNYW 
Sbjct: 414 YGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNYWP 473

Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKSSA 453
           +   N++EC EPL D++  L   G  TA++ Y           +G++ H + + +  ++ 
Sbjct: 474 TYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWTNP 532

Query: 454 DRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
             G V  W   P G  W+  + WE+Y +T D ++++   YP+++  A+     L+   +G
Sbjct: 533 --GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDSEG 590

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
            L + PS SPEH            +  +T + ++I +++   I+AAE L  +E  + +  
Sbjct: 591 KLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDEAKVAQWK 641

Query: 573 LKSLPRLRPTKIAEDGSIMEWV-QRRLNTSFSTCKL 607
                   P +I + G I EW  +  LNT  +  K+
Sbjct: 642 QNQADLKGPIEIGDSGQIKEWYNETTLNTDENGQKM 677


>gi|325269425|ref|ZP_08136042.1| fibronectin type III domain protein [Prevotella multiformis DSM
           16608]
 gi|324988346|gb|EGC20312.1| fibronectin type III domain protein [Prevotella multiformis DSM
           16608]
          Length = 847

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 141/511 (27%), Positives = 230/511 (45%), Gaps = 69/511 (13%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
            T  +P+GNG+ GA V G +  + ++ N+ TLW+G  G  T                   
Sbjct: 85  MTSCLPVGNGQFGATVMGQIVVDDVQFNDKTLWSGKLGGLT------------------- 125

Query: 84  YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
                  S   +G     Y   G++ +    S        Y R LD+N A A V++S+  
Sbjct: 126 -------STAAYGS----YLNFGNLLIR---SRGMKGVTDYVRYLDINDAVAGVRFSMDG 171

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV---NGNNQIIMEGRC 199
           V ++R +F+SNPD  +V + + +  G ++  ++L     +H SY     G   I  +G+ 
Sbjct: 172 VGYSRTYFASNPDSCVVIRYTATRGGMINTTLALKDQNGSHVSYTVDGPGRATITFDGQV 231

Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
                      ND+ +    S     +I  D GT++   +  ++V  ++   + L   + 
Sbjct: 232 --------GRQNDEGEATPESYCCAARIVADGGTVTKNAEGLVEVSDANSMTVYLRGLTD 283

Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
           FD          +     +M+A+   R   Y  L   H  DY+ LF R  + L  +  D 
Sbjct: 284 FDAAAPEYVSGTEQLAGRAMAAVDGARRKGYDALLAAHKADYKSLFDRCLLTLCSTGSD- 342

Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQGI 377
                       VP+ + +  ++ D   +L   EL F +GRYLLISSSR  +  ANLQGI
Sbjct: 343 ------------VPTPQLISGYRADPQGNLFLEELYFSYGRYLLISSSRGVSLPANLQGI 390

Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK----TAQVN 433
           WN   +P W +  H NIN++MNYW + P NLSE   P  D++   +            + 
Sbjct: 391 WNNSNAPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVKPAWRRFARDMG 450

Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
            + +GW +  + +I+       G      + +  AW C HLW+HY YT+DR++L ++A+P
Sbjct: 451 KVDAGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYAYTLDREYLRRQAFP 505

Query: 494 LLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           +++    + L  L++G DG  E     SPEH
Sbjct: 506 VMKSAVDYWLRKLVKGADGTYECPEEWSPEH 536


>gi|46140003|ref|XP_391692.1| hypothetical protein FG11516.1 [Gibberella zeae PH-1]
          Length = 798

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 172/618 (27%), Positives = 308/618 (49%), Gaps = 64/618 (10%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           +++A+   ++ P   T  G A++      P+GNG+LGA+ +G    E + LN D+LW+G 
Sbjct: 16  LVSAKELWSSKPASYTKQGSAEYLLRTGYPVGNGKLGAIHFGPPGREKINLNVDSLWSGG 75

Query: 60  PGD---YT--NPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIEL 110
           P +   YT  NP +PK   L  +R  +    +  AT    +L G  +     ++LG++ +
Sbjct: 76  PFEVDGYTGGNPSSPKFQYLPAIRDRI----FTNATGEMEELMGSGSHFGSNRVLGNLTI 131

Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSV--GNVEFTREHFSSNPDQVIVTKISGSES 168
           +FD    +Y++  YRR LD+ T      ++   G  +F    F S  DQV V  +  + +
Sbjct: 132 QFDGLD-EYSD--YRRSLDMKTGIYETSFASKDGGSKFISSVFCSYSDQVCVYFLK-ANT 187

Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP-GKRIPPKANANDDPKGIQFSAILEIKI 227
              +  + +++ L          Q +++  C  G  +         P+G++++A L +  
Sbjct: 188 RLPNIKIGIENKL--------VKQDLIKTTCKNGMALHTGMTQTGPPEGMKYAAALSVDR 239

Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDS----KKDPTSESMSAL 282
           S   GT++ L D ++ V+  +  + +   A +++D    N  D       DP      A 
Sbjct: 240 S--LGTVTCLNDGQIIVKPKNKRMAIFWAAETNYDQKAGNTDDGWAFKGPDPVPRVKKAS 297

Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
           ++     Y+ L   H++D++KL    ++ L         DT + ++++T   A+ +++++
Sbjct: 298 KTAATKGYAKLRKVHVEDFKKLEEAFTLNLP--------DTQNSKDVET---ADLIQAYK 346

Query: 343 TDE--DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
            D   DP L  +LF   RYLLI+SSR  +  ANLQG W E L   W +  H NINL+MNY
Sbjct: 347 YDGPGDPFLEGILFDLSRYLLITSSRENSLPANLQGRWTELLQAAWGADYHANINLQMNY 406

Query: 401 WQSLPCNLSECQEPLFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
           W +    L+  Q+ +++++T   +  G++TA++ Y A+GWV+H++ +I+   +A +    
Sbjct: 407 WVADQTGLAATQKSVWNYMTDTWVPRGTETAKLLYNATGWVVHNEMNIFGH-TAMKEVAG 465

Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLET 516
           WA +P+  AW+  H+W+ ++YT D+ +L  + YPL++G A F +  L E     DG L  
Sbjct: 466 WANYPVAPAWMMQHVWDAFDYTQDKKWLSSQGYPLIKGVAEFWVSQLQEDAYTEDGSLVA 525

Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
            P  S E     P     CV Y       +I +V  + + AA+++ + +   V+ V  +L
Sbjct: 526 IPCNSAE---TGPT-TFGCVHYQQ-----LIHQVLDSTLIAADIVSEPDSDFVDSVSSTL 576

Query: 577 PRL-RPTKIAEDGSIMEW 593
            RL +    A  G + EW
Sbjct: 577 KRLDKGLHFASWGGLKEW 594


>gi|374373770|ref|ZP_09631430.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
 gi|373234743|gb|EHP54536.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
          Length = 733

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 172/589 (29%), Positives = 261/589 (44%), Gaps = 84/589 (14%)

Query: 15  ITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           I F  P  K   + +PIGNGRLGAM+ GGV ++T++ NE +LW+G      N D      
Sbjct: 27  IWFAKPGLKWDAEGLPIGNGRLGAMMMGGVANDTIQFNEQSLWSGD----NNWDGAYETG 82

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
           D                      H    Y+  G + + FD      +   YRR L+L   
Sbjct: 83  D----------------------HGFGSYRNFGALVVNFDGDK---SSSGYRRGLNLTDG 117

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
                 ++   ++ RE F+S+PDQV+V + + +++G LS  +SL S     +   GN+  
Sbjct: 118 IYTASLTINKTQYKREAFASHPDQVMVFRYT-AQNGRLSGRISLHSAQGASARATGNSLQ 176

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
                           A   P  +Q++A  ++ +  + GT++ L D +L   G     L 
Sbjct: 177 F---------------AGTMPNQLQYAA--KMLLQQEGGTVTTL-DSQLVFTGCKTLTLY 218

Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           L A +++  P          P       L +    +Y  L   H+ D+  L     I + 
Sbjct: 219 LDARTNYK-PDYTADWRGAAPRPVIEKELAAALRKTYEQLRAAHIKDFTALAAAAHIDVG 277

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVA 372
            +P  +            +P+  R++ +     DP L E +FQFGRYLLISSSRPG   A
Sbjct: 278 TTPVAL----------RALPTDLRLQKYAAGGADPDLEETVFQFGRYLLISSSRPGGLPA 327

Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
           NLQG+WN   +P W S  H NIN++MNYW +   NLS C  PL D++   +       + 
Sbjct: 328 NLQGLWNNSNTPPWASDYHNNINIQMNYWAAENTNLSACHIPLIDYIVAQAEPCRIATRK 387

Query: 433 NYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
            + A+  GW       I+  +        W       AW   H++EH+ +T DRD+L+K 
Sbjct: 388 AFGAATRGWTARTSQSIFGGNG-------WEWNIPASAWYAHHVFEHWAFTKDRDYLKKT 440

Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIR 548
           AYP+L+   +F  D L +  DG L      SPEH    P  DG +         D  ++ 
Sbjct: 441 AYPVLKEICNFWEDRLKQLPDGSLVVPNGWSPEH---GPREDGVM--------HDQQLVW 489

Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
           ++F   + AA+ L   + A   KV     RL P KI + G + EW + R
Sbjct: 490 DLFQNYLDAAKALN-TDPAYQLKVADMQRRLAPNKIGKWGQLQEWQEDR 537


>gi|189208288|ref|XP_001940477.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187976570|gb|EDU43196.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 814

 Score =  213 bits (542), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 170/579 (29%), Positives = 269/579 (46%), Gaps = 59/579 (10%)

Query: 29  PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDA--PKALSDVRSLVDS 81
           P+GNGRLGAM  G   +ETL LN D+LW+G P    +YT  NP      AL  +R  +  
Sbjct: 41  PLGNGRLGAMPVGPPAAETLTLNLDSLWSGGPFNISNYTGGNPHTLIASALPGIRDWI-- 98

Query: 82  GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
             +   T     L G   +   YQ+LG++ ++            Y R+LD++T T    +
Sbjct: 99  --FTNGTGNVSALLGSNDNYGSYQVLGNLTVKIPSLSSDIVSN-YTRKLDMSTGTHTTTF 155

Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLL-----DNHSYVNGNNQI 193
                +     F S PDQV V  +  + +G +    V+LD++L      N + V G+   
Sbjct: 156 IANGNDLETTGFCSFPDQVCVYTVQSTGAGDVPPLEVTLDNVLVSPQLQNVTCVEGDTTK 215

Query: 194 IMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL----KVEGSD 248
               R  G  ++ P       P+G+++ +I  + +S+    +S  E+  L       G+ 
Sbjct: 216 PAHLRLRGVTQLGP-------PEGMRYDSIARV-VSNSNTDVSCDENTGLLSIAPRSGTK 267

Query: 249 WAVLLLVASSSFDGPF----INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
              +++ A +++D        N S   +DP     +        +   L  RH+DD+  L
Sbjct: 268 SVSIVIGAGTNYDAKKGTAEHNYSFRGEDPALIVEATTLKAATKTLDQLRGRHIDDFTAL 327

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
                + L         D  +     T     R     T  DP L  LL +  RYL ISS
Sbjct: 328 TGLFELSLP--------DPLNSSQTQTSELINRYTVNNTSGDPYLESLLMENSRYLFISS 379

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           SRPG+   NLQG W+E L   W +  H NIN +MN+W S    L++ Q PL+D++T   +
Sbjct: 380 SRPGSLPPNLQGRWSEGLETDWSADYHANINFQMNHWTSDQTGLTDLQSPLWDYMTDTWM 439

Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
             G++TA + Y A GWV+H++ +I+   +A +    WA +P+  AW+  H+++H++Y+ +
Sbjct: 440 PRGAETATLLYNAPGWVVHNEMNIFGH-TAMKSAAEWANYPIAAAWMMQHVFDHWDYSRN 498

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
             +L K+ YPLL+G A F LD L +     DG L  NP  SPEH          C  Y  
Sbjct: 499 ATWLLKQGYPLLKGVAMFWLDQLQQDGYYKDGSLVVNPCNSPEHGGTT----FGCAHYQQ 554

Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
                +I +VF +I++    +   +   +  +  SL RL
Sbjct: 555 -----LIHQVFHSILAVQPTVADPDTVFLTNLTSSLHRL 588


>gi|323345397|ref|ZP_08085620.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
 gi|323093511|gb|EFZ36089.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
          Length = 801

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 161/562 (28%), Positives = 259/562 (46%), Gaps = 89/562 (15%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           A+PIGNG+LGAM++GG+  + ++ NE TLWTG                  S  + G Y  
Sbjct: 49  ALPIGNGQLGAMIYGGIRQDIVQFNEKTLWTG------------------SAEERGSYQN 90

Query: 87  ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV--GNV 144
             A  ++  G   D                 +     Y R LDL+ ATA   +S   G+ 
Sbjct: 91  FGALVIENIGGSYD-----------------RRGVYNYYRNLDLSNATAVASWSTADGDT 133

Query: 145 EFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
            +TRE+ +SNP Q +V  +  S   +++    L+ +    +Y  G      EG   GK  
Sbjct: 134 VYTREYIASNPAQCVVIHMKASVPRAINNRFYLNDVHGRETYYQGK-----EGMFAGKLT 188

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                          S    +K++   GT++   D  + V+ +D  +++L A + ++   
Sbjct: 189 T-------------VSYCARMKVAAVGGTVTTTNDG-IVVKHADEVMVILAAGTDYNA-- 232

Query: 265 INPSDSKKDPT--SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
           + PS         S   + + S  ++ +  LY+RH++DY+  + R  +QL      I TD
Sbjct: 233 VAPSYISHTTLLPSRIKNTVDSAVSMGWQALYSRHVEDYKAFYDRTDLQLGGVTNTIPTD 292

Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQFGRYLLISSSRPGTQVANLQGIWNED 381
                 ID        ++++ D    L+E L FQ+GRYLLISSSR      NLQGIWN  
Sbjct: 293 KL----IDGY-----AENYEHDNRYRLIEQLYFQYGRYLLISSSRGIDLPNNLQGIWNNS 343

Query: 382 LSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI---NGSKTAQVNYL-AS 437
             P W    H +IN++MNYW +   NLSE  E L +++  +++        A+V     +
Sbjct: 344 NEPAWQCDMHADINVQMNYWLANSTNLSEMNEKLLNYIYNMALVQPQWKSYARVRLRQQN 403

Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
           GW    + +I+   +A +     A     GAWLC HLW+HY YT+DR+FL  +A P++  
Sbjct: 404 GWACFTENNIFGHCTAWQNNYCAA-----GAWLCAHLWQHYRYTLDREFLLHKALPVMVS 458

Query: 498 CASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY------SSTMDMAIIREVF 551
              F L+ L++  DG  E     SPEH    P  + A   Y      ++     +++ +F
Sbjct: 459 QCEFWLERLVKATDGTYECPDEYSPEH---GPGTESAPGVYAIKPENATAHAQQLVKYLF 515

Query: 552 SAIISAAEVLEKNEDALVEKVL 573
           SA + A  ++  N+ A V+++ 
Sbjct: 516 SATLKAISIV-GNKAACVDRMF 536


>gi|358368279|dbj|GAA84896.1| similar to glycoside hydrolase family 95 protein [Aspergillus
           kawachii IFO 4308]
          Length = 810

 Score =  211 bits (537), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 178/611 (29%), Positives = 274/611 (44%), Gaps = 87/611 (14%)

Query: 27  AIPIGNGRLG--------------------AMVWGGVPSETLKLNEDTLWTGVPGD---Y 63
           A P+GNGRLG                    AM  G    E + LN D+LW G P +   Y
Sbjct: 38  AFPLGNGRLGGSYFDQTSKGYYGRILKCSLAMPVGSYDKEIVNLNVDSLWRGGPFESPTY 97

Query: 64  T--NPDAPKA--LSDVRSLVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SH 116
           +  NP+  KA  L  +R  +    +   T     L G +P    YQ+L ++ ++    S 
Sbjct: 98  SGGNPNVSKAGALPGIREWI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGQLSD 153

Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV 175
           +    + YRR LDL++A     +S G     RE F S PD V V K+S + S   ++F +
Sbjct: 154 I----DGYRRNLDLSSAVYSDHFSTGETYIEREAFCSYPDNVCVYKLSSNSSLPGITFGL 209

Query: 176 --SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
              L S   N S  +GN+  +      G+  P          G+ ++A + + +      
Sbjct: 210 ENQLTSPAPNVS-CHGNSISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNA 255

Query: 234 ISALEDKKLKV-EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNL 288
                   +KV EG     L+  A +++D    N   S     ++P ++ + A  +    
Sbjct: 256 SDLCSSLTIKVPEGEKEVFLVFAADTNYDASNGNSKASFSFKGENPYTKVLQAATNAAKK 315

Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
           +YS L + H+ DYQ +F+  ++ L                    P+ E + S+    DP 
Sbjct: 316 TYSALKSSHVKDYQGVFNEFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPY 364

Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
           +  LLF +GRYL ISSSRPG+   NLQG+W E  SP W    H NINL+MN+W      L
Sbjct: 365 VENLLFDYGRYLFISSSRPGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVEQTGL 424

Query: 409 SECQEPLFDFLTYLSI-NGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMG 466
            E  EPL+ ++    +  G++TA++ Y  S GWV H + + +   +A +    WA +P  
Sbjct: 425 GELTEPLWTYMAETWMPRGAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPAT 483

Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPE 523
            AW+  H+W+H++Y+ D  +  ++ YP+L+G A F L  L++     DG L  NP  SPE
Sbjct: 484 NAWMSHHVWDHFDYSQDSTWYREKGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPE 543

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-T 582
           H    P     C  Y       +I EVF  ++        ++ +    +   L  L P  
Sbjct: 544 H---GPT-TFGCTHYQQ-----LIWEVFGHVLQGWTASGDDDTSFKNAITSKLSTLDPGI 594

Query: 583 KIAEDGSIMEW 593
            I   G I EW
Sbjct: 595 HIGSWGQIQEW 605


>gi|452002453|gb|EMD94911.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
           C5]
          Length = 805

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 176/602 (29%), Positives = 281/602 (46%), Gaps = 73/602 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
           A+P+GNGRL AM  G   +ETL LN D+LW+G P    +YT  NP +    AL  +R  +
Sbjct: 38  ALPVGNGRLAAMPIGPPSAETLTLNLDSLWSGGPFEASNYTGGNPQSSIDSALPGIRDWI 97

Query: 80  DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
               +   T    KL G   +   Y++L ++ +    S +      Y R+LDL       
Sbjct: 98  ----FTNGTGNVTKLLGTNDNYGSYRVLANLTVAIP-SLVGSQVSNYTRKLDLANGLHST 152

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSL-----DSLLDNHSYV-NGN 190
            ++  + +     F S PDQ+ V  +    SGSL +F + L     D+ L+N + V NG 
Sbjct: 153 SFNTNDTQLETTVFCSYPDQICVYTVQ--SSGSLPAFELKLGNELVDAKLENKTCVANGT 210

Query: 191 NQIIMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EG 246
                  R  G  ++ P       P+G+ +  I  +  + D           L V   +G
Sbjct: 211 GADSGHLRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKATCDSNTGILTVTPGDG 263

Query: 247 SDWAVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
           +  A +++ A +++D          S    DP       ++     +  +L + HL+D+ 
Sbjct: 264 AKSATVIIGAETNYDMKKGTAEHQYSFRGNDPGPVVEETIRKASTKTLEELKSSHLEDFT 323

Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRY 359
            L  R    L   P  +        N   VP+ E + S+    T  DP +  LLF + +Y
Sbjct: 324 SLTGRFEFLL---PDPL--------NSAQVPTPELMASYDSNVTSGDPFVENLLFDYAQY 372

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           LLISSSRPG+   NLQG W E ++P W +  H NINL+MNYW +    L+E Q PL+D++
Sbjct: 373 LLISSSRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYM 432

Query: 420 TYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
               +  G +TA + Y A GWV+H++ +I+  ++   G+  WA +P   AW+  H+++++
Sbjct: 433 INTWVPRGHETAMLLYGAPGWVVHNEMNIFGHTAMKDGE-GWANYPAAPAWMMLHVFDYW 491

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGK 532
           +YT D  +L  + YPL+   A F   WL + H      D  L  NP +SPEH    P   
Sbjct: 492 DYTRDTTWLRTQGYPLIRSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-T 544

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
             C  Y       +I +VF A+++   ++ +++      V  +L RL +   +     I 
Sbjct: 545 FGCAHYQQ-----LIHQVFEAVLTTHSLVGESDTEFTSNVSSTLSRLDKGFHVGSWSQIK 599

Query: 592 EW 593
           EW
Sbjct: 600 EW 601


>gi|358388157|gb|EHK25751.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
          Length = 794

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 156/609 (25%), Positives = 285/609 (46%), Gaps = 55/609 (9%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDAPKA-LSDVRSLVDS 81
           +P+GNGR  A V      ET  LNE + W+G       G    P+ PKA L + +    +
Sbjct: 20  LPLGNGRFAASVLSSPAKETFILNEVSFWSGETQKAGGGLAERPEDPKAELRETQKCYLN 79

Query: 82  GQYAEATAASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
           G YA+    + K        +     +G +++  +          + REL L+ A A  +
Sbjct: 80  GDYAKGKKRAEKYLESKKRNFGTNLGVGTLDIVVNGHESIGQVNGFERELRLDEAVAETR 139

Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
           Y++   +F R  F S+P+QV+V +  G +   L   V +    +N ++ +  N    +G+
Sbjct: 140 YTIDGRQFKRRSFLSHPNQVLVVQFDGDDLSGLEVVVGVQG--ENEAFTSKIND---DGK 194

Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
                   +   +D   G++   I+   +  D G +    D KL +       +L+    
Sbjct: 195 LEFNAQALETVHSDGTCGVKGYGIIAATV--DEGKVEH-RDTKLVISAKKNITILV---- 247

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
           +F+  +  P++  +  T+     L+    LS +DL   HL+D+Q L+ R+SI L      
Sbjct: 248 TFNTDYSEPNEEWRKRTTLQ---LEEALKLSAADLLKAHLEDFQPLYRRMSISLGSKSST 304

Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA-NLQGI 377
             +    +   +  PS           DPS+  L F + RYL I+ +R  + +  +LQG+
Sbjct: 305 TASIRTDQRRQNFEPSGY--------ADPSMFALYFHYARYLTIAGTRHDSPLPLHLQGL 356

Query: 378 WN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
           WN  E     W    H++IN +MNY+  L    S+  +PL ++L  L+ +G   A+  Y 
Sbjct: 357 WNDGEACKMGWSCDYHLDINTQMNYFAILNGGFSDLMQPLINYLIRLAASGQHAARACYG 416

Query: 436 ASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
           + GWV H  +++W    AD G +V + L   GG W+  HL E + Y++D  F+   A+PL
Sbjct: 417 SEGWVAHVFSNVWG--FADPGWEVSYGLNVTGGLWMANHLIEMFEYSLDEGFMANDAWPL 474

Query: 495 LEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG----KLACVSYSSTMDMAIIRE 549
           L G + F L++++E    G+L T PS SPE+ F   +G    +    + + T+D+ ++R+
Sbjct: 475 LAGASKFFLNYMVEDPKTGWLLTGPSVSPENSFFVVNGDGEKEEHYAALAPTLDVVLVRD 534

Query: 550 VFS---AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV---------QRR 597
           + +    +++     + N +  +++  ++  +L P +I ++G + EW+          R 
Sbjct: 535 LLAFCEYVVTKFNAGKSNWEDDIQQYQEAQAKLPPFQIGKNGQLQEWLHDFEEAQPYHRH 594

Query: 598 LNTSFSTCK 606
           L+ + + C+
Sbjct: 595 LSHTMALCR 603


>gi|421218935|ref|ZP_15675822.1| large secreted protein [Streptococcus pneumoniae 2070335]
 gi|395581532|gb|EJG42003.1| large secreted protein [Streptococcus pneumoniae 2070335]
          Length = 458

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 156/496 (31%), Positives = 241/496 (48%), Gaps = 56/496 (11%)

Query: 92  VKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFT 147
           + +F  P D   Y+LLG++ +E  D     A   Y RELDL+TA + V +  +  N++  
Sbjct: 4   LTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIK 62

Query: 148 REHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
           RE+F+S    ++  +I  S   +L+ N++L  +   ++      ++ I+M     G+   
Sbjct: 63  REYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR--- 119

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
                    KG+QF  +   K++D  G +S L  + + +  +    L L + + + G   
Sbjct: 120 ---------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI- 166

Query: 266 NPSDSKKDPTSESMSALQSIRNLSYSDLYTR---HLDDYQKLFHRVSIQLSRSPKDIVTD 322
                        +S+LQ     S  D +T    H+  YQ+ F+RV  +L  S   +   
Sbjct: 167 ------------DISSLQG--EFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--- 209

Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
                +I T    E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L
Sbjct: 210 -----SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDEL 260

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
           +P W S   +NIN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  H
Sbjct: 261 NPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAH 320

Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
           H TD +  ++     +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F 
Sbjct: 321 HNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFF 379

Query: 503 LDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            D+L E  DGYL T PS SPE+++   +G       SST+D  I+R    + I  A+ L 
Sbjct: 380 EDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLG 438

Query: 563 KNEDALVEKVLKSLPR 578
            N D +    +K L R
Sbjct: 439 DNSDFISR--VKELKR 452


>gi|330819167|ref|YP_004348029.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
 gi|327371162|gb|AEA62517.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
          Length = 796

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 175/614 (28%), Positives = 283/614 (46%), Gaps = 103/614 (16%)

Query: 13  LKITFN---GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
           L+++++   G +    + +P+GNGRLGA+  G    E L LNE TLW+G   D  +P   
Sbjct: 65  LRLSYSQAAGESNILFEGLPLGNGRLGALTGGSPVREALYLNEITLWSGQK-DAVDP--- 120

Query: 70  KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
                         Y  A   S          YQ+LG + +E    H +     Y R LD
Sbjct: 121 -------------AYTAAGMGS----------YQMLGKLYVELP-GHAQ--ASGYSRSLD 154

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           ++ A AR +Y  G   + RE F S+PD+V+V ++S S+ GS    +SL  +    + V G
Sbjct: 155 ISNAVARTQYVAGGHTYRREVFCSHPDKVLVMRLS-SDGGSHDGTISL--VDGQGASVTG 211

Query: 190 NNQIIM-EGRCPG--KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
           +N I++ +G+  G  +R      A  D   +++ A         +G ++      L    
Sbjct: 212 SNGILLAQGKLDGVGERYATHVLAMPDSGTVKYDA--------SKGVLTMSRCPAL---- 259

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
                L++ A +++ G          DP + + +      +L Y +L  RHL DY  LF 
Sbjct: 260 ----TLIIAARTNYSGIEAEGYLGATDPAALARADASGAAHLPYRNLLERHLRDYTALFG 315

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSS 365
           R S+ L +S           +   T+P   + ++   D  DP L  L  QFGRYL I+SS
Sbjct: 316 RFSLDLGKS--------SDAQRAMTIPDRLKARTASPDIADPELEALYVQFGRYLTIASS 367

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
           R G   ANLQG+W+ + +P W +  H +IN++MNYW +    L ECQ+P  D++     +
Sbjct: 368 R-GPLPANLQGLWSVNNTPPWMADYHTDINVQMNYWLADRAGLPECQKPFADYVLSQLPS 426

Query: 426 GSKTAQVNY-------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
            +++ Q ++               +GW I   T I+       G + W   P   AW C 
Sbjct: 427 WARSTQAHFNDAANSNYSNSSGKVAGWTIAISTGIY-------GGIGWDWSPPASAWYCR 479

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
            LW HY YT+DRD+L +  YP+L+    F    LI +   G L  +   SPEH     D 
Sbjct: 480 TLWNHYQYTLDRDYL-RAIYPVLKSACEFWQARLIVDPASGLLVDDRDWSPEHG----DH 534

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKS---LPRLRPTKIAED 587
           +   ++Y+  +    + ++F+   +A+  L  + D A     L+S   LP++ PT     
Sbjct: 535 QELGITYAQEL----VWDLFTNYGTASGTLNLDTDFAATIAGLRSRLYLPKISPTT---- 586

Query: 588 GSIMEWVQRRLNTS 601
           G + EW++ +++T 
Sbjct: 587 GQLQEWMEDKVDTG 600


>gi|330915124|ref|XP_003296910.1| hypothetical protein PTT_07143 [Pyrenophora teres f. teres 0-1]
 gi|311330715|gb|EFQ94998.1| hypothetical protein PTT_07143 [Pyrenophora teres f. teres 0-1]
          Length = 755

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 163/588 (27%), Positives = 272/588 (46%), Gaps = 48/588 (8%)

Query: 29  PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSGQYA 85
           P+GNGRLGAM  G   +ETL LN D+LW+G P    +YT  +   +++     +    + 
Sbjct: 41  PLGNGRLGAMPVGPAAAETLTLNLDSLWSGGPFNISNYTGGNPHTSIASALPGIRDWIFI 100

Query: 86  EATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
             T     L G   +   YQ+LG++ ++            Y RELD++T      ++   
Sbjct: 101 NGTGNVSALLGSNDNYGSYQVLGNLTVKIPSLESSIISN-YTRELDISTGIHTTTFTANG 159

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLL-----DNHSYVNGNNQIIMEG 197
            +     F S PDQV V  +  + +G +    V+LD++L      N + V+ N+      
Sbjct: 160 NQLETTGFCSFPDQVCVYTVQSTGAGDIPPLEVTLDNVLVLPQLQNVTCVDRNSTQPAYL 219

Query: 198 RCPG--KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
           R  G  +  PP+    D    +  +A +++  + + G +S          G+    +++ 
Sbjct: 220 RLRGVTQLGPPEGMRYDSIARVVSNAKIDMSCNHNAGLLSIAPRS-----GAKSVSIVVG 274

Query: 256 ASSSFDGPF----INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
           A +++D        N S   +DP              +   L +RH+DD+  L     + 
Sbjct: 275 AGTNYDAKKGRAEHNYSFRGEDPAPIVEVTTLKAAAKTLDQLRSRHVDDFTALTGLFELS 334

Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           L         D  +     T     R     T  DP L  LL +  RYL ISSSRPG+  
Sbjct: 335 LP--------DPLNSSQTQTSELVNRYTVNNTGGDPYLESLLMENSRYLFISSSRPGSLP 386

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL--TYLSINGSKT 429
            NLQG W+E L   W +  H NIN++MN+W +    L++ Q PL+D++  T++   G++T
Sbjct: 387 PNLQGRWSEGLETDWSADYHANINIQMNHWTADQTGLTDLQSPLWDYMADTWMP-RGAET 445

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
           A + Y A GWV+H++ +I+   +A +    WA +P+  AW+  H+++H++Y+ +  +L  
Sbjct: 446 ALLEYNAPGWVVHNEMNIFGH-TAMKSAAEWANYPISAAWMMQHVFDHWDYSRNATWLRT 504

Query: 490 RAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
           +AYP+L+G A+F L+ L   +  +D  L  NP  SPEH          C  Y       +
Sbjct: 505 QAYPMLKGVATFWLNQLQPDLYYNDNSLVVNPCNSPEHG----QTTFGCAHYQQ-----L 555

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
           I +VF +I++    +   + + +  +  SL RL     I     I EW
Sbjct: 556 IHQVFHSILAVQPTVADPDTSFLTTLTSSLARLDTGFHIGSFAQIKEW 603


>gi|225017021|ref|ZP_03706213.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
           DSM 5476]
 gi|224950188|gb|EEG31397.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
           DSM 5476]
          Length = 1158

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 175/660 (26%), Positives = 303/660 (45%), Gaps = 101/660 (15%)

Query: 4   AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
           ++S++  N L+I ++ PA  + T+A+ IGNG +G MV+GGV  + + +NE T+W G P +
Sbjct: 35  SQSSANDNLLRIWYDEPATDWQTEALAIGNGYMGGMVFGGVKRDKVHINEKTVWNGGPTE 94

Query: 63  ------YTNPDAPKALSDVRSLVD--SGQYAEATAASVKLFGHPADVYQ----------- 103
                 Y N +  +   D++ + D  +    +    S  +FG   D YQ           
Sbjct: 95  NNNRYNYGNTNPTETEEDLQKIKDDLNAIREKLDDKSEFVFGFDEDSYQSSGTSTRGEAM 154

Query: 104 -----LLGDIE-----LEFDDSHL------KYAEETYRRELDLNTATARVKYSVGNVEFT 147
                L+GD+       ++ D  +      + A   Y R+LD+ T  A V Y    V +T
Sbjct: 155 DWLNKLMGDLTGYSAPQDYADLFITNNAIDESAVTNYIRDLDMRTGLATVSYDYDGVHYT 214

Query: 148 REHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
           RE+F+S PD V+V +++  + G ++FN +L           GNN   +     G  I  K
Sbjct: 215 REYFNSYPDNVLVVRLTADQGGKINFNTNL------TDKTRGNN---LTNTAEGDTITMK 265

Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
           ++   +  G++  A  ++K+  + G IS ++   + V  +D A L+L   + +      P
Sbjct: 266 SSLRSN--GLKVEA--QLKVVPEGGDIS-VDGSSINVANADAATLILACGTDYKMEL--P 318

Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
           +   +DP +     + +     Y+DL   H+ D+  LF R+ I  +             E
Sbjct: 319 TFRGEDPHAAVTGRISAAAEKGYADLKEDHVADHSALFSRMEIGFN-------------E 365

Query: 328 NIDTVPSAERVKSFQ-----------TDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQ 375
            I  +P+ E +K ++           T+ +   +E++ +QFGRYL I+ SR G+   NLQ
Sbjct: 366 EIPQIPTDELIKKYRNMVDNNGGEVPTEAEQRALEIICYQFGRYLTIAGSREGSLPTNLQ 425

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
           G+W E  S  W    H NIN++MNYW ++  NL+EC  P  D+L  L   G   A   + 
Sbjct: 426 GVWGEG-SFAWGGDYHFNINVQMNYWPTMASNLAECHVPYNDYLNVLREAGRGAAAAAFG 484

Query: 436 -------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
                   +GW++   +  +  ++  +        P G AW   + +E+Y ++ D ++L+
Sbjct: 485 IKSEPGEENGWLVGCFSTPYMFATMGQKNNAAGWNPTGSAWALLNSYEYYLFSGDTEYLK 544

Query: 489 KRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
              YP ++  A+F  + L   E    Y+ + PS SPE+           +   ++ D   
Sbjct: 545 NELYPSMKEVANFWNEALYWSEYQQRYV-SGPSYSPEN---------GPIVNGASYDQQF 594

Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCK 606
           I + F   I AAE L  +ED LV    +   +L P  + +DG + EW +    T+F   +
Sbjct: 595 IWQHFENTIQAAETLGVDED-LVATWREKQSKLDPVIVGDDGQVKEWFEE---TTFGKAQ 650


>gi|291549437|emb|CBL25699.1| Uncharacterised Sugar-binding Domain [Ruminococcus torques L2-14]
          Length = 1637

 Score =  208 bits (530), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 172/655 (26%), Positives = 290/655 (44%), Gaps = 108/655 (16%)

Query: 5   ESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
           E+    N L++ ++ PA  + T ++ IGNG +G++V+GG+  + + +NE T+W G P  Y
Sbjct: 38  ETAKNDNLLRVWYDEPATDWQTQSLAIGNGYMGSLVFGGINKDKIHINEKTVWEGGPTSY 97

Query: 64  ------------TNPDAPKALSDVRS----LVDSGQYA--------EATAASVKLFGHPA 99
                       T+ D  K   D+ +    L D  +Y         EA+  + K  G   
Sbjct: 98  NGYSYGTTNKTETDADLQKIKDDLNAIREKLDDKSEYVFGFNEDSYEASGTNTK--GEAM 155

Query: 100 D-VYQLLGDI----------ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
           D + +L+GD+           L   ++        Y R+LD+ TA A V Y    V +TR
Sbjct: 156 DWLNKLMGDLVGYSAPKDYANLYISNNQDSSKVSNYVRDLDMRTALATVNYDYEGVHYTR 215

Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIP 205
           E+F S PD V+  ++S  + G ++F+ +L SL+   ++   V+G+  I M     G  + 
Sbjct: 216 EYFDSYPDNVMAVRLSADQKGKINFDTNLQSLIGGRTHKSTVDGDT-ITMRDALGGNGLN 274

Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISA---LEDKKLKVEGSDWAVLLLVASSSFDG 262
            +A               ++K+ ++ G++S+     +  + V  +D   L+    + +  
Sbjct: 275 IEA---------------QLKVINEGGSLSSNTNGSNPSITVSDADAVTLIFACGTDYKM 319

Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
               PS   +DP     + + +     Y  L   H+ D+  LF R+ +  +         
Sbjct: 320 EL--PSFRGEDPHDAVTARINAAAKKGYEALKKDHVADHDALFSRMELGFN--------- 368

Query: 323 TCSEENIDTVPSAERVKSFQT------------DEDPSLVELLFQFGRYLLISSSRPGTQ 370
               E + T+P+ E +K ++              E  +L  + +QFGRYL I+ SR G  
Sbjct: 369 ----EEVPTIPTDELIKKYRNMVDNNGGEVPTESEQRALEVICYQFGRYLTIAGSREGAL 424

Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
             NLQG+W E     W    H NIN++MNYW +L  NL+ECQ    D+L  L   G   A
Sbjct: 425 PTNLQGVWGEGYFQ-WGGDYHFNINVQMNYWPTLASNLAECQTAYNDYLNVLKEAGRYAA 483

Query: 431 QVNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
              +         +GW++   +  +  S+  +        P+G AW   + +E+Y YT D
Sbjct: 484 AAAFGIKSDEGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNAYEYYLYTED 543

Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            D+L+   YP L+  A+F  + L   E    Y+   PS SPE+           +   ++
Sbjct: 544 TDYLKNELYPSLKEVANFWNEALYWSEYQQRYVSA-PSYSPEN---------GPIVNGAS 593

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQR 596
            D   I + F   I AAE L  + D LVE+  +   +L P  + +DG + EW + 
Sbjct: 594 YDQQFIWQHFENTIQAAETLGVDAD-LVEQWKEKQSKLDPVLVGDDGQVKEWYEE 647


>gi|187734699|ref|YP_001876811.1| glycoside hydrolase family protein [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187424751|gb|ACD04030.1| glycoside hydrolase family 95 [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 788

 Score =  208 bits (529), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 165/596 (27%), Positives = 260/596 (43%), Gaps = 50/596 (8%)

Query: 12  PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           P+++T + PA+ +T+    GNGRLG + +G  P ET+ LNE +++           A +A
Sbjct: 28  PMQVTASTPARVWTEGYGTGNGRLGILSFGVFPKETVVLNEGSIFA-KKNFQMREGAAEA 86

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRREL 128
           L   R L   G+Y  A     K    P ++   YQ  G +++EF       +  +Y+R L
Sbjct: 87  LDKARELCKEGKYRSADQLFRKNILPPGNIAGDYQQGGRLQVEFQGLP---SPSSYQRTL 143

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           D+    A  +   G  E T E  ++         I+ +       +++L+    +   V 
Sbjct: 144 DMRRGKATTRAQFGTGELTTEILAAPSSDCAAYHIACTMPSGCRVSLNLEHPDPSARIVA 203

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
             N  ++EG+           +N   +      IL    S  R   + + D   +V    
Sbjct: 204 QPNGWVLEGQ----------GSNGGTRFENTVVILAPGASVTRKGSTIILDSAREV---- 249

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQK 303
               ++++S S D     P    + P + S++A     L   +   +  L     D + +
Sbjct: 250 ----MVLSSISTDYNIRKP----EAPLTHSLAAKNARILAKAQKAGWKKLAAETEDYFSR 301

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           L  R  + L  SP  +   T ++         ERVK  Q  +DP L+E LFQFGR+  I+
Sbjct: 302 LMTRCQVDLGDSPAGVSAMTTAQR-------LERVK--QGKKDPDLLEQLFQFGRFCTIA 352

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
            +RPG     LQG+WN +L   W     +NIN +MN W S    L E Q    DF+  L 
Sbjct: 353 HTRPGQLPCGLQGLWNPELRAAWMGCYFLNINSQMNQWPSHVTGLGEFQSSYLDFVRSLR 412

Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
            +G + A+      G+   H TD W ++        W    M GAW C HL + Y +T D
Sbjct: 413 PHGEEFARF-IKRDGFCFGHYTDCWKRTYFSGNNPEWGASLMNGAWACAHLVDSYRFTGD 471

Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK----LACVSYS 539
           R+ L K++ P+LE  A F++ W  +  +G   + P  SPE  F APDG     L+ VS  
Sbjct: 472 REDL-KKSLPILESNARFIMSWFEDDGEGRYLSGPGVSPETGFYAPDGTGPNVLSYVSNG 530

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           ++ D  + RE     I A   L      L+ K ++ L ++    I  DG + EW Q
Sbjct: 531 TSHDQLLGREALRNYIYACGELGIRTPTLL-KAVQFLRKIPQPAIGPDGRVQEWRQ 585


>gi|393222962|gb|EJD08446.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
           MF3/22]
          Length = 842

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 170/603 (28%), Positives = 280/603 (46%), Gaps = 74/603 (12%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-------------DAPKALSD 74
           +P+GNG LGAM+ GG   E+ +LN ++LW+G P  + +P             +  +A+  
Sbjct: 56  LPVGNGFLGAMISGGTTQESTQLNIESLWSGGP--FADPGYNGGNKQLDEQSEIGQAMRS 113

Query: 75  VRSLVDSGQYA-----EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
           +R  +   ++      +A  A +  +G+ +    L+  +      +    A   Y R LD
Sbjct: 114 IRQKIFKSKHGTIDNVDALMAPIGAYGNYSSAGFLVSTLT-----NTPSSAISDYARFLD 168

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD---NHSY 186
           L T  AR  ++ GN +FTRE F S P Q      S +     S   +L +++     +  
Sbjct: 169 LETGVARTIWTHGNYQFTRETFCSYPAQACAQNTSSTNPSGFSQTYALGAIIGLPPPNVT 228

Query: 187 VNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
              N+ +   G    PG      A  +  P GI     +E     +        +  L +
Sbjct: 229 CADNSTLRSSGLVSNPGMAYEILATVSVSPGGI-----IECNTVPNVNHTRKASNATLTI 283

Query: 245 EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
             +    ++ V  +++D    + + S      DP     S L S    SYS+    H+ D
Sbjct: 284 SNATSMSIMWVGGTNYDAGAGDAAHSFSFRGSDPHEGLSSLLISASEKSYSEFVAEHISD 343

Query: 301 YQKLFH-RVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELLFQFG 357
           ++   +   S+ L              +NI+  VP+ +    ++ D+ DP L  LLF +G
Sbjct: 344 FKSALNPSFSLNLG-------------QNINLKVPTDKLKDVYRVDKGDPYLEWLLFNYG 390

Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
           RYLL+SS+R G   ANLQG W  D    W +  HVNINL+MNYW +   NL +  + LFD
Sbjct: 391 RYLLVSSAR-GALPANLQGKWARDAGNPWSADYHVNINLQMNYWFAESTNL-DVTKSLFD 448

Query: 418 FL--TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
           F+  T++S  G+ TAQV Y ++ GWV+H++ +I+  +   +G   WA +P   AW+  H+
Sbjct: 449 FIEETWVS-RGTYTAQVLYNSTQGWVLHNEINIFGHTGMKQGDAEWADYPESNAWMMIHV 507

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDG 531
           W+H+++T D  + + + YPL++G ASF L+ LI      DG L   P  SPE     P  
Sbjct: 508 WDHFDFTNDVAWWKAQGYPLVKGAASFHLNKLIPDERFKDGTLVVAPCNSPEQ----PPI 563

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
            LAC          +I ++F+A+   A    + ++A + ++     R+ +   I   G +
Sbjct: 564 TLACAHAQQ-----VIWQLFNAVEKGAAAAGETDEAFLNEIKSKKGRMDKGIHIGSWGQL 618

Query: 591 MEW 593
            EW
Sbjct: 619 QEW 621


>gi|309798858|ref|ZP_07693119.1| alpha-fucosidase [Streptococcus infantis SK1302]
 gi|308117507|gb|EFO54922.1| alpha-fucosidase [Streptococcus infantis SK1302]
          Length = 627

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 157/518 (30%), Positives = 254/518 (49%), Gaps = 63/518 (12%)

Query: 102 YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
           Y   GDI + F++        T Y R LD++ A     Y+     F RE FSS PD V V
Sbjct: 12  YLAFGDIFMVFNNQKKGLENVTDYHRGLDISEAITTTSYTQDGTSFKRETFSSYPDDVTV 71

Query: 161 TKISGSESGSLSF---NVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
           T ++     +L F   N   + L+ N  Y +  N    +G        I  K    D+  
Sbjct: 72  THLTKKGDKTLDFTLWNSLTEDLIANGDY-SWENSKYKQGTVSVDSNGILLKGTVKDN-- 128

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDP 274
           G+QF++ L IK     G ++A +D  L V G+ +A LLL A ++F     NP ++ +KD 
Sbjct: 129 GLQFASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDI 181

Query: 275 TSESM--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
             E    S +++ +   Y  L   H+ DYQ LF+RV + L  S  +  T           
Sbjct: 182 DVEKTVKSIVEAAKAKDYETLKNDHIKDYQSLFNRVQLNLGGSKSNQTT----------- 230

Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAP 390
              E ++++   +   L EL FQ+GRYLLISSSR  T    ANLQG+WN   +P W+S  
Sbjct: 231 --KEALQTYNPTKGQKLEELFFQYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDY 288

Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGW 439
           H+N+NL+MNYW +   NL+E  +P+ +++  +   G           SK  Q N    GW
Sbjct: 289 HLNVNLQMNYWPAYMSNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GW 344

Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
           ++H +   +  ++       W   P   AW+  +++++Y +T D  +L+++ YP+L+  A
Sbjct: 345 LVHTQATPFGWTTPGW-NYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETA 403

Query: 500 SFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
            F   +L   +  D ++ ++PS SPEH           ++  +T D +++ ++F   + A
Sbjct: 404 KFWNSFLHYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEA 453

Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
           A  L+ ++D LV +V     +L+P  I +DG I EW +
Sbjct: 454 ANHLKVDQD-LVTEVKTKFDKLKPLHINQDGRIKEWYE 490


>gi|358400122|gb|EHK49453.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
           206040]
          Length = 788

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 171/604 (28%), Positives = 274/604 (45%), Gaps = 75/604 (12%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
           PA     A P+GNG+LGAM  G V  + + LNE +LW+G P    DY   NP  P   AL
Sbjct: 29  PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFQNPDYIGGNPPGPVYTAL 88

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRREL 128
             +R  +   Q     +    L+G PAD Y    + LG++ ++      +Y   +Y R L
Sbjct: 89  PGIRDTIWQTQINNDIS---PLYGDPADYYYGNYETLGNLTVKIAGLS-QYT--SYNRAL 142

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------------GSLSFNV 175
           DL T   +  +      FT   F + PDQV V  +  +++              S + N+
Sbjct: 143 DLETGIHQTVFRSNGASFTTTTFCTFPDQVCVHNVQSTKALPAITIGLQDNARSSPASNL 202

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
           S D+   N  ++ G  Q  +     G     +      PKG   +A  EI I  D  T S
Sbjct: 203 SCDA---NGVHLRGQTQQDI-----GMIFDARVQVLSRPKGAACTASHEIVIPADSKTKS 254

Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
                 +   G+D+       +S++       S    DP    +S +++    SY+ LY 
Sbjct: 255 V---TVIYAAGTDYDQKKGTKASNY-------SFKGVDPAPAVLSTIKAAAKESYNSLYN 304

Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLF 354
            H+ D+  LF + ++ L  S           +N  ++P+A+ ++ +  D   + +E LLF
Sbjct: 305 SHVKDHNALFSQFTLNLPDS-----------DNSASIPTAKLMEDYDDDIGNTFIENLLF 353

Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
            +GRYL I S RPG+   NLQGIW E L+P W +  HV++N++MN+W +    L + Q P
Sbjct: 354 DYGRYLFIGSCRPGSLPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGDIQGP 413

Query: 415 LFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           L+DF+T   +  G++TA + Y A G+V     + +   +      VW+ +P   AWL  +
Sbjct: 414 LWDFITDTWVPRGTETAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSDYPASAAWLMQN 472

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPD 530
           +W+ Y+Y  D  +     YPL++  A + +  ++     +DG L   P  SPEH +    
Sbjct: 473 VWDRYDYGRDTTWYRATGYPLMKAVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT-- 530

Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGS 589
               C  Y       ++ E+F  II + +         +E V ++  +L P   I   G 
Sbjct: 531 --FGCTHYQQ-----LVWELFDHIIQSWDATGDKNTTFLETVKETQAKLSPGIIIGWFGQ 583

Query: 590 IMEW 593
           I EW
Sbjct: 584 IQEW 587


>gi|396466146|ref|XP_003837624.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
           maculans JN3]
 gi|312214186|emb|CBX94180.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
           maculans JN3]
          Length = 807

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 172/595 (28%), Positives = 271/595 (45%), Gaps = 64/595 (10%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YTNPDAPKALSDVRSLVDSGQ 83
           A P+GNGRLGAM +G    ET+ LN D+LW+G P +   YT  +   A++     +    
Sbjct: 46  AYPLGNGRLGAMPFGPAGQETVNLNLDSLWSGGPFETVSYTGGNPTSAVAQALPGIRDWI 105

Query: 84  YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYS 140
           +   T    +L G   +   Y++LG++ +      +     T + R LD+       +Y 
Sbjct: 106 FTNGTGNVTELLGEDGNFGSYRVLGNLSVSIPSLQIGNVSITGFTRTLDIVNGIHTTRYK 165

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLLD-----------NHSYVN 188
           V   E     F S PDQV V   S   SG L    +SLD+ L            +H  + 
Sbjct: 166 VDENEINTTVFCSYPDQVCV--YSAQSSGQLPVLQLSLDNELVTSELKTRTCEVDHVRMR 223

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFS-----AILEIKISDDRGTISALEDKKLK 243
           G  Q+   G   G R    A     P+GI+ S     AIL I  ++   +++ +   +  
Sbjct: 224 GVTQV---GPPEGMRYDAIARVAS-PEGIKMSCINGTAILNITPNNGTNSVTVILGAETD 279

Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
            +           ++ FD  F       +DP     +  Q     +  +L   H++D+  
Sbjct: 280 YDQKK-------GTAEFDYSF-----RGEDPGPTVEATTQKAAAKTSVELVGAHVEDFTS 327

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
           L  R  + L        TDT +     T+   ER  S  T+ DP L  LLF +  YL IS
Sbjct: 328 LSERFKLSL--------TDTLNSLQTPTLDLIERYDSEDTNGDPYLESLLFDYSNYLFIS 379

Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
           SSR G+   NLQG W+E L   W    H NINL+MN+W +    L++ Q PL+D++    
Sbjct: 380 SSRAGSLPPNLQGRWSEGLYAAWSGDYHANINLQMNHWTADQTGLTDLQSPLWDYMADTW 439

Query: 424 I-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
           +  G++TA++ Y A GWV+H++ +I+  +    G    A +    AW+  H+++H++Y+ 
Sbjct: 440 VPRGTETAELLYDAPGWVVHNEMNIFGHTGMKSGASW-ANYAAAAAWMMQHVYDHWDYSR 498

Query: 483 DRDFLEKRAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           D  +L+ + YPLL+G A F L  L   +  +D  L   P  SPEH    P    AC  + 
Sbjct: 499 DTAWLKSQGYPLLKGVAKFWLHQLQLDMFSNDNSLVVIPCNSPEH---GPT-TFACAHFQ 554

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
                 +I ++F AI++ + ++ +++ A    +  SL  L     I   G I EW
Sbjct: 555 Q-----VIHQLFDAILTLSPIVSESDTAFTTNISSSLKFLDTGFHIGSFGQIKEW 604


>gi|358396613|gb|EHK45994.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
           206040]
          Length = 793

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 158/595 (26%), Positives = 274/595 (46%), Gaps = 52/595 (8%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
           P         IGNGR G +  G    + L LN+D++W G P     YT  +   +L+   
Sbjct: 28  PGNVLMTGYTIGNGRQGGLPLGIPGDDLLCLNDDSVWRGGPFSNSSYTGGNPSSSLAHFL 87

Query: 77  SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
             +    +   T     L+G  +D   Y+ L ++ +       KY+   Y+R LDL TA 
Sbjct: 88  PGIQEFIFQNGTGDESALYGGSSDYGSYEALANLTVSIAGV-TKYSN--YKRTLDLETAL 144

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
              +++     F    F + PDQV V  +S ++    ++F      L+DN+     N   
Sbjct: 145 HSAEFTANGASFQTVQFCTFPDQVCVYHVSSNKPLPDITF-----GLVDNYRT---NPAS 196

Query: 194 IMEGRCPGKRIPPKANANDDPK--GIQFSAILE-IKISDDRGTISALEDKKLKVEGSDWA 250
            ++    G  +  +  A+D     G++  A    +  S  + T ++     L  +    A
Sbjct: 197 TVQCSSSGIWLSGRTVADDGEGLIGMKIDAQASALSSSGLKATCNSRGQTVLSTKSVKSA 256

Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
            +++ + + +D    N +++      DP    +  + ++   SY+ +  RH+ D+ + F+
Sbjct: 257 TIVVASGTEYDAEKGNAANNYSFRGADPHPGVVKTINAVSKKSYNAILQRHVADHGEWFN 316

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           + ++ L               N   V S E + ++ TD+ DP +  LL  +G+Y+ I+SS
Sbjct: 317 KFTLDLP-----------DPNNSAEVDSMELLTNYSTDKGDPFVEGLLIDYGKYMFIASS 365

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
           RPG+   NLQG W  D +P W S  H+++N++MN+W      L    +PL+DF+TY  + 
Sbjct: 366 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 425

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G++TA++ Y ASGWV    T+I+   +A      W+      AW+  H+W+ Y+Y  D+
Sbjct: 426 RGTETARLWYNASGWVAFTNTNIFGH-TAQENDATWSDVAHDIAWMMAHVWDRYDYGRDK 484

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDG--KLACVSYS 539
           ++     YPL++G ASF +D L++     DG L  NP  SPEH    P G     C  + 
Sbjct: 485 NWYASVGYPLMKGVASFWMDLLVQDDYFKDGTLVANPCNSPEH---GPTGFQTFGCAQFQ 541

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
                 +I E+F  II         + + ++++ +S  +L P   +   G I EW
Sbjct: 542 Q-----VIWELFDHIIKDWNASGDRDASFLKRLKESYGKLDPGVHVGSWGQIQEW 591


>gi|282881164|ref|ZP_06289851.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
 gi|281304968|gb|EFA97041.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
          Length = 1008

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 156/561 (27%), Positives = 257/561 (45%), Gaps = 91/561 (16%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
            T  +PIGNG+ G  V GGV  + ++ N+ TLW G                V ++V +  
Sbjct: 206 MTSTLPIGNGQFGGCVMGGVKRDEVQFNDKTLWKG---------------HVGAVVGNPN 250

Query: 84  YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           Y                 Y   G++ +   DS L  A   YRR LD++ A A V Y+   
Sbjct: 251 YGS---------------YLNFGNLYITSTDSRLN-AATNYRRWLDIDQAKAGVAYTANG 294

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSY-VNGNNQII-MEGRCP 200
           V++ RE+  S PD+VI      SE G +S N+ L +      +Y +NG   +I  +G  P
Sbjct: 295 VDYQREYICSFPDKVIAIHYKASEKGKISNNIILFNQNGKTPTYNMNGTTGVITFQGEVP 354

Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
                        PKG  +    +  ++   GTI+  +D  + V+ +D   + L  +++F
Sbjct: 355 ---------RTGTPKGESY--YCKAYVTAKGGTIAVGKDGGIDVKNADEMFIYLYGTTNF 403

Query: 261 DGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
           D     +I  SD+   P S     + +  +  Y+ +   H++DY+ L+ R  + ++++  
Sbjct: 404 DASNDEYI--SDAALLP-SHVTGVVDAALSKGYAAICDAHVEDYKALYDRCQLNITKA-- 458

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
                      + +V + + +  F      +L+  E+ F +GRYL+ISSSR     +NLQ
Sbjct: 459 -----------MPSVTTRKLIADFAISPADNLLLEEIYFCYGRYLMISSSRGVDLPSNLQ 507

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-------SK 428
           GIWN   +P W+S  H NIN++MNYW +   NLSE   P   FL Y+           + 
Sbjct: 508 GIWNNVNNPAWNSDIHSNINVQMNYWPAEITNLSELHLP---FLKYIHREACERPQWRAN 564

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL-WPMGGAWLCTHLWEHYNYTMDRDFL 487
             Q+     GW +  + +I+   S       W   + +  AW C HLW+HY +T+D+++L
Sbjct: 565 ARQIAGQTVGWTLTTENNIYGSGSN------WMQNYTIANAWYCMHLWQHYRFTLDKEYL 618

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
           +  AYP +  CA + L  L++  DG  E     SPEH    P  + A     +     ++
Sbjct: 619 KNIAYPAMRSCAEYWLQRLVKAADGTYECPNEFSPEH---GPGSENA-----TAHSQQLV 670

Query: 548 REVFSAIISAAEVLEKNEDAL 568
            ++F+  + A   L  +EDA+
Sbjct: 671 WDLFNNTLQAIAELGISEDAI 691


>gi|340514441|gb|EGR44703.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
          Length = 755

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 166/589 (28%), Positives = 263/589 (44%), Gaps = 59/589 (10%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
           A P+GNG+LGAM  G V  + + LNE +LW G P    DY   NP AP   AL  +R  +
Sbjct: 3   AYPLGNGKLGAMPLGVVGEDIVVLNEHSLWAGGPFQSPDYIGGNPPAPVYTALPGIRETI 62

Query: 80  DSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
              Q     +A   L+G PA  Y    + LG++ +       KY   +Y R LDL T   
Sbjct: 63  WKTQINNDISA---LYGDPAYYYYGNYETLGNLTVNIAGVS-KYT--SYNRALDLETGIH 116

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
             ++     +FT   F + PDQV    I  S+          DSL  N +          
Sbjct: 117 TTEFKANGAKFTITTFCTFPDQVCAYNIQSSKPLPAVTIGLRDSLRSNPA---------S 167

Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV-LLL 254
              C    +  +     D  G+ F A  ++     R T ++     +  +G   ++ ++ 
Sbjct: 168 NLTCDANGVHLRGQTQQD-IGMIFDARAQLINRPKRATCTSSHGLSVPSDGRTTSLTVVY 226

Query: 255 VASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            A +++D        N S    DP    +S ++ +   S++ +Y  H+ D+  LF + S+
Sbjct: 227 AAGTNYDQKKGTKASNYSFKGVDPAPAVLSTIKKVSQKSFNSMYNAHIKDHNGLFSQFSL 286

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
            L    K             +VP+A  ++++  D  DP +  LLF +GRYL I S R G+
Sbjct: 287 DLPDPEKSA-----------SVPTATLMENYDYDLGDPFVENLLFDYGRYLFIGSCRDGS 335

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSK 428
              NLQGIW E L+P W +  HV++N++MN+W +    L E Q PL+DF+    +  G++
Sbjct: 336 LPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGEIQGPLWDFIIDTWVPRGTE 395

Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
           TA + Y A G+V     + +   +      VW+ +P   AWL  ++W  Y+Y+ D  + +
Sbjct: 396 TAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSNYPASAAWLMQNVWNRYDYSRDTHWWK 454

Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
              YPL++  A + +  ++     +DG L   P  SPEH +        C  Y       
Sbjct: 455 TVGYPLMKSIAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHYQQ----- 505

Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
           ++ EVF  +I   E         +E V ++  +L P   I   G I EW
Sbjct: 506 LVWEVFDHVIEGWEASGDKNTTFLETVKETQSKLSPGIIIGWFGQIQEW 554


>gi|229829382|ref|ZP_04455451.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
           14600]
 gi|229792545|gb|EEP28659.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
           14600]
          Length = 1622

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 175/665 (26%), Positives = 293/665 (44%), Gaps = 114/665 (17%)

Query: 6   STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY- 63
           +  + N L++ ++ PA  + T ++ IGNG +G +V+GG+  + + +NE T+W G P    
Sbjct: 39  NAKSDNLLRLWYDKPASDWQTQSLAIGNGYMGGLVFGGINQDRIHINEKTVWEGGPDGKS 98

Query: 64  ------TNPDAPKA--------LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ------ 103
                 TNP + +         L+++R  +D          S  +FG   + YQ      
Sbjct: 99  TYSYGTTNPISTEEDLQKIKDNLNEIRQKLDD--------KSEHVFGFDENSYQASGTDT 150

Query: 104 ----------LLGDIELEFDDSHLKYAE------------ETYRRELDLNTATARVKYSV 141
                     L+GD  L+  D+   YA               Y R+LD+ TA A V Y  
Sbjct: 151 KGEAMDALNKLMGD--LKGYDAPTDYANLYISNDQDPSKVTNYVRDLDMRTALATVSYDY 208

Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
             V + RE+F+S PD ++  ++S  + G +SF  +L++L+   +Y N     ++ G    
Sbjct: 209 EGVHYCREYFNSYPDNIMAVRLSADKDGKISFKTNLENLIGGDAYTN-----VVRGDTIT 263

Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK---KLKVEGSDWAVLLLVASS 258
            R        D  +G    A  ++K+ ++ G+IS+ E+     ++V G++   L+    +
Sbjct: 264 MR--------DALRGNGLKAEAQLKVINEGGSISSDENDGKPAIRVSGANAVTLIFACGT 315

Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
            +      P+   +DP       +Q+     Y  L   H++D+  LF R+ +        
Sbjct: 316 DYKMEL--PNFRGEDPHKAVKKRIQAAAKKGYQVLKKDHVEDHSALFSRMELGFDEEIPQ 373

Query: 319 IVTD-------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
           I TD          E N   +P         + E  +L  + +QFGRYL I+ SR G+  
Sbjct: 374 IPTDELIRRYRNMVENNGGQIP--------MSAEQRALEVMCYQFGRYLTIAGSREGSLP 425

Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
            NLQG+W E    TW    H NIN++MNYW ++  NL EC +P  DFL  L   G   A 
Sbjct: 426 TNLQGVWGEGFF-TWYGDYHFNINVQMNYWPTMASNLGECMKPYNDFLNVLKEAGRNAAA 484

Query: 432 VNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            +Y         +GW++   +  +  S+  +        P+G AW   + +E+Y YT D 
Sbjct: 485 ASYGIKSREGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNSYEYYLYTGDT 544

Query: 485 DFLEKRAYPLLEGCASF---LLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            +L ++ YP ++  A+F    L W  E    Y+ + PS SPE+           +   ++
Sbjct: 545 QYL-RQLYPSMKEVANFWNKALYW-SEYQQRYV-SAPSYSPEN---------GPIVNGAS 592

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTS 601
            D   I +     I AAE L  + D LV +  +   +L P  + + G + EW +    TS
Sbjct: 593 YDQQFIWQHLENTIHAAETLGLDGD-LVAEWKEKQSKLDPVIVGKSGQVKEWFEE---TS 648

Query: 602 FSTCK 606
           F   +
Sbjct: 649 FGKAQ 653


>gi|383113206|ref|ZP_09933980.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
 gi|313697388|gb|EFS34223.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
          Length = 765

 Score =  204 bits (520), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 179/612 (29%), Positives = 285/612 (46%), Gaps = 120/612 (19%)

Query: 13  LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
           +K+ +  PA+++ T A+PIGNG LG + +GG+  E L+ NE TLWTG             
Sbjct: 32  MKLWYTRPAQNWMTSALPIGNGELGGLFFGGIACERLQFNEKTLWTG------------- 78

Query: 72  LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
            S+ +                         YQ  G++ ++F + + +  +  Y REL L+
Sbjct: 79  -SETKR----------------------GAYQSFGNLYIDFAEHNGEAVD--YCRELCLD 113

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNGN 190
            A   V Y +  V++ RE+F+S PD+VIV +I+     G L+ +V L+   D+H      
Sbjct: 114 NAIGSVSYEMNGVKYRREYFASYPDRVIVMRITTPGMKGRLNLSVRLE---DSHF----- 165

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQ-----FSAILEIKISDDRGTISALEDKKLKVE 245
                           + + N +  GIQ      S   ++K+ +++G +S + D +L V 
Sbjct: 166 ---------------GQLSVNKNILGIQGQLDLLSYDAQVKVLNEKGQLSVV-DNRLTVC 209

Query: 246 GSDWAVLLLVASSSFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
            +D   +LLVA ++F+   I+ +D    S +D   E  + L +    +Y+ L   HL DY
Sbjct: 210 DADAVTILLVAGTNFN---ISATDYLGTSSEDLHKELYTRLSNASRKNYAALKNIHLKDY 266

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
           Q LF RV + L             + ++   P+ E V++ +  E   L  L FQ+GRYL+
Sbjct: 267 QSLFSRVKLDL-------------QADMPEYPTDELVRNHK--ESRYLDMLYFQYGRYLM 311

Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
           + SSR      NLQGIWN D +P W+   H NIN++MNYW +   NL EC  P   FL Y
Sbjct: 312 LGSSRGMNLPNNLQGIWNADNTPPWECDIHSNINIQMNYWPAEITNLPECHLP---FLQY 368

Query: 422 LSI------NGS--KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
           +++      NGS  + AQ   L  GW I  + +I+  S        W +     AW CTH
Sbjct: 369 IAVEAVGKPNGSWRRIAQGEGL-RGWTIKTQNNIFGYSD-------WNINRPANAWYCTH 420

Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DG 531
           LW+HY Y  D ++L   A+P+++    +  D L E  DG L      SPE     P  DG
Sbjct: 421 LWQHYAYNNDLEYLRNIAFPVMQSTCKYWFDRLKENKDGKLVAPDEWSPEQ---GPWEDG 477

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
               V+Y+  +   +  E   A+ +  +V  + ++  V ++     +L     +   G I
Sbjct: 478 ----VAYAQQLVWQLFNETLHAVEALKKVDIQIDNVFVSELADKFRKLDNGVSVGSWGQI 533

Query: 591 MEWVQRRLNTSF 602
            EW + +    F
Sbjct: 534 KEWKEDKGKLDF 545


>gi|325855022|ref|ZP_08171738.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
           18C-A]
 gi|325484000|gb|EGC86940.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
           18C-A]
          Length = 753

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 148/513 (28%), Positives = 228/513 (44%), Gaps = 73/513 (14%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
            T  +PIGNG+ GA + G V  + ++ N+ TLW+G  G  T            S  D G 
Sbjct: 1   MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48

Query: 84  YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           Y          FG+              F  SH       Y R LD+N A A V++ +  
Sbjct: 49  YLN--------FGNL-------------FISSHGMKKVTDYVRYLDINNAVAGVQFCMDG 87

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
           V + R +F+SNPD  IV + + S+ G +S  ++L  +  N  Y    V+  NQ  I  +G
Sbjct: 88  VAYRRTYFASNPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
           +         A   D       S     ++  + G +       ++V  +D   + L   
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNAKGLIEVSNADCMTIYLRGL 197

Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
           + FD              S + + + S +   Y+ L   H  DY+ LF R    L  S  
Sbjct: 198 TDFDPDAPEYVAGSGRLASRAAATVDSAQRKGYAALLAAHKADYRSLFDRCQFTLGDSKA 257

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
           DI T              + + S++ +   +L   EL F +GRYLLISSSR  +  ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGISLPANLQ 304

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
           GIWN   +P W +  H NIN++MNYW + P NLSE   P  D++     +  +  + A+ 
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           + ++ +GW +  + +I+       G      + +  AW C HLW+HY YTMDR++L  RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           + +++    + L  L++  DG  E     SPEH
Sbjct: 420 FSVMKSAVDYWLRKLVKASDGTYECPDEWSPEH 452


>gi|358383160|gb|EHK20828.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
          Length = 791

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 154/593 (25%), Positives = 273/593 (46%), Gaps = 51/593 (8%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
           P         IGNGR G +  G   ++ L LN+D++W G P     YT  +   +L+   
Sbjct: 29  PGNVLMTGYTIGNGRQGGLPLGIPGNDLLCLNDDSIWRGGPFANSSYTGGNPSSSLAHFL 88

Query: 77  SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
             +    +   T    +L+G  AD   Y+ L ++ +        Y++  Y+R LDL TA 
Sbjct: 89  PGIQEAIFQNGTGDESELYGGTADYGSYEALANLTVSIAGV-TNYSK--YKRTLDLETAL 145

Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
              +++     F+   F S PDQV V  +S ++    ++F      L+DN+     N   
Sbjct: 146 HSAEFTANGATFSTVQFCSFPDQVCVYHVSSNKPLPQITF-----GLVDNYRT---NPPS 197

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK---LKVEGSDWA 250
            ++    G  +  +  AND    I      + +     G  +    +    L  + +  A
Sbjct: 198 TVKCSSSGIWLSGRTVANDGEGLIGMKIDAQARALPSAGLKAICNSQGQTVLSTKSAKSA 257

Query: 251 VLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
            +++ + + +D    N + +      DP    +  + ++   SY+ +   H+ D+ + F+
Sbjct: 258 TIVVASGTEYDATKGNAAHNYSFRGVDPYPGVVKTINAVSKKSYNTILQSHVKDHGEWFN 317

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           + ++ L         D  +  ++DT+   E + ++ T++ DP +  LL ++G+Y+ I+SS
Sbjct: 318 KFTLDLP--------DPHNSADVDTM---ELLTNYTTEKGDPFVENLLIEYGQYMFIASS 366

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
           RPG+   NLQG W  D +P W S  H+++N++MN+W      L    +PL+DF+TY  + 
Sbjct: 367 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 426

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
            G++TA + Y  SGWV    T+I+   +A      W+      AW+  H+W+ Y+Y  D+
Sbjct: 427 RGTETASLWYNVSGWVAFTNTNIFGH-TAQENDATWSNVAHDIAWMMAHVWDRYDYGRDK 485

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
            +     YPL++G ASF +D ++      DG L  NP  SPEH    P     C  +   
Sbjct: 486 KWYASVGYPLMKGVASFWVDMMVPDEYFKDGTLVANPCNSPEH---GPT-TFGCAQFQQ- 540

Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
               ++ E+F  II   +     + A +++V +S  +L P   +   G I EW
Sbjct: 541 ----VVWELFDHIIKDWDASGDTDTAFLKRVKESYSKLDPGVHVGSWGQIQEW 589


>gi|421287924|ref|ZP_15738687.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58771]
 gi|395886487|gb|EJG97503.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA58771]
          Length = 717

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 149/513 (29%), Positives = 241/513 (46%), Gaps = 51/513 (9%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A     Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
           V + +   + +L F + L    D  S      +      C        I  K    D+  
Sbjct: 87  VQRFTKEGAETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
            ++F++ L  +     G I    D+ +++ G+ +A L L A + F     +    K D  
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
            + +  + + +   Y+ L +RH++DYQ LF RV + L             E N+D   + 
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTD 247

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
           +NL+MNYW +   NL E   P+ +++  L + G + A V Y          +GW++H + 
Sbjct: 308 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366

Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
               W     D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423

Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            +L +        ++PS SPEH           +S  +T D ++I ++F   I AA+ L 
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +ED L E   KS   L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|358381765|gb|EHK19439.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
          Length = 788

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 163/596 (27%), Positives = 271/596 (45%), Gaps = 59/596 (9%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
           PA     A P+GNG+LGAM  G V  + + LNE +LW+G P    DY   NP AP   AL
Sbjct: 29  PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFESPDYIGGNPPAPVYTAL 88

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRREL 128
             +R  + + Q     +A   L+G P       Y+ LG++ ++      +Y+  +Y R L
Sbjct: 89  PGIRETIWNTQINNDISA---LYGDPTYYHYGNYETLGNLTVKIAGVS-RYS--SYNRAL 142

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           DL T   +  ++    +FT   F + PDQV    +  ++            L DN     
Sbjct: 143 DLETGIHQTAFTSNGAKFTITTFCTFPDQVCAYNVQSNKP----LPAVTIGLQDNQ---- 194

Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
             +       C    +  +     D  G+ F A  ++     + T ++  +  +  +G  
Sbjct: 195 -RSSPSSNSSCDANGVRLRGQTQQD-IGMIFDARAQVLNRPRKATCTSSHELLVPSDGKT 252

Query: 249 WAV-LLLVASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
            +V ++  A +++D        N S    DP    +S +Q++   S+S +Y  H+ D+  
Sbjct: 253 ASVTVVYAAGTNYDQKKGTKASNYSFKGVDPAPAVVSTIQAVEKKSFSSMYNAHVKDHNT 312

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
           LF + ++ L  S   +           +VP+A  ++++  +  DP +  LLF +GRYL I
Sbjct: 313 LFSQFTLNLPDSEHSV-----------SVPTATLMENYDYNVGDPFVENLLFDYGRYLFI 361

Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
            S R G+   NLQGIW E+  P W S  HV++N++MN+W +    L + Q PL+DF+   
Sbjct: 362 GSCRDGSLPPNLQGIWTENQFPAWSSDYHVDVNVQMNHWHTEQTGLGDIQGPLWDFIIDT 421

Query: 423 SI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
            +  G++TA++ Y A G+V     + +   +      VW+ +P   AWL  ++W  Y+Y 
Sbjct: 422 WVPRGTETAELLYDAPGFVGFSNLNTFG-FTGQMNSAVWSNYPASAAWLMQNVWNRYDYG 480

Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
            D  + +   YPL++  A + +  ++     +DG L   P  SPEH +        C  Y
Sbjct: 481 RDTHWWKTVGYPLMKSVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHY 536

Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
                  ++ EVF  II + E         +E V ++  +L P   I   G I EW
Sbjct: 537 QQ-----LVWEVFDHIIDSWEDSGDTNTTFLETVKETQSKLSPGIIIGWFGQIQEW 587


>gi|327313293|ref|YP_004328730.1| hypothetical protein HMPREF9137_1029 [Prevotella denticola F0289]
 gi|326946180|gb|AEA22065.1| conserved hypothetical protein [Prevotella denticola F0289]
          Length = 753

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 147/513 (28%), Positives = 229/513 (44%), Gaps = 73/513 (14%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
            T  +PIGNG+ GA + G V  + ++ N+ TLW+G  G  T            S  D G 
Sbjct: 1   MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48

Query: 84  YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
           Y          FG+              F  SH       Y R LD+N A A V++ +  
Sbjct: 49  YLN--------FGNL-------------FISSHGMRKVTDYVRYLDINNAVAGVQFCIDG 87

Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
           V + R +F+S+PD  IV + + S+ G +S  ++L  +  N  Y    V+  NQ  I  +G
Sbjct: 88  VAYRRTYFASSPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
           +         A   D       S     ++  + G +       ++V  +D   + L   
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNARGLIEVINADCMTVYLRGL 197

Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
           + FD                + + + S +   Y+ L   H  DY+ LF R  + L  S  
Sbjct: 198 TDFDPDAPEYVAGAGRLAGRAAATVDSAQRRGYAALLAAHKADYRSLFDRCQLTLGDSKA 257

Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
           DI T              + + S++ +   +L   EL F +GRYLLISSSR  +  ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGVSLPANLQ 304

Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
           GIWN   +P W +  H NIN++MNYW + P NLSE   P  D++     +  +  + A+ 
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364

Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
           + ++ +GW +  + +I+       G      + +  AW C HLW+HY YTMDR++L  RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419

Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
           +P+++    + L  L++  DG  E     SPEH
Sbjct: 420 FPVMKSAVDYWLRKLVKASDGTYECPDEWSPEH 452


>gi|358390062|gb|EHK39468.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
           206040]
          Length = 797

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 171/637 (26%), Positives = 293/637 (45%), Gaps = 81/637 (12%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDA 68
           ++ +  P+  F  ++ +GNGR  A V      ET  LNE T W+G       G    P+ 
Sbjct: 6   RLYYTTPSTSFPTSLALGNGRFAASVLSSPEHETFLLNEVTFWSGEARNAGEGLAERPED 65

Query: 69  PKA-LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLKYA 120
           PKA L   ++   +G YA+    + K        FG    V +L  DI +     H   A
Sbjct: 66  PKAELRKTQNCYLNGDYAQGKKRAEKYLESKKNNFGTNLGVGKL--DIAV---TGHGNPA 120

Query: 121 E-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
           + + + REL  + A    +Y V   ++ R  F S+P QV+V +  G +   L   VS   
Sbjct: 121 DIQDFERELRFDEAITETRYKVNGHQYKRRAFLSHPHQVLVIQFDGDDLSGLEVAVS--- 177

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-----NDDPKGIQFSAILEIKISDDRGTI 234
                  V G N+          R+   A A     +D   G++   I+  K+++ +   
Sbjct: 178 -------VQGENEAFTSKVNSESRLEFDAQALETVHSDGTCGVKGFGIVAAKVNEGK--- 227

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
              +D KL +       + +  ++ ++       +S+ +    ++  ++ +  L   DL 
Sbjct: 228 VEQKDGKLTISAQKSITIFVAFNTDYN-------ESRNEWRERTLLQIEDVLQLPIDDLL 280

Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVEL 352
             HL DYQ L+ R+ I+L   PK       S  N   +P+ +R  +F++    DP +  L
Sbjct: 281 KEHLGDYQPLYRRMDIRLG--PK-------SNPN-SNIPTDQRRGNFESSGYADPGMFAL 330

Query: 353 LFQFGRYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
            F + RYL I+ +R  + +  +LQG+WN  E     W    H++IN +MNY+  L   L+
Sbjct: 331 YFHYSRYLTIAGTREDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAILNSGLA 390

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRG-KVVWALWPMGG 467
           +  +PL+ ++  L++ G +TA+  Y +  GWV H  ++ W  +  D G ++ + L   GG
Sbjct: 391 DLMKPLYKYIFKLAVKGQQTARTCYGSREGWVAHVFSNAWGFT--DPGWEISYGLNVTGG 448

Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
            W+   L E Y YT+D   +    +PLL G   F LD++IE    G+L T PS SPE+ F
Sbjct: 449 LWMAAPLIEMYEYTLDDGLMMTNLWPLLFGATKFWLDYMIEDPKTGWLLTGPSVSPENSF 508

Query: 527 --IAPDG--KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE----DALVEKVLKSLPR 578
             +  DG  +      S T+D+ ++R++F+     A  L+       D  +++  K L +
Sbjct: 509 FVVNEDGTKEEHSADLSPTLDVVLLRDLFAFCEYFAGKLKTMTGFPWDEDIKEYQKVLAK 568

Query: 579 LRPTKIAEDGSIMEWV---------QRRLNTSFSTCK 606
           L P +I ++G + EW+          R L+ + + C+
Sbjct: 569 LPPLQIGKNGQLQEWLHDYEEAQPYHRHLSHTMALCR 605


>gi|421230262|ref|ZP_15686926.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2061376]
 gi|395593788|gb|EJG54030.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2061376]
          Length = 717

 Score =  202 bits (513), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 150/524 (28%), Positives = 246/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A     Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V + +   + +L F + L     L  +  Y               ++ I+M+GR      
Sbjct: 87  VQRFTKEGAETLDFTIELSLTCDLASDEKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|210632036|ref|ZP_03297176.1| hypothetical protein COLSTE_01069 [Collinsella stercoris DSM 13279]
 gi|210159752|gb|EEA90723.1| F5/8 type C domain protein [Collinsella stercoris DSM 13279]
          Length = 1203

 Score =  201 bits (511), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 162/607 (26%), Positives = 276/607 (45%), Gaps = 78/607 (12%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD---APKALSDVR 76
           DA+ IGNG+ GA+++G V  + +  NE TLWTG P       D  N D       L  +R
Sbjct: 72  DALVIGNGKTGAILFGQVAQDKVHFNEKTLWTGGPSKSRPNYDGGNKDQAVTKHQLDALR 131

Query: 77  SLVDSGQ---YAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
           + +D      +   T    +++G  +    YQ  GD+E +F       +  + Y R+LD+
Sbjct: 132 AKMDDHSKDVFPMGTQIPTEVWGDGNGMGAYQDFGDLEFDFSPMGATNSNIQNYERDLDM 191

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
            TA + V Y    V +TRE+ +S+P  V+  ++  S+ G +SF++ + S    +   + +
Sbjct: 192 RTAVSTVSYDFNGVHYTREYLASHPAGVVAVRLDASKDGEISFDLGVGSAKGLNVRASAD 251

Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
              +++ G      +  +  A   P+G               G+I A E     V  +D 
Sbjct: 252 AGDLVLAGNVADNGMLCEMRARVLPEG---------------GSIKASESGGFSVRDADA 296

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS----IRNLSYSDLYTRHLDDYQKLF 305
             +L    + ++  +  PS        +  +AL+        +SY +L  +H+DD++ LF
Sbjct: 297 VTVLYATETDYENAY--PSYRSGQTLEQVDAALKEKLDVAAGISYDELKKQHIDDHRSLF 354

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISS 364
            RV I L   P    TD             + +K ++  + DP + E+LFQFGRYL I+S
Sbjct: 355 ERVEIDLGGVPAQKPTD-------------QMMKDYRAGNNDPFIEEMLFQFGRYLTIAS 401

Query: 365 SRPGTQV-ANLQGIW-NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
           SR G ++ +NL GIW   D    W    H N+N++MNYW +   NLSEC     D++  L
Sbjct: 402 SREGDELPSNLCGIWMMGDAGRFWGGDFHFNVNVQMNYWPAYMTNLSECGSVFTDYMESL 461

Query: 423 SINGSKTAQVNYL-------------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
            + G  TA+ +                 G++++ + + +   +A  G   +     G +W
Sbjct: 462 VVPGRVTAERSAAMKTENHATTPVGQGKGFLVNTQNNPFG-CTAPFGSQEYGWNVTGSSW 520

Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIA 528
              ++++ Y +T D + L  R YP+L+   +F   +L    +   L   PS S E     
Sbjct: 521 ALQNVYDEYLFTRDENLLRTRIYPMLKEMTTFWDGFLWWSDYQKRLVVGPSFSAEQ---- 576

Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
                      ST D +++ E+++  I A+E L  +ED L  +  K+  +L P  I E+G
Sbjct: 577 -----GPTVNGSTYDQSLVWELYTMAIDASERLGVDED-LRAEWKKTRDKLNPIIIGEEG 630

Query: 589 SIMEWVQ 595
            + EW +
Sbjct: 631 QVKEWFE 637


>gi|148984088|ref|ZP_01817383.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
           SP3-BS71]
 gi|418232655|ref|ZP_12859241.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA07228]
 gi|418237110|ref|ZP_12863676.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19690]
 gi|147923377|gb|EDK74490.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
           SP3-BS71]
 gi|353885968|gb|EHE65752.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA07228]
 gi|353891548|gb|EHE71302.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA19690]
          Length = 717

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVCKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|418157949|ref|ZP_12794665.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
 gi|353824397|gb|EHE04571.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
          Length = 692

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|421299116|ref|ZP_15749803.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60080]
 gi|395900587|gb|EJH11525.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA60080]
          Length = 717

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 153/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L  E       ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKEQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|418112994|ref|ZP_12749994.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41538]
 gi|418153393|ref|ZP_12790131.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16121]
 gi|418155639|ref|ZP_12792366.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16242]
 gi|419513045|ref|ZP_14052677.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA05578]
 gi|419517252|ref|ZP_14056868.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02506]
 gi|421283791|ref|ZP_15734577.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA04216]
 gi|353783356|gb|EHD63785.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41538]
 gi|353816944|gb|EHD97152.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16121]
 gi|353819888|gb|EHE00077.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA16242]
 gi|379634210|gb|EHZ98775.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA05578]
 gi|379639325|gb|EIA03869.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA02506]
 gi|395880477|gb|EJG91529.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA04216]
          Length = 717

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|320537187|ref|ZP_08037155.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
 gi|320145965|gb|EFW37613.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
          Length = 735

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 175/594 (29%), Positives = 267/594 (44%), Gaps = 74/594 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDA-----PKAL 72
           ++PIGNG +GA ++GG+  E L LNE TLWTG P         G+ T  D          
Sbjct: 57  SLPIGNGFIGASIFGGIRREYLHLNEKTLWTGGPCKKRPNYSGGNKTGVDENGYTPADYF 116

Query: 73  SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDS-HLKYAE-----E 122
           + +R+L   G+ AEA A   KL G  A      YQ  G   ++F  S H   +E     +
Sbjct: 117 AKIRTLFSEGKDAEAAALCDKLVGEKASEGYGAYQSFGKFFIDFYYSAHTALSEPPAEIK 176

Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
            YRRELDLN A   V+Y     E+ R +F++ P  V+  KI+ S    L  +V  +S   
Sbjct: 177 AYRRELDLNQALVEVRYQYNTTEYRRMYFANYPSNVLAGKITASNP-VLHCSVHFESD-Q 234

Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
             S     N   + G         K   ND    ++F  +L  +I  D   I+   DK +
Sbjct: 235 GGSISYTQNGFTLSG---------KVEDND----LEF--LLRCRIRTD--GITTCSDKGI 277

Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
            +  + +    L +++ +   +  P      P     + L    N S+  L   H+ DY 
Sbjct: 278 SITQASFLEFFLCSATDYSDSY--PKYRTGFPPHIDEANL----NKSFDALLAEHIKDYC 331

Query: 303 KLFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
            LF R  + + + S  D+ TD    E  +   S +            L +LLFQ+GRYLL
Sbjct: 332 PLFDRCRLNIGQDSEPDMPTDVLLSEYKNGKFSRK------------LEDLLFQYGRYLL 379

Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
           +SSSR    + ANLQG+WN   SP W S  H+NINL+MNYW +    L EC  PL  ++ 
Sbjct: 380 LSSSREKNILPANLQGMWNNSNSPPWASDYHLNINLQMNYWLACVTGLPECCIPLVKYVA 439

Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
            L     +TA+      G ++ H  +     +       W   P    W+  +LW++Y  
Sbjct: 440 ALEKPAERTAKAYTGLDGGLMIHTQNTPFGWTCPGWSFDWGWSPAAFPWILQNLWQYYCA 499

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
           + D   L++  YPL +    F    L+ +     L ++P+ SPEH    P       +  
Sbjct: 500 SGDFTRLKEIIYPLFKKEIQFYTAVLVFDKKQNRLVSSPTYSPEH---GPR------TNG 550

Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           +T + ++I E+F   I AA++  + + AL+ +  K    L+P  I +   I+EW
Sbjct: 551 NTYEQSLIWELFKQGIEAAKLCGEKK-ALIAQWKKVQENLKPIVIGKSRQILEW 603


>gi|386346135|ref|YP_006044384.1| alpha-L-fucosidase [Spirochaeta thermophila DSM 6578]
 gi|339411102|gb|AEJ60667.1| alpha-L-fucosidase 2 precursor [Spirochaeta thermophila DSM 6578]
          Length = 784

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 166/586 (28%), Positives = 259/586 (44%), Gaps = 74/586 (12%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  + D  P+GNGRL A+V GGV  E + LN + LW G   D    +    +  VR   
Sbjct: 13  PAGVWRDGYPVGNGRLAALVLGGVGEERIHLNHEWLWRGWYRDRVAEERAHLVGWVREAF 72

Query: 80  DSGQYAEATAASVKLFGHPADV---------YQLLGDIELEFDDSHLKYAEETYRRELDL 130
            +G + E T  + + FG    V         YQ  G + L ++       E  YRRELDL
Sbjct: 73  FTGDWEEGTRRANEAFGGGGGVSGRTCRVGAYQPAGTLVLRWEGME----EAEYRRELDL 128

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
                RV+    ++E         P   +  ++SG   G +     +   ++      G+
Sbjct: 129 EEGVVRVRRGE-SLEEVMAVLGGGP---VGVRVSGWGKGWVGLGREVQEGVEVRVEC-GD 183

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFS--AILEIKISDDRGTISALEDKKLKVEGSD 248
            ++ +EGR                +GI +   A++E  +  + G    +E +++ V    
Sbjct: 184 GRVRLEGRFE--------------EGIVWEVLAVVEGGVCREEGKGVWVEGEEVVVWVVV 229

Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
                +  S         PS    +   E   A++            RH++ Y +LF RV
Sbjct: 230 DVWEEVGGSRRR-----LPSYGPPEVPGEGWEAVRR-----------RHVEAYGQLFGRV 273

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + +             EE +  +P+  R    + D DP L  LLF +GRYLLISSS PG
Sbjct: 274 RLVVE-----------GEEPL--LPTGRR----RGDPDPLLPVLLFDYGRYLLISSSAPG 316

Query: 369 TQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
             + ANLQG WN  L P WD+  H++INL+MNYW +    L EC  PL  ++  +  +  
Sbjct: 317 CDLPANLQGKWNPLLEPPWDADYHMDINLQMNYWLAEGAGLGECVTPLVRYVVRMMPSAR 376

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           + A+  +   G      +D WA+++ +     W +W    AW+  HL   Y Y+ D  FL
Sbjct: 377 EAARRLFGCRGIWFPLTSDAWARATPE--AYGWDVWVGAAAWMAQHLVWRYLYSGDEGFL 434

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            +  YP LE  A F  D+L+E  +G L+  PS SPEH +   +G    +  SS +D+ ++
Sbjct: 435 RETVYPFLEEVALFFEDFLVEDGEGVLQVVPSQSPEHRWEGLEGFPVGLCVSSAVDVQLV 494

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           R V    +     L  +E +   ++   L RLR   +  DG ++EW
Sbjct: 495 RWVLRMAVELGGRL-GDEVSRWREMEGRLARLR---VGRDGVLLEW 536


>gi|417677381|ref|ZP_12326788.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
           GA17545]
 gi|418226036|ref|ZP_12852664.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
 gi|419467267|ref|ZP_14007148.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
 gi|332072822|gb|EGI83303.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
           GA17545]
 gi|353881233|gb|EHE61047.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
 gi|379543014|gb|EHZ08166.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
          Length = 692

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +     G I    D+ +++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   + +  + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|391873884|gb|EIT82888.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus oryzae 3.042]
          Length = 513

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 119/304 (39%), Positives = 163/304 (53%), Gaps = 21/304 (6%)

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           D++ L  RV + L+ S         +  N+ T    ER K+   D DP LV L+FQFGRY
Sbjct: 15  DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65

Query: 360 LLISSSR-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
            LI+SSR  GT     NLQG+WNED  P W     VNINLEMNYW +   NL+E   PL 
Sbjct: 66  SLIASSRETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125

Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             L  +   G   A+  Y     G+V+HH TDIW  +        W +WPMGGAWL  +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
            E+Y +T D + L++R +PLL   A F   ++    +GYL T PS+SPE+ F+ P+    
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSK 244

Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
            G    +  + TMD  ++ E+F +II   +VL  N +    K   SLP ++  +I   G 
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303

Query: 590 IMEW 593
           I+EW
Sbjct: 304 ILEW 307


>gi|83764453|dbj|BAE54597.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 513

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 119/304 (39%), Positives = 163/304 (53%), Gaps = 21/304 (6%)

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           D++ L  RV + L+ S         +  N+ T    ER K+   D DP LV L+FQFGRY
Sbjct: 15  DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65

Query: 360 LLISSSRP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
            LI+SSR  GT     NLQG+WNED  P W     VNINLEMNYW +   NL+E   PL 
Sbjct: 66  SLIASSRKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125

Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             L  +   G   A+  Y     G+V+HH TDIW  +        W +WPMGGAWL  +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
            E+Y +T D + L++R +PLL   A F   ++    +GYL T PS+SPE+ F+ P+    
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSE 244

Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
            G    +  + TMD  ++ E+F +II   +VL  N +    K   SLP ++  +I   G 
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303

Query: 590 IMEW 593
           I+EW
Sbjct: 304 ILEW 307


>gi|418189889|ref|ZP_12826401.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47373]
 gi|419493782|ref|ZP_14033507.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47210]
 gi|353853616|gb|EHE33597.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47373]
 gi|379592355|gb|EHZ57171.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47210]
          Length = 717

 Score =  199 bits (507), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 151/524 (28%), Positives = 243/524 (46%), Gaps = 73/524 (13%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
           V   +     +L F + L     L  N  Y               ++ I+M+GR      
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143

Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
                 ND    ++F++ L  +   D    S     ++++ G+ +A L L A + F    
Sbjct: 144 ------ND----LRFASYLAWETDGDIRVWSY----RVQISGASYANLFLAAKTDFAQNP 189

Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
            +    K D   +    + + +   Y+ L +RH++DYQ LF RV + L            
Sbjct: 190 ASNYRKKLDLEQQVRDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237

Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
            E ++D   + + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296

Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
           +P W+S  H+N+NL+MNYW +   NL E   P+ +++  L + G + A V Y        
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355

Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
             +GW++H +     W     D     W   P   AW+   ++E Y++  D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412

Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
           P+L     F   +L +        ++PS SPEH           +S  +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463

Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
              I AA+ L  +ED L E   KS   L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|336427815|ref|ZP_08607806.1| hypothetical protein HMPREF0994_03812 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336008564|gb|EGN38577.1| hypothetical protein HMPREF0994_03812 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 377

 Score =  199 bits (507), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 135/409 (33%), Positives = 206/409 (50%), Gaps = 46/409 (11%)

Query: 13  LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
           +K+ ++ PA+ + +A+PIGNGR+G MV GG+  E ++LNED++W+G   +  NPDA + L
Sbjct: 1   MKLWYDKPARFWHEALPIGNGRMGGMVHGGITRELIQLNEDSVWSGKHLNRINPDAKENL 60

Query: 73  SDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
             +R L+  G+  EA   A   L G P     YQ  G+  L+    H     + YRREL+
Sbjct: 61  PVIRKLIREGRVEEAQQLAMYALSGVPNSQRSYQTAGECCLQM---HHGDEVQDYRRELE 117

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L    +RV Y+V  V + RE + S P+  +V  +   +  + SF+  L      H+  + 
Sbjct: 118 LAEGISRVAYTVQGVRYIRESYVSYPENCMVMVLKTEDGTAFSFDCLLGRC---HNATDE 174

Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
             ++     C           +   +GI F+A L  K     G    +  + L V     
Sbjct: 175 VEKVDEHTIC--------FTVDGGQEGISFAAALCAKAV---GGFVRVIGEHLLVRDVQE 223

Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY---SDLYTRHLDDYQKLFH 306
           A L L   +SF       +D +K         L  IR  +    +D+   H +D+  +F+
Sbjct: 224 AYLYLDIETSF-----READYRK-------VCLDRIRTAAVKEEADIRALHKEDFGSVFN 271

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
           R+++    +  D+          + +P+ ER++  Q  E D  L+EL FQ+GRYLL+SSS
Sbjct: 272 RLALSFELTDADL----------EQIPTDERLRRVQAGERDMGLMELYFQYGRYLLMSSS 321

Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
           R G+  ANLQGIWN+ L P W+S   +NIN EMNYW +   NLSECQ P
Sbjct: 322 RKGSLPANLQGIWNDKLYPVWESKFTININTEMNYWIAGSGNLSECQLP 370


>gi|418108082|ref|ZP_12745119.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41410]
 gi|419423496|ref|ZP_13963709.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43264]
 gi|353778359|gb|EHD58827.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA41410]
 gi|379586068|gb|EHZ50922.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA43264]
          Length = 717

 Score =  199 bits (506), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
           V   +     +L F + L    D  S      +      C        I  K    D+  
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
            ++F++ L  +     G I    D+ +++ G+ +A L L A + F     +    K D  
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
            + +  + + +   Y+ L +RH++DYQ LF RV + L             E ++D   + 
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
           +NL+MNYW +   NL E   P+ +++  L + G + A V Y          +GW++H + 
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366

Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
               W     D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423

Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            +L +        ++PS SPEH           +S  +T D ++I ++F   I AA+ L 
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +ED L E   KS   L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|417694531|ref|ZP_12343718.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
           GA47901]
 gi|418110621|ref|ZP_12747640.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
           GA49447]
 gi|332201080|gb|EGJ15151.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
           GA47901]
 gi|353781242|gb|EHD61687.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
           GA49447]
          Length = 692

 Score =  199 bits (506), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
           V   +     +L F + L    D  S      +      C        I  K    D+  
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
            ++F++ L  +     G I    D+ +++ G+ +A L L A + F     +    K D  
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
            + +  + + +   Y+ L +RH++DYQ LF RV + L             E ++D   + 
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
           +NL+MNYW +   NL E   P+ +++  L + G + A V Y          +GW++H + 
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366

Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
               W     D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423

Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            +L +        ++PS SPEH           +S  +T D ++I ++F   I AA+ L 
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +ED L E   KS   L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|421218284|ref|ZP_15675178.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
 gi|395583053|gb|EJG43502.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
          Length = 692

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
           V   +     +L F + L    D  S      +      C        I  K    D+  
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
            ++F++ L  +     G I    D+ +++ G+ +A L L A + F     +    K D  
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
            + +  + + +   Y+ L +RH++DYQ LF RV + L             E ++D   + 
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDYPDALPANLQGVWNAVDNPPWNSDYHLN 307

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
           +NL+MNYW +   NL E   P+ +++  L + G + A V Y          +GW++H + 
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366

Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
               W     D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423

Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            +L +        ++PS SPEH           +S  +T D ++I ++F   I AA+ L 
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQAAQELG 474

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +ED L E   KS   L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506


>gi|336399821|ref|ZP_08580621.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069557|gb|EGN58191.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 1111

 Score =  199 bits (505), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 140/541 (25%), Positives = 247/541 (45%), Gaps = 81/541 (14%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
           +++  S +  N   + +  PA+++ T  +PIG+G+ GA + G +  + ++ N+ TLW+G 
Sbjct: 334 VISIASYTPKNKYTLWYTQPAENWMTSCLPIGDGQFGATLMGQIAVDDIQFNDKTLWSGK 393

Query: 60  PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
            G  T+ D                           +G     Y   G++ +     H   
Sbjct: 394 LGARTSSDN--------------------------YG----FYLNFGNLYIMSKGMH--- 420

Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-- 177
           +   Y R LD+N A A V ++   V++ R +F+SNPD  IV +   S++G ++  + L  
Sbjct: 421 SATNYVRYLDINDAIAGVNFTSDGVDYQRSYFASNPDSCIVIRYKASQNGHINAVLRLKN 480

Query: 178 ----DSL--LDN--HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
               DS   +DN   + ++ N  I  +G   G  + P+            S +   ++  
Sbjct: 481 QNGKDSCYNIDNSQQATISFNGTIARQGD-SGVTVEPE------------SYVCSARVVI 527

Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
           D G++       ++V G++  ++ L   + +D              +   + +Q  +   
Sbjct: 528 DGGSLKKNSAGLIEVIGANSMIIYLRGLTDYDPDAPQYVSGAALLPTRVAAIVQKAQKKG 587

Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
           Y  L   H  DY++ F R  + LS +  +I             P+   + +++ D   +L
Sbjct: 588 YETLLAAHKADYKQWFDRCQLTLSNAKNNI-------------PTPTLIANYKNDPKANL 634

Query: 350 V--ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
              EL F +GRYLLISSSR  +  ANLQGIWN + +P W +  H NIN++MNYW + P N
Sbjct: 635 FLEELYFSYGRYLLISSSRGVSLPANLQGIWNNNNTPAWHADIHSNINVQMNYWPAEPTN 694

Query: 408 LSECQEPLFDFLTYLSINGSKTAQ----VNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
           LSE   P  +++   +       Q    +  + +GW +  + +I+       G      +
Sbjct: 695 LSELHMPFLNYIYREACVKPTWRQYAKDMGGVNAGWTLPTENNIYGS-----GTTFAPTY 749

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
            +  AW C HLW+HY YT+D+D+L ++A+P ++ C  +    L++ +DG  E     SPE
Sbjct: 750 TIANAWYCQHLWQHYQYTLDKDYLRRQAFPAMKSCVEYWFQKLVKANDGTYECPDEWSPE 809

Query: 524 H 524
           H
Sbjct: 810 H 810


>gi|225019012|ref|ZP_03708204.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
           DSM 5476]
 gi|224948237|gb|EEG29446.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
           DSM 5476]
          Length = 1657

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 172/621 (27%), Positives = 270/621 (43%), Gaps = 120/621 (19%)

Query: 13  LKITFNGPAKHFTDA------IPIGNGRLGAMVWGGVPSETLKLNEDTLW--TGVPGDYT 64
           LK+ ++ PA + +DA      +P+G G +GA V+G   +E ++L E++L    G  G   
Sbjct: 53  LKLWYDEPAPN-SDAGWEQWSLPLGCGYMGANVFGITDTERIQLTENSLCGNNGFEGGLN 111

Query: 65  NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE-- 122
           N                                              F +++L +  +  
Sbjct: 112 N----------------------------------------------FSETYLDFGHDYS 125

Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--- 176
               Y R+L LN ATA V+Y  G V ++RE+F+S PD+V+  K+S SESG LSF +    
Sbjct: 126 GVSNYTRDLILNDATAHVRYDYGGVTYSREYFTSYPDKVMAIKLSASESGKLSFTLRPTI 185

Query: 177 --LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
             L+          G+  I + GR  G  +  +      P G   S         D GTI
Sbjct: 186 PYLNEKKSGTVSAQGDT-ITLSGRMHGYEVDFEGQYKVIPSGGSASMQAANDADGDNGTI 244

Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSK----KDPTSESMSALQSIRN 287
                   +V G+D AV+L+   ++++     F+NP  +K    + P ++    ++    
Sbjct: 245 --------QVTGADSAVILIAIGTNYEFDPQVFLNPDATKLEGFEHPHAKVTERIEQASA 296

Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DED 346
            SY  L + H  DYQ LF R    L  +   + TD             E + +++    D
Sbjct: 297 QSYEQLRSNHTADYQNLFDRTRFDLGGAVPQLTTD-------------ELMNAYKAGSND 343

Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
             L EL FQ+GRYLLISSSR G    NLQG+WN      W +    NIN++MNYW     
Sbjct: 344 RYLEELYFQYGRYLLISSSRKGALPPNLQGVWNMYEQAPWTAGYWHNINIQMNYWPVFST 403

Query: 407 NLSECQEPLFDFL-TYLSINGSKTAQV-------NYLASGWVIHHKTDIWAKSSADRGKV 458
           NL+E  +   D+   YL    + + Q        NY   G       + W+  +      
Sbjct: 404 NLAELFDSYIDYYNAYLPAVRNSSNQFIAQQHPDNYDPGG------DNGWSIGTGAGPYS 457

Query: 459 VWALWPMG------GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
           V+A    G      GA +    WE+Y++T D D LE   YP + G A+F +  ++E H  
Sbjct: 458 VYAPNGQGTDGNGTGALMAQVFWEYYDFTRDPDILENITYPAVSGAANF-MSRVMEPHGD 516

Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
           YL  +PS SPE      +G    V+  +  D  +  E+    + AAE+L + ++AL +++
Sbjct: 517 YLLADPSASPEQ---MENGNY-VVTVGTAWDQQLAYEMEQNTLEAAELLGRQDEALPQRL 572

Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
              + +L P ++   G I E+
Sbjct: 573 ADQIDKLDPVQVGFSGQIKEF 593


>gi|302405797|ref|XP_003000735.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
 gi|261360692|gb|EEY23120.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
          Length = 652

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 153/489 (31%), Positives = 229/489 (46%), Gaps = 50/489 (10%)

Query: 17  FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
           +  P +     +PIGNGRLGA+V+G    E + LNE+++W+G   D  NP +  A   VR
Sbjct: 30  YETPGQDLKSGLPIGNGRLGALVYGSAI-EKITLNENSVWSGPFQDRANPGSLSAFPVVR 88

Query: 77  SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
            L+  G+Y EA   +++ + G P D   Y +  D+ L+F   H +     Y R LD  T 
Sbjct: 89  DLLTKGKYTEAGQLTLRNMTGIPTDTQWYSVTADLFLDF--GHREEGWSGYERWLDTQTG 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHSYVNGN 190
                ++   V +TRE  +      I  +++ S+ G+LSFN S      +L N S    +
Sbjct: 147 ITGTVFNWNGVNYTREAVAGADGGAIAMRLTASQHGALSFNTSWYREKGILKNTSSSCAS 206

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             ++  G              DD   I FS  + +   D  G+I    D  + VEG+   
Sbjct: 207 TLVLDIG-------------GDDAGSIPFSTAVRLVAED--GSIRKGNDSMISVEGATTV 251

Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            + +   +SF   + +    K++ T +   A+++     +  + ++   D+Q L  RV +
Sbjct: 252 DIFVNVETSFR--WASTDKIKEELTRQLDVAVKT----GFDTIKSQAAKDHQSLMKRVEL 305

Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--- 367
            L  S +  +  T      D   +A RV +     DP  + L F FGR+LLISSSR    
Sbjct: 306 DLGSSSEAGLLTT------DKRIAAYRVNA---TADPEFLTLNFNFGRHLLISSSRASAS 356

Query: 368 -GTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
            G  V ANLQGIWN+   P W S   VNIN EMNYW +   +L E   PL+D L+     
Sbjct: 357 SGMGVPANLQGIWNDMYFPPWGSKYSVNINTEMNYWLAEVTDLPETLPPLWDLLSRTRDK 416

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD-- 483
           G  TA+  Y   GWV HH  DIW  S  +     ++LWP    W+   L E Y ++ D  
Sbjct: 417 GLITAKEMYGCPGWVSHHNLDIWGDSCPNANGTAYSLWPSSNLWMSQQLMERYRFSNDKI 476

Query: 484 ----RDFLE 488
               RD++E
Sbjct: 477 QEWRRDYVE 485


>gi|152968134|ref|YP_001363918.1| twin-arginine translocation pathway signal [Kineococcus
           radiotolerans SRS30216]
 gi|151362651|gb|ABS05654.1| twin-arginine translocation pathway signal [Kineococcus
           radiotolerans SRS30216]
          Length = 808

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 183/617 (29%), Positives = 265/617 (42%), Gaps = 72/617 (11%)

Query: 14  KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
           ++ + GPA  + +A+P+G+GRLGA+ WG    E L LN+D  W+G          P    
Sbjct: 5   RLRYEGPATTWLEALPVGDGRLGAVCWGLADGERLSLNDDRAWSG----------PVGGP 54

Query: 74  DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE----------ET 123
              +  D     EA  A+V L G P    +LL  + +    + L   +            
Sbjct: 55  HHPTPPDHPDRVEAARAAV-LAGDPTRAGELLEPV-VHHTQAFLPVGDLLVTTAAAAAPG 112

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
             R LDL TATA  +  V     T  H +S    V+V +++   +G+    ++L S L  
Sbjct: 113 VVRGLDLGTATAWSQRPVPG--GTVRHETSVGAGVLVHRVTAPGAGT-GLRLALASPLRP 169

Query: 184 HS---YVNGNNQIIMEGRC----PGKRIPPKANANDDP-----KGIQFSAILEIKISDDR 231
                 V   +   +E R     P    P   + ++DP      G     +  +      
Sbjct: 170 AGSTLRVPDGDPGGLEWRTLLDLPEDVHPWHPDQHEDPVRWAAPGTPSRQVAVVVRVRCD 229

Query: 232 GTISALEDKKLKVEGSDWAVL----LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
           GT  A  D    VEG  W  +    ++VA  + D P  +P+     P  E+ +A  +   
Sbjct: 230 GTPRAAPDPAGPVEGPAWDGVREAHVVVAVETPD-PATDPTGR---PDVEAAAARAAAAV 285

Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
                +  RH  ++ +LF R  + L  R P    TD               V   + DED
Sbjct: 286 ADPGAVRERHRREHAELFGRSDLDLGGRVPAGTTTDAL-------------VGLAEHDED 332

Query: 347 PSLVELLFQFG--RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
            + V         RYLL++ SRPGT    LQGIWNE+L P W S   +N+NL M YW   
Sbjct: 333 AARVLAALAVAHARYLLVTGSRPGTLPLTLQGIWNEELQPPWSSNYTLNVNLPMAYWPVQ 392

Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK---VVWA 461
           P  L EC EPL  F   L+  G+ TA   Y A GWV HH +D WA++ +  G      W+
Sbjct: 393 PWGLPECAEPLLAFAERLAAAGTATAAEMYGARGWVAHHNSDGWAQTRSVGGGWNDPAWS 452

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
            WP GG WL  +L +  ++  D   L +R  P++EG   F LD L+   DG L T PSTS
Sbjct: 453 AWPYGGVWLSLNLLDALDFAADPGPLARRVLPVVEGAVRFCLDRLVVLPDGTLGTAPSTS 512

Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA-----EVLEKNEDALVEKVLKSL 576
           PE+ ++   G    V  SST D+ + R + +     A       +  +  A VE  L  L
Sbjct: 513 PENHWLDAAGNAQAVERSSTCDLELTRGLLTGWSRWAGRQTHAPVPADLRAEVEAALAGL 572

Query: 577 PRLRPTKIAEDGSIMEW 593
           P          G ++EW
Sbjct: 573 PH---PGTGARGELLEW 586


>gi|317036568|ref|XP_001397589.2| alpha-fucosidase A [Aspergillus niger CBS 513.88]
          Length = 768

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 167/592 (28%), Positives = 262/592 (44%), Gaps = 86/592 (14%)

Query: 25  TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
           T A P+GNGRLGAM  G    E + LN D+LW G P +   Y+  NP+  KA  L  +R 
Sbjct: 36  TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95

Query: 78  LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
            +    +   T     L G +P    YQ+L ++ ++  +            ++D      
Sbjct: 96  WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGE----------LSDID------ 135

Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNNQ 192
                       RE F S PD V V ++S + S   ++F +   L S   N S  +GN+ 
Sbjct: 136 --------GYHNREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNSI 186

Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWAV 251
            +      G+  P          G+ ++A + + +     T        +KV EG     
Sbjct: 187 SLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEVF 233

Query: 252 LLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
           L+  A ++++    N   S     ++P  + +    +    SYS L + H+ DYQ +F++
Sbjct: 234 LVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFNK 293

Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
            ++ L                    P+ E + S+    DP +  LLF +GRYL ISSSRP
Sbjct: 294 FTLTLP-----------DPNGSADRPTTELLSSYSQPGDPYVENLLFDYGRYLFISSSRP 342

Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NG 426
           G+   NLQG+W E  SP W    H NINL+MN+W      L E  EPL+ ++    +  G
Sbjct: 343 GSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPRG 402

Query: 427 SKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           ++TA++ Y  S GWV H + + +   +A +    WA +P   AW+  H+W+H++Y+ D  
Sbjct: 403 AETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDSA 461

Query: 486 FLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +  +  YP+L+G A F L  L++     DG L  NP  SPEH    P     C  Y    
Sbjct: 462 WYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEH---GPT-TFGCTHYQQ-- 515

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
              +I E+F  ++        ++ +    +      L P   I   G I EW
Sbjct: 516 ---LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 564


>gi|423281387|ref|ZP_17260298.1| hypothetical protein HMPREF1203_04515 [Bacteroides fragilis HMW
           610]
 gi|404583091|gb|EKA87774.1| hypothetical protein HMPREF1203_04515 [Bacteroides fragilis HMW
           610]
          Length = 402

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 135/400 (33%), Positives = 202/400 (50%), Gaps = 51/400 (12%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
           N L + +  PA ++ +A+P+GNG LGAMV+G    E L+LNE TL++G P      P   
Sbjct: 22  NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 81

Query: 70  KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
              ++V +L++ G YA A     + + G  +  YQ L D+ L FD   ++   E Y REL
Sbjct: 82  SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 138

Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
           +L  A   ++Y  G + +TRE+F SNPD+V+V +IS S    ++  VS  S         
Sbjct: 139 NLQDAVHTIRYQAGGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 198

Query: 189 GNNQIIMEGRCP---------------------------GKRIPPKANANDDP---KGIQ 218
              ++I+ G+ P                           G+R   K     D    KG+ 
Sbjct: 199 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 258

Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
           F +   +K+     T   L+D +LKV G    +LL+ A++S++G   +PS    D  ++ 
Sbjct: 259 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 313

Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
            + L     L Y DL  RHL DYQ+LF RV++ L            SE++   +P+  R+
Sbjct: 314 DTILSVSGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 362

Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
             F+ + D +L  LLFQ+GRYLLI+SSR G Q ANLQGIW
Sbjct: 363 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIW 402


>gi|225019811|ref|ZP_03709003.1| hypothetical protein CLOSTMETH_03764, partial [Clostridium
           methylpentosum DSM 5476]
 gi|224947447|gb|EEG28656.1| hypothetical protein CLOSTMETH_03764 [Clostridium methylpentosum
           DSM 5476]
          Length = 1411

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 177/644 (27%), Positives = 289/644 (44%), Gaps = 130/644 (20%)

Query: 4   AESTSTTNPLKITFNGPAKHFTD------AIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
           AE  +    LK+ ++ PA   +D      +IP+GNG +G  ++GGV +E +++ E++L  
Sbjct: 38  AEPLAAAKQLKLWYDEPAPS-SDIGWREWSIPMGNGYMGVNLFGGVQTERIQITENSL-- 94

Query: 58  GVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL 117
                                       + +  SV    + ++ Y     I+ E  D   
Sbjct: 95  ----------------------------QDSNTSVGGLNNFSETY-----IDFEHSDP-- 119

Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV-- 175
               + Y+REL+L+   A V Y    V + R++F+  PD+V+V ++S SE+G LSF +  
Sbjct: 120 ----QNYQRELNLSEGVASVVYDSDGVRYERQYFTDYPDKVMVIRLSASEAGKLSFTLRP 175

Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAND-------DPKGIQFSAILEIKIS 228
           ++  L D H         +  G   GK    KA  +        +   ++F    + K+ 
Sbjct: 176 TIPYLCDYH---------VEPGDNRGKHGTVKAEGDTITLAGAMEYYNVEFEG--QYKVL 224

Query: 229 DDRGTISALEDKK-----LKVEGSDWAVLLLVASSSFD-GPFINPSDSKKD-------PT 275
              GT++A  D+      + V+ +D AV+L+   ++++    +  ++++ D       P 
Sbjct: 225 PTGGTMTAQNDQNGDNGTISVQNADSAVILIGIGTNYELKSSVFTANNRLDKLKGNAHPH 284

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
           ++    +Q     SY +L   H +DY+ LF RVS+        + TD             
Sbjct: 285 AKVTKIIQDASAKSYDELLASHQEDYKGLFDRVSVDFGGQMPTVTTD------------- 331

Query: 336 ERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNI 394
           E +K++Q  + DP L EL +QFGRY+LI SSR G    NLQG+WN    P W S    NI
Sbjct: 332 ELLKNYQNGQSDPYLEELFYQFGRYMLICSSRKGALPPNLQGVWNVFNDPPWRSGYWHNI 391

Query: 395 NLEMNYWQSLPCNLSECQEPLFDFL-TYLSI------------NGSKTAQVNYLASGWVI 441
           NL+MNYW +   NL E  E   D+   YL              N S   +VN   +GW +
Sbjct: 392 NLQMNYWPAFTGNLPELFEAYADYQKAYLEKAEQYAVSNIQKNNPSALDKVNTKENGWAL 451

Query: 442 HHKTDIW----AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
            + T  W    + S++  G          GA+     W++Y+YT D   LE  AYP + G
Sbjct: 452 GNST--WPYNISGSASHSGFGT-------GAFTSIMFWDYYDYTRDASVLEDTAYPAVSG 502

Query: 498 CASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
            A F L  +++  DGYL  +PS SPE++      K    ++    D  +I E     + A
Sbjct: 503 MAKF-LSKIVQPIDGYLLASPSYSPENQHNGGSYKTVGCAF----DQQMIYENHLDTLKA 557

Query: 558 AEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRL 598
           A+ L    ++E AL   + + LP L P ++   G I E+ + + 
Sbjct: 558 ADALGLTAEDEPALA-TLEQQLPLLDPVQVGASGQIKEYREEKF 600


>gi|302883112|ref|XP_003040458.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
           77-13-4]
 gi|256721342|gb|EEU34745.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
           77-13-4]
          Length = 812

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 159/592 (26%), Positives = 263/592 (44%), Gaps = 50/592 (8%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPK--ALSDVRSLV 79
             P+GNG L    +G    E +  N D+LW+G P +   YT  NP   K  AL  +R  +
Sbjct: 45  GYPVGNGILAGTHFGDPGHEKIVFNVDSLWSGGPFENSAYTGGNPTTSKSTALPGIREYI 104

Query: 80  DSGQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
               + + T     L G  +    Y++LG++ +    +   Y    Y R LD +T     
Sbjct: 105 ----FDQGTGNVSALLGSGNYYGSYRVLGNLSIIIGHA-TDYTN--YTRSLDPSTGVHTT 157

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
            Y   +V +T   F SNP    V +++  E    + N+  ++L  + S  N         
Sbjct: 158 TYLADSVNYTTTLFCSNPADACVYRVTSDED-LPNINIQFENLAVSSSLAN------PSC 210

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE---GSDWAVLLL 254
             P  R        D P+G+++ AI     + D   +S   +  L +    G     +++
Sbjct: 211 NHPYTRFRGVTQLGD-PEGMKYEAIARFVDNRDGDGVSCATNGSLTIARSPGFKTVDVII 269

Query: 255 VASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
            A +++D    N  +       DP      +  S     Y  L   H++DYQ LF   ++
Sbjct: 270 SAGTNYDATKGNAENDYSFRGDDPAEAVQRSTSSGAQQGYDKLLKAHIEDYQSLFGTFTL 329

Query: 311 QLSRSPKDIVTDTC---SEENIDTVPSAE-RVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
            L  + K    +T    S  + + +     R+       DP L  LLF + RYLLI+SSR
Sbjct: 330 TLPDAQKSAGHETAVLISNYSSNGIGDPYIRIYYISKSRDPYLESLLFDYSRYLLIASSR 389

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
             +  ANLQG W E ++P+W S  H NIN++MNYW +    L +    L++++    +  
Sbjct: 390 ENSLPANLQGKWTEQMNPSWSSDYHANINIQMNYWAADQTGLGKTSVALWNYMRNTWVPR 449

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
           G++TA++ Y A GWV+H++ +I+  +   +G   WA +P+  AW+  H+W++Y Y     
Sbjct: 450 GTETAKLLYDAPGWVVHNEMNIFGHTGM-KGSATWANYPVAAAWMMQHVWDNYEYGRSLT 508

Query: 486 FLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
           +L +  YPLL+  A F +  L E    +DG L  NP  S EH    P     C  Y    
Sbjct: 509 WLRQEGYPLLKEVAQFWISQLQEDEFNNDGTLVVNPCNSAEH---GPT-TFGCTHYQQ-- 562

Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
              +I +V  A +++   + +++     ++   L +L +       G I EW
Sbjct: 563 ---LIHQVLEATLNSITYIGEDDQDFTSELKTVLKKLDKGLHYTSWGGIKEW 611


>gi|325261390|ref|ZP_08128128.1| fibronectin type III domain protein [Clostridium sp. D5]
 gi|324032844|gb|EGB94121.1| fibronectin type III domain protein [Clostridium sp. D5]
          Length = 1783

 Score =  195 bits (495), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 167/614 (27%), Positives = 273/614 (44%), Gaps = 74/614 (12%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----------YTNPDAPKALSDVR 76
           ++PIGN  +GA V+GGV  E ++LNE +LW+G P D            N      +  ++
Sbjct: 73  SLPIGNSAIGASVFGGVDIERIQLNEKSLWSGGPSDSRPDYNGGNIQQNGQDGATMKQIQ 132

Query: 77  SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
            L   G  + A+A   KL G   D        Y   G++ L+F D       E Y R+L+
Sbjct: 133 ELFKEGNNSAASALCNKLIGVSDDAGDKGYGYYLSYGNMYLDFQDGASPDNVENYSRDLN 192

Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
           L  A + V Y      + RE+F S PD V+VT+++ +E G+L F+V ++   D+      
Sbjct: 193 LRNAVSSVDYDYKGTHYHREYFVSYPDNVLVTRLT-AEGGTLDFDVRVEP--DDQKGGGS 249

Query: 190 NNQIIME-GRCPGKRIPPKA---NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
           NN      GR     +       N       ++FS+    K+  D G       +K+ V 
Sbjct: 250 NNPSAESYGRSWDTDVKDGVISINGELTDNQMKFSS--HTKVVADEGGKVKDGTEKVSVS 307

Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDD 300
           G+    +     + +   +    + +   T+E +SA     +       Y  +   H  D
Sbjct: 308 GAKEVTIYTSIGTDYKNEY---PEYRTGQTAEEVSARIKAYVDQAAVKGYEAVKEAHTKD 364

Query: 301 YQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
           +  +F RV + L ++  D  TD+  +  N       ER +         L  +LFQ+GRY
Sbjct: 365 FDSIFGRVDLNLGQTVSDRATDSLLAAYNSGKASEGERRQ---------LEVMLFQYGRY 415

Query: 360 LLISSSR------PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
           L I SSR      P  +   +NLQGIW    +  W +  H+N+NL+MNYW +   N++EC
Sbjct: 416 LTIESSRETPDDDPSRETLPSNLQGIWVGANNSAWHADYHMNVNLQMNYWPTYSTNMAEC 475

Query: 412 QEPLFDFLTYLSINGSKTAQV------NYLASGWVIHHKTD--IWAKSSADRGKVVWALW 463
            +PL  ++  L   G  TA++          +G++ H + +   W     D     W   
Sbjct: 476 AQPLISYVDSLREPGRVTAKIYAGIGDGKSETGFMAHTQNNPFGWTCPGWD---FSWGWS 532

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
           P    W+  + W++Y++T D ++L    YP++   A      L++   G L ++PS SPE
Sbjct: 533 PAAVPWILQNCWDYYDFTGDTEYLRNVIYPIMREEALLYDQMLVDDGTGKLVSSPSFSPE 592

Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PT 582
           H    P  + A  +Y  T+    I +++   I AAE+L  + +  VE       RL+ P 
Sbjct: 593 H---GP--RTAGNTYEQTL----IWQLYEDTIQAAEILGTDAEQ-VEVWKDKQSRLKGPI 642

Query: 583 KIAEDGSIMEWVQR 596
           +I + G I EW + 
Sbjct: 643 EIGDSGQIKEWYEE 656


>gi|111658272|ref|ZP_01408963.1| hypothetical protein SpneT_02000541 [Streptococcus pneumoniae
           TIGR4]
          Length = 576

 Score =  195 bits (495), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 194/384 (50%), Gaps = 36/384 (9%)

Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
           KG+QF  +   K++D  G +S L  + + +  +    L L + + + G            
Sbjct: 9   KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI---------- 55

Query: 275 TSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
               +S+LQ    ++ Y      H+  YQ+ F+RV  +L  S   +        +I T  
Sbjct: 56  ---DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SIPTNL 104

Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
             E  K +       L  LLF +GRYLLISSS+P    ANLQGIW ++L+P W S   +N
Sbjct: 105 LLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTIN 160

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
           IN +MNYW   PC+L E + PLFD L  +   G  TA+  Y A G+  HH TD +  ++ 
Sbjct: 161 INTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFGDTAP 220

Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
               +  A+W +   WLCTH+WEHY Y  D   L +  + +++    F  D+L E  DGY
Sbjct: 221 QSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV-DGY 278

Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEK 571
           L T PS SPE+++   +G       SST+D  I+R    + I  A+ L  N D +  V++
Sbjct: 279 LMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKE 338

Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
           + K LP+   TKI  +G I EW++
Sbjct: 339 LKKKLPK---TKIGSNGQIQEWLE 359


>gi|146386783|pdb|2EAE|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum In Complexes With Products
          Length = 898

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
           +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 51  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 110

Query: 82  GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                 T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 111 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 168

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
            +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 169 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 225

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                 +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 226 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 279

Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
           ++ +    P     ++  +  +     +Q   N  Y+ +   H+DD+  ++ RV I L +
Sbjct: 280 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 339

Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
           S         +    D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 340 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 395

Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 396 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 455

Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
            TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 456 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 514

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
           +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 515 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 573

Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
           G     +Y S++   ++ +   A  +              +A+   KN+     DA   +
Sbjct: 574 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 629

Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
                KSL  L+P ++ + G I EW
Sbjct: 630 SWSCAKSL--LKPIEVGDSGQIKEW 652


>gi|146386777|pdb|2EAB|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum (Apo Form)
 gi|146386778|pdb|2EAB|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum (Apo Form)
 gi|146386779|pdb|2EAC|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum In Complex With
           Deoxyfuconojirimycin
 gi|146386780|pdb|2EAC|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum In Complex With
           Deoxyfuconojirimycin
          Length = 899

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
           +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 52  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111

Query: 82  GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                 T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
            +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                 +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 227 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280

Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
           ++ +    P     ++  +  +     +Q   N  Y+ +   H+DD+  ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340

Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
           S         +    D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396

Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456

Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
            TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
           +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 574

Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
           G     +Y S++   ++ +   A  +              +A+   KN+     DA   +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630

Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
                KSL  L+P ++ + G I EW
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEW 653


>gi|34451973|gb|AAQ72464.1| alpha-fucosidase [Bifidobacterium bifidum JCM 1254]
          Length = 1959

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 627  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 687  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 745  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 802  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+DD+  ++ RV I L +
Sbjct: 856  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 915

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 916  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 972  LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149

Query: 531  GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
            G     +Y S++   ++ +   A  +              +A+   KN+     DA   +
Sbjct: 1150 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 1205

Query: 572  ---VLKSLPRLRPTKIAEDGSIMEW 593
                 KSL  L+P ++ + G I EW
Sbjct: 1206 SWSCAKSL--LKPIEVGDSGQIKEW 1228


>gi|310286736|ref|YP_003937994.1| cell wall protein containing Ig-like domains (group2 and 3)
            [Bifidobacterium bifidum S17]
 gi|309250672|gb|ADO52420.1| cell wall protein containing Ig-like domains (group2 and 3)
            [Bifidobacterium bifidum S17]
          Length = 1959

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 165/630 (26%), Positives = 286/630 (45%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 627  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 687  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 745  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 802  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q+  N  Y+ +   H+DD+  ++ RV I L +
Sbjct: 856  ATDYKQKYPSYRTGETAAEVNTRVAKVVQAAANKGYTAVKKAHIDDHSAIYDRVKINLGQ 915

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 916  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 972  LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L  R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1200

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228


>gi|393222468|gb|EJD07952.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
           MF3/22]
          Length = 835

 Score =  192 bits (489), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 170/612 (27%), Positives = 281/612 (45%), Gaps = 68/612 (11%)

Query: 17  FNGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDA 68
           ++ P + +T   +P+GNG L AM  GG   E+ +LN ++LW+G P       G    PD 
Sbjct: 36  YDAPGQIWTQHYLPLGNGFLAAMTPGGTLQESTQLNIESLWSGGPFADPAYNGGNKQPDE 95

Query: 69  PKALSDVRSLVDSGQYAEATAAS--VKLFGHPADVYQLL---GDIELEFDDSHLKYAEET 123
             A++     +    +  +T  +  V +   P D Y      G +     +S L    + 
Sbjct: 96  QAAMAQAMQSIRQSIFNSSTGITDNVDVLMTPIDAYGSYSGAGFLVSTLQNSSLSNISD- 154

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---L 180
           + R LDL++   +  ++  N +F+RE F S+P Q  V   S + S   +   +L +   L
Sbjct: 155 FGRFLDLDSGLTKTIWNEDNAQFSRETFCSHPTQACVQNTSTAASSGFTQTYALAAASGL 214

Query: 181 LDNHSYVNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
              +     N  + + G    PG      A     P G      L+  +  +  T   + 
Sbjct: 215 PAPNVTCTDNATLRLNGLVAEPGMAYELLATVAVPPGGT-----LKCTVVPNMDTTDNVV 269

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-------KKDPTSESMSALQSIRNLSYS 291
           +  + V     A ++ V  +++D   IN  D+         DP  + +  L S    SYS
Sbjct: 270 NATITVSNVTSASVVWVGGTNYD---INAGDAVHNFSFRGPDPHDDLVPLLSSASKKSYS 326

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
           +L + H+ DY+   H  S+ L +           + ++DT  + + + ++  D+    VE
Sbjct: 327 ELLSDHVADYEATLHAFSLDLGQ-----------KADLDT-STDKLINAYTVDKGDVYVE 374

Query: 352 -LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
            LLF +GR+LL SSSR G   ANLQG W  D  P W +  H++IN+EMNYW +   NL +
Sbjct: 375 WLLFNYGRHLLASSSR-GILPANLQGKWAVDAFPAWGADYHLDINVEMNYWLAEMTNL-D 432

Query: 411 CQEPLFDFL--TYLSINGSKTAQVNY-LASGWVIHHKT--DIWAKSSADRGKVVWALWPM 465
             +PLF+++  TY +  G+ TAQV Y +  GWV+H +    I+  +    G+  W  +P 
Sbjct: 433 VSKPLFNYIAKTY-APRGAYTAQVLYNITQGWVVHTEVMFKIFGYTGMKVGEAEWYDYPE 491

Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSP 522
             AWL  ++W+H++YT D  + + + YPLL+G A F L+ LI      DG L   P  SP
Sbjct: 492 PNAWLMLNVWDHFDYTNDVAWWKAQGYPLLKGVALFHLEKLIPDEHFLDGTLVVAPCNSP 551

Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RP 581
           E   I     LAC          +I ++ +AI   A    + +++ +  V   + ++ + 
Sbjct: 552 EQAPI----TLACAH-----SQQLIWQLLNAIEKGAAAAGETDESFLNDVRAKIAQMDKG 602

Query: 582 TKIAEDGSIMEW 593
             I   G + EW
Sbjct: 603 IHIGSWGQLQEW 614


>gi|146386781|pdb|2EAD|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum In Complex With Substrate
 gi|146386782|pdb|2EAD|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
           Bifidobacterium Bifidum In Complex With Substrate
          Length = 899

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 165/625 (26%), Positives = 286/625 (45%), Gaps = 82/625 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
           +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 52  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111

Query: 82  GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                 T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
            +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                 +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 227 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280

Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
           ++ +    P     ++  +  +     +Q   N  Y+ +   H+DD+  ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340

Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
           S         +    D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396

Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
           LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456

Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
            TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
           +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SP    +  D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPAQGPLGTD 574

Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
           G     +Y S++   ++ +   A  +              +A+   KN+     DA   +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630

Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
                KSL  L+P ++ + G I EW
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEW 653


>gi|383115618|ref|ZP_09936374.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
 gi|313694978|gb|EFS31813.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
          Length = 793

 Score =  191 bits (485), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 166/604 (27%), Positives = 259/604 (42%), Gaps = 108/604 (17%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           ++PIGNG +GA ++G   +E ++L E T   GV G Y                       
Sbjct: 58  SLPIGNGYMGACIFGRTDTERIQLTEKTF--GVKGPYKKGG------------------- 96

Query: 87  ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
                    G+ A++Y     IE    D  L      Y+R L LN A +RV Y    V +
Sbjct: 97  --------IGNFAEIY-----IEGIHHDQPL-----NYKRSLRLNDAISRVNYQYEGVNY 138

Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNG-----NNQIIME 196
           TRE+F++ P  VIV K+   + G +SF +      L    D  +   G     N+ I + 
Sbjct: 139 TREYFANYPSNVIVVKLKADQPGKISFTLRPVLPYLHEYNDEGTGRTGKVSAQNDLITLT 198

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
           G     R+P +A     P G Q  A+     +D+ G      +  ++++ +D  VLL+ A
Sbjct: 199 GDIQFFRLPYEAQIKVIPSGGQLKAM-----NDELGN-----NGTIRIQQADSVVLLINA 248

Query: 257 -------SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
                  SS F     N     + P       +Q   +  Y  L   H+ DYQ LF RV 
Sbjct: 249 QTAYQLKSSVFTASPENKFTGNEHPHRAVSQCIQKAADKGYEALCKEHIADYQSLFSRVD 308

Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
           + L      I TD+   +        +R K     E   + ELLFQ+GRYLLI+SSR G+
Sbjct: 309 LHLCNETPGIPTDSLLHD-------YQRGK-----ESLYMDELLFQYGRYLLIASSRKGS 356

Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
              +LQG W++     W      NIN++MNYW +   NL+E       F+ Y+  N +  
Sbjct: 357 LPPHLQGAWSQYEYAPWSGGYWHNINIQMNYWAAFNTNLAEV------FIPYVEYNEAFR 410

Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW---------------LCTHL 474
              N  A+G++  +  D  +    + G   W +     A+                 T L
Sbjct: 411 QSANEKATGYIKKNNPDALSAIPEENG---WTIGTGANAFSIDSPGGHSGPGTGGFTTKL 467

Query: 475 -WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
            W++Y++T D D L+K +YP + G A FL   L    + YL  +PS+SPE        + 
Sbjct: 468 FWDYYDFTRDEDILKKHSYPAMLGMAKFLSKTLKPTEEEYLLADPSSSPEQYHNGTTYQT 527

Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
              ++    D  +I E F  ++ AA++L K E   +  + + + +L   +I E G I E+
Sbjct: 528 KGCAF----DQGMIWESFHDVLKAADIL-KEESPFLRTIKEQIGKLDAIQIGESGQIKEY 582

Query: 594 VQRR 597
            + +
Sbjct: 583 REEK 586


>gi|149199701|ref|ZP_01876733.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
 gi|149137218|gb|EDM25639.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
          Length = 1754

 Score =  191 bits (485), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 166/603 (27%), Positives = 270/603 (44%), Gaps = 117/603 (19%)

Query: 29  PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT 88
           PIGNG  GA ++G   +E +++ + TL                        + G+Y +  
Sbjct: 63  PIGNGYTGANIFGRTDTERIQITDKTL-----------------------HNRGKYNKGG 99

Query: 89  AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
             S                 E++ D  H K+++  YRR L+LN   A V Y+   V +TR
Sbjct: 100 LTSF---------------AEIKLDFRHHKFSK--YRRSLNLNEGIAHVAYNYRGVNYTR 142

Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKA 208
           E+F+S PD VIV +++  +  +LSF +  +         +G+                  
Sbjct: 143 EYFASYPDNVIVIRLTADKKAALSFEIRPEIPYLERKERSGS-----------------I 185

Query: 209 NANDDPKGIQFSAIL-------EIKISDDRGTISA-LEDKKLKVEGSDWAVLLLVASSSF 260
           +A DD   ++ S  L       +IK+ ++ GT+ A  +   ++V  +D   +L+   +++
Sbjct: 186 SAKDDLLTLKGSIALFSCNFDGQIKVLNEGGTLKANAKQGSIEVSKADAVTILIATGTNY 245

Query: 261 ---DGPFINPSDSKKDPT----SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
              +  F N S  K +P     +E  + +Q+ +N  Y  L  RHL DYQ LF RV++ L+
Sbjct: 246 RLHEDTFRNTSAKKLNPKEFPHNEVSARIQAAQNRGYEQLKERHLKDYQNLFGRVAVNLN 305

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
             P +  T              E+ K+ +T+    L EL+FQ+GRYLLISSSR  +  AN
Sbjct: 306 SRPSNDPTHIL----------LEKYKAGKTNN--WLEELMFQYGRYLLISSSREKSLPAN 353

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL-TYLSINGSKTAQV 432
           LQG W++D    W      NIN++MNYW S+  NL+EC +   +F   YL I  ++    
Sbjct: 354 LQGAWSQDYYTPWSGGFWHNINVQMNYWGSMSTNLAECFQSYTNFYKAYLPI--AREHAT 411

Query: 433 NYLA------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
           +Y+             +GW+I    + +   SA             G +    L ++Y +
Sbjct: 412 DYVQKYNPSQVTKGGDNGWIIGTGANAYYIPSAGGHSGP-----GTGGFTAKLLMDYYLF 466

Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD------GKLA 534
           T D+ +LE+ AYP +   + F    LI  H   L   PS SPE +   P+      GKL 
Sbjct: 467 TQDKQYLEEVAYPAMLSLSKFYSKVLIP-HGDKLLVEPSASPE-QLAKPEQVKNMPGKLK 524

Query: 535 CVSY----SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
              Y      T D   + E F+  ++ A+ L  +ED  ++ + + + +L P  I  DG I
Sbjct: 525 GGKYYVTAGCTFDQGFVWESFADTLTLADAL-GSEDPFLDTIREQITKLDPILIGADGQI 583

Query: 591 MEW 593
            E+
Sbjct: 584 KEY 586


>gi|311063634|ref|YP_003970359.1| 1,2-A-L-fucosidase [Bifidobacterium bifidum PRL2010]
 gi|310865953|gb|ADP35322.1| 1,2-A-L-Fucosidase [Bifidobacterium bifidum PRL2010]
          Length = 1959

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 164/630 (26%), Positives = 285/630 (45%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 627  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTRYNGGNNETKGQNGATLRALNKQ 686

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 687  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 745  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 802  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 855

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+ D+  ++ RV I L +
Sbjct: 856  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 916  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 972  LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGKGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSANNWAKGDNGNFTD 1200

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228


>gi|115384756|ref|XP_001208925.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114196617|gb|EAU38317.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1276

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 163/595 (27%), Positives = 263/595 (44%), Gaps = 96/595 (16%)

Query: 24   FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
             T A P+GNGRLG   + G                  G+  N  A +AL  +R  +    
Sbjct: 556  ITTAFPLGNGRLGEKAYAG------------------GNPNNCRA-EALPGIRDFI---- 592

Query: 84   YAEATAASVKLFGH-PA-DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
            +   T     L G  P+   YQ+LG++ ++  +         YRR LD+ +      ++V
Sbjct: 593  FQNGTGNVSALLGEFPSYGSYQVLGNLTIDLGELE---NVRGYRRRLDMKSGVYTDGFAV 649

Query: 142  GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
            GN  + R  F S PDQV V  IS + +   S  + L+            NQ++     P 
Sbjct: 650  GNALYNRTAFCSYPDQVCVYHISSANASLPSVEIGLE------------NQVV----SPA 693

Query: 202  KRIPPKANA-----NDDPK-GIQFSA----ILEIKISDD--RGTISALEDKKLKVEGSDW 249
              +   AN+        P  G+ ++A    ++  K S D   GT+  +   + +V     
Sbjct: 694  PNVTCHANSISLYGQTFPTIGMIYNARATVVVPGKSSGDFCAGTVVRVPSGQKEV----- 748

Query: 250  AVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
              ++L A +++D    N     S    DP  + +         SY+ L + H+ D++ + 
Sbjct: 749  -YIVLAADTNYDASKGNAAAKFSFRGSDPYEKVLQTASKAAKKSYAQLKSSHVKDFRAIS 807

Query: 306  HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
               ++ L         D+  +      P+ E + ++    DP +  LLF +GRYL +SSS
Sbjct: 808  DGFTLTLPDR-----RDSAGK------PTTELIAAYTQPGDPFIEGLLFDYGRYLFMSSS 856

Query: 366  RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL--TYLS 423
            R G+   NLQG+W E  SP W +  H NINL+MN+W      L E  EPL+ ++  T+L 
Sbjct: 857  RAGSLPPNLQGLWTEQASPAWSADYHANINLQMNHWAVEQVGLGELTEPLWKYMADTWLP 916

Query: 424  INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
              G +TA++ Y   GWV H + +++   +A +    WA +P   AW+  H+W+H++YT D
Sbjct: 917  -RGQETARLLYGGEGWVTHDEMNVFGH-TAMKNDAQWANYPAVNAWMSQHVWDHFDYTQD 974

Query: 484  RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
              + +   YP+L+G A F L  L++    +DG    NP  SPEH    P     C +Y  
Sbjct: 975  AAWYQSMGYPILKGAAQFWLSQLVQDEHFNDGTWVVNPCNSPEH---GPT-TFGCTNYQQ 1030

Query: 541  TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRL-RPTKIAEDGSIMEW 593
                 +I E+F  ++        ++D L  + + S    L     I   G I EW
Sbjct: 1031 -----LIWELFDHVLRGWTA-SGDKDRLFRRAIASKFAALDNGIHIGSWGQIQEW 1079


>gi|149197418|ref|ZP_01874469.1| hypothetical protein LNTAR_00515 [Lentisphaera araneosa HTCC2155]
 gi|149139436|gb|EDM27838.1| hypothetical protein LNTAR_00515 [Lentisphaera araneosa HTCC2155]
          Length = 980

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 169/598 (28%), Positives = 272/598 (45%), Gaps = 65/598 (10%)

Query: 23  HFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSG 82
           H+ DA PIG+GRLG MV+G V      L +   W       T PD    L  VR L   G
Sbjct: 197 HWRDAYPIGSGRLGGMVYGDVNEARFMLQDARHWFNHASSSTMPDLSGLLQQVRDLQKQG 256

Query: 83  QYAEATAASVKLF---GHPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
           +YA+A       F    + A++   L  GD+ +  +  ++      Y+R LDL  +    
Sbjct: 257 KYADANVLYRNAFKGKNYRANIGSPLSIGDLVIRSNAKNI----SQYQRTLDLKKSETHT 312

Query: 138 KYSVGNVEFTREHFSS---NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN--- 191
            +S   V++TR+ F S   +   V+  +++  ++ +L  +V L   L N     G+    
Sbjct: 313 AWSNEGVDYTRKAFISRIGDSKDVLFVQLNAKQAKALDISVHLG--LHNPDKARGSRPKA 370

Query: 192 -----QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
                 +   G    K + P    N       F A+  + IS D G +   E   +K++G
Sbjct: 371 FRPSVNVDFAGHIQYKALNP----NTTSALKDFGAVARV-ISHD-GELKE-EIDHVKIKG 423

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
              A  +L+A  +F+      +D   D  +  +  L+     +Y      H+  +Q LF+
Sbjct: 424 ---ASQILIAVKTFNSA---DADEAIDRITRELYKLKG----TYQTYLNPHVKAHQGLFN 473

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
             S+ L  S  D      +EE I         K+ + D + + VE L+  GRYL I  SR
Sbjct: 474 AASVDLKASKDD--RALSNEELI--------AKARKLDLENAFVERLWAMGRYLSIVGSR 523

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHV-NINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
            G    +L G+WN D +PTW  A H+ NIN+ M +W  +  NLSE   P FD        
Sbjct: 524 KGGHPVHLTGLWNGDYNPTW--AIHLMNINMPMIHWHLMDGNLSELMLPFFDMFDRQLPA 581

Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV--WALWPMGG-AWLCTHLWEHYNYTM 482
             + A+  Y   G  I+    +   +     KV+    +  MG  AW+  H W++Y +T+
Sbjct: 582 SRENARKLYGLDGIYIN---PLLGNNEDGLLKVISPHLIHMMGNNAWVAQHYWDYYTFTL 638

Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC------V 536
           D+ FL +RA PL+E  A+F   +LIE  DG+ +  PS SPE+  +  +G           
Sbjct: 639 DKKFLAERAVPLMEEAATFYEGFLIENEDGFYDITPSNSPENSPLNAEGHRLIPNRHIDT 698

Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
             ++T + A IRE+F+ +I A+  L  N+  + +   + + +LRP +I  +G + EW+
Sbjct: 699 HINATWEYAAIREMFTNLIEASNTLAINQSKIAD-WKEVIAKLRPYEINAEGGVREWL 755


>gi|421310055|ref|ZP_15760680.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA62681]
 gi|395909670|gb|EJH20545.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
           GA62681]
          Length = 709

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 147/513 (28%), Positives = 236/513 (46%), Gaps = 59/513 (11%)

Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
            Y   GDI +EF       ++ T Y+R+L+++ A A   Y      F RE F+S PD ++
Sbjct: 27  TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86

Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
           V   +     +L F + L    D  S      +      C        I  K    D+  
Sbjct: 87  VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145

Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
            ++F++ L  +     G I    D+ +++ G+ +A L L A + F     +    K D  
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200

Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
            + +  + + +   Y+ L +RH++DYQ LF RV + L             E ++D   + 
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247

Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
           + +K+++  E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN D         H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNSDY--------HLN 299

Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
           +NL+MNYW +   NL E   P+ +++  L + G + A V Y          +GW++H + 
Sbjct: 300 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 358

Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
               W     D     W   P   AW+   ++E Y++  D+D+L ++ YP+L     F  
Sbjct: 359 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 415

Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
            +L +        ++PS SPEH           +S  +T D ++I ++F   I AA+ L 
Sbjct: 416 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 466

Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
            +ED L E   KS   L P +I + G I EW +
Sbjct: 467 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 498


>gi|313139434|ref|ZP_07801627.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
 gi|313131944|gb|EFR49561.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
          Length = 1959

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 627  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 687  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 745  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 802  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+ D+  ++ RV I L +
Sbjct: 856  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 916  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 972  LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L+ R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSTDNWAKGDNGNFAD 1200

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++   G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGNSGQIKEW 1228


>gi|238482581|ref|XP_002372529.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
           NRRL3357]
 gi|220700579|gb|EED56917.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
           NRRL3357]
          Length = 785

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 162/614 (26%), Positives = 268/614 (43%), Gaps = 59/614 (9%)

Query: 1   MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
           ++   S +T N L  +         +A  +GNG+LG M +G   +E L  N D LW G P
Sbjct: 6   LLGMSSFATANSLWSSKAASWDTTNEAYTLGNGKLGVMPFGEPGAEKLNYNHDELWEGGP 65

Query: 61  -------GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELE 111
                  G   N    + LS+VR  +    + + T    +L G       +  L ++ + 
Sbjct: 66  FEVDGYRGGNPNSSMTEILSEVRDEI----WKKGTGNDSRLHGDTDGYGSFHSLANLTIA 121

Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
            D  H K ++  Y R LDL T      YS G  ++T + + S P QV + K++ + + S 
Sbjct: 122 IDGIH-KVSD--YTRSLDLGTGIHTTTYSTGKGKYTTDVYCSYPAQVCIYKLNSTAALS- 177

Query: 172 SFNVSLDSLLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
              +  D L++  S  N   +      R   +  PP+        G+ +  I    I   
Sbjct: 178 KVTIYFDQLVEESSLWNATCDSDFARLRGVTQEGPPR--------GMTYDTIARSSIPGR 229

Query: 231 RGTISALEDKKLKVEGSDWAVLLLV--ASSSFDG----PFINPSDSKKDPTSESMSALQS 284
             + +     KL +   + + L +V  A + FDG       + +   +DP         S
Sbjct: 230 CDSSTG----KLAINARNSSSLTIVIGAGTDFDGTKGTAATDYTFKGEDPAEYVEKITSS 285

Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
             + S S L T H++DY  L    ++ L         DT      +         + +TD
Sbjct: 286 ALSQSESKLRTEHIEDYSGLMSAFTLDLP--------DTQDSTGTELSTLITNYNANKTD 337

Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
            DP L +LLF +GR+L ISSSR  +   NLQG+W+   +  W    H NINL+MN W + 
Sbjct: 338 GDPYLEKLLFDYGRHLFISSSRANSLPPNLQGVWSPTKNAAWSGDYHANINLQMNLWGAE 397

Query: 405 PCNLSECQEPLFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
              + E    +F+++    +  G++TA++ Y  +GWV H + +I+  +     +   A +
Sbjct: 398 ATGIGELTVAVFNYMEQNWMPRGAETAELLYGGAGWVTHDEMNIFGHTGMKTYQTS-ANY 456

Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPST 520
           P   AW+  H+W+ Y+Y+ ++ +  K+ +PLL+G A F    L      +D  L  NP T
Sbjct: 457 PAAPAWMMQHVWDRYDYSHNKTWFIKQGWPLLKGVAEFWASQLQVDKFNNDSSLVVNPCT 516

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL- 579
           SPE             ++  T    +I +V+   I  AE+  + +  L++ +   LPRL 
Sbjct: 517 SPEQ---------GPTTFGCTHWQQLIHQVYENAIQGAEIAGETDSTLLKDIKDQLPRLD 567

Query: 580 RPTKIAEDGSIMEW 593
           +   I   G I EW
Sbjct: 568 KGLHIGTWGQIKEW 581


>gi|421735948|ref|ZP_16174814.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
 gi|407296769|gb|EKF16285.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
          Length = 1935

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 622  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 682  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 740  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 797  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+ D+  ++ RV I L +
Sbjct: 851  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 911  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 967  LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L  R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1195

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++ + G I EW
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1223


>gi|390936092|ref|YP_006393651.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
 gi|389889705|gb|AFL03772.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
          Length = 1959

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 627  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 687  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 745  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 802  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+ D+  ++ RV I L +
Sbjct: 856  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 916  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 972  LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L  R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1200

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228


>gi|154305361|ref|XP_001553083.1| hypothetical protein BC1G_08975 [Botryotinia fuckeliana B05.10]
          Length = 792

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 167/600 (27%), Positives = 271/600 (45%), Gaps = 78/600 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
           A PIGNG+L A+ +G   SE L LN+D+LW G P       G   N     +LS +R  +
Sbjct: 39  AYPIGNGQLAALPFGTPGSEKLNLNKDSLWNGGPFGDASYIGGNPNSSVSSSLSGIRDFI 98

Query: 80  DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
               +   T     L G   +   YQ+L ++ +      +  A E Y+R LDLNT     
Sbjct: 99  ----FQNGTGNVTALMGSDDNYGSYQVLANLSVSLQG--ISGATE-YKRSLDLNTGIHTT 151

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGS---LSF-NVSLDSLLDNHSYVNGNNQI 193
            +   N  +T   F S PD V V +++ + + S   + F NV  DS L   S    +   
Sbjct: 152 TFKTSNSSYTTAVFCSYPDSVCVYQVNSTTTLSKIDVHFDNVLTDSSLIKSSCSKSSKSA 211

Query: 194 IMEGRCP---GKRIPPKANANDDPKGIQFS---AILEIKISDDRGTISALEDKKLKVEGS 247
           +  G      G     +A   +  K +  S    IL I  S D+ ++S            
Sbjct: 212 LFSGITQADIGMIYKAEARVLESTKSVSCSNTTGILSITPSHDQKSLS------------ 259

Query: 248 DWAVLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
               L++ A +++D      +D+      DPT+   S +    + +   L  +H+ D+  
Sbjct: 260 ----LVISAGTNYDATKGTAADNYSFKGVDPTAYVSSTIAKAASKTVKTLRNKHVSDFSA 315

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRY 359
           L +  ++ L         D     N +T   A  + ++ T +    DP +  LLF + RY
Sbjct: 316 LMNSFTLSLP--------DPLGSANKET---AAVIAAYNTTDNTHTDPWVENLLFDYSRY 364

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           L ISSSR  +   NLQG W   L   W +  H NIN++MN+W ++   L + Q  L+ ++
Sbjct: 365 LFISSSRDNSLPPNLQGKWAYGLYNAWGADYHANINIQMNHWGAVQTGLGDLQSALWTYM 424

Query: 420 T-YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +   +  G++TA++ Y A GWV+H + +I+  +    G   WA +P   +WL  H+ ++Y
Sbjct: 425 SETWAPRGAETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYY 484

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLAC 535
           +Y+ D+++L +  YPLL+  + F L  L +    +DG L  NP +SPEH    P     C
Sbjct: 485 DYSRDKNWLRETGYPLLKAVSEFWLSQLQKDEYFNDGTLVVNPCSSPEH---GPT-TFGC 540

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS--LPRLRPTKIAEDGSIMEW 593
             Y       +I  +F+  + AA  L    D+ ++K L +  L   +   I+    I EW
Sbjct: 541 THYQQ-----LIHSLFTTTLQAARTLSL--DSTLQKSLTTSLLSLDKGLHISPTTQIQEW 593


>gi|347826700|emb|CCD42397.1| glycoside hydrolase family 95 protein [Botryotinia fuckeliana]
          Length = 792

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 167/600 (27%), Positives = 271/600 (45%), Gaps = 78/600 (13%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
           A PIGNG+L A+ +G   SE L LN+D+LW G P       G   N     +LS +R  +
Sbjct: 39  AYPIGNGQLAALPFGTPGSEKLNLNKDSLWNGGPFGDASYIGGNPNSSVSSSLSGIRDFI 98

Query: 80  DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
               +   T     L G   +   YQ+L ++ +      +  A E Y+R LDLNT     
Sbjct: 99  ----FQNGTGNVTALMGSDDNYGSYQVLANLSVSLQG--ISGATE-YKRSLDLNTGIHTT 151

Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGS---LSF-NVSLDSLLDNHSYVNGNNQI 193
            +   N  +T   F S PD V V +++ + + S   + F NV  DS L   S    +   
Sbjct: 152 TFKTSNSSYTTAVFCSYPDSVCVYQVNSTTTLSKIDVHFDNVLTDSSLIKSSCSKSSKSA 211

Query: 194 IMEGRCP---GKRIPPKANANDDPKGIQFS---AILEIKISDDRGTISALEDKKLKVEGS 247
           +  G      G     +A   +  K +  S    IL I  S D+ ++S            
Sbjct: 212 LFSGITQADIGMIYKAEARVLESTKSVSCSNTTGILSITPSHDQKSLS------------ 259

Query: 248 DWAVLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
               L++ A +++D      +D+      DPT+   S +    + +   L  +H+ D+  
Sbjct: 260 ----LVISAGTNYDATKGTAADNYSFKGVDPTAYVSSTIAKAASKTVKTLRNKHVSDFSA 315

Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRY 359
           L +  ++ L         D     N +T   A  + ++ T +    DP +  LLF + RY
Sbjct: 316 LMNSFTLSLP--------DPLGSANKET---AAVIAAYNTTDNTHTDPWVENLLFDYSRY 364

Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
           L ISSSR  +   NLQG W   L   W +  H NIN++MN+W ++   L + Q  L+ ++
Sbjct: 365 LFISSSRDNSLPPNLQGKWAYGLYNAWGADYHANINIQMNHWGAVQTGLGDLQSALWTYM 424

Query: 420 T-YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
           +   +  G++TA++ Y A GWV+H + +I+  +    G   WA +P   +WL  H+ ++Y
Sbjct: 425 SETWAPRGAETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYY 484

Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLAC 535
           +Y+ D+++L +  YPLL+  + F L  L +    +DG L  NP +SPEH    P     C
Sbjct: 485 DYSRDKNWLRETGYPLLKAVSEFWLSQLQKDEYFNDGTLVVNPCSSPEH---GPT-TFGC 540

Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS--LPRLRPTKIAEDGSIMEW 593
             Y       +I  +F+  + AA  L    D+ ++K L +  L   +   I+    I EW
Sbjct: 541 THYQQ-----LIHSLFTTTLQAARALSL--DSTLQKSLTTSLLSLDKGLHISPTTQIQEW 593


>gi|421734699|ref|ZP_16173762.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
 gi|407077388|gb|EKE50231.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
          Length = 1954

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 164/630 (26%), Positives = 283/630 (44%), Gaps = 92/630 (14%)

Query: 28   IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
            +P GNG++G  VWG V  E +  NE+TLWTG PG  T      N    +  + +R+L   
Sbjct: 622  LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681

Query: 82   GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
                  T     L G  + A+    L  GDI L++  +     E  YRR+L+L+   A V
Sbjct: 682  LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739

Query: 138  KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
             +    V +TRE+F+SNPD V+V +++ S++G L+FNVS+ +   N +Y        ++G
Sbjct: 740  TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796

Query: 198  RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
                  +  K    ++  G+ +++ +++ + +  GT+S   D   LKV  +    L + A
Sbjct: 797  DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850

Query: 257  SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
            ++ +    P     ++  +  +     +Q   N  Y+ +   H+ D+  ++ RV I L +
Sbjct: 851  ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910

Query: 315  SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
            S       +      D +  A +  S  T +   L  L++++GRYL I SSR  +Q+ +N
Sbjct: 911  SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966

Query: 374  LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
            LQGIW      N   +  W S  H+N+NL+MNYW +   N+ E  EPL +++  L   G 
Sbjct: 967  LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026

Query: 428  KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
             TA+V   A              G++ H +   +  ++  +    W   P    W+  ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085

Query: 475  WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
            +E Y Y+ D   L  R Y LL+  + F +++++          L T  + SPE   +  D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144

Query: 531  GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
            G        +T + +++ ++ +  I AA+  + + D LV                     
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1195

Query: 574  ----------KSLPRLRPTKIAEDGSIMEW 593
                      KSL  L+P ++   G I EW
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGNSGQIKEW 1223


>gi|379719129|ref|YP_005311260.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
 gi|378567801|gb|AFC28111.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
          Length = 913

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 167/600 (27%), Positives = 270/600 (45%), Gaps = 71/600 (11%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           + +A+P GNG +GA V G + SET+ L    LWTG       P+    L+++R L+D G 
Sbjct: 120 WREALPSGNGLIGAAVHGAIGSETVLLTHAELWTGGT-KQELPEVSGTLAEIRRLMDEGA 178

Query: 84  YAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET----YRRELDLNTATARVK 138
           Y EA      L G   +  Y+ + +  L   D  +    +     YRRELDL T    V+
Sbjct: 179 YREANGL---LEGRLREAGYEPVRETPLPLADLKVVRTAQAGFRRYRRELDLETGEVSVR 235

Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-------SLLDNHSYVNGNN 191
           +  G   + R+ F S  D +IV ++ GS  G +   + L        S  D  SYV+ + 
Sbjct: 236 WEEGAAAYERKLFVSRSDDLIVYEL-GSRGGCVDVALLLQPHEKGTASRPDMPSYVSESL 294

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEG 246
           +I              A  NDD  G  F A+L       ++ +D+G        +L V G
Sbjct: 295 EI-----TAADGFLRYAARNDD--GRDFGAVLRAVPAGGRLGEDQG--------RLSVTG 339

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKL 304
           +D  VL+LV    F G          D + E       +R ++  YS+L  RH   +  L
Sbjct: 340 AD-KVLILV--KVFAG---------GDRSQEWTRLEAELREVAWTYSELLDRHTALHGPL 387

Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
                + L  + ++  + T ++E +         ++++    P+L EL++ +GRYL IS 
Sbjct: 388 MRSADVHLGGAGEE-ASCTYTDELLQ--------EAYEGGLSPALAELMWAYGRYLFISG 438

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
           +RPG     L G+W  D    W S    N N++M YW +    LSE   P+ D+      
Sbjct: 439 TRPGGLPFGLYGLWCGDYKAVW-SHFMANENVQMMYWHAAAGGLSELILPMLDYYESRLE 497

Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
                A+  Y   G  I   T            V+   W     WL  H +E+Y +T D 
Sbjct: 498 IFRDNARKLYGCRGIFIPAGTTPGMAEPFQTVPVI-MHWTGAAGWLARHFYEYYRFTGDL 556

Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH---EFIAPDGKLACVSYS-- 539
           +FL +RA P ++  A F  D+L+EG DG L + PS SPE+    +I+ +G    ++++  
Sbjct: 557 EFLRRRALPFMKEAALFYEDFLVEGEDGRLVSYPSVSPENTPGNYISEEGVFGAMAHAMP 616

Query: 540 ----STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
               + +D AI++E+ + ++ A E+  + E   V +    L R+   ++  DG++ EW+ 
Sbjct: 617 TAVNALLDFAILKELLTDLLEAVELTGEGEPEAVRRWSVLLERIPAYEVNGDGAVREWLH 676


>gi|156041112|ref|XP_001587542.1| hypothetical protein SS1G_11535 [Sclerotinia sclerotiorum 1980]
 gi|154695918|gb|EDN95656.1| hypothetical protein SS1G_11535 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 796

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 162/590 (27%), Positives = 257/590 (43%), Gaps = 58/590 (9%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
           A PIGNG+L A+ +G   SE L LN D+LW G P       G   N      L  +R  +
Sbjct: 39  AYPIGNGQLAALPFGEPGSEKLNLNRDSLWNGGPFENASYNGGNPNFSVASTLPGIRDWI 98

Query: 80  DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATAR 136
               +   T     L G   +   YQ+LG++ +         ++ T Y+R LDL T    
Sbjct: 99  ----FRNGTGNVTTLMGSDDNYGSYQVLGNLSVSLQG----ISDATGYKRSLDLGTGIHT 150

Query: 137 VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIME 196
             ++  NV FT   F S PD V V +++ S +     ++  D+L  + S V  +     +
Sbjct: 151 TTFNTANVSFTTAVFCSYPDSVCVYQVN-STATLPRIDIYFDNLQADSSLVKSSCSTSSK 209

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
                             +     +   +  S+  GT+S +              L++ A
Sbjct: 210 SALFSGITQADIGMIYKAEARVIESAKSVSCSNTTGTLSIIPSNNQHS-----LSLVISA 264

Query: 257 SSSFDG----PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
            +++D        N S   +DP++     +    + ++  L   HL D+  L +  ++ L
Sbjct: 265 GTNYDATKGTAAHNYSFKGEDPSNYVSKTVAKAASKTFKTLRKNHLADFSALINTFTLSL 324

Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRYLLISSSRPG 368
                    D     N +T   A  + ++ T E    DP L  LLF + RYL ISSSR  
Sbjct: 325 P--------DPLGSANKET---ATVISAYNTTENSHTDPWLESLLFDYSRYLFISSSRDN 373

Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT-YLSINGS 427
           +   NLQG W   LS  W    H NINL+MN+W +    L + Q  L+ ++    +  GS
Sbjct: 374 SLPPNLQGKWAYGLSNAWGGDYHSNINLQMNHWVADQTGLGDLQSALWSYMAETWAPRGS 433

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           +TA++ Y A GWV+H + +I+  +    G   WA +P   +WL  H+ ++Y+Y+ D  +L
Sbjct: 434 ETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYYDYSRDETWL 493

Query: 488 EKRAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
           +   YPLL+  + F L  L   +  +DG L  NP +SPEH    P     C  Y      
Sbjct: 494 KNTGYPLLKAISEFWLSQLQKDVYFNDGTLVVNPCSSPEH---GPT-TFGCTHYQQ---- 545

Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
            +I  VF++ + AA  L   ++ L   +  +L  L +   I+    I EW
Sbjct: 546 -LIHAVFTSTLQAARTLST-DNTLQNTLQSTLTTLDKGLHISPLTQIQEW 593


>gi|429725254|ref|ZP_19260100.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
           473 str. F0040]
 gi|429150389|gb|EKX93300.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
           473 str. F0040]
          Length = 1038

 Score =  188 bits (477), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 183/618 (29%), Positives = 277/618 (44%), Gaps = 107/618 (17%)

Query: 11  NPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT- 64
           NPL + +  PA    +     ++PIGNG+LGA ++GGV ++ ++ NE TLW G P D   
Sbjct: 201 NPLTLWYPSPANAGPNPWMEYSLPIGNGQLGACIFGGVKTDEIQFNEKTLWWGTPKDMQR 260

Query: 65  -NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
            N D P                      V  FG     Y   G + ++  +++L   ++ 
Sbjct: 261 QNGDGP----------------------VSGFG----CYLNFGGLFVQNLNANLSQVKD- 293

Query: 124 YRRELDLNTATARVKYS-VGNVEFTREHFSSNPDQVIVT--KISGSESGSLSFN-VSLDS 179
           Y R LD+ TA A VK++     ++TR + SS PD VI    + +G     L F  +S D+
Sbjct: 294 YVRYLDIQTAVAGVKFTDEAGTQYTRRYLSSQPDGVIAALYEANGKNKLDLQFTLISGDT 353

Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
           L    +    +      G+ P   I   A     P G               GT++A  D
Sbjct: 354 LKTKKTEYTADGSGWFAGKLP--TIFHNARFKVVPVG---------------GTLTATAD 396

Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDLYTRHL 298
             + V+G++  +++L   +SF       +    D  +  ++AL  +    S+  +   ++
Sbjct: 397 G-IVVKGAEKVMVILAGGTSFAPTLPERTKGTADDLNARITALVDNAAKKSFEAIEAANI 455

Query: 299 DDYQKLFHRVSIQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
            D+Q    RV+  L      R+ KD+V    +  N           +  T +   L +L 
Sbjct: 456 ADHQSYMSRVAFHLEGAASQRNTKDLVDYYSAAPN-----------NRNTADGLFLEQLY 504

Query: 354 FQFGRYLLISSSRPGTQVAN-LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
           F FGRYL ISSSR    V N LQGIWN      W+S  H NIN++MNYW + P NLS+C 
Sbjct: 505 FNFGRYLSISSSRGSMPVPNNLQGIWNNRHDAPWNSDVHNNINVQMNYWPAEPTNLSDCH 564

Query: 413 EPLFDFLTYLSINGSKTAQVNYLA-----------SGWVIHHKTDIWAKSSADRGKVVWA 461
            P   FL Y+ IN S++      A            GW +  +++I+       G   W+
Sbjct: 565 MP---FLNYI-INNSQSEGWQRAAREFNKINGKSNKGWTVFTESNIFG------GMSTWS 614

Query: 462 L-WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
             + +  AWL  HLW+HY YT+D+DFL +RA+P + G A F +  L + +DG  E     
Sbjct: 615 SNYCVANAWLVYHLWQHYRYTLDQDFL-RRAWPAIWGSAEFWIHRLKKANDGTYEAPNEW 673

Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPR- 578
           SPE+     DG +A      T ++ I  +V   I+ A  V   +ED  L+   L  L + 
Sbjct: 674 SPEYG-PKQDG-VAHAQQLITENLQIAHDVVE-ILGAKNVGISDEDLKLLNDRLTHLDKG 730

Query: 579 LRPTKIAEDGSIMEWVQR 596
           LR  K   D     W QR
Sbjct: 731 LRIEKYRND-----WAQR 743


>gi|427386362|ref|ZP_18882559.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726402|gb|EKU89267.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
           12058]
          Length = 817

 Score =  188 bits (477), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 162/605 (26%), Positives = 266/605 (43%), Gaps = 112/605 (18%)

Query: 27  AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
           ++PIGNG +GA ++G    E ++L E T+  G  G Y                       
Sbjct: 84  SLPIGNGAMGACIFGRTDVERIQLAEKTM--GNKGAY----------------------- 118

Query: 87  ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
               S+  F + A++Y           D H  YA+  Y+R L LN A + V Y     E+
Sbjct: 119 ----SMGGFTNFAEIYL----------DIHHNYAQ-NYKRTLRLNDAISTVSYIHEGTEY 163

Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNGNNQ-----IIME 196
            RE+F+SNP  VI  K+  S+ G +SF V      L S  +  +  +G+ Q     I +E
Sbjct: 164 NREYFASNPANVIAVKLKASQPGMISFTVRPVLPYLHSFNNEQTGRSGHVQAEKDLITLE 223

Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE----DKKLKVEGSDWAVL 252
           G      +P +                +IKI +  GT+S++     +  + V  +D  +L
Sbjct: 224 GEIQYFHLPYEG---------------QIKIINYGGTLSSVNKGDNNSFINVSKADSVIL 268

Query: 253 LLVASSSF---DGPFINPSDSK----KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
            +  ++S+   D  F+ P+  K      P  +    ++      Y  L ++H+ DYQ  F
Sbjct: 269 YITVATSYELKDSVFLLPNAEKFKGNAHPHGQVSKRIREAIEKGYECLRSKHIADYQHFF 328

Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
           +RV +QL+             E+  ++P+ + +  ++  + D  L EL FQ+GRYLLISS
Sbjct: 329 NRVDLQLT-------------EHTPSIPTDKLLNQYRNGKHDTYLEELFFQYGRYLLISS 375

Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF------ 418
           SR G+  ANLQG+WN+     W      N+N++MNYW +   NL+E   P  D+      
Sbjct: 376 SRQGSLPANLQGVWNQYEFAPWSGGYWHNVNVQMNYWPAFNTNLAELFIPYMDYNEAFRK 435

Query: 419 ------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
                 + Y++ N  +        +GW I      +  S               G +   
Sbjct: 436 AATGKAVDYITQNNPEALDPTVEENGWTIGTGATAFGISGPGGHSGP-----GTGGFTTK 490

Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
             W++Y++T D+  L+   YP L G A FL   L    DG L  +PS SPE   I   G 
Sbjct: 491 LFWDYYDFTRDKQLLKDHVYPALMGMAKFLSKTLKPQPDGTLLVDPSFSPEQ--IHQQGY 548

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
               S     D ++I E +  ++ AA++L  +++  ++ V + + +L   +I E G I E
Sbjct: 549 YR--SKGCIFDQSMILETYRDLLIAAKILN-DKNPFLKTVKEQIGKLDAIQIGESGQIKE 605

Query: 593 WVQRR 597
           + + +
Sbjct: 606 FREEK 610


>gi|317139357|ref|XP_001817454.2| alpha-fucosidase A [Aspergillus oryzae RIB40]
          Length = 777

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 155/585 (26%), Positives = 260/585 (44%), Gaps = 51/585 (8%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSG 82
           +A  +GNG+LG M +G   +E L LN D LW G P     Y   +   +++++ S V   
Sbjct: 23  EAYTLGNGKLGVMPFGEPGAEKLNLNHDELWEGGPFEVNGYRGGNPNSSMTEILSEVRDE 82

Query: 83  QYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
            + + T    +L G       +  L ++ +  D    K ++  Y R LDL T      YS
Sbjct: 83  IWKKGTGNDSRLHGDTDGYGSFHSLANLTIAIDGID-KVSD--YTRSLDLGTGIHTTTYS 139

Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN-NQIIMEGRC 199
            G  ++T + + S P QV + K++ + + S    +  D L++  S  N   +      R 
Sbjct: 140 TGKGKYTTDVYCSYPAQVCIYKLNSTATLS-KVTIYFDQLVEESSLWNATCDSDFARLRG 198

Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV--AS 257
             +  PP+        G+ +  I    I     + +     KL +   + + L +V  A 
Sbjct: 199 VTQEGPPR--------GMTYDTIARSSIPGRCDSSTG----KLAINARNSSSLTIVIGAG 246

Query: 258 SSFDG----PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
           + FDG       + +   +DP         S  + S S L T H++DY  L    ++ L 
Sbjct: 247 TDFDGTKGTAATDYTFKGEDPAEYVEKITSSALSQSESKLRTEHIEDYSGLMSAFTLDLP 306

Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
                   DT      +         + +TD DP L +LLF +GR+L ISSSR  +   N
Sbjct: 307 --------DTQDSTGTELSTLITNYNANKTDGDPYLEKLLFDYGRHLFISSSRANSLPPN 358

Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKTAQV 432
           LQG+W+   +  W    H NINL+MN W +    L E    +F+++    +  G++TA++
Sbjct: 359 LQGVWSPTKNAAWSGDYHANINLQMNLWGAEATGLGELTVAVFNYMEQNWMPRGAETAEL 418

Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
            Y  +GWV H + +I+  +     +   A +P   AW+  H+W+ Y+Y+ ++ +  ++ +
Sbjct: 419 LYGGAGWVTHDEMNIFGHTGMKTYQTS-ANYPAAPAWMMQHVWDRYDYSHNKTWFIEQGW 477

Query: 493 PLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
           PLL+G A F    L      +D  L  NP TSPE             ++  T    +I +
Sbjct: 478 PLLKGVAEFWASQLQVDKFNNDSSLVVNPCTSPEQ---------GPTTFGCTHWQQLIHQ 528

Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
           V+   I  AE+  + +  L++ +   LPRL +   I   G I EW
Sbjct: 529 VYENAIQGAEIAGETDSTLLKDIKDQLPRLDKGLHIGTWGQIKEW 573


>gi|225019386|ref|ZP_03708578.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
           DSM 5476]
 gi|224947849|gb|EEG29058.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
           DSM 5476]
          Length = 1796

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 143/499 (28%), Positives = 240/499 (48%), Gaps = 57/499 (11%)

Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
           Y+R LDLNTA   V Y +  V +TR+ F++ PD V+V K+  S+ G+L F V  + + D 
Sbjct: 185 YQRYLDLNTAVTGVSYDIDGVTYTRQMFANFPDNVMVYKMDASKEGALDFTVRPE-IPDM 243

Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAIL---EIKISDDRGTISALED 239
            S  +GN      G+     +  + N     +G ++ + +L   + K+  D GT++A  D
Sbjct: 244 VSKASGNYDKTTMGKE--GTVFAEENGLITLRGTLKHNGMLFEGQYKVIPDGGTMTASND 301

Query: 240 K-----KLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
           +     ++ V G++ A +++   +++    +N  D     +DP  +  + + +   L + 
Sbjct: 302 ENNDHGQITVSGANSAYIIIALGTNY----VNDYDKDYVGEDPHDDVTARIANAEALGFD 357

Query: 292 DLYTRHLDDYQKLFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
           +LY+RH  DY  LF R ++ L+ +  P D  TD   +E      +  R +  +       
Sbjct: 358 ELYSRHKADYTALFDRATLSLNGATFPADKTTDQLLKE----YKAGSRSQYLE------- 406

Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
            +L FQFGRYLLI++SR  T   NLQG+WN+  +P+W S  H NINL+MNYW ++  NLS
Sbjct: 407 -QLYFQFGRYLLIAASRGDTLPTNLQGVWNDSETPSWQSDYHTNINLQMNYWPAMETNLS 465

Query: 410 ECQEPLFDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDIWAKSSADRGKVVWA 461
           E   PL +++  L   G  T Q  +          SGW+++        +         +
Sbjct: 466 ETAIPLVEYIDSLRKPGRVTFQKTWGIEPAEGDEESGWIVNCSNGPMGFTGNINSNA--S 523

Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETN 517
               G A++  +L+++Y +T D+D+L    YP+L+  +   +  L     E     L   
Sbjct: 524 FTATGAAFINQNLFDYYQFTQDKDYLRSTIYPILKESSKTYMQILEPGRTEADKDKLYMV 583

Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
           PS S E       G     +Y    D  +I + F+    AA+ L  + D   E + + +P
Sbjct: 584 PSYSSEQ------GPWTVGAY---FDQQLIYQCFNDTALAADELGIDSDFAAE-LRELMP 633

Query: 578 RLRPTKIAEDGSIMEWVQR 596
           +L P +I + G I EW Q 
Sbjct: 634 KLDPIQIGDSGQIKEWQQE 652


>gi|402084812|gb|EJT79830.1| hypothetical protein GGTG_04913 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 819

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 171/602 (28%), Positives = 272/602 (45%), Gaps = 71/602 (11%)

Query: 30  IGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDY--TNPDAPKALSDVRSLVDSGQY 84
           +GNGRLGAM +G   +E L  N D+LW+G P    DY   NP A KA  D    +    +
Sbjct: 50  LGNGRLGAMPFGPPGAERLVFNVDSLWSGGPFQSADYRGGNPVASKA--DALPAIRDQIW 107

Query: 85  AEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSV 141
              T     L G  A+   Y++LG+  +  D + +  A  T YRR LDL T      +  
Sbjct: 108 KNGTGDLSPLLGSSANYGSYRVLGNFTV--DIAGVADAPYTDYRRSLDLTTGVHTTTFKT 165

Query: 142 GNVEFTREHFSSNPDQVIV--TKISGSESGSL-SFNVSLD-SLLDNHSYVNGNNQIIMEG 197
           GN  F+   +   PDQV V    ++G    +L   +V  D +L+   ++           
Sbjct: 166 GNSSFSTWVYCGFPDQVCVYTVAVTGDRPAALPDVSVRFDNALVPAETFTRSCGDAFTRV 225

Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEI---------KISDDRGTISALEDKKLKV---E 245
           R   +  PP+        G+++ A+  +           S    T    +D  L +   E
Sbjct: 226 RGVTQVGPPE--------GLRYDAMARVVSSGGGGGGGGSAASTTTRCGDDGTLVISTPE 277

Query: 246 GSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
           G     +++ A + FD    N +        DP     +   +    + ++L   HLDDY
Sbjct: 278 GQRSVSVVIGAGTDFDQTKGNAASGYSFRGDDPAPLVEATTAAAAAKTQAELLKAHLDDY 337

Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE---DPSLVELLFQFGR 358
             L      QL         D    +    V + + + S++ D+   DP L   LF + R
Sbjct: 338 AALMG--GFQL---------DIADAKGSAAVETRKLIASYRADDVTGDPYLEAALFDYSR 386

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP-LFD 417
           +L +SSSR  +   NL G W E+L P W +  H NINL+MNYW +    L     P L+D
Sbjct: 387 HLAVSSSRANSLPTNLAGRWTEELEPAWSADHHANINLQMNYWVNDQTGLGPATTPALWD 446

Query: 418 FLTY-LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
           ++    +  G++TA++ Y A +GWV+H++ +++   SA + +  WA +P   AW+  H+W
Sbjct: 447 YMELNWAPRGAETARLLYGADAGWVVHNEMNVFG-FSAMKEEASWANYPAANAWMMQHVW 505

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGK 532
           + + Y +D  +  ++ YPL++G A F L  L E    +DG L  NP  SPEH    P   
Sbjct: 506 DRWEYGLDAAWFRRQGYPLIKGTAQFWLSQLQEDKWFNDGSLVVNPCNSPEH---GPT-T 561

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
             C  +       I R +F+A ++ AE   + + A +  V  +L RL +    ++ G + 
Sbjct: 562 FGCTHFHQE----IHRTLFTA-LAGAEAGGETDAAFLGSVRAALARLDKGVHRSDFGGLK 616

Query: 592 EW 593
           EW
Sbjct: 617 EW 618


>gi|393247026|gb|EJD54534.1| glycoside hydrolase family 95 protein [Auricularia delicata
           TFB-10046 SS5]
          Length = 861

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 158/602 (26%), Positives = 262/602 (43%), Gaps = 91/602 (15%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDVRSLVDSGQYAE 86
           +P+GNG +G M       + + LN ++LWTG P     N +    L+ V + V      E
Sbjct: 103 LPVGNGYMGMMQSSRPDFDDVVLNLESLWTGGPYNSANNYNGGNPLTAVNASVR-----E 157

Query: 87  ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE---------------TYRRELDLN 131
              A++   G P        D+    D SH                      Y R LD N
Sbjct: 158 NIRATIWANGSP--------DLTPLVDGSHYGSLSSPGSLHISRSIGNDVTGYERALDFN 209

Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL-DNHSYVNGN 190
             T    +  G+  + R +F S PDQV V    G+ + +  +  SLD+L   +++ V   
Sbjct: 210 DGTISATWKEGSNSYLRTYFCSFPDQVCVVNTEGTGNDTAIY--SLDTLRPRDYASVACL 267

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGSDW 249
           ++  +  R              +  G+ +  ++  I  S D  T S   +  L   G+  
Sbjct: 268 DKSTLAYR-----------GLAESSGMTYEILVRLISSSPDSVTCSGAGNATLTGSGARQ 316

Query: 250 AVLLLVASS------------SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
            VL+  A++            SF GP         DP + ++++L      SY  L +RH
Sbjct: 317 MVLITGATNYNIDAGTRAHNFSFAGP---------DPHASALNSLSKASRSSYEALLSRH 367

Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQF 356
           +DDY  LFH   + L + P D+V            P+ + V  + T      +E LLF  
Sbjct: 368 IDDYSALFHGFELDLGQKP-DVVK-----------PTDQLVAEYVTGTGNVYLEWLLFNL 415

Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           GR+++I+ +R G   + LQ +W   L   W    H NINL+MNYW +   NL     PL+
Sbjct: 416 GRFMMITGAR-GVLPSGLQSVWTTGLEAPWGGDYHANINLQMNYWGAEETNLGAVTGPLW 474

Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
           +++    +  GS+TAQ+ Y + G+V+H++ +I+  +    G   WA +P    W+  H+W
Sbjct: 475 NYMRKTWVPRGSETAQLVYGSRGFVVHNEMNIFGHTGMKLGDPQWADYPAAATWMMLHVW 534

Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGK 532
           +H+++T D ++   + + LL+  A F LD L E     DG L   P  SPE+  + P   
Sbjct: 535 DHFDFTGDLNWFRSQGWSLLKAQAEFWLDNLFEDSASKDGTLVAVPCNSPENGIVGP--- 591

Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
               +Y       +I E+F  I    ++    + + ++++   L +L R  +I   G + 
Sbjct: 592 ----TYGCAHFQQLIWELFHNIQKGFKLSGDADQSFLKEIEAKLSKLDRGVRIGSWGQMQ 647

Query: 592 EW 593
           EW
Sbjct: 648 EW 649


>gi|418200759|ref|ZP_12837202.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47976]
 gi|353864300|gb|EHE44218.1| fibronectin type III domain protein [Streptococcus pneumoniae
           GA47976]
          Length = 477

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 142/499 (28%), Positives = 232/499 (46%), Gaps = 76/499 (15%)

Query: 11  NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
            P+  T+ G  +   +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY   
Sbjct: 15  QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71

Query: 66  --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
              D    L+++R  ++   Y  A   + +    P       Y   GDI +EF       
Sbjct: 72  NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131

Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
           ++ T Y+R+L+++ A     Y      F RE F+S PD ++V + +   + +L F + L 
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191

Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
               L  +  Y               ++ I+M+GR            ND    ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKD---------ND----LRFASYL 238

Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
             +     G I    D+ +++ G+ +A L L A + F     +    K D   + +  + 
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           + +   Y+ L +RH++DYQ LF RV + L             E N+D   + + +K+++ 
Sbjct: 295 TAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341

Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
            E  +L EL FQ+GRYLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401

Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
            +   NL E   P+ +++  L + G + A V Y          +GW++H +     W   
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460

Query: 452 SADRGKVVWALWPMGGAWL 470
             D     W   P   AW+
Sbjct: 461 GWD---YYWGWSPAANAWM 476


>gi|336378685|gb|EGO19842.1| glycoside hydrolase family 95 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 864

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 162/581 (27%), Positives = 267/581 (45%), Gaps = 76/581 (13%)

Query: 28  IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLVD 80
           +PIGNG L AM+ GG+  E  +LN ++LW G P       G    P     ++     + 
Sbjct: 79  LPIGNGYLAAMIPGGIFQEVTQLNIESLWQGGPLQDPSYNGGNNLPSQQAQMAQDMQSIR 138

Query: 81  SGQYA--EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
              +A    T  +++    P   Y             +       Y R LDL+   AR  
Sbjct: 139 QSIFASPNGTINNIEEICTPPGDYGSYSGAGYFISTLNNTGTTSNYGRWLDLDEGVARTT 198

Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSL-----SFNVSLDSLLDNHSYVNGNNQI 193
           +S G+  F+RE F S+P Q  V  ++ S   SL     +F+VS ++ L   +    +N  
Sbjct: 199 WSQGSSIFSREAFCSHPAQACVQYVNTSGQASLPTVTYAFSVSQETGLPAPNVTCLDNAT 258

Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA-------LEDKKLKVEG 246
           +      G    P         G+ +  I  ++ S+  GT+S          +  + V G
Sbjct: 259 L---NIRGYVTNP---------GMMYEIIGRVQASN--GTVSCNVVSGSTPTNATVSVSG 304

Query: 247 SDWAVLLLVASSSFDGPFINPSD-------SKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
           +  A +  V  +++D   I+  D          DP S  +S + S  + SY++L + H+ 
Sbjct: 305 ASEAWITWVGGTNYD---IDAGDLAHNFTFQGVDPHSNLVSLVSSATSNSYTELLSEHIA 361

Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQFGR 358
           DY  L    S+ L ++P D+ T           P+ + V S+QT    + +E +LF FGR
Sbjct: 362 DYTSLISPFSLSLGQTP-DLST-----------PTDQIVASYQTYVGNAYLEWVLFNFGR 409

Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
           YLL SS+R G   ANLQG W +  S +W +  H NINL+MNYW +   NL+  Q  LFD+
Sbjct: 410 YLLTSSAR-GILPANLQGKWADGQSNSWGADYHANINLQMNYWFAEMANLNVTQS-LFDY 467

Query: 419 L-TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSA--DRGKVVWALWPMGGAWLCTHL 474
           +    +  G++TA + Y ++ GWV H + +I+  +    +     WA +P   AW+  H 
Sbjct: 468 MEKTWAPRGAETALILYNISQGWVTHDEMNIFGHTGMKLEGNSAQWADYPESNAWMMIHA 527

Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDG 531
           W+H++YT D ++ + + +PL++  ASF L+ LI     +DG L T P  SPE        
Sbjct: 528 WDHFDYTNDVEWWKAQGWPLVKAVASFHLEKLIPDLHFNDGTLVTAPCNSPEQ------- 580

Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
               +++       +I ++F+A+    E     + A ++ +
Sbjct: 581 --VPITFGCAHAQQLIWQLFNAVEKGYEAAGDTDTAFIQAI 619


>gi|337748035|ref|YP_004642197.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
 gi|336299224|gb|AEI42327.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
          Length = 913

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 167/598 (27%), Positives = 268/598 (44%), Gaps = 67/598 (11%)

Query: 24  FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
           + +A+P GNG +GA V G + SET+ L    LWTG       P+    L+++R L+D G 
Sbjct: 120 WREALPSGNGLIGAAVHGAIGSETVLLTHAELWTGGT-KQELPEVSGTLAEIRRLMDEGA 178

Query: 84  YAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET----YRRELDLNTATARVK 138
           Y EA      L G   +  Y+ + +  L   D  +    +     YRRELDL T    V+
Sbjct: 179 YREANGL---LEGRLREAGYEPVRETPLPLADLKVVRTAQAGFRRYRRELDLETGEVSVR 235

Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-------SLLDNHSYVNGNN 191
           +  G   + R+ F S  D +IV ++  S  GS+  ++ L        S  D  SYV+ + 
Sbjct: 236 WEEGAAAYERKLFVSRSDDLIVYELE-SRGGSVDVDLLLQLHEKGTASRPDIPSYVSESL 294

Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEG 246
           QI              A  NDD  G  F A+L       ++ +D+G        +L V G
Sbjct: 295 QI-----TAADGFLRYAARNDD--GRDFGAVLRAVPAGGRLGEDQG--------RLSVTG 339

Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
           +D  VL+LV    F G      D  ++ T   + A       +YS+L  RH   +  L  
Sbjct: 340 AD-KVLILV--KVFAG-----GDRSQEWTR--LEAELREAAWTYSELLDRHTALHGPLMR 389

Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
              + L  + ++    T ++E +         ++++    P+L EL++ +GRYL IS +R
Sbjct: 390 SADLHLGGAGEEAAC-TYTDELLQ--------EAYEGGLSPALAELMWAYGRYLFISGTR 440

Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
           PG     L G+W  D    W S    N N++M YW +    LSE   P+ D+        
Sbjct: 441 PGGLPFGLYGLWCGDYKAVW-SHFMANENVQMMYWHAAAGGLSELILPMLDYYESRLEIF 499

Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
              A+  Y   G  I   T            V+   W     WL  H +E+Y +T D +F
Sbjct: 500 RDNARKLYDCRGIFIPAGTTPGMAEPFQTVPVI-MHWTGAAGWLARHFYEYYRFTGDLEF 558

Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH---EFIAPDGKLACVSYS---- 539
           L +RA P ++  A F  D+L+ G DG L + PS SPE+    +I+ +G    ++++    
Sbjct: 559 LRRRALPFMKEAALFYEDFLVAGEDGRLVSYPSVSPENTPGNYISEEGVFGAMAHAMPTA 618

Query: 540 --STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
             + +D AI++E+ + ++ A E+  + E   V +    L R+   +   DG++ EW+ 
Sbjct: 619 VNALLDFAILKELLTGLLEAVELTGEGEPEAVRRWSVLLERIPAYEANGDGAVREWLH 676


>gi|307718131|ref|YP_003873663.1| alpha-L-fucosidase [Spirochaeta thermophila DSM 6192]
 gi|306531856|gb|ADN01390.1| alpha-L-fucosidase 2 precursor [Spirochaeta thermophila DSM 6192]
          Length = 758

 Score =  186 bits (471), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 168/586 (28%), Positives = 253/586 (43%), Gaps = 74/586 (12%)

Query: 20  PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
           PA  + D  P+GNGRL A+V GG+  E + LN + LW G   D         L  VR   
Sbjct: 15  PAGVWRDGYPVGNGRLAALVVGGLGEERIHLNHEWLWRGRYRDRVAEGRAHLLGWVREAF 74

Query: 80  DSGQYAEATAASVKLF-------GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
             G + E T  + + F       G P  V  YQ  G + L ++        E Y RELDL
Sbjct: 75  FRGDWEEGTRRANEAFGGGGGVSGRPCRVGAYQPAGTLVLWWEGMD----GEGYERELDL 130

Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
                RV+      E         P   +  ++SG   G +     +   +       G 
Sbjct: 131 EEGVVRVRRGRSVEEVM-AVMGGGP---VGVRVSGWGRGWVGLEREVQEGVAVRVGAKGG 186

Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
             + +EGR             ++  G +  A++       RG +   E  ++ VEG +  
Sbjct: 187 -MVRLEGRF------------EEGIGWEVRAVV-------RGGVCRGEGGRVWVEGEEVV 226

Query: 251 VLLLVASSSFDGPFIN--PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
           V ++V      G      PS    +   E   A++            RH++ Y  LF RV
Sbjct: 227 VWVVVDVWEEVGGSRRRLPSYGPPEVPGEGWEAVRR-----------RHVEAYGGLFGRV 275

Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
            + +             EE +  +P+  R    + D DP L  LLF +GRYLLI+SS PG
Sbjct: 276 RLVVE-----------GEEPL--LPTGRR----REDPDPLLPALLFDYGRYLLIASSAPG 318

Query: 369 TQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
             + ANLQG WN  L P WD+  H++INL+MNYW +    L EC  PL  ++  +  +  
Sbjct: 319 CDLPANLQGKWNPLLEPPWDADYHMDINLQMNYWLAEGAGLGECVRPLVRYVLRMVPSAR 378

Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
           + A+  +   G      +D WA+++ +     W +W    AW+  HL   Y Y  D  FL
Sbjct: 379 EAARRLFGCRGIWFPLTSDAWARATPE--AYGWDVWVGAAAWMAQHLVWRYLYGGDEGFL 436

Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
            + AYP L+  A F  D+L+E  +G L+  PS SPEH +   +G    +  SS +D+ ++
Sbjct: 437 RETAYPFLKEVALFFEDFLVEDGEGVLQVVPSQSPEHRWEGLEGFPVGLCVSSAVDVQLV 496

Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
           R V    +     L  +E     ++   L RLR   +  DG ++EW
Sbjct: 497 RWVLRMAVELGGRL-GDELGRWREMEGRLARLR---VGGDGVLLEW 538


>gi|395326583|gb|EJF58991.1| hypothetical protein DICSQDRAFT_65986 [Dichomitus squalens LYAD-421
           SS1]
          Length = 831

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 171/619 (27%), Positives = 272/619 (43%), Gaps = 81/619 (13%)

Query: 15  ITFNGPAKHFT----DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAP 69
           I +  P + F     D +P+GNG L AMV G    E  +LN ++LW+G P  D T     
Sbjct: 33  IWYTQPGRDFDFWADDWLPVGNGYLAAMVNGQAAQEVTQLNIESLWSGGPFQDPTYNGGN 92

Query: 70  KALSDVRSLVDSGQYAE--------ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
           KA SD  ++    Q            T  S    G P  +   +G   L      L    
Sbjct: 93  KAASDQATVAQEMQVIRQAIFQSPNGTIDSASTSGGPLSIGSYVGAGYL-LATLDLNGGF 151

Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL---SFNVSLD 178
             + R LDL+ A  R  ++ GN  F RE F S+P Q  V +I+ +++ +L   ++  S+D
Sbjct: 152 SDFVRWLDLDAAVQRTSWTQGNASFFRETFCSHPTQACVQRINTTDASTLPALTYAYSVD 211

Query: 179 S----LLDNHS-YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
           +    L+   S + N   QI      PG               + F  +  +  S    +
Sbjct: 212 AESGILIPTVSCFDNSTLQITGTASSPG---------------MAFEILARVSASGTNTS 256

Query: 234 I----SALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSAL---Q 283
           I    +   +  + V G+  A +  V  + +D   G  ++    K     +++ AL    
Sbjct: 257 IVCAPTGTNNATISVSGASDAFITWVGGTDYDADAGDAVHSFSFKGADPHDALVALIEPA 316

Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
           +    +Y      H+ DY  L  +  + L ++P D  T           P+ +   ++QT
Sbjct: 317 TASATTYDGALAAHIADYAGLITKFELDLDQTP-DFAT-----------PTDQLHDAYQT 364

Query: 344 D-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
           D  +P L  LLF FGRYLL  S+R GT  ANLQG W +D S  W +  H NIN++MNYW 
Sbjct: 365 DVGNPYLEWLLFNFGRYLLAGSAR-GTLPANLQGKWAKDDSNPWSADYHSNINIQMNYWF 423

Query: 403 SLPCNLSECQEPLFDFL-TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRG--KV 458
           +    + +   PLFD+     +  G+ TAQ  Y ++ GWV H+  +I+  +    G    
Sbjct: 424 AELTGM-DVVTPLFDYFEKTWAPRGALTAQYLYNISEGWVTHN--EIFGHTGMKGGGNTA 480

Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLE 515
            WA +P   AW+  H+W+H+++T D D+ + + +PLL+  A F L  L+     +D  L 
Sbjct: 481 SWADYPESNAWMMLHVWDHFDFTQDSDWFKAQGWPLLKSVAQFHLQKLVPDERFNDSTLV 540

Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
            NP  SPE   I     L C          +I ++F+AI     +    + A +++V   
Sbjct: 541 VNPCNSPEQVPI----TLGCAHAQQ-----LIWQLFNAIDKGFAISGDTDTAFLDEVRAK 591

Query: 576 LPRL-RPTKIAEDGSIMEW 593
             ++ +   I   G + EW
Sbjct: 592 REQMDKGIHIGSWGQLQEW 610


>gi|421234544|ref|ZP_15691162.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2061617]
 gi|395600398|gb|EJG60555.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2061617]
          Length = 477

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 140/484 (28%), Positives = 226/484 (46%), Gaps = 73/484 (15%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 27  EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86

Query: 79  VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   A   L G     Y      GDI +EF       ++ T Y+R+L+++ A
Sbjct: 87  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472

Query: 467 GAWL 470
            AW+
Sbjct: 473 NAWM 476


>gi|421249885|ref|ZP_15706342.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2082239]
 gi|395613579|gb|EJG73607.1| fibronectin type III domain protein [Streptococcus pneumoniae
           2082239]
          Length = 456

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 138/484 (28%), Positives = 226/484 (46%), Gaps = 73/484 (15%)

Query: 26  DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
           +A+PIGNG LGA V+G + +E ++ NE +LW+G P     DY      D    L+++R  
Sbjct: 6   EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65

Query: 79  VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
           ++   Y  A   + +    P       Y   GDI +EF       ++ T Y+R+L+++ A
Sbjct: 66  LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125

Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
                Y      F RE F+S PD ++V + +   + +L F + L     L  +  Y    
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185

Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
                      ++ I+M+GR            ND    ++F++ L  +     G I    
Sbjct: 186 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 229

Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
           D+ +++ G+ +A L L A + F     +    K D   + +  + + +   Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288

Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
           +DYQ LF RV + L             E ++D   + + +K+++  E  +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335

Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
           YLLISSSR  P    ANLQG+WN   +P W+S  H+N+NL+MNYW +   NL E   P+ 
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395

Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
           +++  L + G + A V Y          +GW++H +     W     D     W   P  
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451

Query: 467 GAWL 470
            AW+
Sbjct: 452 NAWM 455


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.132    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,012,891,469
Number of Sequences: 23463169
Number of extensions: 430173124
Number of successful extensions: 1042170
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1262
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 1034349
Number of HSP's gapped (non-prelim): 2012
length of query: 613
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 464
effective length of database: 8,863,183,186
effective search space: 4112516998304
effective search space used: 4112516998304
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)